Cloudera Educational Services

Upcoming Sessions

See All Upcoming Sessions

This course helps customers use Cloudera Data Platform to address data governance tasks, motivated by the need for compliance with regulations such as the European Union's General Data Protection Regulation (GDPR) and the United State's Health Insurance Portability and Accountability Act (HIPAA). What you'll learn Through instructor-led discussion, demonstrations, and hands-on exercises, you will learn how to: Identify which tools in Cloudera Data Platform (CDP) to use for key data governance activities Organize data objects using classifications and business glossary terms Find access history for data objects and policies Use Data Catalog Profilers in CDP to assist in organizing data objects Use Data Catalog to foster collaboration with colleagues View and interpret a data object's lineage Create and apply resource- and tag-based access control policies Create policies for data masking and row-level filtering What to expect This course is best suited for data stewards and others who are responsible for, or have an interest in, implementing regulatory compliance or performing typical data governance activities using the Cloudera Data Platform. Familiarity with basic data governance concepts is helpful, but not required. [DATE: October TBD 2024] Virtual Classroom, [APAC] 9:00 - 17:00 (Australia EAST TIMEZONE) Read more

About This Course Join Director, Product Management Tejnadh Reddy Paila for a full 1.5-hour session and demo on Cloudera's new Observability product to monitor and optimize Cloudera deployments across hybrid cloud.  Read more

About This Course In this video you will learn how Cloudera enables key hybrid capabilities like application portability, and data replication in order to quickly move workloads and data to the cloud. Let's explore how Cloudera Data Platform (CDP) excels at following hybrid use cases, through data pipeline replication and data pipeline migration exercises to the public cloud. Develop Once and Run Anywhere De-risk Cloud Migration This Course contains demonstrations showing how to add Private Base clusters as Classic Clusters and use CDP replication Manager to migrate HDFS data,Hive tables to the cloud Provider of your choice. This course includes 30 minutes of video content including demonstrations. Audience and Prerequisites This OnDemand course is suitable for CDP Administrators, data administrators, and data operators. Read more

This three-day hands-on training course delivers the key concepts and expertise developers need to improve the performance of their Apache Spark applications. During the course, participants will learn how to identify common sources of poor performance in Spark applications, techniques for avoiding or solving them, and best practices for Spark application monitoring. Apache Spark Application Performance Tuning presents the architecture and concepts behind Apache Spark and underlying data platform, then builds on this foundational understanding by teaching students how to tune Spark application code. The course format emphasizes instructor-led demonstrations illustrate both performance issues and the techniques that address them, followed by hands-on exercises that give students an opportunity to practice what they've learned through an interactive notebook environment. The course applies to Spark 2.4, but also introduces the Spark 3.0 Adaptive Query Execution framework.   Read more

This four-day hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP).  Hands-on exercises allow students to practice writing Spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system. After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries. Download full course description  What you'll learn During this course, you will learn how to: Distribute, store, and process data in a CDP cluster Write, configure, and deploy Apache Spark applications Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data Query data using Spark SQL, DataFrames, and Hive tables Deploy a Spark application on the Data Engineering Service What to expect This course is designed for developers and data engineers. All students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful.  Prior knowledge of Spark and Hadoop is not required. 5th-8th nov 2024 Virtual Classroom EMEA 9:00 - 17:00 CST Read more

DO NOT START THIS CERTIFICATION EXAM HERE! Once you have been enrolled, you will receive an email with additional instructions to schedule your exam. Read more

Shopping Cart

Your cart is empty