Cloudera Educational Services

Upcoming Sessions

See All Upcoming Sessions

This course introduces Apache Iceberg, a high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store, available on Cloudera Data Warehouse and Cloudera Data Engineering on both Private and Public Cloud. Combined with Cloudera Data Platform, Iceberg can enable users to build an open data lakehouse architecture for multi-function analytics and to deploy large-scale end-to-end pipelines. This course covers various aspects of Apache Iceberg, such as benefits, architecture, internal operation, read and write operations, and advanced functions, all while drawing comparisons to Hive and building on the students’ existing knowledge and experience. [DATE: March 11-14, 2025] Virtual Classroom, [APAC] 9:00 - 17:00 (SGP TIMEZONE) Read more

One of the most critical functions of a data-driven enterprise is the ability to manage ingest and data flow across complex ecosystems.  Does your team have the tools and skill sets to succeed at this? Apache NiFi and this four-day course provides the fundamental concepts and experience necessary to automate the ingress, flow, transformation, and egress of data using NiFi. The course also covers tuning, troubleshooting, and monitoring the dataflow process as well as how to integrate a dataflow within the Cloudera CDP Hybrid ecosystem and external systems. Download full course description  What you'll learn During this course, you learn how to:  Define, configure, organize, and manage dataflows  Transform and trace data as it flows to its destination  Track changes to dataflows with NiFi Registry  Use the NiFi Expression Language to control dataflows  Optimize dataflows for better performance and maintainability Connect dataflows with other systems, such as Apache Kafka, Apache Hive, and HDFS Utilize the Data Flow Service What to expect This course is designed for developers, data engineers, administrators, and others with an interest in learning NiFi’s innovative no-code, graphical approach to data ingest. Although programming experience is not required, basic experience with Linux is presumed, and previous exposure to big data concepts and applications is helpful. [DATE: February 24-27, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (CST TIMEZONE) Read more

This four-day hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP).  Hands-on exercises allow students to practice writing Spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system. After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries. Download full course description  What you'll learn During this course, you will learn how to: Distribute, store, and process data in a CDP cluster Write, configure, and deploy Apache Spark applications Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data Query data using Spark SQL, DataFrames, and Hive tables Deploy a Spark application on the Data Engineering Service What to expect This course is designed for developers and data engineers. All students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful.  Prior knowledge of Spark and Hadoop is not required. [DATE: March 10-13, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (PST TIMEZONE) Read more

DESCRIPTION [DATE: October 7-10, 2025] Virtual Classroom, [APAC] 9:00 - 17:00 (Beijing TIMEZONE) Read more

DESCRIPTION [DATE: July 15-18, 2025] Virtual Classroom, [APAC] 9:00 - 17:00 (Beijing TIMEZONE) Read more

DESCRIPTION [DATE: March 18-21, 2025] Virtual Classroom, [APAC] 9:00 - 17:00 (Beijing TIMEZONE) Read more

Shopping Cart

Your cart is empty