Upcoming Sessions
-
April
15
ILT - Cloudera Custom Training - CHAMP-138: Championing Kubernetes - 4935388 - AMER 4
Starting:2026/04/15 @ 09:00 AM Central Time (US & Canada)Ending:2026/04/15 @ 05:00 PM Central Time (US & Canada) -
April
20
ILT - DSCI-273: Generative AI on Cloudera - 4983641 - public APAC - PCS
Starting:2026/04/20 @ 09:30 AM SingaporeEnding:2026/04/21 @ 05:30 PM Singapore
See All Upcoming Sessions
DATE: May 12, 2026 9:00 - 17:00 (SGT TIMEZONE) Virtual Classroom, APAC Read more
DATE: June 25, 2026 9:00 - 17:00 (CEST TIMEZONE) Virtual Classroom, EMEA Read more
DATE: May 7, 2026 9:00 - 17:00 (CEST TIMEZONE) Virtual Classroom, EMEA Read more
DO NOT START THIS CERTIFICATION EXAM HERE! Once you have been enrolled, you will receive an email with additional instructions to schedule your exam. If you click through this course and complete it you will not receive the email. Read more
This four-day hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform. Hands-on exercises allow students to practice writing Spark applications that integrate with Cloudera Data Platform core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system. After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries. Download full course description What you'll learn During this course, you will learn how to: Distribute, store, and process data in a cluster Write, configure, and deploy Apache Spark applications Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data Query data using Spark SQL, DataFrames, and Hive tables Deploy a Spark application on the Data Engineering Service What to expect This course is designed for developers and data engineers. All students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful. Prior knowledge of Spark and Hadoop is not required. DATE: May 11-14, 2026 9:00 - 17:00 (CEST TIMEZONE) Virtual Classroom, EMEA Read more
This three-day hands-on training course delivers the key concepts and expertise developers need to optimize the performance of their Apache Spark applications. During the course, participants will learn how to identify common sources of poor performance in Spark applications, techniques for avoiding or solving them, and best practices for Spark application monitoring. Optimizing Apache Spark Applications presents the architecture and concepts behind Apache Spark and underlying data platform, then builds on this foundational understanding by teaching students how to tune Spark application code. The course format emphasizes instructor-led demonstrations illustrate both performance issues and the techniques that address them, followed by hands-on exercises that give students an opportunity to practice what they've learned through an interactive notebook environment. Download full course description This course is designed for software developers, engineers, and data scientists who have experience developing Spark applications and want to learn how to improve the performance of their code. This is not an introduction to Spark. Spark examples and hands-on exercises are presented in Python and the ability to program in this language is required. Basic familiarity with the Linux command line is assumed. Basic knowledge of SQL is helpful. DATE: April 22-24, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more
Shopping Cart
Your cart is empty