Upcoming Sessions
-
March
31
ILT - ADMIN-335: Administering Data Services on premises - 4899024 - public
Starting:2026/03/31 @ 09:30 AM SingaporeEnding:2026/04/03 @ 05:30 PM Singapore -
April
1
ILT - Cloudera Custom Training - CHAMP-138: Championing Kubernetes - 4935383 - AMER 3
Starting:2026/04/01 @ 09:00 AM Central Time (US & Canada)Ending:2026/04/01 @ 05:00 PM Central Time (US & Canada)
See All Upcoming Sessions
Cloudera is a fully integrated edge to AI product set. Cloudera Manager is purposely built as the DevOps tooling for building and managing the Cloudera platform. This four-day hands-on course presents detailed explanation, comprehensive theory, key skills, and recommended practices for successful platform administration. Upon completion of this course a Cloudera Administrator will learn the full range of functionality and capability of Cloudera Manager. DATE: April 21-24, 2026 9:00 - 17:00 (Beijing TIMEZONE) Virtual Classroom, APAC - in Mandarin language Read more
Overview This three-day hands-on training course delivers the key concepts and expertise developers need to optimize the performance of their Apache Spark applications. During the course, participants will learn how to identify common sources of poor performance in Spark applications, techniques for avoiding or solving them, and best practices for Spark application monitoring. Optimizing Apache Spark Applications presents the architecture and concepts behind Apache Spark and underlying data platform, then builds on this foundational understanding by teaching students how to tune Spark application code. The course format emphasizes instructor-led demonstrations illustrate both performance issues and the techniques that address them, followed by hands-on exercises that give students an opportunity to practice what they've learned through an interactive notebook environment. Download full course description What You'll Learn Students who successfully complete this course will be able to: Understand Apache Spark's architecture, job execution, and how techniques such as lazy execution and pipelining can improve runtime performance Evaluate the performance characteristics of core data structures such as RDD and DataFrames Select the file formats that will provide the best performance for your application Identify and resolve performance problems caused by data skew Use partitioning, bucketing, and join optimizations to improve SparkSQL performance Understand the performance overhead of Python-based RDDs, DataFrames, and user-defined functions Take advantage of caching for better application performance Understand how the Catalyst and Tungsten optimizers work Understand how Workload XM can help troubleshoot and proactively monitor Spark applications performance Learn how the Adaptive Query Execution engine improves performance What to Expect This course is designed for software developers, engineers, and data scientists who have experience developing Spark applications and want to learn how to improve the performance of their code. This is not an introduction to Spark. Spark examples and hands-on exercises are presented in Python and the ability to program in this language is required. Basic familiarity with the Linux command line is assumed. Basic knowledge of SQL is helpful. DATE: May 26-28, 2026 9:00 - 17:00 (CEST TIMEZONE) Virtual Classroom, EMEA Read more
Cloudera is a fully integrated edge to AI product set. Cloudera Manager is purposely built as the DevOps tooling for building and managing the Cloudera platform. This four-day hands-on course presents detailed explanation, comprehensive theory, key skills, and recommended practices for successful platform administration. Upon completion of this course a Cloudera Administrator will learn the full range of functionality and capability of Cloudera Manager. DATE: April 20-23, 2026 9:00 - 17:00 (EST TIMEZONE) Virtual Classroom, AMER Read more
DATES: 19 May/26 May/2 June/9 June/16 June/23 June/30 June/7 July/14 July, 2026 9:00 - 17:00 (IST TIMEZONE) Virtual Classroom, APAC Read more
This three-day course provides participants with a comprehensive understanding of the Cloudera platform and its integrated services, including Cloudera Data Warehouse, Cloudera Data Engineering, Cloudera Data Flow, and Cloudera AI. Participants will gain hands-on experience in designing, implementing, and optimizing data workflows and analytics solutions within the Cloudera ecosystem. The course emphasizes practical strategies for building scalable, secure, and efficient data-driven solutions tailored to enterprise needs. Key topics include data ingestion and processing, stream management, query optimization, machine learning integration, and managing resource performance in production environments. Download full course description DATE: May 19-21, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more
This course introduces Apache Iceberg, a high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store, available on Cloudera Data Warehouse and Cloudera Data Engineering on both Private and Public Cloud. Combined with Cloudera Data Platform, Iceberg can enable users to build an open data lakehouse architecture for multi-function analytics and to deploy large-scale end-to-end pipelines. This course covers various aspects of Apache Iceberg, such as benefits, architecture, internal operation, read and write operations, and advanced functions, all while drawing comparisons to Hive and building on the students’ existing knowledge and experience. DATE: May 12-15, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more
Shopping Cart
Your cart is empty