Upcoming Sessions
-
June
23
ILT - DENG-254: Preparing with Cloudera Data Engineering and Apache Spark - 5063293 - public Chinese
Starting:2026/06/23 @ 09:00 AM BeijingEnding:2026/06/26 @ 05:00 PM Beijing -
June
23
ILT - ADMIN-332: Securing Cloudera on premises - 5025813 - public APAC
Starting:2026/06/23 @ 09:30 AM SingaporeEnding:2026/06/26 @ 05:30 PM Singapore
See All Upcoming Sessions
About This Training Generative AI (GenAI) and Large Language Models (LLMs) are extremely powerful new tools that are changing every industry. To fully take advantage of GenAI and LLMs, these new capabilities need to be combined with your existing enterprise data. This two-day course teaches how to use Cloudera AI to train, augment, fine tune, and host LLMs to create powerful enterprise AI solutions. What Skills You Will Gain Through lecture and Hands-On exercises, you will learn how to: Select the right LLM model for a use case Configure a Prompt for an LLM Use Retrieval Augmented Generation (RAG) Fine Tune an LLM Model with Enterprise Data Use the AI Model Registry and host an LLM Create an AI Agent with Crew AI Who Should Take This Course This course is designed for data scientists and machine learning engineers who need to understand how to utilize Cloudera AI to leverage the full power of their enterprise data, generative AI, and Large Language Models and deliver powerful business solutions. DATE: July 7-8, 2026 9:30 - 17:30 (GMT+8 TIMEZONE) Virtual Classroom, APAC Read more
DO NOT START THIS CERTIFICATION EXAM HERE! Once you have been enrolled, you will receive an email with additional instructions to schedule your exam. If you click through this course and complete it you will not receive the email. Read more
This four-day instructor-led course begins by introducing Apache Kafka, explaining its key concepts and architecture, and discussing several common use cases. Building on this foundation, you will learn how to plan a Kafka deployment, and then gain hands-on experience by installing and configuring your own cloud-based, multi-node cluster running Kafka on the Cloudera Data Platform (CDP). You will then use this cluster during more than 20 hands-on exercises that follow, covering a range of essential skills, starting with how to create Kafka topics, producers, and consumers, then continuing through progressively more challenging aspects of Kafka operations and development, such as those related to scalability, reliability, and performance problems. Throughout the course, you will learn and use Cloudera’s recommended tools for working with Kafka, including Cloudera Manager, Schema Registry, Streams Messaging Manager, and Cruise Control. DATE: August 17-20, 2026 9:00 - 17:00 (GMT+2 TIMEZONE) Virtual Classroom, EMEA Read more
This course helps customers use Cloudera Data Platform to address data governance tasks, motivated by the need for compliance with regulations such as the European Union's General Data Protection Regulation (GDPR) and the United State's Health Insurance Portability and Accountability Act (HIPAA). What you'll learn Through instructor-led discussion, demonstrations, and hands-on exercises, you will learn how to: Identify which tools in Cloudera Data Platform (CDP) to use for key data governance activities Organize data objects using classifications and business glossary terms Find access history for data objects and policies Use Data Catalog Profilers in CDP to assist in organizing data objects Use Data Catalog to foster collaboration with colleagues View and interpret a data object's lineage Create and apply resource- and tag-based access control policies Create policies for data masking and row-level filtering What to expect This course is best suited for data stewards and others who are responsible for, or have an interest in, implementing regulatory compliance or performing typical data governance activities using the Cloudera Data Platform. Familiarity with basic data governance concepts is helpful, but not required. DATE: September 8-9, 2026 9:00 - 17:00 (GMT+2 TIMEZONE) Virtual Classroom, EMEA Read more
This course introduces Apache Iceberg, a high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store, available on Cloudera Data Warehouse and Cloudera Data Engineering on both Private and Public Cloud. Combined with Cloudera Data Platform, Iceberg can enable users to build an open data lakehouse architecture for multi-function analytics and to deploy large-scale end-to-end pipelines. This course covers various aspects of Apache Iceberg, such as benefits, architecture, internal operation, read and write operations, and advanced functions, all while drawing comparisons to Hive and building on the students’ existing knowledge and experience. DATE: September 8-11, 2026 9:00 - 17:00 (GMT+2 TIMEZONE) Virtual Classroom, EMEA Read more
Apache Ozone is the next-generation hybrid storage service offering versatility and out-of-the-box compatibility. Ozone is an object storage format exceeding the limitations of HDFS. This course teaches architecture, internal operations, installation, file system usage, best practices, security, maintenance, monitoring, tuning and testing. Download full course description What you'll learn This course teaches the Ozone internal architecture and how to install, use, maintain, monitor, tune, integrate, and test the the Ozone service in a secure environment. Participants will gain the following skills: Understanding the Benefits of Using Ozone Installing and Configuring Secure Ozone Managing Files and Objects in Ozone Performance Tuning and Doing Baseline Tests Controlling Replication and Understanding Failover and Recovery Performing Maintenance Tasks Monitoring Ozone Using Recon Service Integrating Hive, Impala, Spark, Nifi, and Flink with Ozone Migrating Data from HDFS to Ozone What to expect This advanced course is for administrators who are currently using CDP Private Cloud Base. The course will appeal to technicians, such as data engineers and applications developers, who are migrating data and applications to Apache Ozone. Prior experience of Cloudera Data Platform, to include HDFS, YARN, and Hive, is expected. Students must have access to the Internet to reach the classroom environments, which are located on Amazon Web Services. DATE: August 24-27, 2026 9:00 - 17:00 (GMT+2 TIMEZONE) Virtual Classroom, EMEA Read more
Shopping Cart
Your cart is empty