Upcoming Sessions
-
April
15
ILT - ADMIN-230: Administrating Cloudera Data Platform - 4308489
Starting:2025/04/15 @ 01:00 AM (GMT+00:00) UTCEnding:2025/04/18 @ 09:00 AM (GMT+00:00) UTCType:Multi-day Session -
April
22
ILT - ADMIN-230: Administrating Cloudera Data Platform - 4306325
Starting:2025/04/22 @ 01:00 AM (GMT+00:00) UTCEnding:2025/04/25 @ 09:00 AM (GMT+00:00) UTCType:Multi-day Session
See All Upcoming Sessions

Designing Edge to AI Applications is a 4-day learning event that addresses advanced big data architecture topics for building edge to AI applications to cover streaming, operational data processing, analytics, and machine learning. The workshop brings together technical contributors into a group setting to design and architect solutions to a challenging business problem. The workshop addresses big data architecture problems in general, and then applies them to the design of a challenging system. Throughout the highly interactive workshop, participants apply concepts to real-world examples resulting in detailed synergistic discussions. The workshop is conducive for participants to learn techniques for architecting big data systems, not only from Cloudera’s experience but also from the experiences of fellow participants. More specifically, this workshop addresses advanced big data architecture topics, including, data formats, transformation, transactions, real-time, batch and machine learning processing, scalability, fault tolerance, security, and privacy, minimizing the risk of an unsound architecture and technology selection. What you'll learn Cloudera Data Platform Big Data Architecture Building Scalable applications Building Fault Tolerant Solutions Security and Privacy Deployment on Public, Private, and Hybrid Cloud What to expect Participants should mainly be architects, developer team leads, big data developers, data engineers, senior analysts, dev ops admins and machine learning developers who are working on big data or streaming applications and have an interest in how to design and develop such applications on CDP. To gain the most from the workshop, participants should have working knowledge of popular Big Data and streaming technologies such as HDFS, Spark, Kafka, Hive/Impala, Data Formats, and relational database management systems. Detailed API level knowledge is not needed, as there will not be any programming activities and instead the focus will be on architecture design. The workshop will be divided into small groups to discuss the problems, develop solutions, and present their solutions. September 8-11, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

This course helps customers use Cloudera Data Platform to address data governance tasks, motivated by the need for compliance with regulations such as the European Union's General Data Protection Regulation (GDPR) and the United State's Health Insurance Portability and Accountability Act (HIPAA). What you'll learn Through instructor-led discussion, demonstrations, and hands-on exercises, you will learn how to: Identify which tools in Cloudera Data Platform (CDP) to use for key data governance activities Organize data objects using classifications and business glossary terms Find access history for data objects and policies Use Data Catalog Profilers in CDP to assist in organizing data objects Use Data Catalog to foster collaboration with colleagues View and interpret a data object's lineage Create and apply resource- and tag-based access control policies Create policies for data masking and row-level filtering What to expect This course is best suited for data stewards and others who are responsible for, or have an interest in, implementing regulatory compliance or performing typical data governance activities using the Cloudera Data Platform. Familiarity with basic data governance concepts is helpful, but not required. June 23-24, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

Apache Ozone is the next-generation hybrid storage service offering versatility and out-of-the-box compatibility. Ozone is an object storage format exceeding the limitations of HDFS. This course teaches architecture, internal operations, installation, file system usage, best practices, security, maintenance, monitoring, tuning and testing. Download full course description What you'll learn This course teaches the Ozone internal architecture and how to install, use, maintain, monitor, tune, integrate, and test the the Ozone service in a secure environment. Participants will gain the following skills: Understanding the Benefits of Using Ozone Installing and Configuring Secure Ozone Managing Files and Objects in Ozone Performance Tuning and Doing Baseline Tests Controlling Replication and Understanding Failover and Recovery Performing Maintenance Tasks Monitoring Ozone Using Recon Service Integrating Hive, Impala, Spark, Nifi, and Flink with Ozone Migrating Data from HDFS to Ozone What to expect This advanced course is for administrators who are currently using CDP Private Cloud Base. The course will appeal to technicians, such as data engineers and applications developers, who are migrating data and applications to Apache Ozone. Prior experience of Cloudera Data Platform, to include HDFS, YARN, and Hive, is expected. Students must have access to the Internet to reach the classroom environments, which are located on Amazon Web Services. June 23-26, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

December 8-10, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

September 22-24, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

This four-day hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP). Hands-on exercises allow students to practice writing Spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system. After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries. Download full course description What you'll learn During this course, you will learn how to: Distribute, store, and process data in a CDP cluster Write, configure, and deploy Apache Spark applications Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data Query data using Spark SQL, DataFrames, and Hive tables Deploy a Spark application on the Data Engineering Service What to expect This course is designed for developers and data engineers. All students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful. Prior knowledge of Spark and Hadoop is not required. November 17-20, 2025 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more
Shopping Cart
Your cart is empty