Cloudera Education

Upcoming Sessions

August

10

ILT - DENG-254: Preparing with Cloudera Data Engineering and Apache Spark - 5114313 - public APAC

Starting:
2026/08/10 @ 09:30 AM Singapore

Ending:
2026/08/13 @ 05:30 PM Singapore
August

10

ILT - DENG-254: Preparing with Cloudera Data Engineering and Apache Spark - 5118228 - public EMEA

Starting:
2026/08/10 @ 09:00 AM Budapest

Ending:
2026/08/13 @ 05:00 PM Budapest

See All Upcoming Sessions

ILT - DANA-262: Analyzing with Cloudera Data Warehouse - 5163139

This four-day Analyzing with Data Warehouse course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages. Download full course description What you'll learn Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the ecosystem, learning how to: Use Apache Hive and Apache Impala to access data through queries Identify distinctions between Hive and Impala, such as differences in syntax, data formats, and supported features Write and execute queries that use functions, aggregate functions, and subqueries Use joins and unions to combine datasets Create, modify, and delete tables, views, and databases Load data into tables and store query results Select file formats and develop partitioning schemes for better performance Use analytic and windowing functions to gain insight into their data Store and query complex or nested data structures Process and analyze semi-structured and unstructured data Optimize and extend the capabilities of Hive and Impala Determine whether Hive, Impala, an RDBMS, or a mix of these is the best choice for a given task Utilize the benefits of CDP Public Cloud Data Warehouse What to expect This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. DATE: November 23-26, 2026 9:00 - 17:00 (CET TIMEZONE) Virtual Classroom, EMEA Read more

ILT - Cloudera Training for Apache HBase - 5160908

Overview Take your knowledge to the next level with Cloudera Training for Apache HBase. Cloudera Educational Services’ three-day training course enables participants to store and access massive quantities of multi-structured data and perform hundreds of thousands of operations per second. Download full course description Hands-on Hadoop Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as: The use cases and usage occasions for HBase, Hadoop, and RDBMS Using the HBase shell to directly manipulate HBase tables Designing optimal HBase schemas for efficient data storage and recovery How to connect to HBase using the Java API to insert and retrieve data in real time Best practices for identifying and resolving performance bottlenecks Audience and prerequisites This course is appropriate for developers and administrators who intend to use HBase. Prior experience with databases and data modeling is helpful, but not required. Knowledge of Java is assumed. Prior knowledge of Hadoop is not required, but Cloudera Developer Training for Spark and Hadoop provides an excellent foundation for this course. DATE: October 7-9, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more

ILT - ADMIN-234: Administering Apache HBase - 5160903

About This Training Apache HBase is a distributed, scalable, NoSQL database designed for real-time read/write access to large datasets. Built on top of HDFS, it brings low-latency random access to Hadoop-scale data. This course covers HBase architecture, data modeling, read/write internals, deployment, high availability, tuning, security, troubleshooting, and advanced topics like Phoenix, HBCK2, and YCSB benchmarking. What Skills You Will Gain Understanding HBase architecture and its role in the Cloudera Operational Database Deploying and configuring HBase clusters for high availability Designing effective HBase schemas for scalable, real-time workloads Analyzing and optimizing HBase write and read paths Tuning HBase performance through memory management, caching, and compaction Securing HBase with authorization policies in Ranger Monitoring and troubleshooting clusters using HBCK2 and diagnostic tools Performing data backup, recovery, and cluster migration Querying HBase tables using Apache Phoenix and its advanced features Benchmarking performance with the YCSB tool Managing medium-sized objects (MOBs) efficiently Who Should Take This Course? This course is designed for administrators and data engineers who manage or support Apache HBase deployments in production environments. It is also valuable for DevOps professionals involved in performance tuning, monitoring, and troubleshooting databases. Prior experience with HDFS and ZooKeeper is recommended. Students must have Internet access to connect to the hands-on lab environments. ___________________________________________________________________ DATE: September 2-4, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more

ILT - ADMIN-335: Administering Data Services on premises - 5160901

This four-day course teaches the architecture, deployment, and configuration of Cloudera Data Services on Embedded Containerized Services (ECS). Cloudera Data Services provide a state-of-the-art, low- code platform that unifies the entire data lifecycle, reducing development costs and accelerating the development and deployment of use cases. The course starts by covering best practices for managing Docker images and containers. Students will then build a Docker private registry. This Docker private registry will be used to deploy a Data Services cluster on ECS. Students will install, configure, and validate Cloudera Data Engineering, Cloudera Data Warehouse, and Cloudera Machine Learning. Through hands-on exercises, students will gain experience with Kubernetes, install a Private Cloud Embedded Container Service (ECS), and deploy Cloudera Data Services. Additionally, the course covers networking and hardware requirements and explains how Kubernetes pods dynamically scale to support Cloudera Data Services. Who should take this course This immersive course is designed for Cloudera Administrators transitioning to managing Cloudera Data Services on premises. Students should have at least 3 to 5 years of system administration experience. Students must have proficiency in the Linux Command Line Interface and knowledge of Identity Management, including Transport Layer Security and Kerberos. Familiarity with SQL select statements is recommended. Prior experience with Cloudera products is required. Students need reliable internet access to connect to the Amazon Web Services environment used in this course. Recommended prerequisite courses • ADMIN-230: Administering Cloudera on premises • ADMIN-332: Securing Cloudera on premises DATE: September 15-18, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more

ILT - DSCI-272: Predicting with MLOps on Cloudera AI - 5160850

Enterprise data science teams need collaborative access to business data, tools, and computing resources required to develop and deploy machine learning workflows. Cloudera AI, part of the Cloudera platform, provides the solution, giving data science teams the required resources. This four-day course covers machine learning workflows and operations using Cloudera AI. Participants explore, visualize, and analyze data. You will also train, evaluate, and deploy machine learning models. The course walks through an end-to-end data science and machine learning workflow based on realistic scenarios and datasets from a fictitious technology company. The demonstrations and exercises are conducted in Python (with PySpark) using Cloudera AI. Download full course description DATE: August 25-28, 2026 9:30 - 17:30 (SGT TIMEZONE) Virtual Classroom, APAC Read more

ILT - DENG-255: Building an Open Data Lakehouse using Apache Iceberg - 5151317

This course introduces Apache Iceberg, a high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store, available on Cloudera Data Warehouse and Cloudera Data Engineering on both Private and Public Cloud. Combined with Cloudera Data Platform, Iceberg can enable users to build an open data lakehouse architecture for multi-function analytics and to deploy large-scale end-to-end pipelines. This course covers various aspects of Apache Iceberg, such as benefits, architecture, internal operation, read and write operations, and advanced functions, all while drawing comparisons to Hive and building on the students’ existing knowledge and experience. DATE: August 17-20, 2026 9:00 - 17:00 (CEST TIMEZONE) Virtual Classroom, EMEA Read more

Your cart is empty

Upcoming Sessions

10

ILT - DENG-254: Preparing with Cloudera Data Engineering and Apache Spark - 5114313 - public APAC

10

ILT - DENG-254: Preparing with Cloudera Data Engineering and Apache Spark - 5118228 - public EMEA

See All Upcoming Sessions

Shopping Cart