Cloudera Educational Services

Upcoming Sessions

See All Upcoming Sessions

This three-day hands-on training course delivers the key concepts and expertise developers need to optimize the performance of their Apache Spark applications. During the course, participants will learn how to identify common sources of poor performance in Spark applications, techniques for avoiding or solving them, and best practices for Spark application monitoring. Optimizing Apache Spark Applications presents the architecture and concepts behind Apache Spark and underlying data platform, then builds on this foundational understanding by teaching students how to tune Spark application code. The course format emphasizes instructor-led demonstrations illustrate both performance issues and the techniques that address them, followed by hands-on exercises that give students an opportunity to practice what they've learned through an interactive notebook environment. Download full course description   This course is designed for software developers, engineers, and data scientists who have experience developing Spark applications and want to learn how to improve the performance of their code. This is not an introduction to Spark. Spark examples and hands-on exercises are presented in Python and the ability to program in this language is required. Basic familiarity with the Linux command line is assumed. Basic knowledge of SQL is helpful.   DATE: April 27-29, 2026 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

This four-day course teaches the architecture, deployment, and configuration of Cloudera Data Services on Embedded Containerized Services (ECS). Cloudera Data Services provide a state-of-the-art, low- code platform that unifies the entire data lifecycle, reducing development costs and accelerating the development and deployment of use cases.   The course starts by covering best practices for managing Docker images and containers. Students will then build a Docker private registry. This Docker private registry will be used to deploy a Data Services cluster on ECS. Students will install, configure, and validate Cloudera Data Engineering, Cloudera Data Warehouse, and Cloudera Machine Learning. Through hands-on exercises, students will gain experience with Kubernetes, install a Private Cloud Embedded Container Service (ECS), and deploy Cloudera Data Services. Additionally, the course covers networking and hardware requirements and explains how Kubernetes pods dynamically scale to support Cloudera Data Services. Who should take this course This immersive course is designed for Cloudera Administrators transitioning to managing Cloudera Data Services on premises. Students should have at least 3 to 5 years of system administration experience. Students must have proficiency in the Linux Command Line Interface and knowledge of Identity Management, including Transport Layer Security and Kerberos. Familiarity with SQL select statements is recommended. Prior experience with Cloudera products is required. Students need reliable internet access to connect to the Amazon Web Services environment used in this course. Recommended prerequisite courses • ADMIN-230: Administering Cloudera on premises • ADMIN-332: Securing Cloudera on premises   DATE: March 30 - April 2, 2026 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

This four-day instructor-led course begins by introducing Apache Kafka, explaining its key concepts and architecture, and discussing several common use cases. Building on this foundation, you will learn how to plan a Kafka deployment, and then gain hands-on experience by installing and configuring your own cloud-based, multi-node cluster running Kafka on the Cloudera Data Platform (CDP). You will then use this cluster during more than 20 hands-on exercises that follow, covering a range of essential skills, starting with how to create Kafka topics, producers, and consumers, then continuing through progressively more challenging aspects of Kafka operations and development, such as those related to scalability, reliability, and performance problems. Throughout the course, you will learn and use Cloudera’s recommended tools for working with Kafka, including Cloudera Manager, Schema Registry, Streams Messaging Manager, and Cruise Control. DATE: March 30 - April 2, 2026 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+3 TIMEZONE) Read more

About This Training Generative AI (GenAI) and Large Language Models (LLMs) are extremely powerful new tools that are changing every industry. To fully take advantage of GenAI and LLMs, these new capabilities need to be combined with your existing enterprise data. This two-day course teaches how to use Cloudera AI to train, augment, fine tune, and host LLMs to create powerful enterprise AI solutions. What Skills You Will Gain Through lecture and Hands-On exercises, you will learn how to: Select the right LLM model for a use case Configure a Prompt for an LLM Use Retrieval Augmented Generation (RAG) Fine Tune an LLM Model with Enterprise Data Use the AI Model Registry and host an LLM Create an AI Agent with Crew AI Who Should Take This Course This course is designed for data scientists and machine learning engineers who need to understand how to utilize Cloudera AI to leverage the full power of their enterprise data, generative AI, and Large Language Models and deliver powerful business solutions.   DATE: March 24-25, 2026 Virtual Classroom, EMEA 9:00 - 17:00 (GMT+2 TIMEZONE) Read more

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from Admin:230 Administering Cloudera on premises, that is part of our full OnDemand Training Library.  The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. Overview Lab environment is included with this course. Starting in lesson 04 you will be able to launch your environment. This course presents detailed explanation, comprehensive theory, key skills, and recommended practices for successful platform administration. Upon completion of this course a Cloudera Administrator will learn the full range of functionality and capability of Cloudera Manager. This course provides an in-depth explanation and skills to become highly productive with Cloudera Manager and the Cloudera platform. Cloudera Manager is a full featured and mature DevOps tool. It is used to install, configure, operate, troubleshoot, report, and upgrade Cloudera. Many Cloudera Administrators only use a fraction of the capabilities built into Cloudera Manager. This course teaches the architecture, deployment, configuration, logging, reporting, REST API, and much more. The course provides references for architecture and recommended practices used by enterprises around the globe. What to expect While this course is an entry point for aspiring Cloudera Administrators this course is detailed enough for more senior Cloudera Administrators to discover new functionality and capabilities. This course is intended for Linux Administrators who are taking up roles as Platform Administrators. We recommend a minimum of 2 years of system administration experience in industry. Students must have proficiency in Linux. Knowledge of Directory Services, Transport Layer Security, Kerberos, and SQL select statements is helpful. Students must have access to the Internet to reach Amazon Web Services. Read more

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from DSCI:272 - Predicting with MLOps on Cloudera AI, that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course.   Overview Enterprise data science teams need collaborative access to business data, tools, and computing resources required to develop and deploy machine learning workflows. Cloudera AI, part of the Cloudera platform, provides the solution, giving data science teams the required resources. This course covers machine learning workflows and operations using Cloudera AI. Participants explore, visualize, and analyze data. You will also train, evaluate, and deploy machine learning models. The course walks through an end-to-end data science and machine learning workflow based on realistic scenarios and datasets from a fictitious technology company. The demonstrations and exercises are conducted in Python (with PySpark) using Cloudera AI. ​ Course Length This course includes approximately 8.5 hours of video lectures and demonstrations.  You will need your own environment to work on the labs. The labs will take approximately 7 hours to complete.   What to expect The course is designed for data scientists who need to understand how to utilize Cloudera AI and the Cloudera platform to achieve faster model development and deliver production machine learning at scale. Data engineers, developers, and solution architects who collaborate with data scientists will also find this course valuable. Read more

Shopping Cart

Your cart is empty