Upcoming Sessions
-
February
24
ILT - ADMIN-230: Administrating Cloudera Data Platform - 4216825
Starting:2025/02/24 @ 06:00 AM (GMT+00:00) UTCEnding:2025/02/27 @ 02:00 PM (GMT+00:00) UTCType:Multi-day Session -
February
24
ILT - DOPS-242: Ingesting with Cloudera DataFlow - 4241109
Starting:2025/02/24 @ 03:00 PM (GMT+00:00) UTCEnding:2025/02/27 @ 11:00 PM (GMT+00:00) UTCType:Multi-day Session
See All Upcoming Sessions

DESCRIPTION [DATE: March 18-21, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (MST TIMEZONE) Read more

One of the most critical functions of a data-driven enterprise is the ability to manage ingest and data flow across complex ecosystems. Does your team have the tools and skill sets to succeed at this? Apache NiFi and this four-day course provides the fundamental concepts and experience necessary to automate the ingress, flow, transformation, and egress of data using NiFi. The course also covers tuning, troubleshooting, and monitoring the dataflow process as well as how to integrate a dataflow within the Cloudera CDP Hybrid ecosystem and external systems. Download full course description What you'll learn During this course, you learn how to: Define, configure, organize, and manage dataflows Transform and trace data as it flows to its destination Track changes to dataflows with NiFi Registry Use the NiFi Expression Language to control dataflows Optimize dataflows for better performance and maintainability Connect dataflows with other systems, such as Apache Kafka, Apache Hive, and HDFS Utilize the Data Flow Service What to expect This course is designed for developers, data engineers, administrators, and others with an interest in learning NiFi’s innovative no-code, graphical approach to data ingest. Although programming experience is not required, basic experience with Linux is presumed, and previous exposure to big data concepts and applications is helpful. [DATE: March 4-7, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (MST TIMEZONE) Read more

This four-day Analyzing with Data Warehouse course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages. Download full course description What you'll learn Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the ecosystem, learning how to: Use Apache Hive and Apache Impala to access data through queries Identify distinctions between Hive and Impala, such as differences in syntax, data formats, and supported features Write and execute queries that use functions, aggregate functions, and subqueries Use joins and unions to combine datasets Create, modify, and delete tables, views, and databases Load data into tables and store query results Select file formats and develop partitioning schemes for better performance Use analytic and windowing functions to gain insight into their data Store and query complex or nested data structures Process and analyze semi-structured and unstructured data Optimize and extend the capabilities of Hive and Impala Determine whether Hive, Impala, an RDBMS, or a mix of these is the best choice for a given task Utilize the benefits of CDP Public Cloud Data Warehouse What to expect This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. [DATE: March 18-21, 2025] Virtual Classroom, [EMEA] 9:00 - 17:00 (CET TIMEZONE) Read more

This course introduces Apache Iceberg, a high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store, available on Cloudera Data Warehouse and Cloudera Data Engineering on both Private and Public Cloud. Combined with Cloudera Data Platform, Iceberg can enable users to build an open data lakehouse architecture for multi-function analytics and to deploy large-scale end-to-end pipelines. This course covers various aspects of Apache Iceberg, such as benefits, architecture, internal operation, read and write operations, and advanced functions, all while drawing comparisons to Hive and building on the students’ existing knowledge and experience. [DATE: March 11-14, 2025] Virtual Classroom, [APAC] 9:00 - 17:00 (SGP TIMEZONE) Read more

One of the most critical functions of a data-driven enterprise is the ability to manage ingest and data flow across complex ecosystems. Does your team have the tools and skill sets to succeed at this? Apache NiFi and this four-day course provides the fundamental concepts and experience necessary to automate the ingress, flow, transformation, and egress of data using NiFi. The course also covers tuning, troubleshooting, and monitoring the dataflow process as well as how to integrate a dataflow within the Cloudera CDP Hybrid ecosystem and external systems. Download full course description What you'll learn During this course, you learn how to: Define, configure, organize, and manage dataflows Transform and trace data as it flows to its destination Track changes to dataflows with NiFi Registry Use the NiFi Expression Language to control dataflows Optimize dataflows for better performance and maintainability Connect dataflows with other systems, such as Apache Kafka, Apache Hive, and HDFS Utilize the Data Flow Service What to expect This course is designed for developers, data engineers, administrators, and others with an interest in learning NiFi’s innovative no-code, graphical approach to data ingest. Although programming experience is not required, basic experience with Linux is presumed, and previous exposure to big data concepts and applications is helpful. [DATE: February 24-27, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (Central TIMEZONE) Read more

This four-day hands-on training course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP). Hands-on exercises allow students to practice writing Spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with “big data” stored in a distributed file system. After taking this course, participants will be prepared to face real-world challenges and build applications to execute faster decisions, better decisions, and interactive analysis, applied to a wide variety of use cases, architectures, and industries. Download full course description What you'll learn During this course, you will learn how to: Distribute, store, and process data in a CDP cluster Write, configure, and deploy Apache Spark applications Use the Spark interpreters and Spark applications to explore, process, and analyze distributed data Query data using Spark SQL, DataFrames, and Hive tables Deploy a Spark application on the Data Engineering Service What to expect This course is designed for developers and data engineers. All students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful. Prior knowledge of Spark and Hadoop is not required. [DATE: March 10-13, 2025] Virtual Classroom, [AMER] 9:00 - 17:00 (Central TIMEZONE) Read more
Shopping Cart
Your cart is empty