Cloudera Educational Services

Upcoming Sessions

See All Upcoming Sessions

By purchasing and enrolling in this course, you will have access for one year to all our OnDemand training content. That includes the following courses, along with any new courses we add during your subscription. Before you purchase, you can find descriptions of these courses in our store. Please note: You might not be able to access your courses immediately; the system can take up to  30 minutes after purchase is complete to process the new subscription. Please be patient for this process. Administrator Training: CDP Private Cloud Base Apache NiFi Anti-Patterns AWS Fundamentals for CDP Public Cloud CDP Data Governance with SDX: Implementing Regulatory Compliance on the Cloudera Data Platform CDP Data Visualization Training CDP Public Cloud Administration Cloudera Data Analyst Training Cloudera Data Platform: CDP for CDH Administrators Cloudera Data Platform: CDP for HDP Administrators Cloudera Data Science Workbench Training Cloudera DataFlow: Flow Management with Apache NiFi Cloudera Essentials for CDP Cloudera Machine Learning Training Cloudera Operational Database Fundamentals Cloudera Search Training Cloudera Training for Apache HBase Data Warehousing in Cloudera Data Platform Developer Training for Apache Spark and Hadoop Introduction to Apache Kudu Introduction to Cloudera Data Warehouse: Self-Service Analytics in the Cloud with CDP Introduction to Cloudera Machine Learning Just Enough Git Just Enough Python Just Enough Scala Streaming Processing, Management, and Analytics with CDF, including: Cloudera DataFlow: Data-in-Motion Overview Apache Kafka Basics Developing Apache Kafka Client Applications with Java   Cloudera Streams Messaging Manager Cloudera Schema Registry Managing Apache Kafka Clusters with Cloudera Manager Cloudera Kafka Security Apache Kafka Connect Read More

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Course Cloudera's OnDemand training course for CDP Private Cloud Base provides the fundamental knowledge necessary to carry out the planning, provisioning, configuration, monitoring, and management tasks required of an administrator for the Cloudera Data Platform (CDP) as a bare metal deployment or as the base for a Private Cloud deployment. Course Length This course includes over 7 hours of video content. Hands-on exercises will take approximately 9.5 hours.  Audience and Prerequisites This course is best suited to systems administrators who have at least basic Linux experience. Prior knowledge of CDP, nor earlier platforms such as Cloudera’s CDH or Hortonworks HDP, is not required. Note: If you do have experience with CDH or HDP as an administrator, you might prefer Cloudera Data Platform: CDP for CDH Administrators or Cloudera Data Platform: CDP for HDP Administrators. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read More

Note: Enrolling here will not give you access to the actual module. The module is available by purchasing the Full OnDemand Library subscription.  About This Module This module is an introduction to Apache Kafka. For more about Kafka, see Streaming Processing, Management, and Analytics with CDF. This module is included as part of that learning path.  Course Length This module includes 40 minutes of video content. Hands-on exercises will take approximately 15 minutes.  Audience and Prerequisites This module is designed for Data Engineers, Administrators, and others who want to understand stream processing administration, configuration, and applications within CDF.  Though programming experience is not required, code samples are provided in Java, and basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Note: Enrolling here will not give you access to the actual module. The module is available by purchasing the Full OnDemand Library subscription.  Read More

Note: Enrolling here will not give you access to the actual module. The module is available by purchasing the Full OnDemand Library subscription.  About This Module This module provides training on Kafka Connect, which enables streaming integration between Kafka and other systems. Course Length This module includes 45 minutes of video content.  Audience and Prerequisites This module is designed for Data Engineers, Administrators, and others who want to understand stream processing administration, configuration, and applications within CDF.  Though programming experience is not required, code samples are provided in Java, and basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Note: Enrolling here will not give you access to the actual module. The module is available by purchasing the Full OnDemand Library subscription.  Read More

About This Module During this series, Mark Payne, a Principal Software Engineer at Cloudera and co-creator of Apache NiFi, will explain several common ways that people use NiFi incorrectly or inefficiently. After explaining the weaknesses of each approach, Mark then shows how to improve those flows to make better use of NiFi's design and architecture.   Part 1: Flows Overview examines a flow that splits and rejoins data, treats structured/semi-structured data as unstructured text, and blurs the line between FlowFile content and attributes. Part 2: Flow Layout illustrates how a disorganized dataflow can make it difficult to understand and maintain. Mark shares tips for laying out the dataflow to make it clean, simple, and easy for others to follow. Part 3: Load Balancing explains how to make your dataflows more scalable by balancing the load across a cluster of nodes. Mark also references his Cloudera technical blog post that shows how NiFi can process more than one billion events per second. Part 4: Scheduling covers scheduling and concurrency anti-patterns. Mark discusses common problems related to thread pools, scheduling processors, and how to configure settings for best performance. Part 5: Primary Node Only looks at the primary node and how it is sometimes misused. Please note: This course does not award a course completion certificate. Module Length This course includes 1 hour of video content.   Read More

About This Course The cloud has transformed the way that organizations manage their infrastructure. The revolutionary new Cloudera Data Platform (CDP) natively supports major cloud providers such as Amazon Web Services (AWS) and Microsoft Azure, enabling our customers to effortlessly deploy and scale workloads while protecting data and keeping costs under control. Despite the many benefits the cloud makes possible, getting started with the cloud can be challenging. Not only are there multiple cloud providers, the fact that AWS alone offers more than 175 distinct services illustrates how simply finding the relevant services can be an overwhelming task. This course was designed to solve exactly this problem. During this course, you'll learn the key concepts behind cloud infrastructure, including the benefits, tradeoffs, and costs associated with running your workloads in the cloud. You'll also learn about the specific services, such as EC2 and S3, that are relevant to running Cloudera's products and services on AWS. Please note: This course does not award a certificate of completion. Course Length This course includes 2.5 hours of video content. Hands-on exercises will require approximately 2.5 hours to complete. Audience and Prerequisites This course is best suited to system administrators and architects with basic Linux command line knowledge and an understanding of Cloudera cluster deployment. Read More

Shopping Cart

Your cart is empty