Cloudera Educational Services

Upcoming Sessions

See All Upcoming Sessions

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Course Whether you’re building big data applications, developing data pipelines, or working on machine learning projects, it’s essential to manage changes to your code. Although developers and data scientists have employed a variety of tools for this over the years, an open source version control system called git has emerged as the standard tool for thousands of organizations around the world. This course introduces students to the Git version control system through a series of lectures, demonstrations, and hands-on exercises. Course Length This module includes over an hour of video content. Hands-on exercises may take an additional 3 hours.  Audience and Prerequisites This course is best suited to developers and data scientists who feel comfortable performing basic operations from the Linux command line. No prior experience with git or other revision control systems is necessary. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Course This course provides the fundamental concepts and experience necessary to automate the ingest, flow, transformation, and egress of data using Apache NiFi. Along with gaining a grasp of the key features, concepts, and benefits of NiFi, participants will create and run NiFi dataflows for a variety of scenarios. Students will gain expertise using processors, connections, and process groups, and will use NiFi Expression Language to control the flow of data from various sources to multiple destinations. Participants will monitor dataflows, examine progress of data through a dataflow, and connect dataflows to external systems such as Kafka and HDFS. After taking this course, participants will have key knowledge and expertise for configuring and managing data ingestion, movement, and transformation scenarios for the enterprise. Course Length This module includes 4 hours of video content. Hands-on exercises will take approximately 9 hours. Audience and Prerequisites This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases. Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Course This course is an introduction to CDP Data Visualization, a fully integrated visualization layer across CDP experiences and form factors. DataViz is built for business users to provide an easy-access, out of the box, self-service ability to quickly draw insights and take action from fast moving and vast historic data sets. Through recorded demonstrations and instructional information the course guides you through the complete workflow of using CDP DataViz, from connecting to data to sharing interactive applications with a wider audience. This course includes hands-on exercises, but an exercise environment is not provided. You must have your own access to CDP (Public Cloud, Private Cloud, or the on-premises Private Cloud Base) to complete the exercises. Course Length This module includes 1 hour, 40 minutes of video content. Hands-on exercises will take approximately 1.5 hours. Audience and Prerequisites This module is designed for business analysts, data analysts, data scientists, line-of-business users, and other data enthusiasts. The proposed course will have no formal prerequisites, but it is assumed learners have a basic familiarity with tabular data and related concepts. Learners will benefit from having some experience with SQL. To follow along with the demonstrations and to complete the suggested exercises, the learner must have access to a CDP environment equipped to run CDP Data Visualization in either of its form factors (inside CDW or inside CML) or access to a CDSW instance in which they can run CDP Data Visualization. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Course This module is an introduction to the modern data warehouse with big data, including an overview of the data warehousing functions of Cloudera Data Platform. It provides a foundation for more detailed learning in how to use those functions. Course Length This module includes 60 minutes of video content. Hands-on exercises will take approximately 45 minutes.  Audience and Prerequisites This module is designed for database administrators, data architects, and data engineers. The module design assumes knowledge of traditional relational databases, but this is not strictly required. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Module This module addresses how to manage security concerns when using Apache Kafka. For more about Kafka, see Streaming Processing, Management, and Analytics with CDF. This module is included as part of that learning path.  Course Length This module includes 40 minutes of video content. Hands-on exercises will take approximately 2.25 hours. Audience and Prerequisites This module is designed for Data Engineers, Administrators, and others who want to understand stream processing administration, configuration, and applications within CDF.  Though programming experience is not required, code samples are provided in Java, and basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  About This Module This module introduces Schema Registry and shows you how to use it with Apache NiFi. For more about streaming, see Streaming Processing, Management, and Analytics with CDF. This module is included as part of that learning path.  Course Length This module includes 1.25 hours of video content. Hands-on exercises will take approximately 1 hour.  Audience and Prerequisites This module is designed for Data Engineers, Administrators, and others who want to understand stream processing administration, configuration, and applications within CDF.  Though programming experience is not required, code samples are provided in Java, and basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Note: Enrolling here will not give you access to the actual course. The course is available by purchasing the Full OnDemand Library subscription.  Read more

Shopping Cart

Your cart is empty