Cloudera Educational Services

You searched free. There are 29 items matching your criteria. Reset search

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course This course provides the fundamental concepts and experience necessary to automate the ingest, flow, transformation, and egress of data using Apache NiFi. Along with gaining a grasp of the key features, concepts, and benefits of NiFi, participants will create and run NiFi dataflows for a variety of scenarios. Students will gain expertise using processors, connections, and process groups, and will use NiFi Expression Language to control the flow of data from various sources to multiple destinations. Participants will monitor dataflows, examine progress of data through a dataflow, and connect dataflows to external systems such as Kafka and HDFS. After taking this course, participants will have key knowledge and expertise for configuring and managing data ingestion, movement, and transformation scenarios for the enterprise. Course Length This module includes 4 hours of video content. Hands-on exercises will take approximately 9 hours. Audience and Prerequisites This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases. Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Read more

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course Cloudera University’s Data Analyst Training course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages. Apache Hive makes transformation and analysis of complex, multi-structured data scalable in Cloudera environments. Apache Impala enables real-time interactive analysis of the data stored in Hadoop using a native SQL environment. Together, they make multi-structured data accessible to analysts, database administrators, and others without Java programming expertise. Course Length This course includes 7 hours of video content, plus 2 hours of exercise review. Hands-on exercises will take approximately 10.5 hours. Audience and Prerequisites This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. Prior knowledge of Apache Hadoop is not required. Read more

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course This course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP). Practice writing spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with big data stored in a distributed file system. Course Length This course includes approximately 9.5 hours of video lectures, demonstrations, and exercises. In order to complete the self-paced exercises for this course, students must have access to CDP through their organization. Audience and Prerequisites This course is designed for developers and data engineers. Students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful. Prior knowledge of Spark and Hadoop is not required. Read more

About This Course This video demonstration consists of building a serverless website where one can upload a receipt picture. Following the upload to S3, an AWS Lambda function is triggered and a NiFi slow is executed to process the image and extract useful information using AWS Textract. The information is then sent to a database to serve an expense report solution.  Audience and Prerequisites This OnDemand course is suitable for data engineers and data analysts.   Read more

About This Course This course is part of the Skillup series. Learn how typical AI-Centric' challenges are addressed with CDP Machine Learning. This course includes 40 minutes of video content including a demonstration on customer churn. Audience and Prerequisites This OnDemand course is suitable for data engineers, data analysts, developers, and data scientists    Read more

About This Course DataGen by Francois Risch. In this course, we will give a walkthrough of installation and how to use the DataGen, a tool that generates data on all services provided by Cloudera (HDFS, Hive, HBase, Ozone, Kafka, Kudu, SolR, Local files), in any format (CSV, JSON, Avro, Parquet, ORC). Course Length This course includes 1 hour of video content. Audience and Prerequisites This OnDemand course is suitable for administrators that need to generate data.       Read more

Shopping Cart

Your cart is empty