Introducing - Cloudera DataFlow: Flow Management with Apache NiFi

Difficulty
Basic

Rating

Course Length
40 mins

Instructor
OnDemand Moderation

Price
Free

Description

Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey.

Disclaimer - The following descriptions and objectives are for the full course.

About This Course

This course provides the fundamental concepts and experience necessary to automate the ingest, flow, transformation, and egress of data using Apache NiFi.

Along with gaining a grasp of the key features, concepts, and benefits of NiFi, participants will create and run NiFi dataflows for a variety of scenarios. Students will gain expertise using processors, connections, and process groups, and will use NiFi Expression Language to control the flow of data from various sources to multiple destinations. Participants will monitor dataflows, examine progress of data through a dataflow, and connect dataflows to external systems such as Kafka and HDFS. After taking this course, participants will have key knowledge and expertise for configuring and managing data ingestion, movement, and transformation scenarios for the enterprise.

Course Length

This module includes 4 hours of video content. Hands-on exercises will take approximately 9 hours.

Audience and Prerequisites

This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases. Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful.

Objectives

Students who successfully complete this course will be able to:

  • Understand the role of Apache NiFi and MiNiFi in the Cloudera DataFlow platform
  • Describe NiFi’s architecture, including standalone and clustered configurations
  • Use key features, including FlowFiles, processors, process groups, controllers, and connections, to define a NiFi dataflow
  • Navigate, configure dataflows, and use dataflow information with the NiFi User Interface
  • Trace the life of data, its origin, transformation, and destination, using data provenance
  • Organize and simplify dataflows
  • Manage dataflow versions using the NiFi Registry
  • Use the NiFi Expression Language to control dataflows
  • Implement dataflow optimization methods and available monitoring and reporting features
  • Techniques for optimizing Hive and Impala queries
  • Connect dataflows with other systems, such as Kafka and HDFS
  • Describe aspects of NiFi security

 
Added 9 days ago, by Auruba
 
Added 10 days ago, by María Alejandra
 
Added 15 days ago, by Anonymous
It would be extremely interesting to have access to Cloudera's own "Templates" such as those available on the Internet, for example in Apache, or to delve into examples of each processor such as available documentation.
 
Added 28 days ago, by Ninnette
 
Added about 1 month ago, by Jose
 
Added about 1 month ago, by sergio
 
Added 2 months ago, by Anonymous
 
Added 3 months ago, by Billy
 
Added 3 months ago, by Trung
This course helps me a lots in basic learning related to Apache Nifi
 
Added 3 months ago, by Pawel

Shopping Cart

Your cart is empty