Upcoming Sessions
-
December
10
ILT - Apache Spark Application Performance Tuning Workshop - 3704807
Starting:2024/12/10 @ 08:00 AM (GMT+00:00) UTCEnding:2024/12/12 @ 04:00 PM (GMT+00:00) UTCType:Multi-day Session
See All Upcoming Sessions
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course Cloudera's OnDemand training course for CDP Private Cloud Base provides the fundamental knowledge necessary to carry out the planning, provisioning, configuration, monitoring, and management tasks required of an administrator for the Cloudera Data Platform (CDP) as a bare metal deployment or as the base for a Private Cloud deployment. Course Length This course includes over 7 hours of video content. Hands-on exercises will take approximately 9.5 hours. Audience and Prerequisites This course is best suited to systems administrators who have at least basic Linux experience. Prior knowledge of CDP, nor earlier platforms such as Cloudera’s CDH or Hortonworks HDP, is not required. Note: If you do have experience with CDH or HDP as an administrator, you might prefer Cloudera Data Platform: CDP for CDH Administrators or Cloudera Data Platform: CDP for HDP Administrators. Read more
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course Cloudera's OnDemand training course for CDP Public Cloud provides the fundamental knowledge necessary to carry out the planning, provisioning, configuration, monitoring, and management tasks required of an administrator for the Cloudera Data Platform (CDP) Public Cloud deployment. This course uses the CDP web interface extensively and also provides information about the use of the CDP Command Line Interface (CLI). Course Length This course includes 12 hours of video content. Audience and Prerequisites This course is best suited to systems administrators. Students should have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts and have some experience with the JSON data format. In addition, it is recommended for students to have some operational experience with cloud computing practices and exposure to big data concepts and applications. Read more
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course This course for Cloudera Data Platform (CDP) administrators teaches the skills and practices needed to configure solutions that meet the most demanding technical audit standards. The course is built around a recommended project plan for CDP administrators. The first project stage is implementation of Perimeter Security by installing host level security and Kerberos. The second project stage protects data by implementing Transport Layer Security via Auto-TLS and data encryption using Key Management System and Key Trustee Server (KMS/KTS). The third project stage controls access for users and data using Ranger and Atlas. The fourth stage teaches visibility practices for auditing of systems, users, and data usage. The final project stage analyzes applications in terms of vulnerabilities and introduces CDP practices for risk management in a fully secured Cloudera Data Platform. Course Length This module includes 6 hours of video content. In order to complete the self-paced exercises for this course, students must have access to CDP through their organization. Audience and Prerequisites This module is designed for Cloudera Data Platform (CDP) administrators. Access to a working Cloudera Data Platform is required in order to follow along with the exercises. Prerequisites for this course include CDP Admin for Private Cloud Intermediate skills with Linux, including file system command-line interface (CLI) and command line text editors (vi) Working knowledge of Transport Layer Security (TLS), Kerberos, and encryption ciphers Operational experience with administrating a Hadoop cluster Read more
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course This course provides the fundamental concepts and experience necessary to automate the ingest, flow, transformation, and egress of data using Apache NiFi. Along with gaining a grasp of the key features, concepts, and benefits of NiFi, participants will create and run NiFi dataflows for a variety of scenarios. Students will gain expertise using processors, connections, and process groups, and will use NiFi Expression Language to control the flow of data from various sources to multiple destinations. Participants will monitor dataflows, examine progress of data through a dataflow, and connect dataflows to external systems such as Kafka and HDFS. After taking this course, participants will have key knowledge and expertise for configuring and managing data ingestion, movement, and transformation scenarios for the enterprise. Course Length This module includes 4 hours of video content. Hands-on exercises will take approximately 9 hours. Audience and Prerequisites This course is designed for Developers, Data Engineers, Data Scientists, and Data Stewards. It provides a no-code, graphical approach to configuring real-time data streaming, ingestion, and management solutions for a variety of use cases. Though programming experience is not required, basic experience with Linux is presumed. Exposure to big data concepts and applications is helpful. Read more
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course Cloudera University’s Data Analyst Training course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages. Apache Hive makes transformation and analysis of complex, multi-structured data scalable in Cloudera environments. Apache Impala enables real-time interactive analysis of the data stored in Hadoop using a native SQL environment. Together, they make multi-structured data accessible to analysts, database administrators, and others without Java programming expertise. Course Length This course includes 7 hours of video content, plus 2 hours of exercise review. Hands-on exercises will take approximately 10.5 hours. Audience and Prerequisites This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. Prior knowledge of Apache Hadoop is not required. Read more
Welcome to our Introduction Series for Cloudera Education. The following contains excerpts from a course that is part of our full OnDemand Training Library. The complete library is available for purchase. Upon completion you will see a path to continue your Cloudera training journey. Many courses include hands-on labs and the OnDemand library comes with 100 hands-on lab hours to practice the concepts and exercises taught. Please enjoy this section of your selected course to help you on your data journey. Disclaimer - The following descriptions and objectives are for the full course. About This Course This course delivers the key concepts and knowledge developers need to use Apache Spark to develop high-performance, parallel applications on the Cloudera Data Platform (CDP). Practice writing spark applications that integrate with CDP core components. Participants will learn how to use Spark SQL to query structured data, how to use Hive features to ingest and denormalize data, and how to work with big data stored in a distributed file system. Course Length This course includes approximately 9.5 hours of video lectures, demonstrations, and exercises. In order to complete the self-paced exercises for this course, students must have access to CDP through their organization. Audience and Prerequisites This course is designed for developers and data engineers. Students are expected to have basic Linux experience, and basic proficiency with either Python or Scala programming languages. Basic knowledge of SQL is helpful. Prior knowledge of Spark and Hadoop is not required. Read more
Shopping Cart
Your cart is empty