This four-day workshop covers enterprise data science and machine learning using Apache Spark in Cloudera Data Science Workbench (CDSW). Participants use Spark SQL to load, explore, cleanse, join, and analyze data and Spark MLlib to specify, train, evaluate, tune, and deploy machine learning pipelines. They dive into the foundations of the Spark architecture and execution model necessary to effectively configure, monitor, and tune their Spark applications. Participants also learn how Spark integrates with key components of the Cloudera platform such as HDFS, YARN, Hive, Impala, and Hue as well as their favorite Python or R packages.
[DATE: Month START - END, YEAR]
Virtual Classroom, [APAC, EMEA, AMER]
9:00 - 17:00 (TIMEZONE)
By completing/passing this course, you will attain the certificate Cloudera ILT Course Completion Certificate
Your cart is empty