Offered By: LightBend
Spark Overview for Scala Analytics
The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.
Continue readingCourse
Scala
1.73k+ EnrolledAt a Glance
The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.
There are 5 modules to this course.
1. What is Spark
2. Introduction to RDDs
3. Introduction to DataFrames
4. Advanced Spark Topics
5. Introduction to Spark MLlib
Requirements
2. No previous Spark knowledge is required
3. No previous experience with Data Science concepts is required. These concepts will be explained as needed
Course Staff
Jamie Allen
Frequently Asked Questions
What web browser should I use?
Estimated Effort
8 Hours
Level
Beginner
Skills You Will Learn
Big Data, Scala
Language
English
Course Code
SC0103EN