🏆 Take the free Top-Rated Session from TechXchange in Las Vegas and Build Your First GenAI Application the Right Way! Learn more

Offered By: LightBend

Spark Overview for Scala Analytics

The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.

Continue reading

Course

Scala

1.74k+ Enrolled
4.6
(44 Reviews)

At a Glance

The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.

The “Spark Overview for Scala Analytics” course will cover the history of Spark and how it came to be, how to build applications with Spark, establish an understanding of RDDs and DataFrames, and other advanced Spark topics. Apache Spark™ is a fast and general engine for large-scale data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Having finished this class, a student would be prepared to leverage the core RDD and DataFrame APIs to perform analytics on datasets.

This course is meant to be an overview of Spark and its associated ecosystem.
There are 5 modules to this course.
1. What is Spark
2. Introduction to RDDs
3. Introduction to DataFrames
4. Advanced Spark Topics
5. Introduction to Spark MLlib

Requirements

1. Experience with Java (preferred), Python, or another object oriented language
2. No previous Spark knowledge is required
3. No previous experience with Data Science concepts is required. These concepts will be explained as needed


Course Staff

Jamie Allen

Jamie has worked in consulting since 1994, with top firms including Price Waterhouse and Chariot Solutions. He has a long track record of working closely with clients to build high­ quality, mission critical systems that scale to meet the needs of their businesses, and has worked in myriad industries including automotive, retail, pharmaceuticals, telecommunications and more. Jamie has been coding in Scala and actor based systems since 2009, and is the author of "Effective Akka" book from O'Reilly.


Frequently Asked Questions


What web browser should I use?

This course works best with current versions of Chrome, Firefox or Safari, or Edge.


Estimated Effort

8 Hours

Level

Beginner

Skills You Will Learn

Big Data, Scala

Language

English

Course Code

SC0103EN

Tell Your Friends!

Saved this page to your clipboard!

Have questions or need support? Chat with me 😊