Cognitive Class

Machine learning with Apache SystemML

Apache SystemML is a declarative style language designed for large-scale machine learning. It provides automatic generation of optimized runtime plans ranging from single-node, to in-memory, to distributed computations on Apache Hadoop and Apache Spark. SystemML algorithms are expressed in R-like or Python-like syntax that includes linear algebra primitives, statistical functions and ML-specific constructs.

Start the Free Course

About This Course

Apache SystemML is a declarative style language designed for large-scale machine learning. It provides automatic generation of optimized runtime plans ranging from single-node, to in-memory, to distributed computations on Apache Hadoop and Apache Spark. SystemML algorithms are expressed in R-like or Python-like syntax that includes linear algebra primitives, statistical functions and ML-specific constructs. 
 
As a data scientist, engineer, or just a fellow interested in machine learning, your productivity will increase while having the flexibility to express custom analytics and not worry about the underlying optimization engine. Automatic scalability and optimization is handled by SystemML. This course will not only provide you with a view of how the optimizers function but also provide hands-on examples of ML algorithms and how to run them.

Course Syllabus

  • Module 1 -  What is SystemML?
  1. Explain the purpose and the origin of SystemML
  2. List the alternatives to SystemML
  3. Compare performances of SystemML with the alternatives
  • Module 2 - SystemML and the Spark MLContext
  1. Use MLContext to interact with SystemML (in Scala)
  • Module 3 - Working with BigSheets
  1. Describe and use a number of SystemML algorithms
  • Module 4 - Working with BigSheets
  1. Explain the purpose of DML
  2. Describe the DML language
  3. List some of the built-in functions
  • Module 5 - Working with BigSheets
  1. Describing the optimizer stack
  2. Explaining how SystemML know it's better to run on one machine
  3. Explaining why SystemML is so much faster than single-node R

General Information

  • This course is free.
  • It is self-paced.
  • It can be taken at any time.
  • It can be audited as many times as you wish.

Recommended skills prior to taking this course

Requirements

  • None

Course Staff

Henry Quach, Instructor of SQL Access for Hadoop

Henry L. Quach

Henry L. Quach is the Technical Curriculum Developer Lead for Big Data. He has been with IBM for 9 years focusing on education development. Henry likes to dabble in a number of things including being part of the original team that developed and designed the concept for the IBM Open Badges program. He has a Bachelor of Science in Computer Science and a Master of Science in Software Engineering from San Jose State University.

Earn your IBM Data Science Professional Certificate on Coursera.Learn more ...