Hadoop Programming

Ever wonder how to program a pig and an elephant to work together? Start with dedication, a couple of tricks up your sleeve, and instructions that the beasts understand. This learning path is dedicated to address these programming requirements by filtering and sorting what you need to know and how you need to convey your message.

About this learning path

They say that it is not the destination but the journey that got you there that matters. When you have completed this learning path, concepts such as how parallelism is performed on a cluster will be second nature. Your awareness of how to program, either using high-level or low-level languages will be highlights along the way. This learning path incorporates MapReduce and YARN, an introduction to Apache Pig, and simplifying data pipelines with Apache Kafka.

Come along and start your journey to receiving the following badges: Hadoop Programming – Level 1 and Hadoop Programming – Level 2.


MapReduce and YARN

Effort: 5 hours
Level: Intermediate
Available in: English
About the course

String together your understanding of Yet Another Resource Negotiator (YARN) by gaining exposure to MapReduce1, the tool-sets that start the processing of Big Data.

Apache Pig 101

Effort: 5 hours
Level: Beginner
Available in: English
About the course

What did the 3rd little Pig do with his time after the big bad wolf left the brick house deflated? We know. Instead of building, he started writing mapper and reducer programs so that you wouldn't have to. Let the Pig help you outsmart the next big bad wolf!

Complete Hadoop Programming Learning path

Our learning paths are designed to build on the content learned in the first course and then build upon the concepts in courses that follow. We recommend that they are completed in the order outlined in this learning path to ensure you get the most out of your investment of time. If you like what you see here, come and discover other learning paths and browse our course catalog.