Taming Big Data with MapReduce and Hadoop - Hands On!

Master MapReduce and big data analysis with hands-on examples using Python and Amazon's Elastic MapReduce. Learn key technologies quickly and effectively.

  • Overview
  • Curriculum
  • Instructor
  • Review

Brief Summary

This course teaches you MapReduce and Hadoop through practical examples, using Python and real cloud services. You'll learn to analyze big data in an easy, hands-on way while having fun, all guided by an experienced instructor.

Key Points

  • Learn MapReduce fast with Python and MRJob.
  • Hands-on examples to build real skills.
  • Scale data analysis using Amazon's Elastic MapReduce.
  • Understand Hadoop and its tech like Hive and Spark.
  • Analyze big data in minutes and have fun doing it.

Learning Outcomes

  • Master the concepts of MapReduce.
  • Run your own MapReduce jobs using Python and MRJob.
  • Scale data analysis using Amazon's Elastic MapReduce.
  • Understand how Hadoop manages data across clusters.
  • Explore other Hadoop technologies like Hive, Pig, and Spark.

About This Course

Learn MapReduce fast by building over 10 real examples, using Python, MRJob, and Amazon's Elastic MapReduce Service.

“Big data" analysis is a hot and highly valuable skill – and this course will teach you two technologies fundamental to big data quickly: MapReduce and Hadoop. Ever wonder how Google manages to analyze the entire Internet on a continual basis? You'll learn those same techniques, using your own Windows system right at home.

Learn and master the art of framing data analysis problems as MapReduce problems through over 10 hands-on examples, and then scale them up to run on cloud computing services in this course. You'll be learning from an ex-engineer and senior manager from Amazon and IMDb.

  • Learn the concepts of MapReduce

  • Run MapReduce jobs quickly using Python and MRJob

  • Translate complex analysis problems into multi-stage MapReduce jobs

  • Scale up to larger data sets using Amazon's Elastic MapReduce service

  • Understand how Hadoop distributes MapReduce across computing clusters

  • Learn about other Hadoop technologies, like Hive, Pig, and Spark

By the end of this course, you'll be running code that analyzes gigabytes worth of information – in the cloud – in a matter of minutes.

We'll have some fun along the way. You'll get warmed up with some simple examples of using MapReduce to analyze movie ratings data and text in a book. Once you've got the basics under your belt, we'll move to some more complex and interesting tasks. We'll use a million movie ratings to find movies that are similar to each other, and you might even discover some new movies you might like in the process! We'll analyze a social graph of superheroes, and learn who the most “popular" superhero is – and develop a system to find “degrees of separation" between superheroes. Are all Marvel superheroes within a few degrees of being connected to The Incredible Hulk? You'll find the answer.

This course is very hands-on; you'll spend most of your time following along with the instructor as we write, analyze, and run real code together – both on your own system, and in the cloud using Amazon's Elastic MapReduce service. Over 5 hours of video content is included, with over 10 real examples of increasing complexity you can build, run and study yourself. Move through them at your own pace, on your own schedule. The course wraps up with an overview of other Hadoop-based technologies, including Hive, Pig, and the very hot Spark framework – complete with a working example in Spark.

Don't take my word for it - check out some of our unsolicited reviews from real students:

"I have gone through many courses on map reduce; this is undoubtedly the best, way at the top."

"This is one of the best courses I have ever seen since 4 years passed I am using Udemy for courses."

"The best hands on course on MapReduce and Python. I really like the run it yourself approach in this course. Everything is well organized, and the lecturer is top notch."

  • Understand how MapReduce can be used to analyze big data sets

  • Write your own MapReduce jobs using Python and MRJob

  • Run MapReduce jobs on Hadoop clusters using Amazon Elastic MapReduce

Course Curriculum

1 Lectures

Instructors

Profile photo of Sundog Education by Frank Kane
Sundog Education by Frank Kane

Sundog Education's mission is to make highly valuable career skills in data engineering, data science, generative AI, AWS, and machine learning accessible to everyone in the world. Our consortium of expert instructors shares our knowledge in these emerging fields with you, at prices anyone can afford. Sundog Education is led by Frank Kane and owned by Frank's company, Sundog Software...

Instructors

Profile photo of Frank Kane
Frank Kane

Frank spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations to hundreds of millions of customers, all the time. As an Amazon “bar raiser,” he held veto authority over hiring decisions across the company, interviewed over 1,000 candidates, and hired and managed hundreds. He holds 17 issued patents in the...

Instructors

Profile photo of Sundog Education Team
Sundog Education Team

Our mission is to make highly valuable skills in machine learning, big data, AI, and data science accessible at prices anyone in the world can afford. Our current online courses have reached over 500,000 students worldwide. Sundog Education CEO, Frank Kane, spent 9 years at Amazon and IMDb, developing and managing the technology that automatically delivers product and movie recommendations...

Review
4.9 course rating
4K ratings
ui-avatar of Sarvesh Shantanu
Sarvesh S.
4.5
1 year ago

It was a decent course for gaining knowledge of MapReduce and some introduction on Hadoop.

  • Helpful
  • Not helpful
ui-avatar of Marta Stańczuk
Marta S.
5.0
1 year ago

Valuable knowledge.

  • Helpful
  • Not helpful
ui-avatar of Kapil VAGMARE
Kapil V.
3.0
1 year ago

good course

  • Helpful
  • Not helpful
ui-avatar of Harsh Prakash
Harsh P.
5.0
1 year ago

Great course as usual from Frank. He is really the genius in this field. His explanation of each topics along with proper examples makes the concept look more easy. Can't Thank enough. Keep doing this great job and makes course on Hive, Pig and other Big data Technologies as a whole complete course. Thanks Again!

  • Helpful
  • Not helpful
ui-avatar of Kenny Rachuonyo
Kenny R.
4.0
2 years ago

All good. mrjob does a lot of the magic though. I wish we'd touched a bit of hadoop itself without the abstraction

  • Helpful
  • Not helpful
ui-avatar of Anthony D. Kuznetsov
Anthony D. K.
5.0
5 years ago

This course would be better if it had some exercises that require to be solved and verified on your side for graduation. You really learn the material when your solving problems.

  • Helpful
  • Not helpful
ui-avatar of Radhakrishnan Iyer
Radhakrishnan I.
5.0
5 years ago

Cover examples really well.
Makes sure that the student takes in mulitple examples.
all the slides and every other material provided is to the mark.

  • Helpful
  • Not helpful
ui-avatar of William Czarnowski
William C.
4.5
5 years ago

Very good. The instructor is engaging and very clear. The only negative comment is when he is introducing basic concepts he gets very repetitive, repeating the exact same statements often 3 times (or maybe more!). If we don't get something, we can just replay the video.

  • Helpful
  • Not helpful
ui-avatar of Anonymized User
Anonymized U.
3.5
5 years ago

The instructor was awesome! I would prefer to learn more about mapreduce and mrjob than passing very quickly through other technologies

  • Helpful
  • Not helpful
ui-avatar of Francis Lawson
Francis L.
5.0
5 years ago

Amazing course...my interest on spark and Hive is greatly increase. Looking forward to do more courses on Hive and spark..thanks.

  • Helpful
  • Not helpful
Leave A Reply

Your email address will not be published. Required fields are marked *

Ratings

Courses You May Like

Lorem ipsum dolor sit amet elit
Show More Courses