Intro to Hadoop and MapReduce

Course Cover
compare button icon

Course Features

icon

Duration

1 month

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

Course Description

The Apachea Hadoop project develops open source software for reliable scalable distributed computing Learn the fundamental principles behind it and how you can use its power to make sense of your Big Data

Course Overview

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

Prerequisites/Requirements

Lesson 1 does not have technical prerequisites and is a good overview of Hadoop and MapReduce for managersTo get the most out of the class, however, you need basic programming skills in Python on a level provided by introductory courses like our Introduct

What You Will Learn

Big DataWhat is Big Data?The problems big data createsHow Apache Hadoop addresses these problems

HDFS and MapReduceDiscover how HDFS distributes data over multiple computersLearn how MapReduce enables analyzing datasets in parallel across multiple machines

MapReduce codeWrite your own MapReduce code

MapReduce Design PatternsUse common patterns for MapReduce programs to analyze Udacity forum data

Course Instructors

Sarah Sproehnle

Instructor

Instructor

Ian Wrigley

Instructor

Instructor

Gundega Dekena

Instructor

Instructor
Course Cover