Artificial Intelligence & Data Science
Star icon
Most Popular
Hands on Training icon
Hands On Training
Star icon
Hands on Training icon

Apache Spark Fundamentals

Course Cover
compare button icon

Course Features

icon

Duration

255 minutes

icon

Delivery Method

Online

icon

Available on

Downloadable Courses

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

255 minutes

Course Description

The ever-connected world we live in is creating more data than Moore's Law can keep up with. This means that we need to make better decisions about how to analyze the data. We used Hadoop's MapReduce framework to batch process, but the modern demands for big data processing have outgrown that framework. Apache Spark is the answer. It boasts speeds up to 10-100x faster that Hadoop and holds the world record for large-scale sorting. Spark's general abstraction allows it to do more than batch processing. It can also perform iterative algorithms at lightning speed and streaming semantics at once. This course will teach you Spark from the beginning. You'll start with its history and create a Wikipedia analysis app to help you learn a large portion of its core API. This core knowledge will allow you to explore Spark's other libraries such as the streaming or SQL APIs. You'll also learn how to avoid some common pitfalls in Spark. You will leave this course with a tool belt capable of creating your own performance-maximized Spark application.

Course Overview

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Instructor-Moderated Discussions

projects-img

Case Studies, Captstone Projects

Skills You Will Gain

What You Will Learn

You'll learn Spark from the ground up, starting with its history before creating a Wikipedia analysis application as one of the means for learning a wide scope of its core API

That core knowledge will make it easier to look into Spark's other libraries, such as the streaming and SQL APIs

Finally, you'll learn how to avoid a few commonly encountered rough edges of Spark

You will leave this course with a tool belt capable of creating your own performance-maximized Spark application

Course Instructors

Author Image

Justin Pihony

Instructor

Justin is a software journeyman, continuously learning and honing his skills. Most of his early professional career was spent in C# and MSSQL, but he loves learning about many different languages, es...
Course Cover