Information Technology
Hands on Training icon
Hands On Training
Hands on Training icon

Introduction to PySpark

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Beginner

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

This course will show you how to use Spark with Python. Spark allows you to perform parallel computations using large data sets. It is easy to integrate into Python. PySpark, the Python package that makes all of this magic possible, is responsible. This package allows you to access data on flights between Portland, Washington and Seattle. This package will show you how to manage the data and build a machine-learning pipeline that predicts if flights will be delayed. To get into high-performance machine learning, you can spark your Python code!

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

Prerequisites/Requirements

Introduction to Python

What You Will Learn

Learn to implement distributed data management and machine learning in Spark using the PySpark package

In this course, you'll learn how to use Spark from Python!

You'll use this package to work with data about flights from Portland and Seattle

You'll learn to wrangle this data and build a whole machine learning pipeline to predict whether or not flights will be delayed

Course Instructors

Author Image

Nick Solomon

Data Scientist

Nick has a degree in mathematics with a concentration in statistics from Reed College. He's worked on many data science projects in the past, doing everything from mapping crime data to developing ne...
Author Image

Lore Dirick

Director of Data Science Education at Flatiron School

Lore is a data scientist with expertise in applied finance. She obtained her PhD in Business Economics and Statistics at KU Leuven, Belgium. During her PhD, she collaborated with several banks workin...

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover