Information Technology
Hands on Training icon
Hands On Training
Hands on Training icon

Feature Engineering with PySpark

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

Your job is to find the meaning in chaos. Careful curation is required to create toys datasets like MTCars and Iris. The data must be transformed in order to make them useful for machine-learning algorithms that can predict, extract, classify, cluster, etc. This course will cover the details that data scientists spend between 70 and 80% of their time dealing, such as feature engineering and data wrangling. Let's use PySpark Big Data to reduce these datasets which are becoming increasingly complex.

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Post Course Interactions

projects-img

Hands-On Training,Instructor-Moderated Discussions

Skills You Will Gain

Prerequisites/Requirements

Supervised Learning with scikit-learn

Introduction to PySpark

What You Will Learn

This course will cover the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering

In this chapter learn to remove unneeded information, handle missing values and add additional data to your analysis

In this chapter learn how to create new features for your machine learning model to learn from

In this chapter we'll learn how to choose which type of model we want

Course Instructors

Author Image

John Hogue

Lead Data Scientist, General Mills

I have a strong drive for innovation and giving back. Through my work I enjoy building out a career path and center of excellence for those in data science at General Mills. I have a passion for taki...

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover