Information Technology
Star icon
Most Popular
Hands on Training icon
Hands On Training
Star icon
Hands on Training icon

Scalable Data Processing in R

Course Cover

5

(3)

compare button icon

Course Features

icon

Duration

4 hours

icon

Delivery Method

Online

icon

Available on

Limited Access

icon

Accessibility

Mobile, Desktop, Laptop

icon

Language

English

icon

Subtitles

English

icon

Level

Intermediate

icon

Teaching Type

Self Paced

icon

Video Content

4 hours

Course Description

R programmers have to deal with problems when data sets exceed the RAM. All variables are defaulty stored in memory. Learn how to extract, analyze, and process data from the disk. Split-apply - combine will be used. Also, you'll learn how to create scalable codes with the bigmemory and iotools packages. Federal Housing Finance Agency data will be used in this course. This data set is publicly available and records all mortgages that were held or securitized in the period 2009 to 2015.

Course Overview

projects-img

Virtual Labs

projects-img

International Faculty

projects-img

Case Based Learning

projects-img

Post Course Interactions

projects-img

Case Studies,Hands-On Training,Instructor-Moderated Discussions

projects-img

Case Studies, Captstone Projects

Skills You Will Gain

Prerequisites/Requirements

Writing Efficient R Code

What You Will Learn

You’ll learn tools for processing, exploring, and analyzing data directly from disk

You’ll also implement the split-apply-combine approach and learn how to write scalable code using the bigmemory and iotools packages

In this course, you'll make use of the Federal Housing Finance Agency's data

Course Instructors

Author Image

Michael Kane

Assistant Professor at Yale University

Michael Kane is an Assistant Professor at Yale University. His research is in the area of scalable statistical/machine learning and applied probability.
Author Image

Simon Urbanek

Member of the R-Core; Lead Inventive Scientist at AT&T Labs Research

Simon Urbanek is a member of the R-Core and Lead Inventive Scientist at AT T Labs Research. His research is in the areas of R, statistical computing, visualization, and interactive graphics.

Course Reviews

Average Rating Based on 3 reviews

5.0

100%

Course Cover