
Learning PySpark
Where to Watch Learning PySpark
49.
Repartitioning Data
2018-02-26
In this video, we will learn how to repartition the data.
Watch Learning PySpark Season 1 Episode 49 Now
48.
Pitfalls of UDFs
2018-02-26
In this video, we will discuss the pitfalls of using pure Python user defined functions.
Watch Learning PySpark Season 1 Episode 48 Now
45.
Presenting Data
2018-02-26
In this video, we will learn how to present data.
Watch Learning PySpark Season 1 Episode 45 Now
44.
Transforming Data
2018-02-26
In this video, we will learn how to transform data.
Watch Learning PySpark Season 1 Episode 44 Now
43.
Selecting Data
2018-02-26
In this video, we will learn how to select data from a DataFrame.
Watch Learning PySpark Season 1 Episode 43 Now
42.
Aggregating Data
2018-02-26
In this video, we will learn how to aggregate data.
Watch Learning PySpark Season 1 Episode 42 Now
41.
Filtering Data
2018-02-26
In this video, we will learn how to filter data.
Watch Learning PySpark Season 1 Episode 41 Now
40.
Schema Changes
2018-02-26
In this video, we will learn how to drop, rename, and handle missing observations.
Watch Learning PySpark Season 1 Episode 40 Now
39.
The .distinct(.) Transformation
2018-02-26
In this video, we will how to retrieve distinct values from a DataFrame.
Watch Learning PySpark Season 1 Episode 39 Now
38.
Performing Statistical Transformations
2018-02-26
In this video, we will learn how to calculate descriptive statistics in DataFrames.
Watch Learning PySpark Season 1 Episode 38 Now
37.
Joining Two DataFrames
2018-02-26
In this video, we will learn how to join two DataFrames.
Watch Learning PySpark Season 1 Episode 37 Now
36.
Creating Temporary Tables
2018-02-26
In this video, we will learn how to create temporary views over a DataFrame.
Watch Learning PySpark Season 1 Episode 36 Now
35.
The .sql(...) Transformation
2018-02-26
In this video, we will learn how to use the .sql(...) transformation to interact with the data in a DataFrame.
Watch Learning PySpark Season 1 Episode 35 Now
33.
Interacting with DataFrames
2018-02-26
In this video, we will discuss different ways of interacting with DataFrames.
Watch Learning PySpark Season 1 Episode 33 Now
32.
Specifying Schema of a DataFrame
2018-02-26
In this video, we will learn how to specify schema of a DataFrame.
Watch Learning PySpark Season 1 Episode 32 Now
31.
Creating DataFrames
2018-02-26
In this video, we will learn how to create DataFrames.
Watch Learning PySpark Season 1 Episode 31 Now
30.
Introduction
2018-02-26
In this video, we will provide a brief introduction to Spark DataFrames.
Watch Learning PySpark Season 1 Episode 30 Now
29.
Introducing Actions - Descriptive Statistics
2018-02-26
In this video, we will explore some basic descriptive statistics.
Watch Learning PySpark Season 1 Episode 29 Now
28.
Introducing Actions - Saving Data
2018-02-26
In this video, we will explore how to save data from an RDD.
Watch Learning PySpark Season 1 Episode 28 Now
27.
Introducing Actions - .sortBy(...)
2018-02-26
In this video, we will learn how to sort data within an RDD.
Watch Learning PySpark Season 1 Episode 27 Now
26.
Introducing Actions - .histogram(.)
2018-02-26
In this video, we will learn how to bin data into buckets.
Watch Learning PySpark Season 1 Episode 26 Now
25.
Introducing Actions - .combineByKey(...)
2018-02-26
In this video, we will learn about the most flexible data reduction action.
Watch Learning PySpark Season 1 Episode 25 Now
24.
Introducing Actions - .coalesce(.)
2018-02-26
In this video, we will learn when and why to use the .coalesce(...) method instead of the .repartition(...).
Watch Learning PySpark Season 1 Episode 24 Now
23.
Introducing Actions - .aggregate(...) and .aggregateByKey(...)
2018-02-26
In this video, we will explore how to aggregate the data within each partition first before collecting the results on the driver for the final aggregation.
Watch Learning PySpark Season 1 Episode 23 Now
22.
Introducing Actions - .foreach(.)
2018-02-26
In this video, we will learn how to execute an action on each element of an RDD in each of its partitions.
Watch Learning PySpark Season 1 Episode 22 Now
21.
Introducing Actions - .count()
2018-02-26
In this video, we will learn how to count the number of records in an RDD.
Watch Learning PySpark Season 1 Episode 21 Now
20.
Introducing Actions - .reduce(.) and .reduceByKey(.)
2018-02-26
In this video, we will learn another fundamental method from the Map-Reduce paradigm - the .reduce(...
Watch Learning PySpark Season 1 Episode 20 Now
19.
Introducing Actions - .collect(...)
2018-02-26
In this video, we will learn when to use the .collect(...) action and when to avoid it.
Watch Learning PySpark Season 1 Episode 19 Now
18.
Introducing Actions - .take(...)
2018-02-26
In this video, we will focus on one of the most fundamental tools any data scientist can use: the .take(...
Watch Learning PySpark Season 1 Episode 18 Now
17.
Introducing Transformations - .repartition(...)
2018-02-26
In this video, we will explore how to effectively use repartitioning.
Watch Learning PySpark Season 1 Episode 17 Now
16.
Introducing Transformations - .join(...)
2018-02-26
In this video, we will learn how to join two RDDs.
Watch Learning PySpark Season 1 Episode 16 Now
15.
Introducing Transformations - .sample(...)
2018-02-26
In this video, we will learn how to sample data from RDDs.
Watch Learning PySpark Season 1 Episode 15 Now
14.
Introducing Transformations - .distinct(...)
2018-02-26
In this video, we will explore what the .distinct(...) transformation does.
Watch Learning PySpark Season 1 Episode 14 Now
13.
Introducing Transformations - .flatMap(...)
2018-02-26
In this video, we will explain the difference between .flatMap(...) and .map(...) transformations and we will learn to use it to filter malformed records.
Watch Learning PySpark Season 1 Episode 13 Now
12.
Introducing Transformations - .filter(.)
2018-02-26
In this video, we will learn how to filter data from RDDs.
Watch Learning PySpark Season 1 Episode 12 Now
11.
Introducing Transformations - .map(...)
2018-02-26
In this video, we will introduce lambdas and the .map(...) transformation.
Watch Learning PySpark Season 1 Episode 11 Now
10.
Understanding Lazy Execution
2018-02-26
Spark is lazy to process data. In this video we will learn why this is an advantage.
Watch Learning PySpark Season 1 Episode 10 Now
9.
Schema of an RDD
2018-02-26
In this video, we explore the advantages and disadvantages of RDD's lack of schema.
Watch Learning PySpark Season 1 Episode 9 Now
8.
Creating RDDs
2018-02-26
In this video, we will learn how to create RDDs in many different ways.
Watch Learning PySpark Season 1 Episode 8 Now
7.
Brief Introduction to RDDs
2018-02-26
In this video, we will provide a brief overview of one of the fundamental data structures of Spark - the RDDs.
Watch Learning PySpark Season 1 Episode 7 Now
6.
Cloning GitHub Repository
2018-02-26
The aim of this video is to clone the GitHub repository for the course. Doing this will set everything we need for the following videos.
Watch Learning PySpark Season 1 Episode 6 Now
5.
Newest Capabilities of PySpark 2.0+
2018-02-26
The aim of this video is to briefly review the newest features of Spark 2.0+.
Watch Learning PySpark Season 1 Episode 5 Now
4.
Spark Execution Process
2018-02-26
The aim of this video is to briefly review the execution process.
Watch Learning PySpark Season 1 Episode 4 Now
3.
Apache Spark Stack
2018-02-26
The aim of this video is to provide a brief overview of Apache Spark stack components.
Watch Learning PySpark Season 1 Episode 3 Now
2.
Brief Introduction to Spark
2018-02-26
The aim of the video is to explain Spark and its Python interface.
Watch Learning PySpark Season 1 Episode 2 Now
1.
The Course Overview
2018-02-26
This video gives an overview of the entire course.
Watch Learning PySpark Season 1 Episode 1 Now

Learning PySpark is a series categorized as a new series. Spanning 1 seasons with a total of 46 episodes, the show debuted on 2018. The series has earned a no reviews from both critics and viewers. The IMDb score stands at undefined.
How to Watch Learning PySpark
How can I watch Learning PySpark online? Learning PySpark is available on Packt Publishing with seasons and full episodes. You can also watch Learning PySpark on demand at Amazon online.
Genres
Channel
Packt Publishing
Stream shows like Learning PySpark
Top TV Shows
Also on Packt Publishing