Skip to content
  • Home
  • About
  • Write For US
|       { One stop for all Spark Examples }
Spark By {Examples}
Sign-in
  • Spark
    • Spark RDD
    • Spark DataFrame
    • Spark SQL Functions
    • What’s New in Spark 3.0?
    • Spark Streaming
    • Apache Spark Interview Questions
  • PySpark
  • Pandas
  • R
    • R Programming
    • R Data Frame
    • R dplyr Tutorial
    • R Data Frame
    • R Vector
    • R dplyr Tutorial
  • Snowflake
  • Hive
  • Int Q
    • Spark Interview Questions
    • MongoDB Interview Questions
    • Machine Learning Interview Questions
  • More
    • Python
    • MongoDB
    • Apache Kafka
    • H2O.ai
    • Apache Hadoop
    • NumPy
    • Apache HBase
    • Apache Cassandra
    • H2O Sparkling Water
    • Scala Language
Menu Close
  • Spark
    • Spark RDD
    • Spark DataFrame
    • Spark SQL Functions
    • What’s New in Spark 3.0?
    • Spark Streaming
    • Apache Spark Interview Questions
  • PySpark
  • Pandas
  • R
    • R Programming
    • R Data Frame
    • R dplyr Tutorial
    • R Data Frame
    • R Vector
    • R dplyr Tutorial
  • Snowflake
  • Hive
  • Int Q
    • Spark Interview Questions
    • MongoDB Interview Questions
    • Machine Learning Interview Questions
  • More
    • Python
    • MongoDB
    • Apache Kafka
    • H2O.ai
    • Apache Hadoop
    • NumPy
    • Apache HBase
    • Apache Cassandra
    • H2O Sparkling Water
    • Scala Language
  • Home
  • About
  • Write For US
Read more about the article Spark Load CSV File into RDD
Apache Spark / Apache Spark RDD

Spark Load CSV File into RDD

In this tutorial, I will explain how to load a CSV file into Spark RDD using a Scala example. Using the textFile() the method in SparkContext class we can read…

6 Comments
November 24, 2019
Read more about the article Spark – Read multiple text files into single RDD?
Apache Spark / Apache Spark RDD

Spark – Read multiple text files into single RDD?

Spark core provides textFile() & wholeTextFiles() methods in SparkContext class which is used to read single and multiple text or csv files into a single Spark RDD. Using this method…

13 Comments
November 23, 2019
Read more about the article Spark – How to create an empty RDD?
Apache Spark / Apache Spark RDD

Spark – How to create an empty RDD?

We often need to create empty RDD in Spark, and empty RDD can be created in several ways, for example, with partition, without partition, and with pair RDD. In this…

3 Comments
November 21, 2019
Read more about the article Convert Spark RDD to DataFrame | Dataset
Apache Spark / Apache Spark RDD

Convert Spark RDD to DataFrame | Dataset

While working in Apache Spark with Scala, we often need to Convert Spark RDD to DataFrame and Dataset as these provide more advantages over RDD. For instance, DataFrame is a…

4 Comments
August 22, 2019
Read more about the article Different ways to create Spark RDD
Photo by Deva Darshan on Unsplash
Apache Spark / Apache Spark RDD

Different ways to create Spark RDD

Spark RDD can be created in several ways using Scala & Pyspark languages, for example, It can be created by using sparkContext.parallelize(), from text file, from another RDD, DataFrame, and…

2 Comments
February 4, 2019
Read more about the article Create a Spark RDD using Parallelize
Apache Spark / Apache Spark RDD

Create a Spark RDD using Parallelize

Let's see how to create Spark RDD using parallelize with sparkContext.parallelize() method and using Spark shell and Scala example. Before we start let me explain what is RDD, Resilient Distributed…

27 Comments
December 4, 2018
  • Go to the previous page
  • 1
  • 2
  • 3

Spark RDD Tutorial

  • Spark RDD – Parallelize
  • Spark RDD – Read text file
  • Spark RDD – Read CSV
  • Spark RDD – Create RDD
  • Spark RDD – Actions
  • Spark RDD – Pair Functions
  • Spark RDD – Repartition and Coalesce
  • Spark RDD – Shuffle Partitions
  • Spark RDD – Cache vs Persist
  • Spark RDD – Persistance Storage Levels
  • Spark RDD – Broadcast Variables
  • Spark RDD – Accumulator Variables
  • Spark RDD – Convert RDD to DataFrame

Spark RDD Transformation & Actions

  • Spark RDD – filter()
  • Spark RDD – map()
  • Spark RDD – flatMap()
  • Spark RDD – fold()
  • Spark RDD – aggregate()
  • Spark RDD – reduce()
  • Spark RDD – reduceByKey()
  • Spark RDD – sortByKey()

Spark SQL Functions

  • Spark SQL String Functions
  • Spark SQL Date and Timestamp Functions
  • Spark SQL Array Functions
  • Spark SQL Map Functions
  • Spark SQL Sort Functions
  • Spark SQL Aggregate Functions
  • Spark SQL Window Functions
  • Spark SQL JSON Functions

Top Tutorials

  • Apache Spark Tutorial
  • PySpark Tutorial
  • Python Pandas Tutorial
  • R Programming Tutorial
  • Python NumPy Tutorial
  • Apache Hive Tutorial
  • Apache HBase Tutorial
  • Apache Cassandra Tutorial
  • Apache Kafka Tutorial
  • Snowflake Data Warehouse Tutorial
  • H2O Sparkling Water Tutorial

Categories

  • Apache Spark
  • PySpark
  • Pandas
  • R Programming
  • Snowflake Database
  • NumPy
  • Apache Hive
  • Apache HBase
  • Apache Kafka
  • Apache Cassandra
  • H2O Sparkling Water

Legal

  • Privacy Policy

About SparkByExamples.com

SparkByExamples.com is a Big Data and Spark examples community page, all examples are simple and easy to understand, and well tested in our development environment Read more ..
  • Opens in a new tab
  • Opens in a new tab
  • Opens in a new tab
  • Opens in a new tab
  • Opens in a new tab
[email protected]
+1 (949) 345-0676
Desert Bloom
Irvine, CA 92618
USA
Copyright sparkbyexamples.com
x
x