Skip to content
  • Home
  • About
  • Write For US
|       { One stop for all Spark Examples }
Spark by {Examples}
  • Spark
    • Spark RDD
    • Spark DataFrame
    • Spark SQL Functions
    • What’s New in Spark 3.0?
    • Spark Streaming
    • Apache Spark Interview Questions
  • PySpark
  • Pandas
  • Hive
  • Kafka
  • H2O.ai
  • Interview Q
    • Spark Interview Questions
  • More
    • Apache Hadoop
    • Apache HBase
    • Apache Cassandra
    • Snowflake Database
    • H2O Sparkling Water
    • Scala Language
    • Python NumPy
    • R Language
Menu Close
  • Spark
    • Spark RDD
    • Spark DataFrame
    • Spark SQL Functions
    • What’s New in Spark 3.0?
    • Spark Streaming
    • Apache Spark Interview Questions
  • PySpark
  • Pandas
  • Hive
  • Kafka
  • H2O.ai
  • Interview Q
    • Spark Interview Questions
  • More
    • Apache Hadoop
    • Apache HBase
    • Apache Cassandra
    • Snowflake Database
    • H2O Sparkling Water
    • Scala Language
    • Python NumPy
    • R Language
  • Home
  • About
  • Write For US
Read more about the article PySpark Select First Row of Each Group?
PySpark

PySpark Select First Row of Each Group?

In PySpark select/find the first row of each group within a DataFrame can be get by grouping the data using window partitionBy() function and running row_number() function over window partition.…

0 Comments
April 3, 2021
Read more about the article Spark RDD fold() function example
Photo by Tyler Franta on Unsplash
Apache Spark / Apache Spark RDD

Spark RDD fold() function example

In this tutorial, you will learn fold syntax, usage and how to use Spark RDD fold() function in order to calculate min, max, and a total of the elements with…

0 Comments
December 7, 2019
Read more about the article Spark RDD reduce() function example
Apache Spark / Apache Spark RDD

Spark RDD reduce() function example

Spark RDD reduce() aggregate action function is used to calculate min, max, and total of elements in a dataset, In this tutorial, I will explain RDD reduce function syntax and…

0 Comments
December 7, 2019
Read more about the article Spark RDD aggregate() operation example
Apache Spark / Apache Spark RDD

Spark RDD aggregate() operation example

In this tutorial, you will learn how to aggregate elements using Spark RDD aggregate() action to calculate min, max, total, and count of RDD elements with scala language, and the…

2 Comments
December 2, 2019
Read more about the article Spark DataFrame Select First Row of Each Group?
Apache Spark

Spark DataFrame Select First Row of Each Group?

In this Spark article, I've explained how to select/get the first row, min (minimum), max (maximum) of each group in DataFrame using Spark SQL window functions and Scala example. Though…

5 Comments
September 26, 2019

Categories

  • Apache Hadoop
  • Apache Spark
  • Apache Spark Streaming
  • Apache Kafka
  • Apache HBase
  • Apache Cassandra
  • Snowflake Database
  • H2O Sparkling Water
  • PySpark

Recent Posts

  • R dplyr filter() – Subset DataFrame Rows
  • R Subset Data Frame by Column Value & Name
  • R Subset data frame with Examples
  • R subset() Function – Get Rows & Columns
  • R Select Rows with Examples
  • Select Rows with Row Names in R
  • Select Rows based on Column Value in R
  • Select Rows by Index in R with Examples
  • R Select Rows by Condition with Examples
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy

About SparkByExamples.com

SparkByExamples.com is a Big Data and Spark examples community page, all examples are simple and easy to understand, and well tested in our development environment Read more ..
Copyright sparkbyexamples.com