About SparkByExamples.com

Hello Spark Enthusiast !! Welcome to SparkByExamples.com

SparkByExamples.com is a BigData, Machine Learning, and Cloud platform community page with the intent to share the knowledge that I come across in my real-time projects. It initially started providing tutorials on Apache Spark & Pyspark and later extended to Bigdata ecosystem tools, machine learning. All examples I have explained on this site are simple, easy to understand, and well tested in our development environment.

How It Helps You?

You can use SparkByExamples.com to

  1. Learn Big Data echo system tools like Hadoop, Apache Spark, PySpark, Hive, HBase, Snowflake and many more.
  2. Prepare for Interviews.
  3. Prepare for Certificates.
  4. Finally get technical help for any challenges you come across during your project.

How You Could Help?

If you like SparkByExamples.com and the articles explained here, you can support its growth in the following ways.

  1. You can support this site by keep visiting to learn and get help.
  2. Recommending to your friends and providing Backlinks.
  3. By providing comments, if you see any errors in the articles.
  4. Finally support sharing your knowledge by writing Guest articles.

If you have any comments, feedback, questions, or recommendations you can contact me using the below form.

Leave a Reply

This Post Has 11 Comments

  1. Bikash

    Great website and to the point information, how do I connect with you?

  2. Sandro Jorge

    This site is excellent! Thanks all of you! It helps a lot!!!

  3. Srinivas

    Excellent explanation with simple examples, really appreciate it for your hard work for keeping all in this blog, thank you

  4. Astra

    Thank you so much for all these wonderful PySpark resources, my PySpark learning style is more about hands on practical project first rather than in-depth features right at the start, but when it comes to data engineering it’s really hard to find compact, concise, straight to the point code snippet example with detail explanation, and this website does exactly that! Wish you all the best and hope you post more here in the future with more topics

    If you have Patreon or if there’s other way to support you with money let me know, I don’t mind paying monthly subscription to support you

  5. ozan uzun

    First of all thank you for examples.I’m studing spark this website, I’m using jupyter notebook and take simple turkish notes. if you let, I want to share them in my github and linkedln account. They are have your example in them.
    thank you.

    1. NNK

      Hi Ozan UZun,

      Thanks for your reply. I appreciate you wanted to share but all examples used here are proprietary to SparkByExamples.com and copying these to Github, LinkedIn, or in any form is not acceptable.

      PS: We are already working on publishing these examples for the Jupyter notebook so that they can be used in Databricks as well. If you are interested you can contribute to sparkbyexamples github project.


  6. Anonymous

    I just wanted to say THANK YOU!

  7. Anonymous

    can you please add some very very simple projects for beginners?

  8. sandeep


    I am new to scala. can you please help with my requirement. I have csv file that has header.I need to skip the header when writing this file into another file with parquet format. How can i acheive this in scala using data frame. Please help

    1. NNK

      Hi Sandeep. You mean header as column names. If so, you can skip the header while reading a CSV file and write the DataFrame as parquet file.