About SparkByExamples.com

Hello Spark Enthusiast !! Welcome to SparkByExamples.com

SparkByExamples.com is a BigData, Machine Learning, and Cloud platform community page with the intent to share the knowledge that I come across in my real-time projects. It initially started providing tutorials on Apache Spark & Pyspark and later extended to Bigdata ecosystem tools, machine learning. All examples I have explained on this site are simple, easy to understand, and well tested in our development environment.

How It Helps You?

You can use SparkByExamples.com to

  1. Learn Big Data echo system tools like Hadoop, Apache Spark, PySpark, Hive, HBase, Snowflake and many more.
  2. Prepare for Interviews.
  3. Prepare for Certificates.
  4. Finally get technical help for any challenges you come across during your project.

How You Could Help?

If you like SparkByExamples.com and the articles explained here, you can support its growth in the following ways.

  1. You can support this site by keep visiting to learn and get help.
  2. Recommend to your friends and reference SparkByExamples.com on your sites.
  3. I tried my best not to have any errors in the examples, but we are humans and we make mistakes. So, if you see any errors in the articles, please suggest the corrections by commenting.
  4. Finally, support sharing your knowledge by Writing Guest articles.

If you have any comments, feedback, questions, or recommendations you can contact me using the below form.

PS: If you like the articles and the way I explained them, please provide your feedback or testimonials in the comment section below. Your few words motivate me to write more and more good articles !!

Thank you & Happy Coding !!


Leave a Reply

This Post Has 23 Comments

  1. Ak

    Hi Team,

    I cannot thankyou enough for all the learnings that I am getting from here.

    Lots of Respect!!

    Many Thanks Again!!

  2. Dev

    Love the work you are doing!
    I learned a lot in PySpark. Thank you! đŸ˜€

  3. Emil

    Hi Team, I hope you are doing well. I would tike to share with you my deep appreciation for what you did.
    Best Regards, from Azerbaijan đŸ‡¦đŸ‡¿

    1. NNK

      Hi Emil, Thank you for your kind words!!

  4. Praveen Kumar

    Hi Team, You are doing an awesome job. Your ways of explanation on each and every topic in sparks gives me a clear understanding. Thank you. Praveen

  5. AlixaProDev

    I would say you guys are life sever. You guys have put a lot of effort in here. Thank you dear.

  6. Mahesh Kumar

    This website is just amazing. I wanted to quit my career in pyspark since i had very little knowledge in pyspark, one day apparently i visited this website and now things has changed. I enjoy working in pyspark because of this website only. for any query i directly come over here and search for the concept. The best thing about this website is it has very basic example yet powerful which helps in understanding the concept easily. Thank you for providing such content. I admire you efforts. Please keep it up.

    1. NNK

      Thank you for your comment. I am glad SparkByExamples.com is helping the community !!

  7. Anonymous

    Hi . Just an appreciation post. you are doing great , this blog of Spark by examples is good for learning and look at the coding part & examples in Spark-Scala , which i did not find anywhere else. Keep it up.

  8. Raman

    Really Great tutorial with scala .i have cleared my interview by following this tutorial.
    it would be great if you can make this tutorial as a PDF ,so that people can use this as a reference .

  9. Doniv

    Great website with concise explanations on Spark tutorials, also great examples.
    Thanks a lot.

  10. Vishnu

    This site is brilliant. Thank You very much!!

  11. Bikash

    Great website and to the point information, how do I connect with you?

  12. Sandro Jorge

    This site is excellent! Thanks all of you! It helps a lot!!!

  13. Srinivas

    Excellent explanation with simple examples, really appreciate it for your hard work for keeping all in this blog, thank you

  14. Astra

    Thank you so much for all these wonderful PySpark resources, my PySpark learning style is more about hands on practical project first rather than in-depth features right at the start, but when it comes to data engineering it’s really hard to find compact, concise, straight to the point code snippet example with detail explanation, and this website does exactly that! Wish you all the best and hope you post more here in the future with more topics

    If you have Patreon or if there’s other way to support you with money let me know, I don’t mind paying monthly subscription to support you

  15. ozan uzun

    First of all thank you for examples.I’m studing spark this website, I’m using jupyter notebook and take simple turkish notes. if you let, I want to share them in my github and linkedln account. They are have your example in them.
    thank you.

    1. NNK

      Hi Ozan UZun,

      Thanks for your reply. I appreciate you wanted to share but all examples used here are proprietary to SparkByExamples.com and copying these to Github, LinkedIn, or in any form is not acceptable.

      PS: We are already working on publishing these examples for the Jupyter notebook so that they can be used in Databricks as well. If you are interested you can contribute to sparkbyexamples github project.


  16. Anonymous

    I just wanted to say THANK YOU!

  17. Anonymous

    can you please add some very very simple projects for beginners?

  18. sandeep


    I am new to scala. can you please help with my requirement. I have csv file that has header.I need to skip the header when writing this file into another file with parquet format. How can i acheive this in scala using data frame. Please help

    1. NNK

      Hi Sandeep. You mean header as column names. If so, you can skip the header while reading a CSV file and write the DataFrame as parquet file.