About SparkByExamples.com

Hello Spark Enthusiast !! Welcome to SparkByExamples.com

SparkByExamples.com is a BigData, Machine Learning, and Cloud platform community page with the intent to share the knowledge that I come across in my real-time projects. It initially started providing tutorials on Apache Spark & Pyspark and later extended to Bigdata ecosystem tools, machine learning. All examples I have explained on this site are simple, easy to understand, and well tested in our development environment.

How It Helps You?

You can use SparkByExamples.com to

  1. Learn Big Data echo system tools like Hadoop, Apache Spark, PySpark, Hive, HBase, Snowflake and many more.
  2. Prepare for Interviews.
  3. Prepare for Certificates.
  4. Finally get technical help for any challenges you come across during your project.

How You Could Help?

If you like SparkByExamples.com and the articles explained here, you can support its growth in the following ways.

  1. You can support this site by keep visiting to learn and get help.
  2. Recommend to your friends and reference SparkByExamples.com on your sites.
  3. I tried my best not to have any errors in the examples, but we are humans and we make mistakes. So, if you see any errors in the articles, please suggest the corrections by commenting.
  4. Finally, support sharing your knowledge by Writing Guest articles.

If you have any comments, feedback, questions, or recommendations you can contact me using the below form.

PS: If you like the articles and the way I explained them, please provide your feedback or testimonials in the comment section below. Your few words motivate me to write more and more good articles !!

Thank you & Happy Coding !!


Leave a Reply

This Post Has 39 Comments

  1. sandeep


    I am new to scala. can you please help with my requirement. I have csv file that has header.I need to skip the header when writing this file into another file with parquet format. How can i acheive this in scala using data frame. Please help

    1. NNK

      Hi Sandeep. You mean header as column names. If so, you can skip the header while reading a CSV file and write the DataFrame as parquet file.

  2. Anonymous

    can you please add some very very simple projects for beginners?

  3. Anonymous

    I just wanted to say THANK YOU!

  4. ozan uzun

    First of all thank you for examples.I’m studing spark this website, I’m using jupyter notebook and take simple turkish notes. if you let, I want to share them in my github and linkedln account. They are have your example in them.
    thank you.

    1. NNK

      Hi Ozan UZun,

      Thanks for your reply. I appreciate you wanted to share but all examples used here are proprietary to SparkByExamples.com and copying these to Github, LinkedIn, or in any form is not acceptable.

      PS: We are already working on publishing these examples for the Jupyter notebook so that they can be used in Databricks as well. If you are interested you can contribute to sparkbyexamples github project.


  5. Astra

    Thank you so much for all these wonderful PySpark resources, my PySpark learning style is more about hands on practical project first rather than in-depth features right at the start, but when it comes to data engineering it’s really hard to find compact, concise, straight to the point code snippet example with detail explanation, and this website does exactly that! Wish you all the best and hope you post more here in the future with more topics

    If you have Patreon or if there’s other way to support you with money let me know, I don’t mind paying monthly subscription to support you

  6. Srinivas

    Excellent explanation with simple examples, really appreciate it for your hard work for keeping all in this blog, thank you

  7. Sandro Jorge

    This site is excellent! Thanks all of you! It helps a lot!!!

  8. Bikash

    Great website and to the point information, how do I connect with you?

  9. Vishnu

    This site is brilliant. Thank You very much!!

  10. Doniv

    Great website with concise explanations on Spark tutorials, also great examples.
    Thanks a lot.

  11. Raman

    Really Great tutorial with scala .i have cleared my interview by following this tutorial.
    it would be great if you can make this tutorial as a PDF ,so that people can use this as a reference .

  12. Anonymous

    Hi . Just an appreciation post. you are doing great , this blog of Spark by examples is good for learning and look at the coding part & examples in Spark-Scala , which i did not find anywhere else. Keep it up.

  13. Mahesh Kumar

    This website is just amazing. I wanted to quit my career in pyspark since i had very little knowledge in pyspark, one day apparently i visited this website and now things has changed. I enjoy working in pyspark because of this website only. for any query i directly come over here and search for the concept. The best thing about this website is it has very basic example yet powerful which helps in understanding the concept easily. Thank you for providing such content. I admire you efforts. Please keep it up.

    1. NNK

      Thank you for your comment. I am glad SparkByExamples.com is helping the community !!

  14. AlixaProDev

    I would say you guys are life sever. You guys have put a lot of effort in here. Thank you dear.

  15. Praveen Kumar

    Hi Team, You are doing an awesome job. Your ways of explanation on each and every topic in sparks gives me a clear understanding. Thank you. Praveen

  16. Emil

    Hi Team, I hope you are doing well. I would tike to share with you my deep appreciation for what you did.
    Best Regards, from Azerbaijan 🇦🇿

    1. NNK

      Hi Emil, Thank you for your kind words!!

  17. Dev

    Love the work you are doing!
    I learned a lot in PySpark. Thank you! 😀

  18. Ak

    Hi Team,

    I cannot thankyou enough for all the learnings that I am getting from here.

    Lots of Respect!!

    Many Thanks Again!!

  19. Gabi

    I highly appreciate your help and service here. Much respect guys

  20. Denise

    Very helpful. Thank you

  21. Meenakshi

    I have never seen website like this,very well explained with simple example. Thanks for your great effort.

    1. NNK

      Thanks, Meenakshi for your wonderful words.

  22. Anonymous

    Awesome site. Thanks a lot. I can find here almost any question I want to know about Spark.

  23. Anonymous

    I really like the way you have written the article, mainly the examples!! Truly helpful!!

  24. Manasa

    very good explanation.Deepest Gratitude for your work Thank you:)

  25. MrDk

    thanks alot , go ahed !!

  26. Srinu

    Good Articles with Nice explanations!. Great Job.

  27. Sri Ganti

    Hi Naveen, THANK YOU SO MUCH. all the topics are very well organized and easy to learn.

  28. Ansari

    neat and clean explaination with good examples

  29. Kamesh

    Great Website.
    How can i signup or avoid adds ?

  30. Lohith

    Your Teaching style by example is awesome while taking out all the complexity and learning Spark is enjoyable.

  31. Luiz Geraldo

    Hello everyone.
    I started learning Numpy in this tutorial.
    The way you explain the subject is clear and simple; the examples are relevant and finally, the site is very good.
    Congratulations for the team.

  32. Navaneethan

    Thank you for creating this – it has been extremely useful!