PySpark parallelize() – Create RDD from a list data
PySpark parallelize() is a function in SparkContext and is used to create an RDD from a list collection. In this article, I will explain the usage of parallelize to create…
2 Comments
August 13, 2020