sparkbyexamples.com
PySpark Random Sample with Example
PySpark provides a pyspark.sql.DataFrame.sample(), pyspark.sql.DataFrame.sampleBy(), RDD.sample(), and RDD.takeSample() methods to get the random sampling
Naveen Nelamali