Spark – Working with collect_list() and collect_set() functions
Spark SQL collect_list() and collect_set() functions are used to create an array (ArrayType) column on DataFrame by merging rows, typically after group by or window partitions. In this article, I…
4 Comments
August 27, 2020