Apache Spark with Scala - cheatsheet (1) (1)
Apache Spark with Scala - cheatsheet (1) (1)
3. DataFrame Operations
4. Aggregation Functions
5. Join Operations
6. RDD Operations
8. Data Partitioning
● Pivoting Data:
df.groupBy("column").pivot("pivotColumn").agg(sum("value"))
● Explode Array Column: df.withColumn("exploded", explode($"arrayColumn"))
● Rollup: df.rollup("col1", "col2").agg(sum("value"))
● Cube: df.cube("col1", "col2").agg(sum("value"))