pyspark/groupby

Group By

Group DataFrame by column(s)

pyspark
dataframe
aggregate

Command

df.groupBy("col")

Explanation

Used for aggregation on grouped data.

Examples

Count rows per department

df.groupBy("dept").count().show()