Select specific columns from DataFrame
Drop column(s) from DataFrame
Display rows of DataFrame
Create DataFrame from list or RDD
spark.createDataFrame()
PySpark DataFrames are distributed collections of data.
Create DataFrame from list
df = spark.createDataFrame([(1,"A"),(2,"B")], ["id","name"])
Print schema of DataFrame
Filter DataFrame based on condition
Group DataFrame by column(s)