pyspark.sql.functions.coalesce#
- pyspark.sql.functions.coalesce(*cols)[source]#
- Returns the first column that is not null. - New in version 1.4.0. - Changed in version 3.4.0: Supports Spark Connect. - Parameters
- colsColumnor str
- list of columns to work on. 
 
- cols
- Returns
- Column
- value of the first column that is not null. 
 
 - Examples - >>> cDf = spark.createDataFrame([(None, None), (1, None), (None, 2)], ("a", "b")) >>> cDf.show() +----+----+ | a| b| +----+----+ |NULL|NULL| | 1|NULL| |NULL| 2| +----+----+ - >>> cDf.select(coalesce(cDf["a"], cDf["b"])).show() +--------------+ |coalesce(a, b)| +--------------+ | NULL| | 1| | 2| +--------------+ - >>> cDf.select('*', coalesce(cDf["a"], lit(0.0))).show() +----+----+----------------+ | a| b|coalesce(a, 0.0)| +----+----+----------------+ |NULL|NULL| 0.0| | 1|NULL| 1.0| |NULL| 2| 0.0| +----+----+----------------+