DIFFERENCE BETWEEN COLASCE AND REPARTITION
Sigiloso
Coalesce -> Reduces the partition without shuffling data -> No guarantee for uniform data distribution. -> Efficient when reducing the partition. Re-partition -> Re-distributes the data evenly. -> Used for increasing the partition. Re-partition can be used to increase the parallelism coalesce can be used to decrease the partition without shuffling the data . used when after applying transformation like filter.