Pergunta de entrevista da empresa Altimetrik

DIFFERENCE BETWEEN COLASCE AND REPARTITION

Resposta da entrevista

Sigiloso

30 de jan. de 2025

Coalesce -> Reduces the partition without shuffling data -> No guarantee for uniform data distribution. -> Efficient when reducing the partition. Re-partition -> Re-distributes the data evenly. -> Used for increasing the partition. Re-partition can be used to increase the parallelism coalesce can be used to decrease the partition without shuffling the data . used when after applying transformation like filter.