Pergunta de entrevista da empresa Capgemini

Difference between coalesce and repartition

Resposta da entrevista

Sigiloso

18 de set. de 2024

Coalesce is used to reduce the number of partitions. Here data shuffling is very less. Repatriation is used to either increase or decrease the number of partitions. Data distribution is equal here as data is shuffled. This an expensive operations when compared to coalces.