Pergunta de entrevista da empresa Tata Consultancy Services

Difference between coalesce and partition? What is the lit function used for? What is Hive vectorization? How to reverse a string in Python? How to find if a json file with 5000 lines has the right number of open and close braces? How do you submit a job using spark-submit? How to delete duplicates in SQL but keep one copy? Difference between reduceByKey() and groupByKey()? Why is data shuffling expensive? Difference between cache() and persist()? What does echo $0 output to in Bash? Kafka - retention period, consumer groups, get latest message, what is an offset?