Pergunta de entrevista da empresa Barclays

What is Apache Spark, and how does it differ from Hadoop MapReduce? Explain the core components of Apache Spark. Spark Core, Spark SQL, Spark Streaming, MLlib, GraphX. What is the difference between RDD, DataFrame, and Dataset? When would you use each? How does Spark handle fault tolerance? Explain lineage and DAG (Directed Acyclic Graph). What are transformations and actions in Spark? Provide examples of each