i) Spark architecture ii) Difference between RDDs, DataFrames and Datasets iii) What is fault tolerance and how does spark handle it iv) Memory management and garbage collection in Spark v) One SparkSQL based question vi) One PySpark based question vii) ETL implementation in AWS Glue