1.Explain your project architecture 2.What re services running on the edge node 3.To filter out the Error logs from log table using pyspark. 4.Few python code as I am comfortable with python. 5.Sqoop related questions. 6.Hive questions 7.File formats 8. And few architecture based questions.