ML: What is a difference between supervised and unsupervised learning? Give an example of both. How SVM algorithm works (general description)? How to make sure you are not overfitting while training a model? How you measure accuracy or the error rates? Spark: What is RDD? Why would you ever cache an RDD?