Pergunta de entrevista da empresa Booking.com

Why did you use Random Forests instead of Clustering?

Respostas da entrevista

Sigiloso

21 de jun. de 2017

Clustering is unsupervised learning whereas Random Forest is supervised learning ... depending upon the problem and data you would choose what to use

8

Sigiloso

26 de set. de 2016

When choosing between decision trees and clustering, remember that decision trees are themselves a clustering method. The leaves of a decision tree contain clusters of records that are similar to one another and dissimilar from records in other leaves. The difference between the clusters found with a decision tree and the clusters found using other methods such as K-means, agglomerative algorithms, or self-organizing maps is that decision trees are directed while the other techniques I mentioned are undirected. Decision trees are appropriate when there is a target variable for which all records in a cluster should have a similar value. Records in a cluster will also be similar in other ways since they are all described by the same set of rules, but the target variable drives the process.

5