Pergunta de entrevista da empresa Capital One

How do you deal with unbalanced classification problem?

Respostas da entrevista

Sigiloso

25 de fev. de 2020

Sampling technique like oversampling and downsampling. Models that adapt to unbalanced data like Ada-boosting. Or use anomaly detection methods.

Sigiloso

12 de out. de 2021

The basic idea is sampling (oversampling like resampling) which is not a good option when dealing with huge imbalance. Another option is to set weights for class labels (larger weights for classes with fewer samples). A really good, but tough and many times not feasible to implement, is to do oversampling by using generative networks (i.e. GANs).

Sigiloso

24 de abr. de 2022

over/undersampling, GANs (I believe they require lots of data), and class weights (sometimes refer to class weights)