How do you deal with unbalanced classification problem?
Respostas da entrevista
Sigiloso
25 de fev. de 2020
Sampling technique like oversampling and downsampling. Models that adapt to unbalanced data like Ada-boosting. Or use anomaly detection methods.
Sigiloso
12 de out. de 2021
The basic idea is sampling (oversampling like resampling) which is not a good option when dealing with huge imbalance. Another option is to set weights for class labels (larger weights for classes with fewer samples). A really good, but tough and many times not feasible to implement, is to do oversampling by using generative networks (i.e. GANs).
Sigiloso
24 de abr. de 2022
over/undersampling, GANs (I believe they require lots of data), and class weights (sometimes refer to class weights)