Pergunta de entrevista da empresa Coditation Systems

Why Batch normalization? Why cross entropy loss is used? Assumptions in linear regression How do you reduce high variance? What is attention network? What is gini index? Vanishing and exploding gradients