SQL: joins, window functions python: new column assignment, string and list manipulations How does XGBoost work? step by step process with example How to represent categorical column with high cardinality How can embeddings be generated? How encoder-decoder might help in this How to find presence of multicollinearity in data? What is chi square test of independence? How to handle imbalanced classification? Why is PR auc better suited to such cases then ROC auc? How to handle large no. of classes in multiclass classification? What is negative sampling and how does it help in such scenarios?