Pergunta de entrevista da empresa Meta

Describe how you would detect and mitigate data leakage in a model pipeline.

Resposta da entrevista

Sigiloso

12 de out. de 2025

I explained differences between target leakage and train/test contamination, recommended strict temporal splits, feature provenance checks, cross-validation strategies, and using a separate holdout set. I gave an example where a future-derived feature would leak and how I removed it and reran validation.