Pergunta de entrevista da empresa ByteDance

How will you design an audio-visual AI system that mark the sound source in the video?