Pergunta de entrevista da empresa Microsoft

Devise an algorithm/model that accepts text as an input (social media posts) and needs to determine whether it is fake or not. The dataset at your disposal, which should be used for training, has additional information about each post (like the number of comments). In some cases, some of the data is missing, How would you handle something like this?