The document summarizes a weekly meeting of an informatics student group discussing their work on Twitter Transparency Project datasets. They found some datasets used different languages and hashtags to spread content. Most fake COVID news was early 2020 with few false stories since on Chinese social media. They cleaned data by changing separators and removing line breaks and special characters, processing over 50GB of data. Sentiment analysis tools worked on English data but showed abnormal results on Chinese data. They have datasets on real, fake and disinformation COVID news in Chinese and English and will use existing data to train judgment models.