Yun Yuan is developing a web app to analyze how opinions evolve over time in Reddit comments. The app will organize comments for each post in timestamp order and identify hot comments. Reddit comment and post data will be ingested from JSON files into Hadoop and analyzed using Spark to identify trends. The results will then be presented on a Flask web app. Challenges include handling null values in the JSON data and varying comment trends. The project uses distributed computing clusters with Hadoop, Kafka, Cassandra and Spark to process the large Reddit data at a cost of around $400 per month.