Be the first to like this
The advent of Web 2.0 gave birth to a new kind of applications where content is generated through the collaborative contribution of many different users. This form of content generation is believed to generate data of higher quality since the “wisdom of the crowds” makes its way into the data. However, as it is generally the case in real life, there are many issues for which there is no generally accepted opinion. These issues are characterised as controversial. Knowing these issues when reading the user generated content is of major importance in understanding the quality of the data and the trust that should be given to them. In this work we describe a technique that finds these controversial issues by analyzing the edits that have been performed on the data over time. We apply our technique on Wikipedia, the world’s largest known collaboratively generated database and we report our findings.