DMI’S WIKIPEDIA TOOLS




Erik Borra

Digital Methods Initiative
University of Amsterdam

25 March 2009
Digital Methods Initiative


How can the internet be made to show what is happening in
society?

How to collect and analyz...
tools @ dmi wiki




http://wiki.digitalmethods.net/Dmi/ToolDatabase?
cat=DeviceCentric&subcat=Wikipedia
wikipedia bot edits


S. Niederer and J. Van Dijck (2010). “The case of Wikipedia:
Wisdom of the crowd or technicity of co...
wikipedia bot edits scraper


How?
• Enter the link to an article
• Scraper retrieves all edit logs for an article
• Filte...
two examples




http://wiki.digitalmethods.net/Dmi/DebottingWikipedia

Dependence of climate change articles on bots

Ant...
wikipedia edits scraper and ip localizer

How?
• Enter the link to an article
• Scraper retrieves all edit logs for an art...
ip to geo cases



Scandal research
  WikiScanner (http://wikiscanner.virgil.gr)

Places of edits
   http://mastersofmedia...
wikipedia network analysis

How?
• Enter the link to an article
• Scraper retrieves all bidirectional links to the article...
Body Text




Body text
wip: controversy generator




Wikipedia can be seen as a controversy-defusing device as it
strives to NPOV but well-balan...
wip: controversy generator, possible ways forward

• analyze traces in the system
  • edit-histories
  • protected pages
 ...
Upcoming SlideShare
Loading in …5
×

Wiki Analytics Workshop

1,269 views
1,196 views

Published on

Published in: Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,269
On SlideShare
0
From Embeds
0
Number of Embeds
108
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Wiki Analytics Workshop

  1. 1. DMI’S WIKIPEDIA TOOLS Erik Borra Digital Methods Initiative University of Amsterdam 25 March 2009
  2. 2. Digital Methods Initiative How can the internet be made to show what is happening in society? How to collect and analyze data and distill trends from the Web? Follow the medium as opposed to importing standard methods from social sciences.
  3. 3. tools @ dmi wiki http://wiki.digitalmethods.net/Dmi/ToolDatabase? cat=DeviceCentric&subcat=Wikipedia
  4. 4. wikipedia bot edits S. Niederer and J. Van Dijck (2010). “The case of Wikipedia: Wisdom of the crowd or technicity of content?” New Media and Society Short version @ http://wiki.digitalmethods.net/Dmi/ NetworkedContent
  5. 5. wikipedia bot edits scraper How? • Enter the link to an article • Scraper retrieves all edit logs for an article • Filters out all mentions of ‘bot’ and ‘using’ • Returns permalink, date, time, user, permalink, comment Why? to find out dependency of article upkeep by bots
  6. 6. two examples http://wiki.digitalmethods.net/Dmi/DebottingWikipedia Dependence of climate change articles on bots Anti-vandalism bot activity within a disputed article
  7. 7. wikipedia edits scraper and ip localizer How? • Enter the link to an article • Scraper retrieves all edit logs for an article • When an IP is encountered instead of a username, MaxMinds IP-to-GEO database will be queried for geo information • Returns permalink, date, time, user (or IP), permalink, comment, (city, country, lat, lon) Why? Edit-history analysis, scandal research, places of edits.
  8. 8. ip to geo cases Scandal research WikiScanner (http://wikiscanner.virgil.gr) Places of edits http://mastersofmedia.hum.uva.nl/2007/10/07/ repurposing-the-wikiscanner-comparing-dutch-universities- edits-on-wikipedia/
  9. 9. wikipedia network analysis How? • Enter the link to an article • Scraper retrieves all bidirectional links to the article, from within Wikipedia • Scraper parses those articles and retrieves all their links • (reiterate previous step until certain depth) • List links in table (link from -> to) • Visualize Why? Article network ecology.
  10. 10. Body Text Body text
  11. 11. wip: controversy generator Wikipedia can be seen as a controversy-defusing device as it strives to NPOV but well-balanced articles. What if one disentangles the consensus and lays bare controversies? How would one do that?
  12. 12. wip: controversy generator, possible ways forward • analyze traces in the system • edit-histories • protected pages • amount of followers • forkings / splits • article length • bot edits • templates (detecting controversy types) • ...

×