SlideShare a Scribd company logo
DMI’S WIKIPEDIA TOOLS




Erik Borra

Digital Methods Initiative
University of Amsterdam

25 March 2009
Digital Methods Initiative


How can the internet be made to show what is happening in
society?

How to collect and analyze data and distill trends from the Web?

Follow the medium as opposed to importing standard methods
from social sciences.
tools @ dmi wiki




http://wiki.digitalmethods.net/Dmi/ToolDatabase?
cat=DeviceCentric&subcat=Wikipedia
wikipedia bot edits


S. Niederer and J. Van Dijck (2010). “The case of Wikipedia:
Wisdom of the crowd or technicity of content?” New Media and
Society

Short version @ http://wiki.digitalmethods.net/Dmi/
NetworkedContent
wikipedia bot edits scraper


How?
• Enter the link to an article
• Scraper retrieves all edit logs for an article
• Filters out all mentions of ‘bot’ and ‘using’
• Returns permalink, date, time, user, permalink, comment
Why?
to find out dependency of article upkeep by bots
two examples




http://wiki.digitalmethods.net/Dmi/DebottingWikipedia

Dependence of climate change articles on bots

Anti-vandalism bot activity within a disputed article
wikipedia edits scraper and ip localizer

How?
• Enter the link to an article
• Scraper retrieves all edit logs for an article
• When an IP is encountered instead of a username, MaxMinds
  IP-to-GEO database will be queried for geo information
• Returns permalink, date, time, user (or IP), permalink,
  comment, (city, country, lat, lon)

Why? Edit-history analysis, scandal research, places of edits.
ip to geo cases



Scandal research
  WikiScanner (http://wikiscanner.virgil.gr)

Places of edits
   http://mastersofmedia.hum.uva.nl/2007/10/07/
   repurposing-the-wikiscanner-comparing-dutch-universities-
   edits-on-wikipedia/
wikipedia network analysis

How?
• Enter the link to an article
• Scraper retrieves all bidirectional links to the article, from
  within Wikipedia
• Scraper parses those articles and retrieves all their links
  • (reiterate previous step until certain depth)
• List links in table (link from -> to)
• Visualize
Why? Article network ecology.
Body Text




Body text
wip: controversy generator




Wikipedia can be seen as a controversy-defusing device as it
strives to NPOV but well-balanced articles.

What if one disentangles the consensus and lays bare
controversies? How would one do that?
wip: controversy generator, possible ways forward

• analyze traces in the system
  • edit-histories
  • protected pages
  • amount of followers
  • forkings / splits
  • article length
  • bot edits
  • templates (detecting controversy types)
  • ...

More Related Content

What's hot

IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
Stuart Myles
 
Open Access is Just the Beginning: Disrupting Publishing
Open Access is Just the Beginning: Disrupting PublishingOpen Access is Just the Beginning: Disrupting Publishing
Open Access is Just the Beginning: Disrupting Publishing
Brian Hole
 
News Innovation Lightning Talk
News Innovation Lightning TalkNews Innovation Lightning Talk
News Innovation Lightning Talk
Leigh Dodds
 
Building a scalable, sustainable service with OJS
Building a scalable, sustainable service with OJSBuilding a scalable, sustainable service with OJS
Building a scalable, sustainable service with OJS
Brian Hole
 
IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
Stuart Myles
 
The Web’s Rich Tapestry
The Web’s Rich TapestryThe Web’s Rich Tapestry
The Web’s Rich Tapestry
Leigh Dodds
 

What's hot (6)

IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
 
Open Access is Just the Beginning: Disrupting Publishing
Open Access is Just the Beginning: Disrupting PublishingOpen Access is Just the Beginning: Disrupting Publishing
Open Access is Just the Beginning: Disrupting Publishing
 
News Innovation Lightning Talk
News Innovation Lightning TalkNews Innovation Lightning Talk
News Innovation Lightning Talk
 
Building a scalable, sustainable service with OJS
Building a scalable, sustainable service with OJSBuilding a scalable, sustainable service with OJS
Building a scalable, sustainable service with OJS
 
IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
 
The Web’s Rich Tapestry
The Web’s Rich TapestryThe Web’s Rich Tapestry
The Web’s Rich Tapestry
 

Viewers also liked

Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Initiative
 
Rogers digitalmethods archived_website_30_nov10_optimized
Rogers digitalmethods archived_website_30_nov10_optimizedRogers digitalmethods archived_website_30_nov10_optimized
Rogers digitalmethods archived_website_30_nov10_optimizedDigital Methods Initiative
 
Rogers data days_2014_slides_opti
Rogers data days_2014_slides_optiRogers data days_2014_slides_opti
Rogers data days_2014_slides_opti
Digital Methods Initiative
 
Digital Methods Tool Medley
Digital Methods Tool MedleyDigital Methods Tool Medley
Digital Methods Tool Medley
Digital Methods Initiative
 
Using My Own Work - Felt making
Using My Own Work - Felt makingUsing My Own Work - Felt making
Using My Own Work - Felt makingnfrowotham
 
Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_
Digital Methods Initiative
 
Repurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical deviceRepurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical device
Digital Methods Initiative
 
Digital Methods Summer School 2014 Tool Medley
Digital Methods Summer School 2014 Tool MedleyDigital Methods Summer School 2014 Tool Medley
Digital Methods Summer School 2014 Tool Medley
Digital Methods Initiative
 
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Digital Methods Initiative
 

Viewers also liked (9)

Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool Medley
 
Rogers digitalmethods archived_website_30_nov10_optimized
Rogers digitalmethods archived_website_30_nov10_optimizedRogers digitalmethods archived_website_30_nov10_optimized
Rogers digitalmethods archived_website_30_nov10_optimized
 
Rogers data days_2014_slides_opti
Rogers data days_2014_slides_optiRogers data days_2014_slides_opti
Rogers data days_2014_slides_opti
 
Digital Methods Tool Medley
Digital Methods Tool MedleyDigital Methods Tool Medley
Digital Methods Tool Medley
 
Using My Own Work - Felt making
Using My Own Work - Felt makingUsing My Own Work - Felt making
Using My Own Work - Felt making
 
Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_Rogers studyingpoliticalissues mar2014_optimized_ii_
Rogers studyingpoliticalissues mar2014_optimized_ii_
 
Repurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical deviceRepurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical device
 
Digital Methods Summer School 2014 Tool Medley
Digital Methods Summer School 2014 Tool MedleyDigital Methods Summer School 2014 Tool Medley
Digital Methods Summer School 2014 Tool Medley
 
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
 

Similar to Wiki Analytics Workshop

Aswc2009 Smw Tutorial Part 1 Intro And Examples
Aswc2009 Smw Tutorial Part 1 Intro And ExamplesAswc2009 Smw Tutorial Part 1 Intro And Examples
Aswc2009 Smw Tutorial Part 1 Intro And Examples
Jesse Wang
 
Exploring Article Networks on Wikipedia with NodeXL
Exploring Article Networks on Wikipedia with NodeXLExploring Article Networks on Wikipedia with NodeXL
Exploring Article Networks on Wikipedia with NodeXL
Shalin Hai-Jew
 
Semantic Wiki: Social Semantic Web in Use
Semantic Wiki: Social Semantic Web in UseSemantic Wiki: Social Semantic Web in Use
Semantic Wiki: Social Semantic Web in Use
Jesse Wang
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Artefactual Systems - AtoM
 
Web20 Intro Naj Shaik
Web20 Intro Naj ShaikWeb20 Intro Naj Shaik
Web20 Intro Naj Shaik
Karen Vignare
 
Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...
John Breslin
 
Intranet 2.0: Using Wikis
Intranet 2.0: Using WikisIntranet 2.0: Using Wikis
Intranet 2.0: Using Wikis
Nicole C. Engard
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups
Carlo Vaccari
 
Semantic search on heterogeneous wiki systems - Wikimania 2010
Semantic search on heterogeneous wiki systems - Wikimania 2010Semantic search on heterogeneous wiki systems - Wikimania 2010
Semantic search on heterogeneous wiki systems - Wikimania 2010
Fabrizio Orlandi
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
John Breslin
 
Mphil Computational Biology Seminar Series Presentation (20201111)
Mphil Computational Biology Seminar Series Presentation (20201111)Mphil Computational Biology Seminar Series Presentation (20201111)
Mphil Computational Biology Seminar Series Presentation (20201111)
ShweataNHegde
 
Power to the Users (and Librarians)
Power to the Users (and Librarians)Power to the Users (and Librarians)
Power to the Users (and Librarians)
Guus van den Brekel
 
Distributed wikis
Distributed wikisDistributed wikis
Distributed wikis
Brianna Laugher
 
Jist tutorial semantic wikis and applications
Jist tutorial   semantic wikis and applicationsJist tutorial   semantic wikis and applications
Jist tutorial semantic wikis and applications
Jesse Wang
 
Enabling cross-wikis integration by extending the SIOC ontology
Enabling cross-wikis integration by extending the SIOC ontologyEnabling cross-wikis integration by extending the SIOC ontology
Enabling cross-wikis integration by extending the SIOC ontology
Fabrizio Orlandi
 
Semantic Wikis - Social Semantic Web in Action
Semantic Wikis - Social Semantic Web in ActionSemantic Wikis - Social Semantic Web in Action
Semantic Wikis - Social Semantic Web in Action
Jesse Wang
 
A Survey of the Landscape and State-of-Art in Semantic Wiki
A Survey of the Landscape and State-of-Art in Semantic WikiA Survey of the Landscape and State-of-Art in Semantic Wiki
A Survey of the Landscape and State-of-Art in Semantic Wiki
Max Völkel
 
Wiki on Library Perspective
Wiki on Library PerspectiveWiki on Library Perspective
Wiki on Library Perspective
Champeswar Mishra
 
Web 2.0: characteristics and tools (2010 eng)
Web 2.0: characteristics and tools (2010 eng)Web 2.0: characteristics and tools (2010 eng)
Web 2.0: characteristics and tools (2010 eng)
Carlo Vaccari
 

Similar to Wiki Analytics Workshop (20)

Aswc2009 Smw Tutorial Part 1 Intro And Examples
Aswc2009 Smw Tutorial Part 1 Intro And ExamplesAswc2009 Smw Tutorial Part 1 Intro And Examples
Aswc2009 Smw Tutorial Part 1 Intro And Examples
 
Exploring Article Networks on Wikipedia with NodeXL
Exploring Article Networks on Wikipedia with NodeXLExploring Article Networks on Wikipedia with NodeXL
Exploring Article Networks on Wikipedia with NodeXL
 
Semantic Wiki: Social Semantic Web in Use
Semantic Wiki: Social Semantic Web in UseSemantic Wiki: Social Semantic Web in Use
Semantic Wiki: Social Semantic Web in Use
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
 
Web20 Intro Naj Shaik
Web20 Intro Naj ShaikWeb20 Intro Naj Shaik
Web20 Intro Naj Shaik
 
Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...
 
Intranet 2.0: Using Wikis
Intranet 2.0: Using WikisIntranet 2.0: Using Wikis
Intranet 2.0: Using Wikis
 
Tet200 Wikis Sp10
Tet200 Wikis Sp10Tet200 Wikis Sp10
Tet200 Wikis Sp10
 
Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups Web2.0 2012 - lesson 7 - technologies and mashups
Web2.0 2012 - lesson 7 - technologies and mashups
 
Semantic search on heterogeneous wiki systems - Wikimania 2010
Semantic search on heterogeneous wiki systems - Wikimania 2010Semantic search on heterogeneous wiki systems - Wikimania 2010
Semantic search on heterogeneous wiki systems - Wikimania 2010
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Mphil Computational Biology Seminar Series Presentation (20201111)
Mphil Computational Biology Seminar Series Presentation (20201111)Mphil Computational Biology Seminar Series Presentation (20201111)
Mphil Computational Biology Seminar Series Presentation (20201111)
 
Power to the Users (and Librarians)
Power to the Users (and Librarians)Power to the Users (and Librarians)
Power to the Users (and Librarians)
 
Distributed wikis
Distributed wikisDistributed wikis
Distributed wikis
 
Jist tutorial semantic wikis and applications
Jist tutorial   semantic wikis and applicationsJist tutorial   semantic wikis and applications
Jist tutorial semantic wikis and applications
 
Enabling cross-wikis integration by extending the SIOC ontology
Enabling cross-wikis integration by extending the SIOC ontologyEnabling cross-wikis integration by extending the SIOC ontology
Enabling cross-wikis integration by extending the SIOC ontology
 
Semantic Wikis - Social Semantic Web in Action
Semantic Wikis - Social Semantic Web in ActionSemantic Wikis - Social Semantic Web in Action
Semantic Wikis - Social Semantic Web in Action
 
A Survey of the Landscape and State-of-Art in Semantic Wiki
A Survey of the Landscape and State-of-Art in Semantic WikiA Survey of the Landscape and State-of-Art in Semantic Wiki
A Survey of the Landscape and State-of-Art in Semantic Wiki
 
Wiki on Library Perspective
Wiki on Library PerspectiveWiki on Library Perspective
Wiki on Library Perspective
 
Web 2.0: characteristics and tools (2010 eng)
Web 2.0: characteristics and tools (2010 eng)Web 2.0: characteristics and tools (2010 eng)
Web 2.0: characteristics and tools (2010 eng)
 

More from Digital Methods Initiative

Query Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard RogersQuery Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard Rogers
Digital Methods Initiative
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
Digital Methods Initiative
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Digital Methods Initiative
 
The Birth of Social Media Methods
The Birth of Social Media MethodsThe Birth of Social Media Methods
The Birth of Social Media Methods
Digital Methods Initiative
 
Interactive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with GephiInteractive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with Gephi
Digital Methods Initiative
 
National Tracking Ecologies - Digital Methods Summer School 2013
National Tracking Ecologies - Digital Methods Summer School 2013National Tracking Ecologies - Digital Methods Summer School 2013
National Tracking Ecologies - Digital Methods Summer School 2013Digital Methods Initiative
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Crawling and Scraping tutorial at the Digital Methods Summer School 2013Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Crawling and Scraping tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Digital Methods Initiative
 
Digital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool MedleyDigital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool Medley
Digital Methods Initiative
 
Hashtag lifelines
Hashtag lifelinesHashtag lifelines
Hashtag lifelines
Digital Methods Initiative
 
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...Digital Methods Initiative
 
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Digital Methods Initiative
 
Web Flags Summer School 2012
Web Flags Summer School 2012Web Flags Summer School 2012
Web Flags Summer School 2012
Digital Methods Initiative
 
Dmi12 workshops - crawling and scraping
Dmi12   workshops - crawling and scrapingDmi12   workshops - crawling and scraping
Dmi12 workshops - crawling and scraping
Digital Methods Initiative
 
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Initiative
 
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Initiative
 

More from Digital Methods Initiative (20)

Query Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard RogersQuery Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard Rogers
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
The Birth of Social Media Methods
The Birth of Social Media MethodsThe Birth of Social Media Methods
The Birth of Social Media Methods
 
Interactive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with GephiInteractive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with Gephi
 
National Tracking Ecologies - Digital Methods Summer School 2013
National Tracking Ecologies - Digital Methods Summer School 2013National Tracking Ecologies - Digital Methods Summer School 2013
National Tracking Ecologies - Digital Methods Summer School 2013
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
 
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Crawling and Scraping tutorial at the Digital Methods Summer School 2013Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
 
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
 
Digital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool MedleyDigital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool Medley
 
Hashtag lifelines
Hashtag lifelinesHashtag lifelines
Hashtag lifelines
 
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
 
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
 
Web Flags Summer School 2012
Web Flags Summer School 2012Web Flags Summer School 2012
Web Flags Summer School 2012
 
Dmi12 workshops - crawling and scraping
Dmi12   workshops - crawling and scrapingDmi12   workshops - crawling and scraping
Dmi12 workshops - crawling and scraping
 
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012
 
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
 
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes ResearchDMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
 
DMI Workshop: Crawling and Scraping
DMI Workshop: Crawling and Scraping DMI Workshop: Crawling and Scraping
DMI Workshop: Crawling and Scraping
 

Recently uploaded

Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 

Recently uploaded (20)

Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 

Wiki Analytics Workshop

  • 1. DMI’S WIKIPEDIA TOOLS Erik Borra Digital Methods Initiative University of Amsterdam 25 March 2009
  • 2. Digital Methods Initiative How can the internet be made to show what is happening in society? How to collect and analyze data and distill trends from the Web? Follow the medium as opposed to importing standard methods from social sciences.
  • 3. tools @ dmi wiki http://wiki.digitalmethods.net/Dmi/ToolDatabase? cat=DeviceCentric&subcat=Wikipedia
  • 4. wikipedia bot edits S. Niederer and J. Van Dijck (2010). “The case of Wikipedia: Wisdom of the crowd or technicity of content?” New Media and Society Short version @ http://wiki.digitalmethods.net/Dmi/ NetworkedContent
  • 5. wikipedia bot edits scraper How? • Enter the link to an article • Scraper retrieves all edit logs for an article • Filters out all mentions of ‘bot’ and ‘using’ • Returns permalink, date, time, user, permalink, comment Why? to find out dependency of article upkeep by bots
  • 6. two examples http://wiki.digitalmethods.net/Dmi/DebottingWikipedia Dependence of climate change articles on bots Anti-vandalism bot activity within a disputed article
  • 7. wikipedia edits scraper and ip localizer How? • Enter the link to an article • Scraper retrieves all edit logs for an article • When an IP is encountered instead of a username, MaxMinds IP-to-GEO database will be queried for geo information • Returns permalink, date, time, user (or IP), permalink, comment, (city, country, lat, lon) Why? Edit-history analysis, scandal research, places of edits.
  • 8. ip to geo cases Scandal research WikiScanner (http://wikiscanner.virgil.gr) Places of edits http://mastersofmedia.hum.uva.nl/2007/10/07/ repurposing-the-wikiscanner-comparing-dutch-universities- edits-on-wikipedia/
  • 9. wikipedia network analysis How? • Enter the link to an article • Scraper retrieves all bidirectional links to the article, from within Wikipedia • Scraper parses those articles and retrieves all their links • (reiterate previous step until certain depth) • List links in table (link from -> to) • Visualize Why? Article network ecology.
  • 11. wip: controversy generator Wikipedia can be seen as a controversy-defusing device as it strives to NPOV but well-balanced articles. What if one disentangles the consensus and lays bare controversies? How would one do that?
  • 12. wip: controversy generator, possible ways forward • analyze traces in the system • edit-histories • protected pages • amount of followers • forkings / splits • article length • bot edits • templates (detecting controversy types) • ...