SlideShare a Scribd company logo
Wikipedia for Researchers




          Andrew Gray – Wikipedian in Residence

              andrew.gray@bl.uk / @generalising
About Wikipedia & Wikimedia



   Wikimedia
      Movement and charitable body
      80,000 contributors in 280 languages and
        eleven core projects
      Image repository, dictionary, news site…
      …read by 7% of the world!



   Wikipedia
      19,000,000 articles, 4,000,000 in English
      6,500 articles and 235,000 edits per day

         (…and ten years ago, this was all fields…)



                                                      2
…so what is Wikipedia?



   …an encyclopedia

   …written neutrally and verifiably

   …using previously published information

   …free to use, distribute, or reuse

   …a collaborative community

   …with no firm rules




                                              3
Internal processes



   All edits are visible through watchlists and page histories
      About 7% are vandalism or malicious; processes to detect
         these
      Median time to correction < 2 minutes… but some stay much
         longer

   Individual discussion pages for all articles – “talk”

   Quality review and assessment process

   Specialised “wikiproject” working groups and central noticeboards
      eg/ content topics; style; dispute resolution; copyright; etc.




                                                            4
Quality of Wikipedia



   On average… it’s not bad
      In 2005 four errors per article, versus three in Britannica
      In 2011, in English, Spanish & Arabic:
            “…the Wikipedia articles in this sample scored higher overall than the
            comparison articles with respect to accuracy, references, style/
            readability and overall judgment…”

   Millions of articles – so many are, individually, problematic
      Various ways of identifying “signs” of quality
      Markers for quality are both obvious and subtle



   Very effective “springboard” tool



                                                                  5
Looking for quality



   Corner icons
        - article locked down in some way
           - featured or “good” quality

   Problem tags



   Article talk pages and histories



   Style
      Badly written or formatted articles = often neglected


                                                       6
Accessing other content



   Structured categories and navigational templates




   “What links here”




                                                       7
Moving on to other content



   Other languages – not translations, and may have more content

   Mousing over footnote markers

   Within the references:
      Links through DOIs and other identifiers
      ISBNs go to a special landing page
           …and then out to libraries, booksellers, etc
      ISSNs go to WorldCat
      If an author, look for authority control links:




                                                           8
Preferences



   Available to logged in users

   Two particularly useful options:
      New window for external links (Gadgets > Browsing)


        Quality assessment in headers (Gadgets > Appearance)




        Many others - mostly editor-oriented tools




                                                      9
Looking for sets of material



   Some tools available – http://www.toolserver.org
      Complex to use, but rewarding




   CatScan: look for intersection of categories
      “all physicists born in 1912” – 51 in English, 34 in German




   Full dumps of all data available – http://dumps.wikipedia.org




                                                        10
Research about Wikipedia



   Thriving research around Wikipedia community & content
      by mid-2011, 2100 peer-reviewed articles and 38 PhD theses
      Active research committee and WMF support

   Regular report - http://meta.wikimedia.org/wiki/Research:Newsletter
      also @wikiresearch



   Major themes include:
      Community and content creation
      Reading and researching by users
      Quality of content
      Technical research



                                                           11
Research on communities



   Research on the Wikipedia communities:


        Dynamics of community conflict, discussions, collaboration,
         voting, contribution, mentoring…
        Demographics, motivation and specialisms of contributors
        Patterns of growth and content creation/deletion
        Effect of central programs on volunteer activity
        Cross-cultural interaction




                                                       12
Research on users



   Research on usage of Wikipedia:


        Specific searching behaviour
        Patterns of usage (yearly, daily)
        Tracking external events (eg swine flu) through Wikipedia
        Search engine rankings
        Change in usage by students
        Effect of Wikipedia publication on wider literature




                                                       13
Research on content



   Research on the content of Wikipedia:


        Evolution of content
        Accuracy, coverage and quality
        Biases – geographic, cultural, gender
        Linguistic analysis
        Visualisations of content
        Effect of external publications on Wikipedia




                                                        14
Research on technical aspects



   Research on the technical side of Wikipedia:


      Extensive work on scaling open-content services
      Tools for detecting and handling vandalism
      Algorithmic detection and identification of bias, spam
      Practical research on uses of wikis




                                                       15
Research example – visualising art history




                                  http://commons.wikimedia.org/wiki/File:Wikiarthistory.png
                                                                16
Research example – visualising editing patterns




                                                                             17
                      http://commons.wikimedia.org/wiki/File:WikiTrip_egyptian_revolution_screenshot.png
Research example – editor activity




                        http://commons.wikimedia.org/wiki/File:Effect_of_barnstars_on_productivity.png
                                                                            18

More Related Content

Viewers also liked

Trusting wikipedia
Trusting wikipediaTrusting wikipedia
Trusting wikipedia
Su-Laine Yeo Brodsky
 
Lecture 25: Wikipedia and Reliability
Lecture 25: Wikipedia and ReliabilityLecture 25: Wikipedia and Reliability
Lecture 25: Wikipedia and Reliability
dul_e
 
Wikipedia and Medicine
Wikipedia and MedicineWikipedia and Medicine
Wikipedia and Medicine
Jake Orlowitz
 
The Wikipedia Model
The Wikipedia ModelThe Wikipedia Model
The Wikipedia Model
Frieda Brioschi
 
Wikipedia basics
Wikipedia basicsWikipedia basics
Wikipedia basics
pwcom.co.uk Ltd
 
FirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearchFirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearch
webuploader
 

Viewers also liked (6)

Trusting wikipedia
Trusting wikipediaTrusting wikipedia
Trusting wikipedia
 
Lecture 25: Wikipedia and Reliability
Lecture 25: Wikipedia and ReliabilityLecture 25: Wikipedia and Reliability
Lecture 25: Wikipedia and Reliability
 
Wikipedia and Medicine
Wikipedia and MedicineWikipedia and Medicine
Wikipedia and Medicine
 
The Wikipedia Model
The Wikipedia ModelThe Wikipedia Model
The Wikipedia Model
 
Wikipedia basics
Wikipedia basicsWikipedia basics
Wikipedia basics
 
FirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearchFirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearch
 

Similar to Wikipedia for Researchers

Using wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trendsUsing wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trends
Molly Knapp
 
Wiki case study - Review year 1
Wiki case study  - Review year 1Wiki case study  - Review year 1
Wiki case study - Review year 1
RENDER project
 
Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1
RENDER project
 
Mediawiki and Wiki As a Medium
Mediawiki and Wiki As a MediumMediawiki and Wiki As a Medium
Mediawiki and Wiki As a Medium
Randy Thornton
 
Wrangling Wikipedia
Wrangling WikipediaWrangling Wikipedia
Wrangling Wikipedia
moniquekclark
 
Wiki Webinar
Wiki WebinarWiki Webinar
Wiki Webinar
pinctripod
 
Wikimedia Presentation for Schools
Wikimedia Presentation for SchoolsWikimedia Presentation for Schools
Wikimedia Presentation for Schools
Craig Franklin
 
From Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing WikipediaFrom Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing Wikipedia
Rebekah Cummings
 
Chapter6 McHaney
Chapter6 McHaneyChapter6 McHaney
Chapter6 McHaney
Roger McHaney
 
ALIA Wikipedia and libraries
ALIA Wikipedia and librariesALIA Wikipedia and libraries
ALIA Wikipedia and libraries
Pru Mitchell
 
Wiserpku Lecture@Life Science School Pku
Wiserpku Lecture@Life Science School PkuWiserpku Lecture@Life Science School Pku
Wiserpku Lecture@Life Science School Pku
wiser pku
 
Wiser Pku Lecture@Life Science School Pku
Wiser Pku Lecture@Life Science School PkuWiser Pku Lecture@Life Science School Pku
Wiser Pku Lecture@Life Science School Pku
guest8ed46d
 
Student to Author: Using Wikipedia to Improve Undergraduate Research & Writing
Student to Author: Using Wikipedia to Improve Undergraduate Research & WritingStudent to Author: Using Wikipedia to Improve Undergraduate Research & Writing
Student to Author: Using Wikipedia to Improve Undergraduate Research & Writing
Margot
 
E Write Intro To Web 2
E Write   Intro To Web 2E Write   Intro To Web 2
E Write Intro To Web 2
LeslieOflahavan
 
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
Wikipedia for GLAMS_by_jentzsch_&_ockerbloomWikipedia for GLAMS_by_jentzsch_&_ockerbloom
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
Tracy Jentzsch
 
The public library and wikipedia
The public library and wikipediaThe public library and wikipedia
The public library and wikipedia
dorohoward
 
Wikipedia & Cultural Heritage Institutions: Opportunities for Partnership
Wikipedia & Cultural Heritage Institutions: Opportunities for PartnershipWikipedia & Cultural Heritage Institutions: Opportunities for Partnership
Wikipedia & Cultural Heritage Institutions: Opportunities for Partnership
dorohoward
 
Chapter6 McHaney 2nd edition
Chapter6 McHaney 2nd editionChapter6 McHaney 2nd edition
Chapter6 McHaney 2nd edition
Roger McHaney
 
Wikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s VisibilityiWikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s Visibilityi
Jake Orlowitz
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and Wikimedia
Nick Sheppard
 

Similar to Wikipedia for Researchers (20)

Using wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trendsUsing wikis in library liaison work: overview & trends
Using wikis in library liaison work: overview & trends
 
Wiki case study - Review year 1
Wiki case study  - Review year 1Wiki case study  - Review year 1
Wiki case study - Review year 1
 
Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1Render Review: Wikipedia Case Study, Year 1
Render Review: Wikipedia Case Study, Year 1
 
Mediawiki and Wiki As a Medium
Mediawiki and Wiki As a MediumMediawiki and Wiki As a Medium
Mediawiki and Wiki As a Medium
 
Wrangling Wikipedia
Wrangling WikipediaWrangling Wikipedia
Wrangling Wikipedia
 
Wiki Webinar
Wiki WebinarWiki Webinar
Wiki Webinar
 
Wikimedia Presentation for Schools
Wikimedia Presentation for SchoolsWikimedia Presentation for Schools
Wikimedia Presentation for Schools
 
From Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing WikipediaFrom Frenemies to Friends: Embracing Wikipedia
From Frenemies to Friends: Embracing Wikipedia
 
Chapter6 McHaney
Chapter6 McHaneyChapter6 McHaney
Chapter6 McHaney
 
ALIA Wikipedia and libraries
ALIA Wikipedia and librariesALIA Wikipedia and libraries
ALIA Wikipedia and libraries
 
Wiserpku Lecture@Life Science School Pku
Wiserpku Lecture@Life Science School PkuWiserpku Lecture@Life Science School Pku
Wiserpku Lecture@Life Science School Pku
 
Wiser Pku Lecture@Life Science School Pku
Wiser Pku Lecture@Life Science School PkuWiser Pku Lecture@Life Science School Pku
Wiser Pku Lecture@Life Science School Pku
 
Student to Author: Using Wikipedia to Improve Undergraduate Research & Writing
Student to Author: Using Wikipedia to Improve Undergraduate Research & WritingStudent to Author: Using Wikipedia to Improve Undergraduate Research & Writing
Student to Author: Using Wikipedia to Improve Undergraduate Research & Writing
 
E Write Intro To Web 2
E Write   Intro To Web 2E Write   Intro To Web 2
E Write Intro To Web 2
 
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
Wikipedia for GLAMS_by_jentzsch_&_ockerbloomWikipedia for GLAMS_by_jentzsch_&_ockerbloom
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
 
The public library and wikipedia
The public library and wikipediaThe public library and wikipedia
The public library and wikipedia
 
Wikipedia & Cultural Heritage Institutions: Opportunities for Partnership
Wikipedia & Cultural Heritage Institutions: Opportunities for PartnershipWikipedia & Cultural Heritage Institutions: Opportunities for Partnership
Wikipedia & Cultural Heritage Institutions: Opportunities for Partnership
 
Chapter6 McHaney 2nd edition
Chapter6 McHaney 2nd editionChapter6 McHaney 2nd edition
Chapter6 McHaney 2nd edition
 
Wikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s VisibilityiWikipedia and Libraries: Increasing your Library’s Visibilityi
Wikipedia and Libraries: Increasing your Library’s Visibilityi
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and Wikimedia
 

More from Andrew Gray

Wikipedia and information literacy - LILAC 2014
Wikipedia and information literacy - LILAC 2014Wikipedia and information literacy - LILAC 2014
Wikipedia and information literacy - LILAC 2014
Andrew Gray
 
Wikipedia in the Library - The European Library, Amsterdam 2013
Wikipedia in the Library - The European Library, Amsterdam 2013Wikipedia in the Library - The European Library, Amsterdam 2013
Wikipedia in the Library - The European Library, Amsterdam 2013
Andrew Gray
 
Community communications slides
Community communications slidesCommunity communications slides
Community communications slides
Andrew Gray
 
Wikipedia in the Library Wikimania Hong Kong
Wikipedia in the Library   Wikimania Hong KongWikipedia in the Library   Wikimania Hong Kong
Wikipedia in the Library Wikimania Hong Kong
Andrew Gray
 
Introduction to Wikidata
Introduction to WikidataIntroduction to Wikidata
Introduction to Wikidata
Andrew Gray
 
Social Media at the British Library - Royal Manuscripts
Social Media at the British Library - Royal ManuscriptsSocial Media at the British Library - Royal Manuscripts
Social Media at the British Library - Royal Manuscripts
Andrew Gray
 
AHRC Wikipedian in Residence Report
AHRC Wikipedian in Residence ReportAHRC Wikipedian in Residence Report
AHRC Wikipedian in Residence Report
Andrew Gray
 
Wikipedia Workshop presentation
Wikipedia Workshop presentationWikipedia Workshop presentation
Wikipedia Workshop presentation
Andrew Gray
 

More from Andrew Gray (8)

Wikipedia and information literacy - LILAC 2014
Wikipedia and information literacy - LILAC 2014Wikipedia and information literacy - LILAC 2014
Wikipedia and information literacy - LILAC 2014
 
Wikipedia in the Library - The European Library, Amsterdam 2013
Wikipedia in the Library - The European Library, Amsterdam 2013Wikipedia in the Library - The European Library, Amsterdam 2013
Wikipedia in the Library - The European Library, Amsterdam 2013
 
Community communications slides
Community communications slidesCommunity communications slides
Community communications slides
 
Wikipedia in the Library Wikimania Hong Kong
Wikipedia in the Library   Wikimania Hong KongWikipedia in the Library   Wikimania Hong Kong
Wikipedia in the Library Wikimania Hong Kong
 
Introduction to Wikidata
Introduction to WikidataIntroduction to Wikidata
Introduction to Wikidata
 
Social Media at the British Library - Royal Manuscripts
Social Media at the British Library - Royal ManuscriptsSocial Media at the British Library - Royal Manuscripts
Social Media at the British Library - Royal Manuscripts
 
AHRC Wikipedian in Residence Report
AHRC Wikipedian in Residence ReportAHRC Wikipedian in Residence Report
AHRC Wikipedian in Residence Report
 
Wikipedia Workshop presentation
Wikipedia Workshop presentationWikipedia Workshop presentation
Wikipedia Workshop presentation
 

Recently uploaded

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 

Recently uploaded (20)

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 

Wikipedia for Researchers

  • 1. Wikipedia for Researchers Andrew Gray – Wikipedian in Residence andrew.gray@bl.uk / @generalising
  • 2. About Wikipedia & Wikimedia  Wikimedia  Movement and charitable body  80,000 contributors in 280 languages and eleven core projects  Image repository, dictionary, news site…  …read by 7% of the world!  Wikipedia  19,000,000 articles, 4,000,000 in English  6,500 articles and 235,000 edits per day (…and ten years ago, this was all fields…) 2
  • 3. …so what is Wikipedia?  …an encyclopedia  …written neutrally and verifiably  …using previously published information  …free to use, distribute, or reuse  …a collaborative community  …with no firm rules 3
  • 4. Internal processes  All edits are visible through watchlists and page histories  About 7% are vandalism or malicious; processes to detect these  Median time to correction < 2 minutes… but some stay much longer  Individual discussion pages for all articles – “talk”  Quality review and assessment process  Specialised “wikiproject” working groups and central noticeboards  eg/ content topics; style; dispute resolution; copyright; etc. 4
  • 5. Quality of Wikipedia  On average… it’s not bad  In 2005 four errors per article, versus three in Britannica  In 2011, in English, Spanish & Arabic: “…the Wikipedia articles in this sample scored higher overall than the comparison articles with respect to accuracy, references, style/ readability and overall judgment…”  Millions of articles – so many are, individually, problematic  Various ways of identifying “signs” of quality  Markers for quality are both obvious and subtle  Very effective “springboard” tool 5
  • 6. Looking for quality  Corner icons  - article locked down in some way  - featured or “good” quality  Problem tags  Article talk pages and histories  Style  Badly written or formatted articles = often neglected 6
  • 7. Accessing other content  Structured categories and navigational templates  “What links here” 7
  • 8. Moving on to other content  Other languages – not translations, and may have more content  Mousing over footnote markers  Within the references:  Links through DOIs and other identifiers  ISBNs go to a special landing page  …and then out to libraries, booksellers, etc  ISSNs go to WorldCat  If an author, look for authority control links: 8
  • 9. Preferences  Available to logged in users  Two particularly useful options:  New window for external links (Gadgets > Browsing)  Quality assessment in headers (Gadgets > Appearance)  Many others - mostly editor-oriented tools 9
  • 10. Looking for sets of material  Some tools available – http://www.toolserver.org  Complex to use, but rewarding  CatScan: look for intersection of categories  “all physicists born in 1912” – 51 in English, 34 in German  Full dumps of all data available – http://dumps.wikipedia.org 10
  • 11. Research about Wikipedia  Thriving research around Wikipedia community & content  by mid-2011, 2100 peer-reviewed articles and 38 PhD theses  Active research committee and WMF support  Regular report - http://meta.wikimedia.org/wiki/Research:Newsletter  also @wikiresearch  Major themes include:  Community and content creation  Reading and researching by users  Quality of content  Technical research 11
  • 12. Research on communities  Research on the Wikipedia communities:  Dynamics of community conflict, discussions, collaboration, voting, contribution, mentoring…  Demographics, motivation and specialisms of contributors  Patterns of growth and content creation/deletion  Effect of central programs on volunteer activity  Cross-cultural interaction 12
  • 13. Research on users  Research on usage of Wikipedia:  Specific searching behaviour  Patterns of usage (yearly, daily)  Tracking external events (eg swine flu) through Wikipedia  Search engine rankings  Change in usage by students  Effect of Wikipedia publication on wider literature 13
  • 14. Research on content  Research on the content of Wikipedia:  Evolution of content  Accuracy, coverage and quality  Biases – geographic, cultural, gender  Linguistic analysis  Visualisations of content  Effect of external publications on Wikipedia 14
  • 15. Research on technical aspects  Research on the technical side of Wikipedia:  Extensive work on scaling open-content services  Tools for detecting and handling vandalism  Algorithmic detection and identification of bias, spam  Practical research on uses of wikis 15
  • 16. Research example – visualising art history http://commons.wikimedia.org/wiki/File:Wikiarthistory.png 16
  • 17. Research example – visualising editing patterns 17 http://commons.wikimedia.org/wiki/File:WikiTrip_egyptian_revolution_screenshot.png
  • 18. Research example – editor activity http://commons.wikimedia.org/wiki/File:Effect_of_barnstars_on_productivity.png 18