SlideShare a Scribd company logo
1 of 29
Download to read offline
Crowdsourcing Cultural Heritage
UCL's Transcribe Bentham Project




Dr Melissa Terras
Senior Lecturer in Electronic Communication, UCL Dept of Information Studies
Deputy Director, UCL Centre for Digital Humanities
m.terras@ucl.ac.uk
Crowdsourcing Cultural Heritage



• Bentham and UCL
• Crowdsourcing
  – History and Ideas
  – Heritage and Culture
  – Features and Issues
• Transcribe Bentham
• Potentials and Problems
Jeremy Bentham (1748-1832)
                 •Jurist, philosopher, and legal and
                 social reformer
                 •Leading theorist in Anglo-American
                 philosophy of law
                 •Influenced the development of
                 welfarism
                 •Advocated utilitarianism
                 •Animal rights,
                 •Work on the “panopticon”

                 •Not founder of UCL, but...
                 •60,000 folios in UCL Sp. Collections
                 •Auto-icon
The Bentham Project


             • http://www.ucl.ac.uk/Bentham-Project/
             • Since 1959
             • “aims to produce a new scholarly
               edition of the works and
               correspondence of Jeremy Bentham”
             • twenty six volumes of the new
               Collected Works have been published
             • Previous AHRC grant catalogued the
               manuscripts
                – http://www.benthampapers.ucl.ac.uk/
First 80 hours: 20,000 volunteers, 170,000 pages read.
Currently: 26, 717 volunteers, 220,965 pages read. 237,867 to go
Crowdsourcing



• neologistic portmanteau of “crowd” and
  “outsourcing”
• coined by Jeff Howe in a June 2006 Wired
  magazine article “The Rise of Crowdsourcing”
  – Group intelligence
  – Cheap computers + large crowds = useful
  – “It’s not outsourcing; it’s crowdsourcing.”
Technology and crowd-based research
• Often those outside established institutions that
  have taken the lead in exploiting new technologies
   – Science in the 19th century
   – Classics, maths, black studies, astrophysics,
     oral history, women’s studies, contemporary
     history… all started outside established
     curricula
• Prizes for technological innovation
• Metal detectors/archaeology
• Binoculars/ ornithological fieldwork
• Cassette Recorders/ life history, oral history,
  language
• Telescopes/ astronomical research
Crowdsourcing tasks



•The harnessing of online activity to aid in large
scale projects that require human cognition
•Basic to complex tasks
   • Is this round or square? (yes/no)
   • Is this tag correct for this image?
   • Can you correct the OCR on this page?
Crowdsourcing: Potentials for heritage institutions

•   Achieving goals even with limited resources
•   Achieving goals faster
•   Build new virtual communities and user groups
•   Involve and engage the user community with collections
•   Utilising the knowledge, expertise and interest of the community
•   Improving the quality of data/resource (e.g. corrections), more accurate
    searching
•   Adding value to data (e.g. by addition of comments, tags, ratings, reviews).
•   Making data discoverable in different ways f (e.g. by tagging).
•   Gain insight on user desires by asking and then listening to the crowd.
•   Demonstrating the value and relevance of the institution in the community
•   Strengthen and builditrust and loyalty of collection users
•   Encourage a sense of public ownership and responsibility
•   Holley, R. (2010) “Crowdsourcing: How and Why Should Libraries Do It?” D-
    Lib Magazine http://www.dlib.org/dlib/march10/holley/03holley.html
Galaxy Zoo http://www.galaxyzoo.org/



• Online collaborative astronomy project
• Public assist in classifying millions of galaxies
  from digital photos taken by robots
• Released July 2007
• By August 2007 80,000 volunteers had classified
  10 million galaxies
• To date, more than 60 million galaxies classified
Australian Newspapers Digitisation Program
http://www.nla.gov.au/ndp/


• In 2007 The National Library of Australia began to
  digitise out of copyright newspapers
• However the OCR quality of newsprint is poor
• Opened up the text to allow users to correct
  mistakes in the OCR
• 9000+ members of the public have so far
  corrected 12.5 million lines of newspaper text
Victoria and Albert Museum Crowdsourcing
http://collections.vam.ac.uk/crowdsourcing/


• Search the collections contains 140,000 images,
  selected automatically from the database
• Many images not the best view of an object
• Asking users to help find best crops of images
• 28375 images done in a year
Crowd sourced projects
• Picture Australia, National Library of Australia
   – http://www.pictureaustralia.org/
• Family Search Indexing
   – http://www.familysearch.org/eng/indexing/frameset_indexing.asp
• Free BMD
   – http://www.freebmd.org.uk/
• Distributed Proofreaders (Project Gutenberg)
   – http://www.pgdp.net/c/
• Papyri
   – Project at Oxford to use Galaxy Zoo software to help in classification of
     documentary fragments
• Wikipedia
  – http://www.wikipedia.org/
What do we know of Volunteers?
• Majority of work done by 10% of users
• Clay Shirky describes activity as 'cognitive surplus' time for
  social endeavours, rather than watching TV
• Personal interest
• Personal reward
• Community aspect
• Lot of interest from retirement community, and disabled
  and terminally ill individuals
• Many build up IT expertise as they volunteer
• “addictive”
• Help achieve group goal
• Like to be rewarded
Successful Crowdsourcing




Rose Holley's checklist for crowdsourcing:
http://www.dlib.org/dlib/march10/holley/03holley.html
Enter Transcribe Bentham

• 10,000 images of Bentham’s manuscripts
• Ask user community to transcribe these
  – Provide plain text
  – Or “Markup” in rudimentary TEI
     • Underline, deletions, insertions
• Generate a “Knowledge Bank” of ideas from the
  transcripts
• Link with existing catalogue and transcripts
• Make material more accessible to scholars
Plan



•   Soft launch end of June
•   Full launch early July
•   In process of user testing and creation of system
•   Two full time RAs working on this
    – One for user testing and promotion
    – One for user testing and technical aspects
• http://www.ucl.ac.uk/transcribe-bentham/
User Interaction



• Involving users in the design process is key
• Currently recruiting for testers
• Will be working one to one with users
  – Established textual scholars from DH community
  – Members of the public
• Will open to Beta testing to find bugs
• Then onto full launch
Issues and Outcomes



• Worst Case Scenario?
• Best Case Scenario?
• Is this task suitable to crowd sourcing?
  – Complex
• How can we gauge success?
  – Monitor and log user interaction
  – Report back on initiatives
• How can we reach a user community?
Conclude



• Latest fad?
• Should provide input into cultural and heritage
  institutions, research, and projects
• Longer term outcomes
  – Sustainability
• Good to try these things!
• http://www.ucl.ac.uk/transcribe-bentham/

More Related Content

Viewers also liked

Curso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGACurso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGARC Consulting
 
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianThe "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianDan Davis
 
Crowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageCrowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageMia
 
Crowdsourcing lecture pres
Crowdsourcing lecture presCrowdsourcing lecture pres
Crowdsourcing lecture presOonagh Murphy
 
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Victor de Boer
 
Transcribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionTranscribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionNicole Kearney
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcingmlascarides
 
Everyone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsEveryone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsMia
 
Crowdsourcing as Public Engagement
Crowdsourcing as Public  EngagementCrowdsourcing as Public  Engagement
Crowdsourcing as Public EngagementAlastair Dunning
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyMia
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the libraryTrevor Owens
 
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولدليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولwedad111
 
Digital History Presentation
Digital History PresentationDigital History Presentation
Digital History PresentationEdward Iglesias
 
Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Rose Holley
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseTrevor Owens
 
Reaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageReaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageMia
 

Viewers also liked (17)

Curso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGACurso Técnico Especializado: SIGA
Curso Técnico Especializado: SIGA
 
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the SmithsonianThe "New" Citizen Scientist, Crowdsourcing at the Smithsonian
The "New" Citizen Scientist, Crowdsourcing at the Smithsonian
 
Crowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritageCrowdsourcing as productive engagement with cultural heritage
Crowdsourcing as productive engagement with cultural heritage
 
Crowdsourcing lecture pres
Crowdsourcing lecture presCrowdsourcing lecture pres
Crowdsourcing lecture pres
 
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...Linked Data for Digital History presentation for VU symposium "Connecting Dat...
Linked Data for Digital History presentation for VU symposium "Connecting Dat...
 
Transcribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collectionTranscribing between the lines: crowd-sourcing historic data collection
Transcribing between the lines: crowd-sourcing historic data collection
 
Design for Crowdsourcing
Design for CrowdsourcingDesign for Crowdsourcing
Design for Crowdsourcing
 
Everyone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museumsEveryone wins: crowdsourcing games and museums
Everyone wins: crowdsourcing games and museums
 
Crowdsourcing digital humanities
Crowdsourcing digital humanitiesCrowdsourcing digital humanities
Crowdsourcing digital humanities
 
Crowdsourcing as Public Engagement
Crowdsourcing as Public  EngagementCrowdsourcing as Public  Engagement
Crowdsourcing as Public Engagement
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technology
 
The crowd and the library
The crowd and the libraryThe crowd and the library
The crowd and the library
 
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأولدليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
دليل المعلمة لغتي للصف الثالث ابتدائي الفصل الدراسي الأول
 
Digital History Presentation
Digital History PresentationDigital History Presentation
Digital History Presentation
 
Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010Crowdsourcing Strategies for Archives, Nov 2010
Crowdsourcing Strategies for Archives, Nov 2010
 
The IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can UseThe IMLS National Digital Platform & Your Library: Tools You Can Use
The IMLS National Digital Platform & Your Library: Tools You Can Use
 
Reaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritageReaching out: museums, crowdsourcing and participatory heritage
Reaching out: museums, crowdsourcing and participatory heritage
 

Recently uploaded

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 

Recently uploaded (20)

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 

Mterras 09 jun2010

  • 1. Crowdsourcing Cultural Heritage UCL's Transcribe Bentham Project Dr Melissa Terras Senior Lecturer in Electronic Communication, UCL Dept of Information Studies Deputy Director, UCL Centre for Digital Humanities m.terras@ucl.ac.uk
  • 2. Crowdsourcing Cultural Heritage • Bentham and UCL • Crowdsourcing – History and Ideas – Heritage and Culture – Features and Issues • Transcribe Bentham • Potentials and Problems
  • 3. Jeremy Bentham (1748-1832) •Jurist, philosopher, and legal and social reformer •Leading theorist in Anglo-American philosophy of law •Influenced the development of welfarism •Advocated utilitarianism •Animal rights, •Work on the “panopticon” •Not founder of UCL, but... •60,000 folios in UCL Sp. Collections •Auto-icon
  • 4. The Bentham Project • http://www.ucl.ac.uk/Bentham-Project/ • Since 1959 • “aims to produce a new scholarly edition of the works and correspondence of Jeremy Bentham” • twenty six volumes of the new Collected Works have been published • Previous AHRC grant catalogued the manuscripts – http://www.benthampapers.ucl.ac.uk/
  • 5.
  • 6. First 80 hours: 20,000 volunteers, 170,000 pages read. Currently: 26, 717 volunteers, 220,965 pages read. 237,867 to go
  • 7. Crowdsourcing • neologistic portmanteau of “crowd” and “outsourcing” • coined by Jeff Howe in a June 2006 Wired magazine article “The Rise of Crowdsourcing” – Group intelligence – Cheap computers + large crowds = useful – “It’s not outsourcing; it’s crowdsourcing.”
  • 8. Technology and crowd-based research • Often those outside established institutions that have taken the lead in exploiting new technologies – Science in the 19th century – Classics, maths, black studies, astrophysics, oral history, women’s studies, contemporary history… all started outside established curricula • Prizes for technological innovation • Metal detectors/archaeology • Binoculars/ ornithological fieldwork • Cassette Recorders/ life history, oral history, language • Telescopes/ astronomical research
  • 9. Crowdsourcing tasks •The harnessing of online activity to aid in large scale projects that require human cognition •Basic to complex tasks • Is this round or square? (yes/no) • Is this tag correct for this image? • Can you correct the OCR on this page?
  • 10. Crowdsourcing: Potentials for heritage institutions • Achieving goals even with limited resources • Achieving goals faster • Build new virtual communities and user groups • Involve and engage the user community with collections • Utilising the knowledge, expertise and interest of the community • Improving the quality of data/resource (e.g. corrections), more accurate searching • Adding value to data (e.g. by addition of comments, tags, ratings, reviews). • Making data discoverable in different ways f (e.g. by tagging). • Gain insight on user desires by asking and then listening to the crowd. • Demonstrating the value and relevance of the institution in the community • Strengthen and builditrust and loyalty of collection users • Encourage a sense of public ownership and responsibility • Holley, R. (2010) “Crowdsourcing: How and Why Should Libraries Do It?” D- Lib Magazine http://www.dlib.org/dlib/march10/holley/03holley.html
  • 11. Galaxy Zoo http://www.galaxyzoo.org/ • Online collaborative astronomy project • Public assist in classifying millions of galaxies from digital photos taken by robots • Released July 2007 • By August 2007 80,000 volunteers had classified 10 million galaxies • To date, more than 60 million galaxies classified
  • 12.
  • 13. Australian Newspapers Digitisation Program http://www.nla.gov.au/ndp/ • In 2007 The National Library of Australia began to digitise out of copyright newspapers • However the OCR quality of newsprint is poor • Opened up the text to allow users to correct mistakes in the OCR • 9000+ members of the public have so far corrected 12.5 million lines of newspaper text
  • 14.
  • 15. Victoria and Albert Museum Crowdsourcing http://collections.vam.ac.uk/crowdsourcing/ • Search the collections contains 140,000 images, selected automatically from the database • Many images not the best view of an object • Asking users to help find best crops of images • 28375 images done in a year
  • 16.
  • 17. Crowd sourced projects • Picture Australia, National Library of Australia – http://www.pictureaustralia.org/ • Family Search Indexing – http://www.familysearch.org/eng/indexing/frameset_indexing.asp • Free BMD – http://www.freebmd.org.uk/ • Distributed Proofreaders (Project Gutenberg) – http://www.pgdp.net/c/ • Papyri – Project at Oxford to use Galaxy Zoo software to help in classification of documentary fragments • Wikipedia – http://www.wikipedia.org/
  • 18. What do we know of Volunteers? • Majority of work done by 10% of users • Clay Shirky describes activity as 'cognitive surplus' time for social endeavours, rather than watching TV • Personal interest • Personal reward • Community aspect • Lot of interest from retirement community, and disabled and terminally ill individuals • Many build up IT expertise as they volunteer • “addictive” • Help achieve group goal • Like to be rewarded
  • 19. Successful Crowdsourcing Rose Holley's checklist for crowdsourcing: http://www.dlib.org/dlib/march10/holley/03holley.html
  • 20. Enter Transcribe Bentham • 10,000 images of Bentham’s manuscripts • Ask user community to transcribe these – Provide plain text – Or “Markup” in rudimentary TEI • Underline, deletions, insertions • Generate a “Knowledge Bank” of ideas from the transcripts • Link with existing catalogue and transcripts • Make material more accessible to scholars
  • 21.
  • 22. Plan • Soft launch end of June • Full launch early July • In process of user testing and creation of system • Two full time RAs working on this – One for user testing and promotion – One for user testing and technical aspects • http://www.ucl.ac.uk/transcribe-bentham/
  • 23. User Interaction • Involving users in the design process is key • Currently recruiting for testers • Will be working one to one with users – Established textual scholars from DH community – Members of the public • Will open to Beta testing to find bugs • Then onto full launch
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. Issues and Outcomes • Worst Case Scenario? • Best Case Scenario? • Is this task suitable to crowd sourcing? – Complex • How can we gauge success? – Monitor and log user interaction – Report back on initiatives • How can we reach a user community?
  • 29. Conclude • Latest fad? • Should provide input into cultural and heritage institutions, research, and projects • Longer term outcomes – Sustainability • Good to try these things! • http://www.ucl.ac.uk/transcribe-bentham/