Crowdsourcing Transcription: Who, Why, What, and How Ben Brumfield, Perian Sully
Why? <ul><li>You can’t OCR cursive! </li></ul><ul><ul><li>1 in 3 letters correctly recognized at best </li></ul></ul><ul><...
Who? <ul><li>Genealogy Community </li></ul>
 
 
Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul>
 
 
Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></ul>
 
Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></...
 
 
 
Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></...
 
 
Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></...
(I’m saving this for Perian.)
Why? <ul><li>Sense of Purpose </li></ul>
 
Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul>
Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul>
 
Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul>...
 
Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul>...
 
 
 
 
 
Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul>...
 
 
How? <ul><li>Choose your material: </li></ul><ul><ul><li>Homogenous format. </li></ul></ul><ul><li>Decide on uses for the ...
 
 
 
 
How? <ul><li>Choose your material: </li></ul><ul><ul><li>Homogenous format. </li></ul></ul><ul><li>Decide on uses for the ...
Which Tool? <ul><li>Structured data or free-form text? </li></ul><ul><li>CMS integration or stand-alone? </li></ul><ul><li...
Which Tool? <ul><li>Free/Open-Source Options: </li></ul><ul><ul><li>Wikisource (MediaWiki+ProofreadPage) </li></ul></ul><u...
Thanks! <ul><li>Slides and links at </li></ul><ul><ul><li>http://manuscripttranscription.blogspot.com </li></ul></ul>
Upcoming SlideShare
Loading in …5
×

MCN2011 Crowdsourcing Transcription

3,687 views

Published on

The Who, What, Why and How of Crowdsourcing transcription.

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
3,687
On SlideShare
0
From Embeds
0
Number of Embeds
2,609
Actions
Shares
0
Downloads
11
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

MCN2011 Crowdsourcing Transcription

  1. 1. Crowdsourcing Transcription: Who, Why, What, and How Ben Brumfield, Perian Sully
  2. 2. Why? <ul><li>You can’t OCR cursive! </li></ul><ul><ul><li>1 in 3 letters correctly recognized at best </li></ul></ul><ul><li>Engagement/Outreach </li></ul>
  3. 3. Who? <ul><li>Genealogy Community </li></ul>
  4. 6. Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul>
  5. 9. Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></ul>
  6. 11. Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></ul><ul><li>Libraries </li></ul>
  7. 15. Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></ul><ul><li>Libraries </li></ul><ul><li>Archives </li></ul>
  8. 18. Who? <ul><li>Genealogy Community </li></ul><ul><li>Natural Sciences </li></ul><ul><li>Open Source/Creative Commons </li></ul><ul><li>Libraries </li></ul><ul><li>Archives </li></ul><ul><li>Museums! </li></ul>
  9. 19. (I’m saving this for Perian.)
  10. 20. Why? <ul><li>Sense of Purpose </li></ul>
  11. 22. Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul>
  12. 23. Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul>
  13. 25. Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul><li>It’s Fun! ( Gamification ) </li></ul>
  14. 27. Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul><li>It’s Fun! ( Gamification ) </li></ul><ul><li>Balance of Motivations </li></ul>
  15. 33. Why? <ul><li>Sense of Purpose </li></ul><ul><li>Love of the Subject </li></ul><ul><li>Immersion in the Text </li></ul><ul><li>It’s Fun! ( Gamification ) </li></ul><ul><li>Balance of Motivations </li></ul><ul><li>Money </li></ul>
  16. 36. How? <ul><li>Choose your material: </li></ul><ul><ul><li>Homogenous format. </li></ul></ul><ul><li>Decide on uses for the transcription. </li></ul>
  17. 41. How? <ul><li>Choose your material: </li></ul><ul><ul><li>Homogenous format. </li></ul></ul><ul><li>Decide on uses for the transcription. </li></ul><ul><li>Find sources of volunteers. </li></ul><ul><li>Choose a tool. </li></ul>
  18. 42. Which Tool? <ul><li>Structured data or free-form text? </li></ul><ul><li>CMS integration or stand-alone? </li></ul><ul><li>What kind of mark-up? </li></ul><ul><li>What underlying technology? </li></ul>
  19. 43. Which Tool? <ul><li>Free/Open-Source Options: </li></ul><ul><ul><li>Wikisource (MediaWiki+ProofreadPage) </li></ul></ul><ul><ul><li>FromThePage </li></ul></ul><ul><ul><li>Scripto (War Department Papers) </li></ul></ul><ul><ul><li>Bentham Transcription Desk </li></ul></ul><ul><ul><li>Scribe (Zooniverse/OldWeather) </li></ul></ul><ul><ul><li>OpenScribe (PROV) </li></ul></ul><ul><li>Build your own </li></ul><ul><ul><li>(But please share it with the rest of us!) </li></ul></ul>
  20. 44. Thanks! <ul><li>Slides and links at </li></ul><ul><ul><li>http://manuscripttranscription.blogspot.com </li></ul></ul>

×