SlideShare a Scribd company logo
MAKING LINKS IN THE BHL 
Primary Source Materials as a 
Window to a Scientist’s Methods 
Constance Rinaldo, Librarian of the Ernst Mayr Library, MCZ, Harvard 
TDWG Annual Meeting 2014, Jonkoping, Sweden
Connecting Content: Field Notes, Specimens & 
Published Literature 
• Digitize 
• Deposit 
• Link 
• Repurpose
Why Field Notes? 
• Archival materials fill in the documentation of the full 
research cycle & are primary source material 
• Field notes provide unpublished observations, 
sketches, weather reports and species lists 
• Accessibility & adaptation for today’s tools and 
researchers 
• We chose William Brewster, an ornithologist who 
worked during the late 19th and early 20th centuries 
• Test case to connect old and current data: Brewster 
species lists & current EOL data 
• Connect content from multiple sources to advance 
scientific and educational pursuits= open science.
Life Cycle Completed 
Image digitized for BHL 
Observations in notes, 
Later digitized for BHL 
Original Specimen record 
Publication of species description, 
digitized for BHL 
Full digital specimen record 
With links to digitized material
Purposeful Gaming 
• Digitize horticultural catalogs 
• Select tool for transcription of handwritten & 
multi column formatted BHL content 
• Transcribe field notes & catalogs (each page 
twice) 
• Crowdsource transcription 
• Compare digital outputs 
• Extract problem words for game 
• Build BHL technical framework for classifying, 
comparing & managing multiple OCR outputs
Transcription Tool Criteria 
• Open source 
• Crowdsourcing capability 
• User-friendly 
• Allow administrative oversight and editing (i.e., reviewing, 
correcting, and validating transcriptions) 
• Provide transcription file exports that can be efficiently 
formatted for use by the game(s) 
• Sustainable (tool selected will hopefully be used 
permanently for BHL) 
• Code easy to install, manage, and troubleshoot 
• Technical support 
• Multiple transcriptions of a single page
Transcription Tools 
• FromthePage & Digivol 
• Selected 2 tools to fulfill the need for 2 
transcriptions of each page 
• Built in community of volunteers with Digivol
illustration
"4058841","Jessica Mitchell","Joseph deVeer","JournalsWilliam00Brew_0013.jpg","Fully 
transcribed by Jessica Mitchell. Exported on 21-Oct-2014 from DigiVol 
(http://volunteer.ala.org.au)","05-Jun-2014 02:17:15","11-Jun-2014 
23:02:51","0","MCZ","1888nMarch 20nRevere Beach, Massachusetts.n Cloudy with 
occasional light showers; warm.n To revere Beach with Chadbourne by 9 a.m. train.nLeft 
the cars at Point of Pines and first inspected'nthe pines behind the large hotel in hopes of 
findingnCrossbills there. There were English Sparrows innabundance and four Tree 
Sparrows (S. monticola) butnnothing else save a single Robin. In the bushy thicketsnaround 
the outskirts of the grove Song Sparrowsnswarming as usual at this season and, 
despitenthe gloomy weather, singing freely. We saw nonenelsewhere along the beach 
although they used tonbe numerous during migration time at severalnplaces, especially 
Oak Island.n[margin]S.monticola[/margin]n Near the extreme end of the Point we came 
onna flock of about 15 Pine Linnets feeding amongnweeds on the side of a dyke 
embankment. Firingntwo barrels into these killed 
eight.n[margin]Chrysomitrisnpinus[/margin]n Retracing our steps to the station & 
crossing thenrailroad we next tried the marshes. There were nonsmall birds there but we 
saw a flock of aboutn30 Crows (evidently migrants), about as manynGolden-eye Duck 
feeding in the river, and numerousnHerring Gulls.n The rest of the way to Oak Island we 
kept alongnthe beach ridge. Pine Linnets are exceedinglynnumerous the entire distance, in 
flocks of 5 to 15 birdsneach. We shot nine more specimens. I made onencapital shot at a 
single bird passing very swiftlynbefore the strong S. E. wind.n Besides the Linnets we saw a 
single Snow Bunting,n& many English Sparrows, the latter feeding on thenwet beach in 
flocks. Returned to the city at 12 n.","13"
http://www.tiltfactor.org/the-lab/
Access to Digitized Texts 
• Improved OCR from crowdsourcing & gaming 
• Technical infrastructure to manage & compare 
multiple text sources
Next steps 
• Social media campaign: transcription 
• Release games/more social media 
• Operationalize crowdsourcing of OCR 
improvements: data mining possibilities
More to come

More Related Content

Similar to 2014 tdwg makinglinks

From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...
andrea thomer
 
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
Martin Kalfatovic
 
An Overview of Standards for Biodiversity Literature and the State of the BHL
An Overview of Standards for Biodiversity Literature and the State of the BHLAn Overview of Standards for Biodiversity Literature and the State of the BHL
An Overview of Standards for Biodiversity Literature and the State of the BHL
Martin Kalfatovic
 
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
MartySchlabach
 
Intro to Wikisource
Intro to WikisourceIntro to Wikisource
Intro to Wikisource
Ewan McAndrew
 
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Martin Kalfatovic
 
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
RCAHMW
 
Hughes Strategic Content Alliance talk 27 3-2012
Hughes Strategic Content Alliance talk 27 3-2012Hughes Strategic Content Alliance talk 27 3-2012
Hughes Strategic Content Alliance talk 27 3-2012
lorna_hughes
 
Open Access and Knowledge Sharing
Open Access and Knowledge SharingOpen Access and Knowledge Sharing
Open Access and Knowledge Sharing
Getaneh Alemu
 
Joe Coleman Biodiversity Heritage Library
Joe Coleman Biodiversity Heritage LibraryJoe Coleman Biodiversity Heritage Library
Joe Coleman Biodiversity Heritage Library
Future Perfect 2012
 
Museums and their media production
Museums and their media productionMuseums and their media production
Museums and their media production
peterpavement
 
Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia
Duncan Hull
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
Enrico Daga
 
Wikipedia and Libraries
Wikipedia and LibrariesWikipedia and Libraries
Wikipedia and Libraries
Bob Kosovsky
 
International Digital Library Initiatives
International Digital Library InitiativesInternational Digital Library Initiatives
International Digital Library Initiatives
Dept of Library and Information Science Tumkur University
 
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
Martin Kalfatovic
 
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Martin Kalfatovic
 
Charleston Conference 2012: Climbing the Digital Everest
Charleston Conference 2012: Climbing the Digital EverestCharleston Conference 2012: Climbing the Digital Everest
Charleston Conference 2012: Climbing the Digital Everest
Cengage Learning
 
Wikimedia and open research.pptx
Wikimedia and open research.pptxWikimedia and open research.pptx
Wikimedia and open research.pptx
NickSheppard14
 

Similar to 2014 tdwg makinglinks (20)

From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...From documents to datasets -- mining the Junius Henderson Field Notes for spe...
From documents to datasets -- mining the Junius Henderson Field Notes for spe...
 
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
Oh Time, Thy Pyramids! The Biodiversity Heritage Library and the Unchaining o...
 
An Overview of Standards for Biodiversity Literature and the State of the BHL
An Overview of Standards for Biodiversity Literature and the State of the BHLAn Overview of Standards for Biodiversity Literature and the State of the BHL
An Overview of Standards for Biodiversity Literature and the State of the BHL
 
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
Purposeful Gaming Crowdsourcing the Correction of OCRed Text in the Biodivers...
 
Intro to Wikisource
Intro to WikisourceIntro to Wikisource
Intro to Wikisource
 
Proquest service
Proquest serviceProquest service
Proquest service
 
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
Digitalización de literatura de Biodiversidad: an Overview of the Biodiversit...
 
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
Natural Historical Archives as Digital Challenge and Opportunity - Andreas We...
 
Hughes Strategic Content Alliance talk 27 3-2012
Hughes Strategic Content Alliance talk 27 3-2012Hughes Strategic Content Alliance talk 27 3-2012
Hughes Strategic Content Alliance talk 27 3-2012
 
Open Access and Knowledge Sharing
Open Access and Knowledge SharingOpen Access and Knowledge Sharing
Open Access and Knowledge Sharing
 
Joe Coleman Biodiversity Heritage Library
Joe Coleman Biodiversity Heritage LibraryJoe Coleman Biodiversity Heritage Library
Joe Coleman Biodiversity Heritage Library
 
Museums and their media production
Museums and their media productionMuseums and their media production
Museums and their media production
 
Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia Improving the troubled relationship between Scientists and Wikipedia
Improving the troubled relationship between Scientists and Wikipedia
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
 
Wikipedia and Libraries
Wikipedia and LibrariesWikipedia and Libraries
Wikipedia and Libraries
 
International Digital Library Initiatives
International Digital Library InitiativesInternational Digital Library Initiatives
International Digital Library Initiatives
 
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
Free and Open Access to Biodiversity Literature: An Introduction to the Biodi...
 
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
Biodiversity Heritage Library: A Conversation About A Collaborative Digitizin...
 
Charleston Conference 2012: Climbing the Digital Everest
Charleston Conference 2012: Climbing the Digital EverestCharleston Conference 2012: Climbing the Digital Everest
Charleston Conference 2012: Climbing the Digital Everest
 
Wikimedia and open research.pptx
Wikimedia and open research.pptxWikimedia and open research.pptx
Wikimedia and open research.pptx
 

Recently uploaded

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 

Recently uploaded (20)

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 

2014 tdwg makinglinks

  • 1. MAKING LINKS IN THE BHL Primary Source Materials as a Window to a Scientist’s Methods Constance Rinaldo, Librarian of the Ernst Mayr Library, MCZ, Harvard TDWG Annual Meeting 2014, Jonkoping, Sweden
  • 2. Connecting Content: Field Notes, Specimens & Published Literature • Digitize • Deposit • Link • Repurpose
  • 3. Why Field Notes? • Archival materials fill in the documentation of the full research cycle & are primary source material • Field notes provide unpublished observations, sketches, weather reports and species lists • Accessibility & adaptation for today’s tools and researchers • We chose William Brewster, an ornithologist who worked during the late 19th and early 20th centuries • Test case to connect old and current data: Brewster species lists & current EOL data • Connect content from multiple sources to advance scientific and educational pursuits= open science.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. Life Cycle Completed Image digitized for BHL Observations in notes, Later digitized for BHL Original Specimen record Publication of species description, digitized for BHL Full digital specimen record With links to digitized material
  • 9. Purposeful Gaming • Digitize horticultural catalogs • Select tool for transcription of handwritten & multi column formatted BHL content • Transcribe field notes & catalogs (each page twice) • Crowdsource transcription • Compare digital outputs • Extract problem words for game • Build BHL technical framework for classifying, comparing & managing multiple OCR outputs
  • 10.
  • 11.
  • 12. Transcription Tool Criteria • Open source • Crowdsourcing capability • User-friendly • Allow administrative oversight and editing (i.e., reviewing, correcting, and validating transcriptions) • Provide transcription file exports that can be efficiently formatted for use by the game(s) • Sustainable (tool selected will hopefully be used permanently for BHL) • Code easy to install, manage, and troubleshoot • Technical support • Multiple transcriptions of a single page
  • 13. Transcription Tools • FromthePage & Digivol • Selected 2 tools to fulfill the need for 2 transcriptions of each page • Built in community of volunteers with Digivol
  • 15. "4058841","Jessica Mitchell","Joseph deVeer","JournalsWilliam00Brew_0013.jpg","Fully transcribed by Jessica Mitchell. Exported on 21-Oct-2014 from DigiVol (http://volunteer.ala.org.au)","05-Jun-2014 02:17:15","11-Jun-2014 23:02:51","0","MCZ","1888nMarch 20nRevere Beach, Massachusetts.n Cloudy with occasional light showers; warm.n To revere Beach with Chadbourne by 9 a.m. train.nLeft the cars at Point of Pines and first inspected'nthe pines behind the large hotel in hopes of findingnCrossbills there. There were English Sparrows innabundance and four Tree Sparrows (S. monticola) butnnothing else save a single Robin. In the bushy thicketsnaround the outskirts of the grove Song Sparrowsnswarming as usual at this season and, despitenthe gloomy weather, singing freely. We saw nonenelsewhere along the beach although they used tonbe numerous during migration time at severalnplaces, especially Oak Island.n[margin]S.monticola[/margin]n Near the extreme end of the Point we came onna flock of about 15 Pine Linnets feeding amongnweeds on the side of a dyke embankment. Firingntwo barrels into these killed eight.n[margin]Chrysomitrisnpinus[/margin]n Retracing our steps to the station & crossing thenrailroad we next tried the marshes. There were nonsmall birds there but we saw a flock of aboutn30 Crows (evidently migrants), about as manynGolden-eye Duck feeding in the river, and numerousnHerring Gulls.n The rest of the way to Oak Island we kept alongnthe beach ridge. Pine Linnets are exceedinglynnumerous the entire distance, in flocks of 5 to 15 birdsneach. We shot nine more specimens. I made onencapital shot at a single bird passing very swiftlynbefore the strong S. E. wind.n Besides the Linnets we saw a single Snow Bunting,n& many English Sparrows, the latter feeding on thenwet beach in flocks. Returned to the city at 12 n.","13"
  • 17. Access to Digitized Texts • Improved OCR from crowdsourcing & gaming • Technical infrastructure to manage & compare multiple text sources
  • 18. Next steps • Social media campaign: transcription • Release games/more social media • Operationalize crowdsourcing of OCR improvements: data mining possibilities

Editor's Notes

  1. The story begins….Cal Acad along with MOBOT, HUBot, HUEML, NYBG, AMNH and associated with Smithsonian Fieldbook project Ernst Mayr Library project: William Brewster, ornithologist, journals& diaries: 1865-1919
  2. Open science resources tools and applications are accelerating the rate at which historical and current biodiversity information can be mobilized, customized and turned into participatory activities. The data can be presented in new formats on the web and mobile devices and is broadly available. Here we demonstrate some ways in which historical checklists and current knowledge can be melded using tools that support ecological research, management and educational activities. Brewster’s field notes make it possible to track species changes by comparing his checklists from 1892 to current checklists. By linking these varied data sources and tools, the data life cycle can be completed.  William Brewster was an ornithologist who worked during the late 19th and early 20th centuries. This poster shows how his field notes, digitized and deposited in the Biodiversity Heritage Library (BHL), can be linked with current data in the Encyclopedia of Life (EOL). This case example demonstrates how open science projects can be used to connect content from multiple sources to advance scientific and educational pursuits.
  3. William Brewster, ornithologist, 1851-1919 He was 15 years old when he made these notes, 14 when he began to jot down his observations.
  4. Specimen digitization as part of CC
  5. Hand Transcribed species lists for March and Nov 1892
  6. Repurposing content and building relevance to now. Build brewster field guide for Cambridge in march and nov in eol (marie/tracy) so that comparisons could be made to current observations in Cambridge Mention inaturalist tool for current info—still under development
  7. At least for a couple of pages! NEXT step is the Biocaching App (under development): “what’s in my neighborhood” based on Global Biodiversity Information Facility (GBIF) specimen maps with links to field notes. Curated observations can then be shared with GBIF So what next? Transcriptions! Crowdsourcing! Making it fun. EOL connections/
  8. Purposeful Gaming & BHL: Clearly we need a better way to get transcriptions done, and added to the BHL (prototype working on it) MOBOT lead, HUEML, Cornell, NYBG
  9. The problem handwriting, multicolumn
  10. 93000 pages of seed catalogs to be added by Cornell and NYBG. Also ingesting content from National Agricultural Library (a recently added affiliate to BHL)
  11. Tools investigated in addition to DigiVol and FromThePage Transcribe Bentham (http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham) – eliminated because the code is difficult to install and cannot export structured data T-PEN (http://t-pen.org/TPEN/) – eliminated because it was the least user-friendly of all tools reviewed – steep learning curve; could not do bulk uploads of images Transcribr (http://www.archives.gov/citizen-archivist/transcribe/) - eliminated because image import/data export functionality was lacking Smithsonian Transcription Tool (https://transcription.si.edu/) – wanted to use this tool, but code was not available, and it was problematic to have Smithsonian host our content Scripto (http://scripto.org/) - eliminated for various technical reasons, e.g. relies on zoom.it which is not well supported and didn’t work when tried. Harvard Library eliminated this tool as it wasn’t the most user-friendly
  12. No tools supported multiple transcriptions of a single page, so it was decided to implement the two top tools to provide two transcriptions per page as required for the game. DigiVol came with an existing community of transcribers. Also, looking at two tools gives us the opportunity to evaluate and compare them as we think about selecting a permanent tool for transcription in the BHL portal.
  13. Field notes page
  14. A transcription: (2000 pagesof Brewster field books digitized twice to render fodder for game.)
  15. Tiltfactor selected to develop game to reconcile different transcriptions: 2 games, one for gamers, one for non-gamers.