SlideShare a Scribd company logo
“Choir attempted that beautiful
anthem “Oh, Radiant Morn” –
made a hash of it”
Making a hash of the Adkin Diary transcriptions
Adrian Kingston
Collections Information Manager, Digital Assets and Development
Museum of New Zealand Te Papa Tongarewa
@adriankingston
Crowdsourcing for the Digital Humanities and Cultural Heritage Sector
Victoria University of Wellington, 23 April 2013
Wed. Apr. 23.
Worked at Swamp–Cow p[addock] fence. Bulliman took
48 heavy fat ewes at 15/-. In evening Father + I drove
down to Levin No L[icense] Democratic Vote Campaign
committee meeting. Father voted to chair + self
appointed secretary. Discussed campaign.
Background
 George Leslie Adkin; Farmer, photographer, geologist, explorer,
archaeologist, ethnologist.
 1 man, 41 diaries, 59 years, Over 21000 days
 Thousands of negatives and prints, some albums
 Initial deadline, launch of @life100yearsago ,a project of
WW100
 Did everything ourselves. We resourced most of this project
with a curator (Kirstie Ross) and a monkey with a keyboard
 Figure out process (imaging, cropping, loading, transcription
guidelines),
 Figure out content (data structure, quirks of Adkin, glossaries
etc.)
 Project? What project?
 Very early days.
Process
 Assess album condition
 Photograph album pages
 Crop pages to days
 Create narrative for day
 Load “day” images to EMu “day” narrative
 Transcribe
 Add associated subjects, people, places (from authority files
and controlled vocabularies)
 Add context to narrative entries for month
 Some parts semi-automated, some completely manual; some
need no special skills, others do
Received a letter + referee’s report from Dr
Chilton, Editor “Trans[actions of the] NZ
Inst[itute], on my paper on Tararuas = “my
theories based on too slender evidence and
debatable evidence + also in part erroneous (?
GLA). I decided to withdraw the paper as it is
evidently unsuitable for publication in
“Transactions”
http://collections.tepapa.govt.nz/theme.aspx?irn=4294
Framework
 Using existing framework; EMu, Collections Online
 CIDOC CRM for building and expressing relationships
 Days are conceptual entities, not physical. Framework allows
for this
 Links to physical entities, diaries, photographs, albums
 Links to people, places, topics
 However, scale of content of really starting to highlight issues of
display in Collections Online.
What we’ve learnt
 So much content, so much data
 More than just one man’s story, a huge data source on NZ life
 So much potential for a number of fields of research
 Our existing data structure works really well
 Transcription only one part
 To get most out of the content, need the links, need the rich
conceptual model
 Context needed, or at least useful, for the reader
 Existing display not so hot
 Enlivens the collection, a step beyond just digitisation and
transcription
Issues
 Size of the project is daunting, but the transcription seems
manageable to do through crowdsourcing
 There are a number of existing platforms that look great, but
how to deal with matching to our structure, vocabularies,
authorities?
 Could use automated in text authority mining, but would need
to then match back to authorities and structure
 Beyond scope of crowdsourcing? But does that diminish the
value of the “data”?
 Could come later though, are we getting too hung up on
quality?
Our potential crowd
 By starting it ourselves, we have some content available to
promote the crowdsourcing.
 Already had unsolicited volunteers
 The content is interesting: NZ history, early 20th Century
courtship, farming, geology, religion, war, politics, weather…
 Horowhenua locals interested in local history, and one of their
famous sons
 History students and educators
 Bring students closer to primary material, work with cursive
handwriting, highlight the importance of accuracy in relation to
data, personal biography
 Learning history through a first hand account
 Plan B is do war years with interns
We decided to go into town to lunch so I piloted
the party to Kirkcaldie + Stains where we had a
good dinner… Will wanted to know if one could
have all the courses for 2/-. I told him it was not
customary to indulge in more than six but that if
he wanted to tackle the lot we would have to
leave him at it. Olive ordered dishes she did not
want + Alice also got a bit mixed up.
http://collections.tepapa.govt.nz/theme.aspx?irn=4095
Where to
 Can’t do with existing (human) resource
 Transcription only one part of the project
 Need to figure what parts need to be crowdsourced, what can’t
 Transcription will enable the adding the contextual and semantic
relationships and links to other sources
 Options for automating the above
 Or, with a focussed crowd and a finite project, maybe we don’t need
a new platform, could provide training and use existing tools
 Can’t crowdsource the display platform. Or can we? Crowdfund it?
 Make data available for analysis, visualisation, research, fun
 Need to formalise the project
 Lots to figure out
In evening rode down to see Maud – showed
her some books but there seemed to be a lack
of sympathy between us + the evening was a
failure.
http://collections.tepapa.govt.nz/theme.aspx?irn=4080
See
 Adkin diaries of Collections Online
 @adkin_diary on Twitter
 @life100yearsago on Twitter
Questions?
 Kirstie Ross, Curator Modern New Zealand
 Adrian Kingston, Collections Information Manager
 Philip Edgar, Manager Digital Collections and Access

More Related Content

Similar to Making a hash of the Adkin Diary transcriptions

Data+Design
Data+DesignData+Design
Data+Design
Amanda Makulec
 
Repackaging research AASL 2013
Repackaging research AASL 2013Repackaging research AASL 2013
Repackaging research AASL 2013
Paige Jaeger
 
Get An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesGet An Overall Idea of Digital Humanities
Get An Overall Idea of Digital Humanities
India Assignment India
 
Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.
Becky Smith
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
University of Cape Town
 
SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015
Arts and Humanities Research Council (AHRC)
 
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
Nebraska Library Commission
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
University of Cape Town
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
IMPACT Centre of Competence
 
How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning Presentations
Joaquim Jorge
 
Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)
Big History Project
 
Future of semantic apps
Future of semantic appsFuture of semantic apps
Future of semantic apps
Anthony (Tony) Sarris
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
Stefan Gradmann
 
Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"
National Information Standards Organization (NISO)
 
Steve Knight by Design
Steve Knight by DesignSteve Knight by Design
Steve Knight by Design
Future Perfect 2012
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
PACKED vzw
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Olaf Janssen
 
Tribal libraries and archives panel session - NWILL, September 2021
Tribal libraries and archives  panel session - NWILL, September 2021Tribal libraries and archives  panel session - NWILL, September 2021
Tribal libraries and archives panel session - NWILL, September 2021
Manisha Khetarpal
 
Digital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphDigital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & Triumph
Kimberly Eke
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
Marieke van Erp
 

Similar to Making a hash of the Adkin Diary transcriptions (20)

Data+Design
Data+DesignData+Design
Data+Design
 
Repackaging research AASL 2013
Repackaging research AASL 2013Repackaging research AASL 2013
Repackaging research AASL 2013
 
Get An Overall Idea of Digital Humanities
Get An Overall Idea of Digital HumanitiesGet An Overall Idea of Digital Humanities
Get An Overall Idea of Digital Humanities
 
Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.Essay Simple Life. Online assignment writing service.
Essay Simple Life. Online assignment writing service.
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 
SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015SAWS Rouche Presentation HERA Event Feb 2015
SAWS Rouche Presentation HERA Event Feb 2015
 
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your LibraryNCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
NCompass Live: Library Lockdown - How to Build an Escape Room in Your Library
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
How to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning PresentationsHow to Craft and Deliver Winning Presentations
How to Craft and Deliver Winning Presentations
 
Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)Unit 6: Collective Learning (Part 1)
Unit 6: Collective Learning (Part 1)
 
Future of semantic apps
Future of semantic appsFuture of semantic apps
Future of semantic apps
 
Knowledge = Information + Context
Knowledge = Information + ContextKnowledge = Information + Context
Knowledge = Information + Context
 
Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"Putnam "This is Today's Metadata Quality"
Putnam "This is Today's Metadata Quality"
 
Steve Knight by Design
Steve Knight by DesignSteve Knight by Design
Steve Knight by Design
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
Joining forces with Wikipedia reasons, experiences and impact - Sharing is Ca...
 
Tribal libraries and archives panel session - NWILL, September 2021
Tribal libraries and archives  panel session - NWILL, September 2021Tribal libraries and archives  panel session - NWILL, September 2021
Tribal libraries and archives panel session - NWILL, September 2021
 
Digital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & TriumphDigital Fluencies: A Story of Trials & Triumph
Digital Fluencies: A Story of Trials & Triumph
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
 

More from donellemckinley

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcing
donellemckinley
 
McLean-letters
McLean-lettersMcLean-letters
McLean-letters
donellemckinley
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
donellemckinley
 
Evaluating crowdsourcing websites
Evaluating crowdsourcing websitesEvaluating crowdsourcing websites
Evaluating crowdsourcing websites
donellemckinley
 
Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ
donellemckinley
 
This is not a penis: User-generated tags
This is not a penis: User-generated tagsThis is not a penis: User-generated tags
This is not a penis: User-generated tags
donellemckinley
 
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud:  Collaborative Frameworks for Virtual DH ProjectsCrowd in the Cloud:  Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
donellemckinley
 
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentUC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
donellemckinley
 
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
donellemckinley
 
Lessons from Transcribe Bentham
Lessons from Transcribe BenthamLessons from Transcribe Bentham
Lessons from Transcribe Bentham
donellemckinley
 
Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)
donellemckinley
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
donellemckinley
 

More from donellemckinley (12)

McKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcingMcKinley NDF2013 crowdsourcing
McKinley NDF2013 crowdsourcing
 
McLean-letters
McLean-lettersMcLean-letters
McLean-letters
 
PhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website designPhD proposal: Specialized heuristics for crowdsourcing website design
PhD proposal: Specialized heuristics for crowdsourcing website design
 
Evaluating crowdsourcing websites
Evaluating crowdsourcing websitesEvaluating crowdsourcing websites
Evaluating crowdsourcing websites
 
Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ Crowdsourcing or bust: The Indexer, Archives NZ
Crowdsourcing or bust: The Indexer, Archives NZ
 
This is not a penis: User-generated tags
This is not a penis: User-generated tagsThis is not a penis: User-generated tags
This is not a penis: User-generated tags
 
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud:  Collaborative Frameworks for Virtual DH ProjectsCrowd in the Cloud:  Collaborative Frameworks for Virtual DH Projects
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
 
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake contentUC CEISMIC: some thoughts on crowd-sourcing earthquake content
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
 
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
 
Lessons from Transcribe Bentham
Lessons from Transcribe BenthamLessons from Transcribe Bentham
Lessons from Transcribe Bentham
 
Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)Crowdsourcing workshop quiz (answers)
Crowdsourcing workshop quiz (answers)
 
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
 

Recently uploaded

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 

Recently uploaded (20)

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 

Making a hash of the Adkin Diary transcriptions

  • 1. “Choir attempted that beautiful anthem “Oh, Radiant Morn” – made a hash of it” Making a hash of the Adkin Diary transcriptions Adrian Kingston Collections Information Manager, Digital Assets and Development Museum of New Zealand Te Papa Tongarewa @adriankingston Crowdsourcing for the Digital Humanities and Cultural Heritage Sector Victoria University of Wellington, 23 April 2013
  • 2.
  • 3. Wed. Apr. 23. Worked at Swamp–Cow p[addock] fence. Bulliman took 48 heavy fat ewes at 15/-. In evening Father + I drove down to Levin No L[icense] Democratic Vote Campaign committee meeting. Father voted to chair + self appointed secretary. Discussed campaign.
  • 4. Background  George Leslie Adkin; Farmer, photographer, geologist, explorer, archaeologist, ethnologist.  1 man, 41 diaries, 59 years, Over 21000 days  Thousands of negatives and prints, some albums  Initial deadline, launch of @life100yearsago ,a project of WW100  Did everything ourselves. We resourced most of this project with a curator (Kirstie Ross) and a monkey with a keyboard  Figure out process (imaging, cropping, loading, transcription guidelines),  Figure out content (data structure, quirks of Adkin, glossaries etc.)  Project? What project?  Very early days.
  • 5. Process  Assess album condition  Photograph album pages  Crop pages to days  Create narrative for day  Load “day” images to EMu “day” narrative  Transcribe  Add associated subjects, people, places (from authority files and controlled vocabularies)  Add context to narrative entries for month  Some parts semi-automated, some completely manual; some need no special skills, others do
  • 6. Received a letter + referee’s report from Dr Chilton, Editor “Trans[actions of the] NZ Inst[itute], on my paper on Tararuas = “my theories based on too slender evidence and debatable evidence + also in part erroneous (? GLA). I decided to withdraw the paper as it is evidently unsuitable for publication in “Transactions” http://collections.tepapa.govt.nz/theme.aspx?irn=4294
  • 7. Framework  Using existing framework; EMu, Collections Online  CIDOC CRM for building and expressing relationships  Days are conceptual entities, not physical. Framework allows for this  Links to physical entities, diaries, photographs, albums  Links to people, places, topics  However, scale of content of really starting to highlight issues of display in Collections Online.
  • 8.
  • 9.
  • 10. What we’ve learnt  So much content, so much data  More than just one man’s story, a huge data source on NZ life  So much potential for a number of fields of research  Our existing data structure works really well  Transcription only one part  To get most out of the content, need the links, need the rich conceptual model  Context needed, or at least useful, for the reader  Existing display not so hot  Enlivens the collection, a step beyond just digitisation and transcription
  • 11.
  • 12. Issues  Size of the project is daunting, but the transcription seems manageable to do through crowdsourcing  There are a number of existing platforms that look great, but how to deal with matching to our structure, vocabularies, authorities?  Could use automated in text authority mining, but would need to then match back to authorities and structure  Beyond scope of crowdsourcing? But does that diminish the value of the “data”?  Could come later though, are we getting too hung up on quality?
  • 13.
  • 14. Our potential crowd  By starting it ourselves, we have some content available to promote the crowdsourcing.  Already had unsolicited volunteers  The content is interesting: NZ history, early 20th Century courtship, farming, geology, religion, war, politics, weather…  Horowhenua locals interested in local history, and one of their famous sons  History students and educators  Bring students closer to primary material, work with cursive handwriting, highlight the importance of accuracy in relation to data, personal biography  Learning history through a first hand account  Plan B is do war years with interns
  • 15. We decided to go into town to lunch so I piloted the party to Kirkcaldie + Stains where we had a good dinner… Will wanted to know if one could have all the courses for 2/-. I told him it was not customary to indulge in more than six but that if he wanted to tackle the lot we would have to leave him at it. Olive ordered dishes she did not want + Alice also got a bit mixed up. http://collections.tepapa.govt.nz/theme.aspx?irn=4095
  • 16. Where to  Can’t do with existing (human) resource  Transcription only one part of the project  Need to figure what parts need to be crowdsourced, what can’t  Transcription will enable the adding the contextual and semantic relationships and links to other sources  Options for automating the above  Or, with a focussed crowd and a finite project, maybe we don’t need a new platform, could provide training and use existing tools  Can’t crowdsource the display platform. Or can we? Crowdfund it?  Make data available for analysis, visualisation, research, fun  Need to formalise the project  Lots to figure out
  • 17. In evening rode down to see Maud – showed her some books but there seemed to be a lack of sympathy between us + the evening was a failure. http://collections.tepapa.govt.nz/theme.aspx?irn=4080
  • 18. See  Adkin diaries of Collections Online  @adkin_diary on Twitter  @life100yearsago on Twitter Questions?  Kirstie Ross, Curator Modern New Zealand  Adrian Kingston, Collections Information Manager  Philip Edgar, Manager Digital Collections and Access