SlideShare a Scribd company logo
1 of 22
Download to read offline
The domain as unifier, how focusing on social
history can bring technical fields together
Marieke van Erp

marieke.van.erp@vu.nl
About me
• Researcher in the Computational Lexicology &
Terminology Lab at Vrije Universiteit Amsterdam
• Language Technology + Semantic Web
• Collaborations with humanities, cultural heritage &
information professionals in CATCH, EU FP7 & CLARIAH
projects
image source: http://www.bsbstaalbouw.nl/previews/2010/11/9/media_210_49423_media_210_49423_w600.jpg
Domains
(Social) History
Language
Technology
Semantic Web
Language Technology
• aims to research & develop tools to extract information
from text
• information retrieval, machine translation, deep reading
• majority of the datasets in the field are ‘current’
newspaper texts
• researchers are interested in finding out how their tool
behaves in a different domain
Semantic Web
• aims to create a machine readable Web
• knowledge modelling, formats, knowledge
representation, data sharing
• Linked Open Data cloud provides entry point to many
structured data sources
• many more users could benefit from Semantic Web
technology
(Social) History
• interested in:
• people
• events
• many historians are interested in dealing with:
• larger text corpora
• quantitative methods
image source: https://upload.wikimedia.org/wikipedia/commons/7/74/York_Pioneers'_social_re-union_St_George's_Hall,_Toronto,_March_3,_1911_(HS85-10-23694).jpg
Components
(Social) History
Language
Technology
Semantic Web
knowledge
modelling &
representation
knowledge
knowledge
information
extraction
event extraction
named entity
recognition and linking
vocabularies
vocabularies
entity graphs
standardisation
people & events
statistics
structured data
structured data
• Goal of the project: interlink Rijksmuseum and Sound and Vision
collections through events
• Digital Hermeneutics (History)
• Recognise events and participants in object descriptions (Language
Technology)
• Model events and Narratives (Semantic Web)
• Van Den Akker, C., Legêne, S., Van Erp, M., Aroyo, L., Segers, R., van
Der Meij, L., Van Ossenbruggen, J., Schreiber, G., Wielinga, B., Oomen,
J. and Jacobs, G., 2011, June. Digital hermeneutics: Agora and the
online understanding of cultural heritage. In Proceedings of the 3rd
International Web Science Conference (p. 10). ACM.
Components
(Social) History
Language
Technology
Semantic Web
knowledge
modelling &
representation
event extraction
people
& events
Not only useful for historians
• http://www.newsreader-project.eu
• http://www.understandinglanguagebymachines.org/stories-and-world-views-as-a-
key-to-understanding-language/
• http://www.cltl.nl/projects/current-projects/visualizing-uncertainty-and-perspectives/
• How can computational tools help in analysing digitised
biographies (History)
• Extract person names & information about persons from
text (Language Technology)
• Model relationships between them (SemWeb)
A Prosopography of Dutch Ministers (1575-1815)
Components
(Social) History
Language
Technology
Semantic Web
knowledge
modelling &
representation
named entity recognition
people
& what they did
relationship extraction
WP3
WP3
Components
(Social) History
Language Technology
Semantic Web
knowledge
knowledge modelling
information
extraction
people & events
entity graphs
event
extraction
vocabularies
How to make this happen?
image source: https://static.pexels.com/photos/7096/people-woman-coffee-meeting.jpg
Going forward
• What questions would you like to answer with Language Technology &
Semantic Web?
• What awesome tools & skills do you have?
• What datasets do you have?
• How do you like your coffee?
image source: http://www.independent.ie/incoming/article31308951.ece/ALTERNATES/h342/tea.jpg
http://mariekevanerp.com
Thank you

More Related Content

What's hot

Exploring two decades of evaluating digital scholarship for tenure and promot...
Exploring two decades of evaluating digital scholarship for tenure and promot...Exploring two decades of evaluating digital scholarship for tenure and promot...
Exploring two decades of evaluating digital scholarship for tenure and promot...Cheryl Ball
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
An archipelago of multimedia publishing
An archipelago of multimedia publishingAn archipelago of multimedia publishing
An archipelago of multimedia publishingCheryl Ball
 
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11aboutgeo
 
Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...WARCnet
 
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...Francesco Spagnolo
 
From Information Delivery to Information Support
From Information Delivery to Information SupportFrom Information Delivery to Information Support
From Information Delivery to Information SupportLora Aroyo
 
A networked archipelago of digital publishing at WVU
A networked archipelago of digital publishing at WVUA networked archipelago of digital publishing at WVU
A networked archipelago of digital publishing at WVUCheryl Ball
 
Parthenos Training: Infrastructures - The infrastructural turn
Parthenos Training: Infrastructures - The infrastructural turnParthenos Training: Infrastructures - The infrastructural turn
Parthenos Training: Infrastructures - The infrastructural turnParthenos
 
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...Parthenos
 
Columbia.lippincott.2012
Columbia.lippincott.2012Columbia.lippincott.2012
Columbia.lippincott.2012JoanLippincott
 
Cultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessCultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessFrancesco Spagnolo
 
Getting to digital publishing at WVU
Getting to digital publishing at WVUGetting to digital publishing at WVU
Getting to digital publishing at WVUCheryl Ball
 
CEMEC Discovery Programme discussion digital heritage
CEMEC Discovery Programme discussion digital heritageCEMEC Discovery Programme discussion digital heritage
CEMEC Discovery Programme discussion digital heritageMarco Streefkerk
 

What's hot (20)

Exploring two decades of evaluating digital scholarship for tenure and promot...
Exploring two decades of evaluating digital scholarship for tenure and promot...Exploring two decades of evaluating digital scholarship for tenure and promot...
Exploring two decades of evaluating digital scholarship for tenure and promot...
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
An archipelago of multimedia publishing
An archipelago of multimedia publishingAn archipelago of multimedia publishing
An archipelago of multimedia publishing
 
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11
SEA CHANGE @ DM2Efinal conference, Pisa, Dec 11
 
Butigan vucaj dh_ilde
Butigan vucaj dh_ildeButigan vucaj dh_ilde
Butigan vucaj dh_ilde
 
Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...
 
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...
Bridging The ALM Divide: An Integrated Archive-Library-Museum Approach for Hy...
 
From Information Delivery to Information Support
From Information Delivery to Information SupportFrom Information Delivery to Information Support
From Information Delivery to Information Support
 
A networked archipelago of digital publishing at WVU
A networked archipelago of digital publishing at WVUA networked archipelago of digital publishing at WVU
A networked archipelago of digital publishing at WVU
 
Parthenos Training: Infrastructures - The infrastructural turn
Parthenos Training: Infrastructures - The infrastructural turnParthenos Training: Infrastructures - The infrastructural turn
Parthenos Training: Infrastructures - The infrastructural turn
 
Integrating IIIF and Mirador at Harvard
Integrating IIIF and Mirador at HarvardIntegrating IIIF and Mirador at Harvard
Integrating IIIF and Mirador at Harvard
 
Gist 16-march-2015-jacco
Gist 16-march-2015-jaccoGist 16-march-2015-jacco
Gist 16-march-2015-jacco
 
298 winter
298 winter298 winter
298 winter
 
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...
PARTHENOS Training - Epistemic Cultures: Collaborations between humanists and...
 
Jared Adler Resume
Jared Adler ResumeJared Adler Resume
Jared Adler Resume
 
Keynote csws2013
Keynote csws2013Keynote csws2013
Keynote csws2013
 
Columbia.lippincott.2012
Columbia.lippincott.2012Columbia.lippincott.2012
Columbia.lippincott.2012
 
Cultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessCultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital Access
 
Getting to digital publishing at WVU
Getting to digital publishing at WVUGetting to digital publishing at WVU
Getting to digital publishing at WVU
 
CEMEC Discovery Programme discussion digital heritage
CEMEC Discovery Programme discussion digital heritageCEMEC Discovery Programme discussion digital heritage
CEMEC Discovery Programme discussion digital heritage
 

Similar to The domain as unifier, how focusing on social history can bring technical fields together

Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014HELIGLIASA
 
Omeka as a Tool for Developing Digital Projects
Omeka as a Tool for Developing Digital ProjectsOmeka as a Tool for Developing Digital Projects
Omeka as a Tool for Developing Digital Projectsctobar28
 
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...UBC Library
 
MA in Digital Humanities
MA in Digital Humanities MA in Digital Humanities
MA in Digital Humanities Paul Spence
 
Digital Libraries Digital Humanities: Current and Emerging Roles for Librarians
Digital Libraries Digital Humanities: Current and Emerging Roles for LibrariansDigital Libraries Digital Humanities: Current and Emerging Roles for Librarians
Digital Libraries Digital Humanities: Current and Emerging Roles for Librarianskgerber
 
Developing tools in humanities computing
Developing tools in humanities computing Developing tools in humanities computing
Developing tools in humanities computing Dave Marcial
 
From Catalogue 2.0 to the digital humanities: exploring the future of librari...
From Catalogue 2.0 to the digital humanities: exploring the future of librari...From Catalogue 2.0 to the digital humanities: exploring the future of librari...
From Catalogue 2.0 to the digital humanities: exploring the future of librari...Sally Chambers
 
Web Science: the digital heritage case
Web Science: the digital heritage caseWeb Science: the digital heritage case
Web Science: the digital heritage caseGuus Schreiber
 
Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016Aquiles Alencar Brayner
 
BL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research TeamBL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research Teamlabsbl
 
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHLorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHlorna_hughes
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012Andrea D'Andrea
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...TimelessFuture
 
Adopting technology session
Adopting technology sessionAdopting technology session
Adopting technology sessionMike Frohlich
 
Lecture: Digital Storytelling and New Media Design
Lecture: Digital Storytelling and New Media DesignLecture: Digital Storytelling and New Media Design
Lecture: Digital Storytelling and New Media DesignSusan Rauch, PhD
 

Similar to The domain as unifier, how focusing on social history can bring technical fields together (20)

Dh presentation helig 2014
Dh presentation helig 2014Dh presentation helig 2014
Dh presentation helig 2014
 
Omeka as a Tool for Developing Digital Projects
Omeka as a Tool for Developing Digital ProjectsOmeka as a Tool for Developing Digital Projects
Omeka as a Tool for Developing Digital Projects
 
AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
 
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...
Shaping our Future: Digitization Partnerships Across Libraries, Archives and ...
 
MA in Digital Humanities
MA in Digital Humanities MA in Digital Humanities
MA in Digital Humanities
 
Digital Humanities Workshop
Digital Humanities WorkshopDigital Humanities Workshop
Digital Humanities Workshop
 
Digital Libraries Digital Humanities: Current and Emerging Roles for Librarians
Digital Libraries Digital Humanities: Current and Emerging Roles for LibrariansDigital Libraries Digital Humanities: Current and Emerging Roles for Librarians
Digital Libraries Digital Humanities: Current and Emerging Roles for Librarians
 
NECTAR_VRE1
NECTAR_VRE1NECTAR_VRE1
NECTAR_VRE1
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
 
Developing tools in humanities computing
Developing tools in humanities computing Developing tools in humanities computing
Developing tools in humanities computing
 
From Catalogue 2.0 to the digital humanities: exploring the future of librari...
From Catalogue 2.0 to the digital humanities: exploring the future of librari...From Catalogue 2.0 to the digital humanities: exploring the future of librari...
From Catalogue 2.0 to the digital humanities: exploring the future of librari...
 
Web Science: the digital heritage case
Web Science: the digital heritage caseWeb Science: the digital heritage case
Web Science: the digital heritage case
 
Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016Bl labs roadshow aab_open_university.2016
Bl labs roadshow aab_open_university.2016
 
BL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research TeamBL Labs Roadshow 2016 - Digital Research Team
BL Labs Roadshow 2016 - Digital Research Team
 
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DHLorna hughes 12 05-2013 NeDiMAH and ontology for DH
Lorna hughes 12 05-2013 NeDiMAH and ontology for DH
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 
77. newsletter d andrea2012
77. newsletter d andrea201277. newsletter d andrea2012
77. newsletter d andrea2012
 
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...Chaos&Order: Using visualization as a means to
 explore large heritage collec...
Chaos&Order: Using visualization as a means to
 explore large heritage collec...
 
Adopting technology session
Adopting technology sessionAdopting technology session
Adopting technology session
 
Lecture: Digital Storytelling and New Media Design
Lecture: Digital Storytelling and New Media DesignLecture: Digital Storytelling and New Media Design
Lecture: Digital Storytelling and New Media Design
 

More from Marieke van Erp

Towards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumTowards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumMarieke van Erp
 
A Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebA Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebMarieke van Erp
 
AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit Marieke van Erp
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceMarieke van Erp
 
The Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesThe Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesMarieke van Erp
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Marieke van Erp
 
(Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research (Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research Marieke van Erp
 
Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Marieke van Erp
 
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchSlicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchMarieke van Erp
 
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Marieke van Erp
 
Good Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsGood Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsMarieke van Erp
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Marieke van Erp
 
Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Marieke van Erp
 
HuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationHuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationMarieke van Erp
 
Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Marieke van Erp
 
Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Marieke van Erp
 
Entity Typing and Event Extraction
Entity Typing and Event Extraction Entity Typing and Event Extraction
Entity Typing and Event Extraction Marieke van Erp
 
Evaluating entity linking an analysis of current benchmark datasets and a ro...
Evaluating entity linking  an analysis of current benchmark datasets and a ro...Evaluating entity linking  an analysis of current benchmark datasets and a ro...
Evaluating entity linking an analysis of current benchmark datasets and a ro...Marieke van Erp
 
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...Marieke van Erp
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsEvaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsMarieke van Erp
 

More from Marieke van Erp (20)

Towards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumTowards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH Symposium
 
A Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebA Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic Web
 
AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
 
The Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesThe Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital Humanities
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)
 
(Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research (Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research
 
Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...
 
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchSlicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
 
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
 
Good Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsGood Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologists
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case
 
Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition
 
HuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationHuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the Conversation
 
Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing
 
Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia
 
Entity Typing and Event Extraction
Entity Typing and Event Extraction Entity Typing and Event Extraction
Entity Typing and Event Extraction
 
Evaluating entity linking an analysis of current benchmark datasets and a ro...
Evaluating entity linking  an analysis of current benchmark datasets and a ro...Evaluating entity linking  an analysis of current benchmark datasets and a ro...
Evaluating entity linking an analysis of current benchmark datasets and a ro...
 
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsEvaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
 

Recently uploaded

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

The domain as unifier, how focusing on social history can bring technical fields together

  • 1. The domain as unifier, how focusing on social history can bring technical fields together Marieke van Erp marieke.van.erp@vu.nl
  • 2. About me • Researcher in the Computational Lexicology & Terminology Lab at Vrije Universiteit Amsterdam • Language Technology + Semantic Web • Collaborations with humanities, cultural heritage & information professionals in CATCH, EU FP7 & CLARIAH projects image source: http://www.bsbstaalbouw.nl/previews/2010/11/9/media_210_49423_media_210_49423_w600.jpg
  • 4. Language Technology • aims to research & develop tools to extract information from text • information retrieval, machine translation, deep reading • majority of the datasets in the field are ‘current’ newspaper texts • researchers are interested in finding out how their tool behaves in a different domain
  • 5. Semantic Web • aims to create a machine readable Web • knowledge modelling, formats, knowledge representation, data sharing • Linked Open Data cloud provides entry point to many structured data sources • many more users could benefit from Semantic Web technology
  • 6. (Social) History • interested in: • people • events • many historians are interested in dealing with: • larger text corpora • quantitative methods image source: https://upload.wikimedia.org/wikipedia/commons/7/74/York_Pioneers'_social_re-union_St_George's_Hall,_Toronto,_March_3,_1911_(HS85-10-23694).jpg
  • 7. Components (Social) History Language Technology Semantic Web knowledge modelling & representation knowledge knowledge information extraction event extraction named entity recognition and linking vocabularies vocabularies entity graphs standardisation people & events statistics structured data structured data
  • 8. • Goal of the project: interlink Rijksmuseum and Sound and Vision collections through events • Digital Hermeneutics (History) • Recognise events and participants in object descriptions (Language Technology) • Model events and Narratives (Semantic Web) • Van Den Akker, C., Legêne, S., Van Erp, M., Aroyo, L., Segers, R., van Der Meij, L., Van Ossenbruggen, J., Schreiber, G., Wielinga, B., Oomen, J. and Jacobs, G., 2011, June. Digital hermeneutics: Agora and the online understanding of cultural heritage. In Proceedings of the 3rd International Web Science Conference (p. 10). ACM.
  • 9.
  • 10.
  • 11. Components (Social) History Language Technology Semantic Web knowledge modelling & representation event extraction people & events
  • 12. Not only useful for historians • http://www.newsreader-project.eu • http://www.understandinglanguagebymachines.org/stories-and-world-views-as-a- key-to-understanding-language/ • http://www.cltl.nl/projects/current-projects/visualizing-uncertainty-and-perspectives/
  • 13. • How can computational tools help in analysing digitised biographies (History) • Extract person names & information about persons from text (Language Technology) • Model relationships between them (SemWeb)
  • 14. A Prosopography of Dutch Ministers (1575-1815)
  • 15. Components (Social) History Language Technology Semantic Web knowledge modelling & representation named entity recognition people & what they did relationship extraction
  • 16. WP3
  • 17. WP3
  • 18. Components (Social) History Language Technology Semantic Web knowledge knowledge modelling information extraction people & events entity graphs event extraction vocabularies
  • 19. How to make this happen?
  • 21. Going forward • What questions would you like to answer with Language Technology & Semantic Web? • What awesome tools & skills do you have? • What datasets do you have? • How do you like your coffee? image source: http://www.independent.ie/incoming/article31308951.ece/ALTERNATES/h342/tea.jpg