SlideShare a Scribd company logo
1 of 14
ARIADNE is funded by the European Commission's Seventh Framework Programme
Integrating Data for ArchaeologyIntegrating Data for Archaeology
Dimitris Gavrilis, Eleni Afiontzi, Johan Fihn, Olof Olsson,
Achille Felicetti, Franco Nicollucci, Sebastian Cuy
Dimitris Gavrilis, Eleni Afiontzi, Johan Fihn, Olof Olsson,
Achille Felicetti, Franco Nicollucci, Sebastian Cuy
Introduction
• Traditional projects in Archaeology focused on aggregating
data into one single format / system
– Provide users with a unified interface
– Improve search and retrieval
– Improve retrieval semantics through specialized metadata schemas
• ARIADNE goes one step further : data integration
– Try to model the domain information (ARIADNE Catalog Data Model)
– Use a curation aware aggregator to enrich information using the
above model
– Improve user experience through more substantial and powerful
queries
• Traditional projects in Archaeology focused on aggregating
data into one single format / system
– Provide users with a unified interface
– Improve search and retrieval
– Improve retrieval semantics through specialized metadata schemas
• ARIADNE goes one step further : data integration
– Try to model the domain information (ARIADNE Catalog Data Model)
– Use a curation aware aggregator to enrich information using the
above model
– Improve user experience through more substantial and powerful
queries
Innovation
• Why hasn’t anyone done this before ?
– Complexity
– Performance
– Domain knowledge
• Standard aggregation systems / architectures are
insufficient.
 ARIADNE Infrastructure
• Why hasn’t anyone done this before ?
– Complexity
– Performance
– Domain knowledge
• Standard aggregation systems / architectures are
insufficient.
 ARIADNE Infrastructure
ARIADNE Infrastructure
• Flexibility
– Ingest diverse and heterogeneous data
• XML, RDF, Excel, CSV, …
– Handle each datastream independently and according to
it’s requirements
• Adapting aggregation, validation, enrichment workflows
– Add new curation services easily and on demand
• Flexibility
– Ingest diverse and heterogeneous data
• XML, RDF, Excel, CSV, …
– Handle each datastream independently and according to
it’s requirements
• Adapting aggregation, validation, enrichment workflows
– Add new curation services easily and on demand
ARIADNE Infrastructure
• Complexity
– De-couple services complexity through a micro-service
oriented architecture
– Use loosely connecting services in a highly scalable
environment.
• Performance
– Scalable technologies
• Complexity
– De-couple services complexity through a micro-service
oriented architecture
– Use loosely connecting services in a highly scalable
environment.
• Performance
– Scalable technologies
ARIADNE Infrastructure
• Domain knowledge
– Integrate the domain model (ACDM) into the
infrastructure
– Make extensive use of domain thesauri (e.g. AAT) and
label every resource accordingly
– Create specialized micro-services for curating content
according to the domain needs
• Domain knowledge
– Integrate the domain model (ACDM) into the
infrastructure
– Make extensive use of domain thesauri (e.g. AAT) and
label every resource accordingly
– Create specialized micro-services for curating content
according to the domain needs
Data Integration Overall Architecture
RepositoryRepository
Excel SheetExcel Sheet
ARIADNE
Registry
ARIADNE
Registry
ValidationValidation
CleaningCleaning
EnrichmentEnrichment
IntegrationIntegration
RDF Store
(RDF)
RDF Store
(RDF)
Elastic
Search
Elastic
Search
RDF Store
(CRM)
RDF Store
(CRM)
ArchiveArchive
ARIADNE
Portal
ARIADNE
Portal
Integration
Experiments
Integration
Experiments
Use of RDF
• Every resource is assigned a unique and persistent
identifier that is resolved through a URI
• Every resource has an RDF representation according to
the ACDM schema
• Every resource is assigned a unique and persistent
identifier that is resolved through a URI
• Every resource has an RDF representation according to
the ACDM schema
Data Curation
• Use of curation micro-services for enriching content
– Geo-normalization (identify, extract and normalize places and
coordinates)
– Geo-coding (e.g. Geo-names)
– Thesauri mappings (map native subject terms to a common thesauri :
AAT)
– Temporal normalization (identify, extract and normalize dates)
– Gazetteers (e.g. DAI Gazetteer)
– Historical & Ancient place names identification (Pelagios & Pleiades)
– Temporal information mappings (Perio.do)
• Use of curation micro-services for enriching content
– Geo-normalization (identify, extract and normalize places and
coordinates)
– Geo-coding (e.g. Geo-names)
– Thesauri mappings (map native subject terms to a common thesauri :
AAT)
– Temporal normalization (identify, extract and normalize dates)
– Gazetteers (e.g. DAI Gazetteer)
– Historical & Ancient place names identification (Pelagios & Pleiades)
– Temporal information mappings (Perio.do)
Data Integration
• Data Integration is based on a 3+1 dimensions
– Subject
– Space
– Time
– Resource type
• Data Integration is based on a 3+1 dimensions
– Subject
– Space
– Time
– Resource type
Identify & Link together Resource Types
• Model individual information resource types (e.g.
collections, bibliographic reports, databases, datasets,
etc).
• Identify each resources type during ingestion
• Link / group different resource types
– E.g. put all related heterogeneous resource types
(reports, datasets,…) under the same collections
• Model individual information resource types (e.g.
collections, bibliographic reports, databases, datasets,
etc).
• Identify each resources type during ingestion
• Link / group different resource types
– E.g. put all related heterogeneous resource types
(reports, datasets,…) under the same collections
Thematic integration
• ARIADNE uses the AAT thesaurus to semantically label
ALL aggregated information.
• AAT terms act as a glue and when combined with spatial
and temporal information can produce great results
• Semantic expansion of terms is extensively being used in
order to improve retrieval.
• Expansion of multi-lingual terms facilitates cross-
language search without requiring automatic
translation.
• ARIADNE uses the AAT thesaurus to semantically label
ALL aggregated information.
• AAT terms act as a glue and when combined with spatial
and temporal information can produce great results
• Semantic expansion of terms is extensively being used in
order to improve retrieval.
• Expansion of multi-lingual terms facilitates cross-
language search without requiring automatic
translation.
Spatial & Temporal
• All resources with spatial information
– Are assigned WGS84 projected coordinates
• All resources with temporal information
– Are normalized according the ACDM dates (that takes into
account periods, period names and supports ISO date
format).
• All resources with spatial information
– Are assigned WGS84 projected coordinates
• All resources with temporal information
– Are normalized according the ACDM dates (that takes into
account periods, period names and supports ISO date
format).
Subject Terms Curation Lifecycle
Native SubjectsNative Subjects
Vocabulary
Mapping Tool
Vocabulary
Mapping Tool MOReMORe
mappings
*nativeSubjects
Provider Native
Repository
Provider Native
Repository
Excel Sheet
XML Files
Excel Sheet
XML Files RegistryRegistry
*nativeSubjects
*providedSubjects
*nativeSubjects
*providedSubjects
AATAAT
Elastic SearchElastic SearchARIADNE PortalARIADNE Portal
ACDM / Subjects (JSON)
**providedSubjects
**derivedSubjects
**broaderGenericSubjects
*nativeSubjects
ACDM / Subjects (JSON)
**providedSubjects
**derivedSubjects
**broaderGenericSubjects
*nativeSubjects
*mono-lingual (prefLabel only)
** multi-lingual (prefLabel & altLabel)

More Related Content

What's hot

What's hot (20)

Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
 
The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
Open Access of Research Data - The Present and Future Situation in Germany
Open Access of Research Data - The Present and Future Situation in GermanyOpen Access of Research Data - The Present and Future Situation in Germany
Open Access of Research Data - The Present and Future Situation in Germany
 
Providing services search and beyond
Providing services  search and beyondProviding services  search and beyond
Providing services search and beyond
 
Ariadne Booklet 2016: Building a research infrastructure for Digital Archaeol...
Ariadne Booklet 2016: Building a research infrastructure for Digital Archaeol...Ariadne Booklet 2016: Building a research infrastructure for Digital Archaeol...
Ariadne Booklet 2016: Building a research infrastructure for Digital Archaeol...
 
Ariadne: Interoperability
Ariadne: InteroperabilityAriadne: Interoperability
Ariadne: Interoperability
 
Ariadne introduction
Ariadne introductionAriadne introduction
Ariadne introduction
 
Ireland: ARIADNE - Success stories from partners and the research community
Ireland: ARIADNE - Success stories from partners and the research communityIreland: ARIADNE - Success stories from partners and the research community
Ireland: ARIADNE - Success stories from partners and the research community
 
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...
 
The OAIS reference model and archaeological data
The OAIS reference model and archaeological dataThe OAIS reference model and archaeological data
The OAIS reference model and archaeological data
 
Good data management and online access
Good data management and online accessGood data management and online access
Good data management and online access
 
Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
 
Sweden: ARIADNE - Success stories from partners and the research community
Sweden: ARIADNE - Success stories from partners and the research communitySweden: ARIADNE - Success stories from partners and the research community
Sweden: ARIADNE - Success stories from partners and the research community
 
ARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperabilityARIADNE Registry - towards interoperability
ARIADNE Registry - towards interoperability
 
Ariadne Services
Ariadne ServicesAriadne Services
Ariadne Services
 
Fasti Online at the International Association of Classical Archaeology (AIAC)
Fasti Online at the International Association of Classical Archaeology (AIAC)Fasti Online at the International Association of Classical Archaeology (AIAC)
Fasti Online at the International Association of Classical Archaeology (AIAC)
 
Archaeology Data Service (ADS)
Archaeology Data Service (ADS)Archaeology Data Service (ADS)
Archaeology Data Service (ADS)
 
Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...Comparing and matching archaeological excavation data for integration in onto...
Comparing and matching archaeological excavation data for integration in onto...
 
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
Achille Felicetti "Introduction to the Ariadne winter school and to the ARIAD...
 

Viewers also liked

Viewers also liked (18)

Antiquarians in the 21st Century: Opening up our data
Antiquarians in the 21st Century: Opening up our dataAntiquarians in the 21st Century: Opening up our data
Antiquarians in the 21st Century: Opening up our data
 
Linked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE ProjectLinked Open Data Approaches within the ARIADNE Project
Linked Open Data Approaches within the ARIADNE Project
 
The Effect of ARIADNE: A Success Story Why ARIADNE Counts
The Effect of ARIADNE: A Success Story Why ARIADNE Counts The Effect of ARIADNE: A Success Story Why ARIADNE Counts
The Effect of ARIADNE: A Success Story Why ARIADNE Counts
 
Legacy data and archaeological archives in Europe and North Africa
Legacy data and archaeological archives in Europe and North AfricaLegacy data and archaeological archives in Europe and North Africa
Legacy data and archaeological archives in Europe and North Africa
 
Beyond the Pale: grey literature as a method of publication
Beyond the Pale: grey literature as a method of publicationBeyond the Pale: grey literature as a method of publication
Beyond the Pale: grey literature as a method of publication
 
The Ariadne Project
The Ariadne ProjectThe Ariadne Project
The Ariadne Project
 
Digital preservation and access in a European perspective: Introducing ARIADNE
Digital preservation and access in a European perspective: Introducing ARIADNEDigital preservation and access in a European perspective: Introducing ARIADNE
Digital preservation and access in a European perspective: Introducing ARIADNE
 
Bulgaria: ARIADNE - Success stories from partners and the research community
Bulgaria: ARIADNE - Success stories from partners and the research communityBulgaria: ARIADNE - Success stories from partners and the research community
Bulgaria: ARIADNE - Success stories from partners and the research community
 
France: ARIADNE - Success stories from partners and the research community
France: ARIADNE - Success stories from partners and the research communityFrance: ARIADNE - Success stories from partners and the research community
France: ARIADNE - Success stories from partners and the research community
 
Czech Republic: ARIADNE - Success stories from partners and the research comm...
Czech Republic: ARIADNE - Success stories from partners and the research comm...Czech Republic: ARIADNE - Success stories from partners and the research comm...
Czech Republic: ARIADNE - Success stories from partners and the research comm...
 
Italy: ARIADNE - Success stories from partners and the research community
Italy: ARIADNE - Success stories from partners and the research communityItaly: ARIADNE - Success stories from partners and the research community
Italy: ARIADNE - Success stories from partners and the research community
 
Austria: ARIADNE - Success stories from partners and the research community
Austria: ARIADNE - Success stories from partners and the research communityAustria: ARIADNE - Success stories from partners and the research community
Austria: ARIADNE - Success stories from partners and the research community
 
Introduction: Long-term preservation and access: Where is an archive for my ...
 Introduction: Long-term preservation and access: Where is an archive for my ... Introduction: Long-term preservation and access: Where is an archive for my ...
Introduction: Long-term preservation and access: Where is an archive for my ...
 
Publishing Open Access isn’t the End of the Story
Publishing Open Access isn’t the End of the StoryPublishing Open Access isn’t the End of the Story
Publishing Open Access isn’t the End of the Story
 
inkedARC.net: Accessing the Benefits of Open Data Practice within Archaeology
inkedARC.net: Accessing the Benefits of Open Data Practice within ArchaeologyinkedARC.net: Accessing the Benefits of Open Data Practice within Archaeology
inkedARC.net: Accessing the Benefits of Open Data Practice within Archaeology
 
Old and New Adventures in Open Access
Old and New Adventures in Open AccessOld and New Adventures in Open Access
Old and New Adventures in Open Access
 
Open Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and BenefitsOpen Data Publication - Requirements, Good practices, and Benefits
Open Data Publication - Requirements, Good practices, and Benefits
 
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UKEaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
Eaa2014 Opportunities and Challenges with Open Access and Open Data in the UK
 

Similar to Integrating Data for Archaeology

E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
Sven Schlarb
 

Similar to Integrating Data for Archaeology (20)

E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan moore
 
Steve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data ArchiveSteve Mc Eachern Australian Data Archive
Steve Mc Eachern Australian Data Archive
 
Do MORe with your data
Do MORe with your dataDo MORe with your data
Do MORe with your data
 
Harvard Hypermap: An Open Source Framework for Making the World’s Geospatial ...
Harvard Hypermap: An Open Source Framework for Making the World’s Geospatial ...Harvard Hypermap: An Open Source Framework for Making the World’s Geospatial ...
Harvard Hypermap: An Open Source Framework for Making the World’s Geospatial ...
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 2
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014
 
From Ambition to Go Live SWIB.pdf
From Ambition to Go Live SWIB.pdfFrom Ambition to Go Live SWIB.pdf
From Ambition to Go Live SWIB.pdf
 
From Ambition to Go Live
From Ambition to Go LiveFrom Ambition to Go Live
From Ambition to Go Live
 
Evolving NASA’s Data and Information Systems for Earth Science
Evolving NASA’s Data and Information Systems for Earth ScienceEvolving NASA’s Data and Information Systems for Earth Science
Evolving NASA’s Data and Information Systems for Earth Science
 
Collections.ed – Launching the University Collections Online, Ianthe Sutherla...
Collections.ed – Launching the University Collections Online, Ianthe Sutherla...Collections.ed – Launching the University Collections Online, Ianthe Sutherla...
Collections.ed – Launching the University Collections Online, Ianthe Sutherla...
 
Methods for analyzing and mapping temporal data
Methods for analyzing and mapping temporal dataMethods for analyzing and mapping temporal data
Methods for analyzing and mapping temporal data
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...
 
Geoservices Activities at EDINA
Geoservices Activities at EDINAGeoservices Activities at EDINA
Geoservices Activities at EDINA
 
Easter JISC metadata May25 DT
Easter JISC metadata May25 DTEaster JISC metadata May25 DT
Easter JISC metadata May25 DT
 

More from ariadnenetwork

Eaa2021 476 norwegian_unimus
Eaa2021 476 norwegian_unimusEaa2021 476 norwegian_unimus
Eaa2021 476 norwegian_unimus
ariadnenetwork
 
Eaa2021 476 ways and capacity in archaeological data management in serbia
Eaa2021 476 ways and capacity in archaeological data management in serbiaEaa2021 476 ways and capacity in archaeological data management in serbia
Eaa2021 476 ways and capacity in archaeological data management in serbia
ariadnenetwork
 
Eaa2021 476 izeta cattaneo idacordig and suquia
 Eaa2021 476 izeta cattaneo idacordig and suquia Eaa2021 476 izeta cattaneo idacordig and suquia
Eaa2021 476 izeta cattaneo idacordig and suquia
ariadnenetwork
 
Eaa2021 476 preserving historic building documentation pakistan
Eaa2021 476 preserving historic building documentation  pakistanEaa2021 476 preserving historic building documentation  pakistan
Eaa2021 476 preserving historic building documentation pakistan
ariadnenetwork
 
Eaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seaddaEaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seadda
ariadnenetwork
 

More from ariadnenetwork (20)

ARIADNE plus - vms workshop.pdf
ARIADNE plus - vms workshop.pdfARIADNE plus - vms workshop.pdf
ARIADNE plus - vms workshop.pdf
 
DANS Data Trail Data Management Tools for Archaeologists
DANS Data Trail Data Management Tools for ArchaeologistsDANS Data Trail Data Management Tools for Archaeologists
DANS Data Trail Data Management Tools for Archaeologists
 
Eaa2021 476 natália botica - from 2_archis to datarepositorium2
Eaa2021 476 natália botica - from 2_archis to datarepositorium2Eaa2021 476 natália botica - from 2_archis to datarepositorium2
Eaa2021 476 natália botica - from 2_archis to datarepositorium2
 
Eaa2021 476 kecheva_nekhrizov_bulgaria
Eaa2021 476 kecheva_nekhrizov_bulgariaEaa2021 476 kecheva_nekhrizov_bulgaria
Eaa2021 476 kecheva_nekhrizov_bulgaria
 
Eaa2021 476 norwegian_unimus
Eaa2021 476 norwegian_unimusEaa2021 476 norwegian_unimus
Eaa2021 476 norwegian_unimus
 
Eaa2021 session 476 abstracts
Eaa2021 session 476 abstractsEaa2021 session 476 abstracts
Eaa2021 session 476 abstracts
 
Eaa2021 476 ways and capacity in archaeological data management in serbia
Eaa2021 476 ways and capacity in archaeological data management in serbiaEaa2021 476 ways and capacity in archaeological data management in serbia
Eaa2021 476 ways and capacity in archaeological data management in serbia
 
Eaa2021 476 izeta cattaneo idacordig and suquia
 Eaa2021 476 izeta cattaneo idacordig and suquia Eaa2021 476 izeta cattaneo idacordig and suquia
Eaa2021 476 izeta cattaneo idacordig and suquia
 
Eaa2021 476 preserving historic building documentation pakistan
Eaa2021 476 preserving historic building documentation  pakistanEaa2021 476 preserving historic building documentation  pakistan
Eaa2021 476 preserving historic building documentation pakistan
 
Eaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seaddaEaa2021 s476 ariadne-seadda
Eaa2021 s476 ariadne-seadda
 
Preferred Formats = Pre-FAIRed Formats
Preferred Formats = Pre-FAIRed FormatsPreferred Formats = Pre-FAIRed Formats
Preferred Formats = Pre-FAIRed Formats
 
Heeren pan-seadda-leiden-17mrt2020
Heeren pan-seadda-leiden-17mrt2020Heeren pan-seadda-leiden-17mrt2020
Heeren pan-seadda-leiden-17mrt2020
 
D6.1 initial report-innovation-strategy-and-targeted-activities
D6.1 initial report-innovation-strategy-and-targeted-activitiesD6.1 initial report-innovation-strategy-and-targeted-activities
D6.1 initial report-innovation-strategy-and-targeted-activities
 
ARIADNEplus Community Needs Survey - Key Results
ARIADNEplus Community Needs Survey - Key ResultsARIADNEplus Community Needs Survey - Key Results
ARIADNEplus Community Needs Survey - Key Results
 
ARIADNEplus survey-2019-report
ARIADNEplus survey-2019-reportARIADNEplus survey-2019-report
ARIADNEplus survey-2019-report
 
05 caa hasil_novak
05 caa hasil_novak05 caa hasil_novak
05 caa hasil_novak
 
04 ariadn eplus_caa2019_cnrs_open_archaeo_20190424
04 ariadn eplus_caa2019_cnrs_open_archaeo_2019042404 ariadn eplus_caa2019_cnrs_open_archaeo_20190424
04 ariadn eplus_caa2019_cnrs_open_archaeo_20190424
 
03 ariadn eplus_caa_2019_inrap
03 ariadn eplus_caa_2019_inrap03 ariadn eplus_caa_2019_inrap
03 ariadn eplus_caa_2019_inrap
 
02 2019 caa_krakowvg
02 2019 caa_krakowvg02 2019 caa_krakowvg
02 2019 caa_krakowvg
 
01 caa2019 ariadn_eplus_snd_uj_krakow 20190425
01 caa2019 ariadn_eplus_snd_uj_krakow 2019042501 caa2019 ariadn_eplus_snd_uj_krakow 20190425
01 caa2019 ariadn_eplus_snd_uj_krakow 20190425
 

Recently uploaded

PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 

Recently uploaded (20)

PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 

Integrating Data for Archaeology

  • 1. ARIADNE is funded by the European Commission's Seventh Framework Programme Integrating Data for ArchaeologyIntegrating Data for Archaeology Dimitris Gavrilis, Eleni Afiontzi, Johan Fihn, Olof Olsson, Achille Felicetti, Franco Nicollucci, Sebastian Cuy Dimitris Gavrilis, Eleni Afiontzi, Johan Fihn, Olof Olsson, Achille Felicetti, Franco Nicollucci, Sebastian Cuy
  • 2. Introduction • Traditional projects in Archaeology focused on aggregating data into one single format / system – Provide users with a unified interface – Improve search and retrieval – Improve retrieval semantics through specialized metadata schemas • ARIADNE goes one step further : data integration – Try to model the domain information (ARIADNE Catalog Data Model) – Use a curation aware aggregator to enrich information using the above model – Improve user experience through more substantial and powerful queries • Traditional projects in Archaeology focused on aggregating data into one single format / system – Provide users with a unified interface – Improve search and retrieval – Improve retrieval semantics through specialized metadata schemas • ARIADNE goes one step further : data integration – Try to model the domain information (ARIADNE Catalog Data Model) – Use a curation aware aggregator to enrich information using the above model – Improve user experience through more substantial and powerful queries
  • 3. Innovation • Why hasn’t anyone done this before ? – Complexity – Performance – Domain knowledge • Standard aggregation systems / architectures are insufficient.  ARIADNE Infrastructure • Why hasn’t anyone done this before ? – Complexity – Performance – Domain knowledge • Standard aggregation systems / architectures are insufficient.  ARIADNE Infrastructure
  • 4. ARIADNE Infrastructure • Flexibility – Ingest diverse and heterogeneous data • XML, RDF, Excel, CSV, … – Handle each datastream independently and according to it’s requirements • Adapting aggregation, validation, enrichment workflows – Add new curation services easily and on demand • Flexibility – Ingest diverse and heterogeneous data • XML, RDF, Excel, CSV, … – Handle each datastream independently and according to it’s requirements • Adapting aggregation, validation, enrichment workflows – Add new curation services easily and on demand
  • 5. ARIADNE Infrastructure • Complexity – De-couple services complexity through a micro-service oriented architecture – Use loosely connecting services in a highly scalable environment. • Performance – Scalable technologies • Complexity – De-couple services complexity through a micro-service oriented architecture – Use loosely connecting services in a highly scalable environment. • Performance – Scalable technologies
  • 6. ARIADNE Infrastructure • Domain knowledge – Integrate the domain model (ACDM) into the infrastructure – Make extensive use of domain thesauri (e.g. AAT) and label every resource accordingly – Create specialized micro-services for curating content according to the domain needs • Domain knowledge – Integrate the domain model (ACDM) into the infrastructure – Make extensive use of domain thesauri (e.g. AAT) and label every resource accordingly – Create specialized micro-services for curating content according to the domain needs
  • 7. Data Integration Overall Architecture RepositoryRepository Excel SheetExcel Sheet ARIADNE Registry ARIADNE Registry ValidationValidation CleaningCleaning EnrichmentEnrichment IntegrationIntegration RDF Store (RDF) RDF Store (RDF) Elastic Search Elastic Search RDF Store (CRM) RDF Store (CRM) ArchiveArchive ARIADNE Portal ARIADNE Portal Integration Experiments Integration Experiments
  • 8. Use of RDF • Every resource is assigned a unique and persistent identifier that is resolved through a URI • Every resource has an RDF representation according to the ACDM schema • Every resource is assigned a unique and persistent identifier that is resolved through a URI • Every resource has an RDF representation according to the ACDM schema
  • 9. Data Curation • Use of curation micro-services for enriching content – Geo-normalization (identify, extract and normalize places and coordinates) – Geo-coding (e.g. Geo-names) – Thesauri mappings (map native subject terms to a common thesauri : AAT) – Temporal normalization (identify, extract and normalize dates) – Gazetteers (e.g. DAI Gazetteer) – Historical & Ancient place names identification (Pelagios & Pleiades) – Temporal information mappings (Perio.do) • Use of curation micro-services for enriching content – Geo-normalization (identify, extract and normalize places and coordinates) – Geo-coding (e.g. Geo-names) – Thesauri mappings (map native subject terms to a common thesauri : AAT) – Temporal normalization (identify, extract and normalize dates) – Gazetteers (e.g. DAI Gazetteer) – Historical & Ancient place names identification (Pelagios & Pleiades) – Temporal information mappings (Perio.do)
  • 10. Data Integration • Data Integration is based on a 3+1 dimensions – Subject – Space – Time – Resource type • Data Integration is based on a 3+1 dimensions – Subject – Space – Time – Resource type
  • 11. Identify & Link together Resource Types • Model individual information resource types (e.g. collections, bibliographic reports, databases, datasets, etc). • Identify each resources type during ingestion • Link / group different resource types – E.g. put all related heterogeneous resource types (reports, datasets,…) under the same collections • Model individual information resource types (e.g. collections, bibliographic reports, databases, datasets, etc). • Identify each resources type during ingestion • Link / group different resource types – E.g. put all related heterogeneous resource types (reports, datasets,…) under the same collections
  • 12. Thematic integration • ARIADNE uses the AAT thesaurus to semantically label ALL aggregated information. • AAT terms act as a glue and when combined with spatial and temporal information can produce great results • Semantic expansion of terms is extensively being used in order to improve retrieval. • Expansion of multi-lingual terms facilitates cross- language search without requiring automatic translation. • ARIADNE uses the AAT thesaurus to semantically label ALL aggregated information. • AAT terms act as a glue and when combined with spatial and temporal information can produce great results • Semantic expansion of terms is extensively being used in order to improve retrieval. • Expansion of multi-lingual terms facilitates cross- language search without requiring automatic translation.
  • 13. Spatial & Temporal • All resources with spatial information – Are assigned WGS84 projected coordinates • All resources with temporal information – Are normalized according the ACDM dates (that takes into account periods, period names and supports ISO date format). • All resources with spatial information – Are assigned WGS84 projected coordinates • All resources with temporal information – Are normalized according the ACDM dates (that takes into account periods, period names and supports ISO date format).
  • 14. Subject Terms Curation Lifecycle Native SubjectsNative Subjects Vocabulary Mapping Tool Vocabulary Mapping Tool MOReMORe mappings *nativeSubjects Provider Native Repository Provider Native Repository Excel Sheet XML Files Excel Sheet XML Files RegistryRegistry *nativeSubjects *providedSubjects *nativeSubjects *providedSubjects AATAAT Elastic SearchElastic SearchARIADNE PortalARIADNE Portal ACDM / Subjects (JSON) **providedSubjects **derivedSubjects **broaderGenericSubjects *nativeSubjects ACDM / Subjects (JSON) **providedSubjects **derivedSubjects **broaderGenericSubjects *nativeSubjects *mono-lingual (prefLabel only) ** multi-lingual (prefLabel & altLabel)