SlideShare a Scribd company logo
06 | xx   population
14 HELLENIC DIGITAL LIBRARIES
1. Pandektis - National Documentation Center of Greece
2. Medusa - Veria Central Public Library
3. The Historical Archives of the American Farm School of
   Thessaloniki
4. Technical Chamber of Greece Regional Department of
   Corfu
5. Central Library of NTUA
6. Music Library - Lilian Voudouri
7. Corgialenios Digital Library
14 HELLENIC DIGITAL LIBRARIES
8. University of Athens - Pergamos
9. Hellenic Ministry of Education - Educational
   Television
10.Anatolia College - Digital Archives & Special
   Collections
11.Technical Chamber of Greece - Library
12.Serres Central Public Library
13.Levadia Central Public Library
14.Athos Memory
http://aggregator.libver.gr
http://aggregator.libver.gr
http://aggregator.libver.gr
HOW EUROPEANA WORKS

‘Digitisation and online accessibility of
  European cultural material is essential in
  order to highlight that heritage, to inspire
  the creation of content and to encourage
  new online services to emerge.’
           Council of the European Union, May 2010
EUROPEANA is based on Digital
      Library Interoperability
• Enables aggregation and unified metadata-
  driven search of content
• More focused and accurate than web search
  engines (e.g., Google)
  – Unified retrieval of data for re-use in other
    applications
     • Common value-added services
  – Unified browsing / visualisation
  – Data cleaning
  – Data mining
EUROPEANA CONTENT AGGREGATION
       Horizontal Aggregators                                 Vertical Aggregators
          Archives                                            National Aggregators

                                                                    Culture Grid



               Archives Portal Europe

       Libraries
                   The European Library                                              MLAs

                                                              Regional Aggregators
                                           Dark Aggregators       Flanders museums
                                          ATHENA     ELocal
                     Film archives

European Film Gateway




                                                                   MLAs
                               Museums              MLAs
population
Hellenic Aggregator Architecture
Hellenic Aggregator Metadata
            Aggregation
• Guide the digital libraries about technical
  specifications and features that they must support
• Aggregate metadata
• Validate metadata, detect problems and suggest
  solutions
• Encode metadata according to Europeana
  standards
• Communicate with Europeana and transmit all
  metadata
Activities except from submitting
              metadata
• Disseminating the vision and objectives of Euro-
  peana to their network of institutions in order to
  increase support for and involvement with
  Europeana.
• Providing valuable feedback about the issues and
  discussions from their field.
• Promoting and implementing standards further
  along the content provision chain.
• Providing domain specific expertise and skills to
  institutions and Europeana.
Registering a new library to the
        Hellenic Aggregator
1. The digital library web site is examined by an
   expert who concludes whether it contains
   content suitable for Europeana.
2. If the digital library supports OAI-PMH,
   metadata tests are conducted, problems are
   identified and solutions are suggested.
3. If the digital library does not support OAI-PMH,
   DEiXTo software is used to harvest the required
   metadata from the target HTML pages.
Registering a new library to the
        Hellenic Aggregator
4. As soon as the digital library's metadata
   comply with the Europeana standards, it is
   registered in the Hellenic Aggregator.
5. Content Provider Agreement is signed by the
   digital library director.
6. The digital library content is published in
   Europeana.
openarchivesengine.com
The Hellenic Aggregator Software Platform
• Our special software capable of metadata aggregation,
  management and dissemination via OAI-PMH.
• Developed using Open source technologies
   •   PHP, cakePHP framework
   •   Mysql
   •   Sphinx Search
   •   Nginx web server
• Very scalable, has been tested with 150 libraries and 4
  million records ( http://www.libsearch.com )
• Also powers http://openarchives.gr
• In development and production since 2006
openarchivesengine.com
The Hellenic Aggregator Software Platform
• OAI-PMH Client - Retrieve and manage metadata from
  any digital library supporting OAI-PMH (e.g.. DSpace,
  eprints, fedora, CDS Invenio, OpenJournalSystems).
• Validate metadata according to standards (Europeana
  and other)
• Support Dublin Core, Europeana Semantic Elements
  and able to support more if required.
• Capable of normalizing metadata & fixing problems in
  order to be compliant with Europeana
• OAI-PMH Server - publish content via OAI-PMH + ESE
  to Europeana and other interested 3rd parties.
Hellenic Aggregator Architecture
OAIPMH.com features
• Validation of OAI-PMH enabled digital library
  in real time. Easily detect errors in all OAI-
  PMH commands and results.
• Metadata extraction from multiple libraries
  via OAI-PMH in XML rapidly and easily, thus
  enabling easy inspection, evaluation and other
  potential uses.
OAIPMH.com benefits
• Strict DC and ESE compliance is necessary.
• Checking the OAI-PMH support of a library is
  difficult especially when dealing with a large
  number of libraries.
• Automates and improves validation of new and
  existing OAI-PMH enabled libraries.
• Administrators are able to evaluate digital
  libraries using a quick and intuitive tool.
• Free access to all.
Current users and future work
• Regular users of OAIPMH.com include:
  – The Hellenic Aggregator
  – Openarchives.gr - Greek digital libraries search engine
  – Many users from Spain, Bulgaria and Cyprus
• Future work:
  – Add more validation rules
  – Support more metadata formats (such as Europeana
    Data Model)
  – Create a public API to encourage third-party usage
Dspace support
• Dspace is the most common digital library
  software in Greece (and abroad)
• We have developed 2 dspace plugins:
  1. Automated ESE schema & fields addition plugin
     (batch insert of ESE fields in existing DC records)
  2. Dspace ESE Crosswalk plugin
• We have developed a PHP script to batch
  insert ESE elements to Europeana
Dspace ESE support quick guide
1. Use the Europeana XML Namespace
   http://europeana.eu/schemas/ese/ and
   augment existing systems’ configuration in
   order to support ESE
2. Populate repository records with ESE
   metadata (optionally use the plugin)
3. Use the DSpace Crosswalks Plugin to support
   OAI-PMH ESE, freely available at
   http://vbanos.gr/?p=189
More info: http://blog.libver.gr/edlocal/
DEiXTo web content data extraction
• DEiXTo is a powerful web data
  extraction tool that is based on the
  W3C Document Object Model
  (DOM). It allows users to create
  highly accurate "extraction rules"
  (wrappers) that describe what pieces
  of data to scrape from a website.
DEiXTo Architecture

                          Web Pages                   DB


                ΔEiXTo

 ΙΕ parser &             executor        Extracted
                                        Information
render engine                                                  Published Data



model builder      extraction rules




                   extraction rules
                                            ΔEiXToBots
                                      (customized executors)
DEiXTo features
• Powerful web data extraction tool
  – Freeware GUI tool (built with Turbo Delphi, Windows-
    only)
  – Free, cross-platform Command Line Executor (in Perl)
  – DEiXToBot agent (implemented in Perl)
• W3C Document Object Model (DOM)
  – DOM-based extraction rules (wrappers).
• Extracted data can be exported to a wide variety
  of formats (tab delimited, XML, RSS, etc).
DEiXTo Corgialenios Library use case
VANGELIS BANOS
Email: vbanos@gmail.com
Web: http://vbanos.gr

Useful pages:
• http://aggregator.libver.gr
• http://blog.libver.gr/edlocal/
• http://openarchivesengine.com
• http://oaipmh.com
• http://www.deixto.com
                                   QUESTIONS?

More Related Content

What's hot

LoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the CloudLoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the Cloud
locloud
 
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
Olaf Janssen
 
LoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana CloudLoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana Cloud
locloud
 
The LoCloud lightweight digital library and alternative content sources, Adam...
The LoCloud lightweight digital library and alternative content sources, Adam...The LoCloud lightweight digital library and alternative content sources, Adam...
The LoCloud lightweight digital library and alternative content sources, Adam...
locloud
 
Limo for the LIBIS network
Limo for the LIBIS networkLimo for the LIBIS network
Limo for the LIBIS networkveerlek
 
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
Increasing Visibility of Cultural Heritage Objects:  A Case of Turkish Conten...Increasing Visibility of Cultural Heritage Objects:  A Case of Turkish Conten...
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
locloud
 
ALIADA Project. AtCult
ALIADA Project. AtCultALIADA Project. AtCult
ALIADA Project. AtCult
aliada project
 
Local content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providersLocal content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providers
locloud
 
Session 3: Vocabulary enrichment, Gerda Koch
Session 3: Vocabulary enrichment, Gerda KochSession 3: Vocabulary enrichment, Gerda Koch
Session 3: Vocabulary enrichment, Gerda Koch
locloud
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
Antoine Isaac
 
Workshop: Concluding Remarks
Workshop: Concluding RemarksWorkshop: Concluding Remarks
Workshop: Concluding Remarks
locloud
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
Antoine Isaac
 
Validation of Europeana data: application profile, OWL ontology, or else?
Validation of Europeana data: application profile, OWL ontology, or else?Validation of Europeana data: application profile, OWL ontology, or else?
Validation of Europeana data: application profile, OWL ontology, or else?
Antoine Isaac
 

What's hot (14)

LoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the CloudLoCloud: Local Cultural Heritage Online and in the Cloud
LoCloud: Local Cultural Heritage Online and in the Cloud
 
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
On The European (Digital) Library, 03-04-2007, Library of Congress, Washingto...
 
LoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana CloudLoCloud: Local Content in a Europeana Cloud
LoCloud: Local Content in a Europeana Cloud
 
The LoCloud lightweight digital library and alternative content sources, Adam...
The LoCloud lightweight digital library and alternative content sources, Adam...The LoCloud lightweight digital library and alternative content sources, Adam...
The LoCloud lightweight digital library and alternative content sources, Adam...
 
Limo for the LIBIS network
Limo for the LIBIS networkLimo for the LIBIS network
Limo for the LIBIS network
 
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
Increasing Visibility of Cultural Heritage Objects:  A Case of Turkish Conten...Increasing Visibility of Cultural Heritage Objects:  A Case of Turkish Conten...
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
 
ALIADA Project. AtCult
ALIADA Project. AtCultALIADA Project. AtCult
ALIADA Project. AtCult
 
Local content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providersLocal content in a Europeana cloud for small & medium content providers
Local content in a Europeana cloud for small & medium content providers
 
Session 3: Vocabulary enrichment, Gerda Koch
Session 3: Vocabulary enrichment, Gerda KochSession 3: Vocabulary enrichment, Gerda Koch
Session 3: Vocabulary enrichment, Gerda Koch
 
All WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKennaAll WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKenna
 
Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021Entity Management at Europeana - DCMI 2021
Entity Management at Europeana - DCMI 2021
 
Workshop: Concluding Remarks
Workshop: Concluding RemarksWorkshop: Concluding Remarks
Workshop: Concluding Remarks
 
EDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD MeetingEDM - American Art Collaborative LOD Meeting
EDM - American Art Collaborative LOD Meeting
 
Validation of Europeana data: application profile, OWL ontology, or else?
Validation of Europeana data: application profile, OWL ontology, or else?Validation of Europeana data: application profile, OWL ontology, or else?
Validation of Europeana data: application profile, OWL ontology, or else?
 

Similar to The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana

Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
Europeana
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
CARARE
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
Antoine Isaac
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
Antoine Isaac
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Services
locloud
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Nuno Freire
 
The ABES Discovery Study
The ABES Discovery StudyThe ABES Discovery Study
The ABES Discovery Study
ABES
 
Technion IR: Institutional Repository with DSpace
Technion IR: Institutional Repository with DSpaceTechnion IR: Institutional Repository with DSpace
Technion IR: Institutional Repository with DSpace
Elena Yaroshenko
 
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
ARTstor-Shared_Shelf
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
DeVonne Parks, CEM
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info
 
Introduction to Europeana Inside
Introduction to Europeana InsideIntroduction to Europeana Inside
Introduction to Europeana Inside
Nicholas Poole
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageMax Kaiser
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
Antoine Isaac
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
Chiara Del Vescovo
 
Breeding 1
Breeding 1Breeding 1
Sharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshopSharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshop
locloud
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013
Antoine Isaac
 
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara AubryArchiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
Biblioteca Nacional de España
 

Similar to The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana (20)

Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
 
European databases in cultural heritage: making connections
European databases in cultural heritage: making connectionsEuropean databases in cultural heritage: making connections
European databases in cultural heritage: making connections
 
Europeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) caseEuropeana as a Linked Data (Quality) case
Europeana as a Linked Data (Quality) case
 
Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018Designing a multilingual knowledge graph - DCMI2018
Designing a multilingual knowledge graph - DCMI2018
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Services
 
Europeana datainaction nov2012
Europeana datainaction nov2012Europeana datainaction nov2012
Europeana datainaction nov2012
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
 
The ABES Discovery Study
The ABES Discovery StudyThe ABES Discovery Study
The ABES Discovery Study
 
Technion IR: Institutional Repository with DSpace
Technion IR: Institutional Repository with DSpaceTechnion IR: Institutional Repository with DSpace
Technion IR: Institutional Repository with DSpace
 
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
Shared Shelf: Media Management Software that Facilitates Access to Your Colle...
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Rio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der WerfRio Info 2009 - Europeana - Bram van der Werf
Rio Info 2009 - Europeana - Bram van der Werf
 
Introduction to Europeana Inside
Introduction to Europeana InsideIntroduction to Europeana Inside
Introduction to Europeana Inside
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
AAC Education Session
AAC Education Session AAC Education Session
AAC Education Session
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Breeding 1
Breeding 1Breeding 1
Breeding 1
 
Sharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshopSharing Cultural Heritage Online with LoCloud: workshop
Sharing Cultural Heritage Online with LoCloud: workshop
 
Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013Europeana vision - Web as Literature 2013
Europeana vision - Web as Literature 2013
 
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara AubryArchiving the French Web: the BnF web archiving workflow. Sara Aubry
Archiving the French Web: the BnF web archiving workflow. Sara Aubry
 

More from Vangelis Banos

Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
Vangelis Banos
 
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομέναΥπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
Vangelis Banos
 
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
Vangelis Banos
 
Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!
Vangelis Banos
 
Αποθηκεύεται το διαδίκτυο; Web Archiving!
Αποθηκεύεται το διαδίκτυο; Web Archiving!Αποθηκεύεται το διαδίκτυο; Web Archiving!
Αποθηκεύεται το διαδίκτυο; Web Archiving!
Vangelis Banos
 
The theory and practice of Website Archivability
The theory and practice of Website ArchivabilityThe theory and practice of Website Archivability
The theory and practice of Website ArchivabilityVangelis Banos
 
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
Vangelis Banos
 
ΥπερΔιαύγεια
ΥπερΔιαύγειαΥπερΔιαύγεια
ΥπερΔιαύγεια
Vangelis Banos
 
Η Ιστορία της Μετρολογίας
Η Ιστορία της ΜετρολογίαςΗ Ιστορία της Μετρολογίας
Η Ιστορία της Μετρολογίας
Vangelis Banos
 
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας ΜετρολογίαςΟ κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
Vangelis Banos
 
Heterogeneity in european digital libraries, the europeana challenge
Heterogeneity in european digital libraries, the europeana challengeHeterogeneity in european digital libraries, the europeana challenge
Heterogeneity in european digital libraries, the europeana challenge
Vangelis Banos
 
Επιτυχημένα παραδείγματα διαλειτουργικότητας σε ελληνικά αποθετήρια και σχε...
Επιτυχημένα παραδείγματα διαλειτουργικότητας  σε ελληνικά αποθετήρια  και σχε...Επιτυχημένα παραδείγματα διαλειτουργικότητας  σε ελληνικά αποθετήρια  και σχε...
Επιτυχημένα παραδείγματα διαλειτουργικότητας σε ελληνικά αποθετήρια και σχε...
Vangelis Banos
 
Η τεχνική υποδομή του εθνικού συσσωρευτή
Η τεχνική υποδομή του εθνικού συσσωρευτήΗ τεχνική υποδομή του εθνικού συσσωρευτή
Η τεχνική υποδομή του εθνικού συσσωρευτήVangelis Banos
 

More from Vangelis Banos (13)

Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
Website Archivability - Library of Congress NDIIPP Presentation 2015/06/03
 
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομέναΥπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
Υπερδιαύγεια - Αναζήτηση στα δημόσια δεδομένα
 
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
BlogForever Crawler: Techniques and algorithms to harvest modern weblogs Pres...
 
Can you save the web? Web Archiving!
Can you save the web? Web Archiving!Can you save the web? Web Archiving!
Can you save the web? Web Archiving!
 
Αποθηκεύεται το διαδίκτυο; Web Archiving!
Αποθηκεύεται το διαδίκτυο; Web Archiving!Αποθηκεύεται το διαδίκτυο; Web Archiving!
Αποθηκεύεται το διαδίκτυο; Web Archiving!
 
The theory and practice of Website Archivability
The theory and practice of Website ArchivabilityThe theory and practice of Website Archivability
The theory and practice of Website Archivability
 
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
CLEAR: a Credible Live Evaluation Method of Website Archivability, iPRES2013
 
ΥπερΔιαύγεια
ΥπερΔιαύγειαΥπερΔιαύγεια
ΥπερΔιαύγεια
 
Η Ιστορία της Μετρολογίας
Η Ιστορία της ΜετρολογίαςΗ Ιστορία της Μετρολογίας
Η Ιστορία της Μετρολογίας
 
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας ΜετρολογίαςΟ κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
Ο κόσμος των μικρών & των μεγάλων μέσα από το βλέμμα της κας Μετρολογίας
 
Heterogeneity in european digital libraries, the europeana challenge
Heterogeneity in european digital libraries, the europeana challengeHeterogeneity in european digital libraries, the europeana challenge
Heterogeneity in european digital libraries, the europeana challenge
 
Επιτυχημένα παραδείγματα διαλειτουργικότητας σε ελληνικά αποθετήρια και σχε...
Επιτυχημένα παραδείγματα διαλειτουργικότητας  σε ελληνικά αποθετήρια  και σχε...Επιτυχημένα παραδείγματα διαλειτουργικότητας  σε ελληνικά αποθετήρια  και σχε...
Επιτυχημένα παραδείγματα διαλειτουργικότητας σε ελληνικά αποθετήρια και σχε...
 
Η τεχνική υποδομή του εθνικού συσσωρευτή
Η τεχνική υποδομή του εθνικού συσσωρευτήΗ τεχνική υποδομή του εθνικού συσσωρευτή
Η τεχνική υποδομή του εθνικού συσσωρευτή
 

Recently uploaded

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 

Recently uploaded (20)

Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 

The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana

  • 1.
  • 2. 06 | xx population
  • 3. 14 HELLENIC DIGITAL LIBRARIES 1. Pandektis - National Documentation Center of Greece 2. Medusa - Veria Central Public Library 3. The Historical Archives of the American Farm School of Thessaloniki 4. Technical Chamber of Greece Regional Department of Corfu 5. Central Library of NTUA 6. Music Library - Lilian Voudouri 7. Corgialenios Digital Library
  • 4. 14 HELLENIC DIGITAL LIBRARIES 8. University of Athens - Pergamos 9. Hellenic Ministry of Education - Educational Television 10.Anatolia College - Digital Archives & Special Collections 11.Technical Chamber of Greece - Library 12.Serres Central Public Library 13.Levadia Central Public Library 14.Athos Memory
  • 8. HOW EUROPEANA WORKS ‘Digitisation and online accessibility of European cultural material is essential in order to highlight that heritage, to inspire the creation of content and to encourage new online services to emerge.’ Council of the European Union, May 2010
  • 9. EUROPEANA is based on Digital Library Interoperability • Enables aggregation and unified metadata- driven search of content • More focused and accurate than web search engines (e.g., Google) – Unified retrieval of data for re-use in other applications • Common value-added services – Unified browsing / visualisation – Data cleaning – Data mining
  • 10. EUROPEANA CONTENT AGGREGATION Horizontal Aggregators Vertical Aggregators Archives National Aggregators Culture Grid Archives Portal Europe Libraries The European Library MLAs Regional Aggregators Dark Aggregators Flanders museums ATHENA ELocal Film archives European Film Gateway MLAs Museums MLAs
  • 13. Hellenic Aggregator Metadata Aggregation • Guide the digital libraries about technical specifications and features that they must support • Aggregate metadata • Validate metadata, detect problems and suggest solutions • Encode metadata according to Europeana standards • Communicate with Europeana and transmit all metadata
  • 14. Activities except from submitting metadata • Disseminating the vision and objectives of Euro- peana to their network of institutions in order to increase support for and involvement with Europeana. • Providing valuable feedback about the issues and discussions from their field. • Promoting and implementing standards further along the content provision chain. • Providing domain specific expertise and skills to institutions and Europeana.
  • 15. Registering a new library to the Hellenic Aggregator 1. The digital library web site is examined by an expert who concludes whether it contains content suitable for Europeana. 2. If the digital library supports OAI-PMH, metadata tests are conducted, problems are identified and solutions are suggested. 3. If the digital library does not support OAI-PMH, DEiXTo software is used to harvest the required metadata from the target HTML pages.
  • 16. Registering a new library to the Hellenic Aggregator 4. As soon as the digital library's metadata comply with the Europeana standards, it is registered in the Hellenic Aggregator. 5. Content Provider Agreement is signed by the digital library director. 6. The digital library content is published in Europeana.
  • 17. openarchivesengine.com The Hellenic Aggregator Software Platform • Our special software capable of metadata aggregation, management and dissemination via OAI-PMH. • Developed using Open source technologies • PHP, cakePHP framework • Mysql • Sphinx Search • Nginx web server • Very scalable, has been tested with 150 libraries and 4 million records ( http://www.libsearch.com ) • Also powers http://openarchives.gr • In development and production since 2006
  • 18. openarchivesengine.com The Hellenic Aggregator Software Platform • OAI-PMH Client - Retrieve and manage metadata from any digital library supporting OAI-PMH (e.g.. DSpace, eprints, fedora, CDS Invenio, OpenJournalSystems). • Validate metadata according to standards (Europeana and other) • Support Dublin Core, Europeana Semantic Elements and able to support more if required. • Capable of normalizing metadata & fixing problems in order to be compliant with Europeana • OAI-PMH Server - publish content via OAI-PMH + ESE to Europeana and other interested 3rd parties.
  • 20.
  • 21. OAIPMH.com features • Validation of OAI-PMH enabled digital library in real time. Easily detect errors in all OAI- PMH commands and results. • Metadata extraction from multiple libraries via OAI-PMH in XML rapidly and easily, thus enabling easy inspection, evaluation and other potential uses.
  • 22. OAIPMH.com benefits • Strict DC and ESE compliance is necessary. • Checking the OAI-PMH support of a library is difficult especially when dealing with a large number of libraries. • Automates and improves validation of new and existing OAI-PMH enabled libraries. • Administrators are able to evaluate digital libraries using a quick and intuitive tool. • Free access to all.
  • 23.
  • 24. Current users and future work • Regular users of OAIPMH.com include: – The Hellenic Aggregator – Openarchives.gr - Greek digital libraries search engine – Many users from Spain, Bulgaria and Cyprus • Future work: – Add more validation rules – Support more metadata formats (such as Europeana Data Model) – Create a public API to encourage third-party usage
  • 25.
  • 26. Dspace support • Dspace is the most common digital library software in Greece (and abroad) • We have developed 2 dspace plugins: 1. Automated ESE schema & fields addition plugin (batch insert of ESE fields in existing DC records) 2. Dspace ESE Crosswalk plugin • We have developed a PHP script to batch insert ESE elements to Europeana
  • 27. Dspace ESE support quick guide 1. Use the Europeana XML Namespace http://europeana.eu/schemas/ese/ and augment existing systems’ configuration in order to support ESE 2. Populate repository records with ESE metadata (optionally use the plugin) 3. Use the DSpace Crosswalks Plugin to support OAI-PMH ESE, freely available at http://vbanos.gr/?p=189 More info: http://blog.libver.gr/edlocal/
  • 28.
  • 29. DEiXTo web content data extraction • DEiXTo is a powerful web data extraction tool that is based on the W3C Document Object Model (DOM). It allows users to create highly accurate "extraction rules" (wrappers) that describe what pieces of data to scrape from a website.
  • 30. DEiXTo Architecture Web Pages DB ΔEiXTo ΙΕ parser & executor Extracted Information render engine Published Data model builder extraction rules extraction rules ΔEiXToBots (customized executors)
  • 31. DEiXTo features • Powerful web data extraction tool – Freeware GUI tool (built with Turbo Delphi, Windows- only) – Free, cross-platform Command Line Executor (in Perl) – DEiXToBot agent (implemented in Perl) • W3C Document Object Model (DOM) – DOM-based extraction rules (wrappers). • Extracted data can be exported to a wide variety of formats (tab delimited, XML, RSS, etc).
  • 33. VANGELIS BANOS Email: vbanos@gmail.com Web: http://vbanos.gr Useful pages: • http://aggregator.libver.gr • http://blog.libver.gr/edlocal/ • http://openarchivesengine.com • http://oaipmh.com • http://www.deixto.com QUESTIONS?

Editor's Notes

  1. The academic blog portal, a managed and publicly available listing of blogs by scholars.It is fairly extensive; at the time of source selection, over 1700 blogs were listed to the portal.It also includes explicit inclusion criteria. For a blog to be listed, it should meet those criteria.
  2. The academic blog portal, a managed and publicly available listing of blogs by scholars.It is fairly extensive; at the time of source selection, over 1700 blogs were listed to the portal.It also includes explicit inclusion criteria. For a blog to be listed, it should meet those criteria.