SlideShare a Scribd company logo
1 of 19
Biblio-transformation-engine:
An open source framework
and use cases
in the digital libraries domain
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Kostas Stamatis, Nikolaos Konstantinou, Anastasia Manta,
Christina Paschou and Nikos Houssos
National Documentation Centre / National Hellenic Research Foundation, Greece
Agenda
• Introduction
• Motivation - the recurring need for data
transformations
• The proposed solution
• Use cases / experience reports
• Summary – conclusions – future work
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Motivation
• Data transformations are needed everywhere
in digital libraries / scholarly communication
systems
• Painful and tedious procedure
• Many sub-tasks of the entire procedure reoccur
and could be reused
• Need for systematic framework for data
transformations to accelerate the process,
reduce errors and facilitate reuse
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Analysis – basis steps
in data transformations
• Retrieve source data records
• Apply processing:
– (Optionally) Remove data records
– (Optionally) Add/modify/delete field values within records
– Transform data source to output format (implement the
corresponding mapping)
• Generate desired output
– Export to a file and/or directly update databases / external
systems
• Need for incremental / selective data loading ->
processing and output conditions may require repeated
execution of the loading/processing cycle
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Design goals
• Customisable, non-intrusive, easy to use, integrate and
extend (e.g. support a variety of data source types)
• Separation of concerns in development – e.g.
development of transformation logic independent of
data sources
– Example: No need to be aware of MARC to develop a
function to harmonise encoding of dates
• Support for recurring execution of the data
loading/processing cycle according to specific criteria
(e.g. useful for OAI-PMH)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
The biblio transformation engine
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Components of the engine
• Data Loader: Retrieves data from input source(s)
according to DataLoading Spec
• ProcessingStep: Transforms input in some way
– Filters: removes records according to specific criteria
– Modifier: updates records according to specific criteria
– Initializer: initializes data in processing steps (e.g. load
author names to Filter)
• Output Generator: Creates the desired output (e.g.
export file, direct update of database)
• Record abstraction: simple common interface for
all types of records that allows complex
transformation functions
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Processing workflow
• Load data – transform input to records
• If processing conditions are met, begin
processing – sequential execution of Filters
and Modifiers
• If output conditions are met, begin output –
execution of OutputGenerator(s)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Processing workflow
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Load source data
Processing
conditions
OK
Generate output
Output
conditions
OK
Apply Filters &
Modifiers
Modify
LoadingSpec
YES
NO
NO
YES
The transformation engine – data model
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Implementation
• FLOSS library developed in Java (maven used
as a build tool)
• Configuration outside the code - dependency
injection mechanisms of the Spring framework
core container
– Specification of Data Loader, Processing Steps,
Conditions, OutputGenerator
– Mapping from source to target format (for one-to-
one field mappings)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Example of mapping configuration
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
FLOSS library
• Available at
http://code.google.com/p/biblio-transformation-engine/
• European Union Public License
• Feel free to download and use it!
• Looking forward to feedback, questions,…
(contributions also welcome )
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Use case 1 – Generate
Linked Open Data
• Sources: Repository records, legacy cultural
material records, research information in CERIF
• Corresponding data loaders reused
• Filters/Modifiers can be totally agnostic of RDF
and input formats
• Use Jena RDF library to generate RDF triples
• Add/generate appropriate identifiers/URI for each
entity (either at the modifier or output generator
level)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Use case 2 – Import/export
data/export to/from repositories
• Source record formats: EndNote, RIS, Bibtex,
UNIMARC
• Developed data loaders for each format, re-
used output generator for DSpace
• Export to different formats and reference styles
based on repository records
– Implemented for DSpace
– For reference styles uses the citeproc-js library and
the Citation Style Language (CSL)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Use case 3 – Feed
the VOA3R aggregator
• Get records of the Hellenic National Archive of Doctoral
Dissertation (HEDI – didaktorika.gr) to the VOA3R
aggregator (Virtual Open Access Agriculture &
Aquaculture Repository)
• Developed subject-based filter and injected it into an
enhanced OAI-PMH server using the library.
• ~1070 of approximately 23.500 records, needed to
apply techniques to cater for the distribution sparsity of
“suitable” records combined with resumption token
• Seamless on-the-fly deployment and co-existence with
sets targeted to other aggregators (DART,
openarchives.gr)
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Use case 4 – Feed Europeana
• Include in Europeana content from the Technical
Chamber of Greece (TEE)
• Records in TEE library catalog (UNIMARC),
available through a Z39.50 interface
• Developed Z39.50 data loader, appropriate filters
and modifiers (independent of UNIMARC)
• Mapping to ESE implemented through a modifier
• ~6800 from the TEE records sent to Europeana
• Repeatable, automated procedure through an
enhanced OAI-PMH server using the library
International Conference on Theory and Practice of Digital Libraries (TPDL 2011), Berlin, 26-28 Sept ember 2011
Future work
• Support more types of data transformations
(contributions welcome )
• Extend declarative specification of mappings to
cover more sophisticated cases
• Configurable support for common data model
to facilitate reuse of Filter and Modifier
implementations
• Systematically study the user experience,
identify and implement potential improvements
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
Thank you for your attention!
• More info:
http://code.google.com/p/biblio-transformation-engine/
kstamatis AT ekt.gr
nkons AT ekt.gr
amanta AT ekt.gr
cpaschou AT ekt.gr
nhoussos AT ekt.gr
7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012

More Related Content

What's hot

Mobyle 2 - Mobyle Workshop - September 28, 2012
Mobyle 2 - Mobyle Workshop - September 28, 2012Mobyle 2 - Mobyle Workshop - September 28, 2012
Mobyle 2 - Mobyle Workshop - September 28, 2012Hervé Ménager
 
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...OpenAIRE
 
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...Jens Mittelbach
 
Geant4 Model Testing Framework: From PAW to ROOT
Geant4 Model Testing Framework:  From PAW to ROOTGeant4 Model Testing Framework:  From PAW to ROOT
Geant4 Model Testing Framework: From PAW to ROOTRoman Atachiants
 
Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007annegrete
 
DBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDimitris Kontokostas
 
Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)Kai Eckert
 
Open Cloud Computing Interface
Open Cloud Computing InterfaceOpen Cloud Computing Interface
Open Cloud Computing InterfaceJean Parpaillon
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...LinDa_FP7
 
Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Michael Day
 
The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...ariadnenetwork
 
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Joachim Neubert
 
The IGeLU Linked Open Data Special Interest Working Group
The IGeLU Linked Open Data Special Interest Working GroupThe IGeLU Linked Open Data Special Interest Working Group
The IGeLU Linked Open Data Special Interest Working GroupLukas Koster
 
Data workshop preso
Data workshop preso Data workshop preso
Data workshop preso Doug Moncur
 

What's hot (20)

Mobyle 2 - Mobyle Workshop - September 28, 2012
Mobyle 2 - Mobyle Workshop - September 28, 2012Mobyle 2 - Mobyle Workshop - September 28, 2012
Mobyle 2 - Mobyle Workshop - September 28, 2012
 
Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...
 
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...
OpenAIRE guidelines for CRIS managers : supporting interoperability of open r...
 
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
d:swarm - A Library Data Management Platform Based on a Linked Open Data Appr...
 
Geant4 Model Testing Framework: From PAW to ROOT
Geant4 Model Testing Framework:  From PAW to ROOTGeant4 Model Testing Framework:  From PAW to ROOT
Geant4 Model Testing Framework: From PAW to ROOT
 
Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007
 
Bologna
BolognaBologna
Bologna
 
DBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in DublinDBpedia+ / DBpedia meeting in Dublin
DBpedia+ / DBpedia meeting in Dublin
 
Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)Linked Open Projects (DCMI Library Community)
Linked Open Projects (DCMI Library Community)
 
Open Cloud Computing Interface
Open Cloud Computing InterfaceOpen Cloud Computing Interface
Open Cloud Computing Interface
 
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
20141030 LinDA Workshop echallenges2014 - State of the art in open data infra...
 
XML Data Using Oracle
XML Data Using OracleXML Data Using Oracle
XML Data Using Oracle
 
Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...
 
The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...The ARIADNE interoperability framework, component architecture and registry s...
The ARIADNE interoperability framework, component architecture and registry s...
 
Work Package 2 - Month 6 by Hannes Mühleisen
Work Package 2 - Month 6 by Hannes MühleisenWork Package 2 - Month 6 by Hannes Mühleisen
Work Package 2 - Month 6 by Hannes Mühleisen
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
Constantly Under Construction: STW Thesaurus for Economics Linked Data Maint...
 
The IGeLU Linked Open Data Special Interest Working Group
The IGeLU Linked Open Data Special Interest Working GroupThe IGeLU Linked Open Data Special Interest Working Group
The IGeLU Linked Open Data Special Interest Working Group
 
Manchesterjan2015
Manchesterjan2015Manchesterjan2015
Manchesterjan2015
 
Data workshop preso
Data workshop preso Data workshop preso
Data workshop preso
 

Similar to OR2012 Biblio-transformation-engine

Biblio-transformation-engine slides in Open Repositories 2012
Biblio-transformation-engine slides in Open Repositories 2012Biblio-transformation-engine slides in Open Repositories 2012
Biblio-transformation-engine slides in Open Repositories 2012Nikos Houssos
 
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...Nikos Houssos
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...rmacneil88
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014ResearchSpace
 
Open Infrastructure for Cultural Heritage Digital Content
Open Infrastructure for Cultural Heritage Digital ContentOpen Infrastructure for Cultural Heritage Digital Content
Open Infrastructure for Cultural Heritage Digital ContentNikos Houssos
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...Pedro Príncipe
 
OpenAIRE guidelines : supporting interoperability for literature repositories...
OpenAIRE guidelines : supporting interoperability for literature repositories...OpenAIRE guidelines : supporting interoperability for literature repositories...
OpenAIRE guidelines : supporting interoperability for literature repositories...OpenAIRE
 
Harvesting Repositories: DPLA, Europeana, & Other Case Studies
Harvesting Repositories:  DPLA, Europeana, & Other Case StudiesHarvesting Repositories:  DPLA, Europeana, & Other Case Studies
Harvesting Repositories: DPLA, Europeana, & Other Case Studieseohallor
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair" OpenAIRE
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesRIANIreland
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
 
Integrating an electronic lab notebook with a university it environment rdmf ...
Integrating an electronic lab notebook with a university it environment rdmf ...Integrating an electronic lab notebook with a university it environment rdmf ...
Integrating an electronic lab notebook with a university it environment rdmf ...rmacneil88
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challengesvty
 
Comsode tools - pushing data to open ecosystem
Comsode tools - pushing data to open ecosystemComsode tools - pushing data to open ecosystem
Comsode tools - pushing data to open ecosystemComsode - FP7 project
 

Similar to OR2012 Biblio-transformation-engine (20)

Biblio-transformation-engine slides in Open Repositories 2012
Biblio-transformation-engine slides in Open Repositories 2012Biblio-transformation-engine slides in Open Repositories 2012
Biblio-transformation-engine slides in Open Repositories 2012
 
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...
Presentation of the EDULLL repository by EKT at Open Repositories 2012 DSpace...
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 
Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014Elns and repositories, American Chemical Society, Dallas, March 2014
Elns and repositories, American Chemical Society, Dallas, March 2014
 
RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015RDM@Edinburgh_interoperation_IDCC2015
RDM@Edinburgh_interoperation_IDCC2015
 
Service Integration to Enhance RDM
Service Integration to Enhance RDMService Integration to Enhance RDM
Service Integration to Enhance RDM
 
Open Infrastructure for Cultural Heritage Digital Content
Open Infrastructure for Cultural Heritage Digital ContentOpen Infrastructure for Cultural Heritage Digital Content
Open Infrastructure for Cultural Heritage Digital Content
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...
CRIS 2014 - OpenAIRE Guidelines: supporting interoperability for Literature R...
 
OpenAIRE guidelines : supporting interoperability for literature repositories...
OpenAIRE guidelines : supporting interoperability for literature repositories...OpenAIRE guidelines : supporting interoperability for literature repositories...
OpenAIRE guidelines : supporting interoperability for literature repositories...
 
Harvesting Repositories: DPLA, Europeana, & Other Case Studies
Harvesting Repositories:  DPLA, Europeana, & Other Case StudiesHarvesting Repositories:  DPLA, Europeana, & Other Case Studies
Harvesting Repositories: DPLA, Europeana, & Other Case Studies
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish Repositories
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
Integrating an electronic lab notebook with a university it environment rdmf ...
Integrating an electronic lab notebook with a university it environment rdmf ...Integrating an electronic lab notebook with a university it environment rdmf ...
Integrating an electronic lab notebook with a university it environment rdmf ...
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challenges
 
RDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service InteroperationRDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service Interoperation
 
Comsode tools - pushing data to open ecosystem
Comsode tools - pushing data to open ecosystemComsode tools - pushing data to open ecosystem
Comsode tools - pushing data to open ecosystem
 

More from Nikolaos Konstantinou

An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsNikolaos Konstantinou
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked DataNikolaos Konstantinou
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebNikolaos Konstantinou
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsNikolaos Konstantinou
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesNikolaos Konstantinou
 
Generating Linked Data in Real-time from Sensor Data Streams
Generating Linked Data in Real-time from Sensor Data StreamsGenerating Linked Data in Real-time from Sensor Data Streams
Generating Linked Data in Real-time from Sensor Data StreamsNikolaos Konstantinou
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsNikolaos Konstantinou
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Nikolaos Konstantinou
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
 
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...Nikolaos Konstantinou
 
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματα
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματαΔιαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματα
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματαNikolaos Konstantinou
 
Publishing Data Using Semantic Web Technologies
Publishing Data Using Semantic Web TechnologiesPublishing Data Using Semantic Web Technologies
Publishing Data Using Semantic Web TechnologiesNikolaos Konstantinou
 
From Sensor Data to Triples: Information Flow in Semantic Sensor Networks
From Sensor Data to Triples: Information Flow in Semantic Sensor NetworksFrom Sensor Data to Triples: Information Flow in Semantic Sensor Networks
From Sensor Data to Triples: Information Flow in Semantic Sensor NetworksNikolaos Konstantinou
 
A rule-based approach for the real-time semantic annotation in context-aware ...
A rule-based approach for the real-time semantic annotation in context-aware ...A rule-based approach for the real-time semantic annotation in context-aware ...
A rule-based approach for the real-time semantic annotation in context-aware ...Nikolaos Konstantinou
 
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...Nikolaos Konstantinou
 

More from Nikolaos Konstantinou (17)

An Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF GraphsAn Approach for the Incremental Export of Relational Databases into RDF Graphs
An Approach for the Incremental Export of Relational Databases into RDF Graphs
 
Materializing the Web of Linked Data
Materializing the Web of Linked DataMaterializing the Web of Linked Data
Materializing the Web of Linked Data
 
Introduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic WebIntroduction: Linked Data and the Semantic Web
Introduction: Linked Data and the Semantic Web
 
Technical Background
Technical BackgroundTechnical Background
Technical Background
 
Deploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software ToolsDeploying Linked Open Data: Methodologies and Software Tools
Deploying Linked Open Data: Methodologies and Software Tools
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational Databases
 
Generating Linked Data in Real-time from Sensor Data Streams
Generating Linked Data in Real-time from Sensor Data StreamsGenerating Linked Data in Real-time from Sensor Data Streams
Generating Linked Data in Real-time from Sensor Data Streams
 
Conclusions: Summary and Outlook
Conclusions: Summary and OutlookConclusions: Summary and Outlook
Conclusions: Summary and Outlook
 
Incremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF GraphsIncremental Export of Relational Database Contents into RDF Graphs
Incremental Export of Relational Database Contents into RDF Graphs
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
 
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...
Priamos: A Middleware Architecture for Real-Time Semantic Annotation of Conte...
 
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματα
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματαΔιαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματα
Διαχείριση Ψηφιακού Περιεχομένου με το DSpace: Λειτουργία και τεχνικά ζητήματα
 
Publishing Data Using Semantic Web Technologies
Publishing Data Using Semantic Web TechnologiesPublishing Data Using Semantic Web Technologies
Publishing Data Using Semantic Web Technologies
 
From Sensor Data to Triples: Information Flow in Semantic Sensor Networks
From Sensor Data to Triples: Information Flow in Semantic Sensor NetworksFrom Sensor Data to Triples: Information Flow in Semantic Sensor Networks
From Sensor Data to Triples: Information Flow in Semantic Sensor Networks
 
A rule-based approach for the real-time semantic annotation in context-aware ...
A rule-based approach for the real-time semantic annotation in context-aware ...A rule-based approach for the real-time semantic annotation in context-aware ...
A rule-based approach for the real-time semantic annotation in context-aware ...
 
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
VisAVis: An Approach to an Intermediate Layer between Ontologies and Relation...
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

OR2012 Biblio-transformation-engine

  • 1. Biblio-transformation-engine: An open source framework and use cases in the digital libraries domain 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012 Kostas Stamatis, Nikolaos Konstantinou, Anastasia Manta, Christina Paschou and Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece
  • 2. Agenda • Introduction • Motivation - the recurring need for data transformations • The proposed solution • Use cases / experience reports • Summary – conclusions – future work 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 3. Motivation • Data transformations are needed everywhere in digital libraries / scholarly communication systems • Painful and tedious procedure • Many sub-tasks of the entire procedure reoccur and could be reused • Need for systematic framework for data transformations to accelerate the process, reduce errors and facilitate reuse 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 4. Analysis – basis steps in data transformations • Retrieve source data records • Apply processing: – (Optionally) Remove data records – (Optionally) Add/modify/delete field values within records – Transform data source to output format (implement the corresponding mapping) • Generate desired output – Export to a file and/or directly update databases / external systems • Need for incremental / selective data loading -> processing and output conditions may require repeated execution of the loading/processing cycle 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 5. Design goals • Customisable, non-intrusive, easy to use, integrate and extend (e.g. support a variety of data source types) • Separation of concerns in development – e.g. development of transformation logic independent of data sources – Example: No need to be aware of MARC to develop a function to harmonise encoding of dates • Support for recurring execution of the data loading/processing cycle according to specific criteria (e.g. useful for OAI-PMH) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 6. The biblio transformation engine 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 7. Components of the engine • Data Loader: Retrieves data from input source(s) according to DataLoading Spec • ProcessingStep: Transforms input in some way – Filters: removes records according to specific criteria – Modifier: updates records according to specific criteria – Initializer: initializes data in processing steps (e.g. load author names to Filter) • Output Generator: Creates the desired output (e.g. export file, direct update of database) • Record abstraction: simple common interface for all types of records that allows complex transformation functions 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 8. Processing workflow • Load data – transform input to records • If processing conditions are met, begin processing – sequential execution of Filters and Modifiers • If output conditions are met, begin output – execution of OutputGenerator(s) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 9. Processing workflow 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012 Load source data Processing conditions OK Generate output Output conditions OK Apply Filters & Modifiers Modify LoadingSpec YES NO NO YES
  • 10. The transformation engine – data model 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 11. Implementation • FLOSS library developed in Java (maven used as a build tool) • Configuration outside the code - dependency injection mechanisms of the Spring framework core container – Specification of Data Loader, Processing Steps, Conditions, OutputGenerator – Mapping from source to target format (for one-to- one field mappings) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 12. Example of mapping configuration 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 13. FLOSS library • Available at http://code.google.com/p/biblio-transformation-engine/ • European Union Public License • Feel free to download and use it! • Looking forward to feedback, questions,… (contributions also welcome ) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 14. Use case 1 – Generate Linked Open Data • Sources: Repository records, legacy cultural material records, research information in CERIF • Corresponding data loaders reused • Filters/Modifiers can be totally agnostic of RDF and input formats • Use Jena RDF library to generate RDF triples • Add/generate appropriate identifiers/URI for each entity (either at the modifier or output generator level) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 15. Use case 2 – Import/export data/export to/from repositories • Source record formats: EndNote, RIS, Bibtex, UNIMARC • Developed data loaders for each format, re- used output generator for DSpace • Export to different formats and reference styles based on repository records – Implemented for DSpace – For reference styles uses the citeproc-js library and the Citation Style Language (CSL) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 16. Use case 3 – Feed the VOA3R aggregator • Get records of the Hellenic National Archive of Doctoral Dissertation (HEDI – didaktorika.gr) to the VOA3R aggregator (Virtual Open Access Agriculture & Aquaculture Repository) • Developed subject-based filter and injected it into an enhanced OAI-PMH server using the library. • ~1070 of approximately 23.500 records, needed to apply techniques to cater for the distribution sparsity of “suitable” records combined with resumption token • Seamless on-the-fly deployment and co-existence with sets targeted to other aggregators (DART, openarchives.gr) 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 17. Use case 4 – Feed Europeana • Include in Europeana content from the Technical Chamber of Greece (TEE) • Records in TEE library catalog (UNIMARC), available through a Z39.50 interface • Developed Z39.50 data loader, appropriate filters and modifiers (independent of UNIMARC) • Mapping to ESE implemented through a modifier • ~6800 from the TEE records sent to Europeana • Repeatable, automated procedure through an enhanced OAI-PMH server using the library International Conference on Theory and Practice of Digital Libraries (TPDL 2011), Berlin, 26-28 Sept ember 2011
  • 18. Future work • Support more types of data transformations (contributions welcome ) • Extend declarative specification of mappings to cover more sophisticated cases • Configurable support for common data model to facilitate reuse of Filter and Modifier implementations • Systematically study the user experience, identify and implement potential improvements 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012
  • 19. Thank you for your attention! • More info: http://code.google.com/p/biblio-transformation-engine/ kstamatis AT ekt.gr nkons AT ekt.gr amanta AT ekt.gr cpaschou AT ekt.gr nhoussos AT ekt.gr 7th International Conference on Open Repositories (OR2012), Edinburgh, 9-13 July 2012