Scalable and sustainable - OCR & document image analysis in the cloud
New Trends in Humanities Computing. HPC Cloud day, 4 October 2011, Amsterdam, Netherlands.
Experimental Workflow Development in Digitisationcneudecker
The document discusses the IMPACT project, which is developing collaborative workflows for digitizing historical books and newspapers printed before 1900. The project is coordinated by the National Library of the Netherlands and involves 26 partner libraries and research institutes. It aims to create a workflow development platform that allows technical staff to provide tools for library staff to design digitization workflows through a community-driven process. The platform will incorporate modular, transparent, flexible, and extensible architectural principles.
Presentation given by Hildelies Balk during the 2nd LIBER-EBLIDA Workshop on Digitisation of Library Material in Europe (19-21 October 2009, The Hague, the Netherlands)
The document discusses the IMPACT project, which is supported by the European Community and coordinated by the National Library of the Netherlands. It proposes establishing a Centre of Competence after IMPACT to support ongoing work in digitization through tools, resources, training, and community support. The Centre would benefit content holders, researchers, and service providers working in digitization.
[Sommer] [7 into 1. Integration and Collaboration: The new library for Humani...Diane Koen
Presentation made by [Dorothea Sommer] at the IFLA Library Buildings and Equipment Satellite Meeting. Illinois Institute of Technology, Chicago, Aug.10-11, 2016.
Nico Verplancke - Digital archiving at the Waalse KrookiMinds conference
The document discusses plans for establishing a Flemish Institute for Archives and Access to Audiovisual Heritage (VIAA). It notes that a large portion of existing audiovisual material is at risk of being lost, and that the institute would work to preserve Flemish audiovisual heritage for current and future generations. Several past and ongoing projects are mentioned that researched archiving solutions. The document discusses recommendations for the institute's scope and business model, including its role in long-term archiving, digitization, access policies, and whether it should have a physical public space like at 'De Krook' in Ghent. Establishing the institute is seen as an important project that learns from past expertise and requires input from various stakeholders.
The GEN6 project aims to pilot IPv6 upgrades of eGovernment services across Europe. It has a budget of 6 million euros and involves organizations from 8 countries. The project will provide guidelines for IPv6 deployment and document national pilot projects upgrading networks, services, and applications in areas like eGovernment, secure clouds, education, and emergency response. Outcomes will include technical documentation of the pilots and estimates of transition costs and benefits to support broader IPv6 adoption. Dissemination efforts will spread the results to public administrations and other stakeholders.
Experimental Workflow Development in Digitisationcneudecker
The document discusses the IMPACT project, which is developing collaborative workflows for digitizing historical books and newspapers printed before 1900. The project is coordinated by the National Library of the Netherlands and involves 26 partner libraries and research institutes. It aims to create a workflow development platform that allows technical staff to provide tools for library staff to design digitization workflows through a community-driven process. The platform will incorporate modular, transparent, flexible, and extensible architectural principles.
Presentation given by Hildelies Balk during the 2nd LIBER-EBLIDA Workshop on Digitisation of Library Material in Europe (19-21 October 2009, The Hague, the Netherlands)
The document discusses the IMPACT project, which is supported by the European Community and coordinated by the National Library of the Netherlands. It proposes establishing a Centre of Competence after IMPACT to support ongoing work in digitization through tools, resources, training, and community support. The Centre would benefit content holders, researchers, and service providers working in digitization.
[Sommer] [7 into 1. Integration and Collaboration: The new library for Humani...Diane Koen
Presentation made by [Dorothea Sommer] at the IFLA Library Buildings and Equipment Satellite Meeting. Illinois Institute of Technology, Chicago, Aug.10-11, 2016.
Nico Verplancke - Digital archiving at the Waalse KrookiMinds conference
The document discusses plans for establishing a Flemish Institute for Archives and Access to Audiovisual Heritage (VIAA). It notes that a large portion of existing audiovisual material is at risk of being lost, and that the institute would work to preserve Flemish audiovisual heritage for current and future generations. Several past and ongoing projects are mentioned that researched archiving solutions. The document discusses recommendations for the institute's scope and business model, including its role in long-term archiving, digitization, access policies, and whether it should have a physical public space like at 'De Krook' in Ghent. Establishing the institute is seen as an important project that learns from past expertise and requires input from various stakeholders.
The GEN6 project aims to pilot IPv6 upgrades of eGovernment services across Europe. It has a budget of 6 million euros and involves organizations from 8 countries. The project will provide guidelines for IPv6 deployment and document national pilot projects upgrading networks, services, and applications in areas like eGovernment, secure clouds, education, and emergency response. Outcomes will include technical documentation of the pilots and estimates of transition costs and benefits to support broader IPv6 adoption. Dissemination efforts will spread the results to public administrations and other stakeholders.
A presentation of the EuropeanaTech network, http://pro.europeana.eu/europeana-tech, at the Europeana Project Group Meeting, Sept. 2012: http://pro.europeana.eu/pro-blog/-/blogs/breaking-out-of-the-bubble%3A-the-europeana-project-group-meeting
Project "The Digital City Revives, A Case Study of Web Archaeology"Tjarda de Haan
Presentation at the iPRES 2016, 13th International Conference on Digital Preservation. Bern, October 3-6, 2016
By Tjarda de Haan, guest e-curator & web archaeologist at the Amsterdam Museum
Partners:
National Coalition Digital Preservation, Netherlands Institute for Sound and Vision, Old inhabitants, (ex) DDS employees and DDS affiliated web-archeologists, UvA Faculty of Science and Waag Society
Visit:
http://www.dpconline.org/newsroom/latest-news/1777-qthe-digital-city-revivesq-a-case-study-of-web-archaeology
http://hart.amsterdammuseum.nl/re-dds
http://www.bitsandbytesunited.com/?portfolio=publication-the-reconstruction-of-the-digital-city-a-case-study-of-web-archaeology
Digital Cultural Heritage and the new EU Framework Programmelocloud
2nd LoCloud CY Awareness Event at the Ministry of Education and Culture.
Presentation delivered by Marinos Ioannides, Cyprus University of Technology
Cyprus
5 March 2014
The Europeana meeting under the Romanian Presidency, Exposing Online the Euro...Europeana
1. The document discusses common practices among national aggregators that provide access to cultural heritage objects. It covers areas like mission, domains, communication services, staffing, data, and technical infrastructure.
2. Key activities of national aggregators include giving free and high quality access to cultural heritage objects through a single point of access, as well as promoting their country's cultural resources and setting quality standards.
3. The document provides details on common approaches to areas like modules development, hardware infrastructure, metadata mapping and processing, and cooperation with Europeana. It also discusses future trends and makes recommendations around developing a national strategy and framework.
16,40 16,55 h. open aire eblida-naple conferenceFESABID
The document discusses the EU's open access policies and the OpenAIRE project.
The EU requires publications and data from publicly funded research to be made openly accessible. The OpenAIRE project aims to deliver an infrastructure to identify, deposit and provide access to publications from EU-funded projects. It links publications to research projects and provides a repository for "homeless" publications not housed elsewhere.
National Stakeholders Coordination Group Meeting 1- Invitation and agenda v4.0Michael Hübner
Invitation and agenda for the first regular meeting of the European National Stakeholders Coordination Group on Smart Networks for the Energy Transition on March 28 in Brussels.
LoCloud: Local Content in a Europeana Cloudlocloud
IMCW 2013 Conference
Presentation on LoCloud by B. Yılmaz, Ö. Külcü, Y. Ünal & T. Çakmak, Hacettepe University, Turkey
4-6 September 2013
Limerick, Ireland.
The LoCloud lightweight digital library and alternative content sources, Adam...locloud
The document discusses user stories and requirements for a proposed lightweight digital library system called LoCloud L3D. It provides examples of how smaller libraries and archives could use such a system to digitize and share their collections without specialized IT expertise. Key requirements identified include easy creation of metadata, support for multiple content types, customizable interfaces, and the ability to migrate from other digital library systems. Open issues discussed include prioritizing content types and features.
The Presto4U project supports the establishment of the PrestoCentre Foundation to facilitate collaboration and knowledge sharing around digital audiovisual preservation. PrestoCentre brings together archives, commercial partners, and researchers to enhance long-term access to cultural heritage through communities of practice, tools, services, and policy recommendations. It has over 100 archive and affiliate members and 20 commercial members internationally.
EDF2013: Language Technology Panel, Hans-Ulrich von Freyberg: Language Infras...European Data Forum
Language Technology Panel talk of Hans-Ulrich von Freyberg, at the European Data Forum 2013, 10 April 2013 in Dublin, Ireland: Language Infrastructures
Europeana Network Association Members Council Meeting, Copenhagen by Max KaiserEuropeana
The document summarizes the report from the IIIF Task Force on their work to promote the adoption of IIIF within the Europeana network. The key points are:
1. The task force identified trends in IIIF adoption, provided recommendations to Europeana, and proposed establishing a working group to support further adoption by content providers.
2. The recommendations cover raising awareness of IIIF, community involvement, and technical implementation strategies, including two approaches - upgrading digital asset systems or using a service-oriented strategy.
3. Establishing a permanent working group within Europeana is recommended to provide ongoing support and advocacy for IIIF adoption.
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
The value of EOSC from a user perspective: Key themes and actions from Day 1EOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels.
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
CAPDM Ltd is an expert in XML markup, information architecture, and e-publishing for education providers. They helped design and deliver the EBS eMBA program, Europe's largest distance learning MBA with over 10,000 graduates. The program integrates 800 deliverables across 37 courses in print and online formats in English, Spanish, Chinese, and Arabic. CAPDM believes in well-designed pedagogy and content that can be delivered to any learning management system using XML single source publishing.
Reducing Infrastructure and Service Fragmentation EOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels.
The presentation gives a detailed overview of 5 Science Demonstrators, namely FusionHPC, Photon-Neutron, TextCrowd, PanCancer.
For more information on the EOSCpilot Science Demonstrators visit: https://eoscpilot.eu/science-demonstrators
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
OpenAIRE Advance: A trusted eInfrastructure for the EOSCEOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels by Yannis Ioannidis, Athena Research and Innovation Centre.
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
The document summarizes the Web@rchive Austria project. It discusses how the Austrian National Library archives websites on the internet, including major domains in Austria and important websites that change regularly. It notes some of the challenges in web archiving like the short lifespan of webpages and how content selection must be careful. Examples of archived websites are provided from different time periods to show the project history.
A presentation of the EuropeanaTech network, http://pro.europeana.eu/europeana-tech, at the Europeana Project Group Meeting, Sept. 2012: http://pro.europeana.eu/pro-blog/-/blogs/breaking-out-of-the-bubble%3A-the-europeana-project-group-meeting
Project "The Digital City Revives, A Case Study of Web Archaeology"Tjarda de Haan
Presentation at the iPRES 2016, 13th International Conference on Digital Preservation. Bern, October 3-6, 2016
By Tjarda de Haan, guest e-curator & web archaeologist at the Amsterdam Museum
Partners:
National Coalition Digital Preservation, Netherlands Institute for Sound and Vision, Old inhabitants, (ex) DDS employees and DDS affiliated web-archeologists, UvA Faculty of Science and Waag Society
Visit:
http://www.dpconline.org/newsroom/latest-news/1777-qthe-digital-city-revivesq-a-case-study-of-web-archaeology
http://hart.amsterdammuseum.nl/re-dds
http://www.bitsandbytesunited.com/?portfolio=publication-the-reconstruction-of-the-digital-city-a-case-study-of-web-archaeology
Digital Cultural Heritage and the new EU Framework Programmelocloud
2nd LoCloud CY Awareness Event at the Ministry of Education and Culture.
Presentation delivered by Marinos Ioannides, Cyprus University of Technology
Cyprus
5 March 2014
The Europeana meeting under the Romanian Presidency, Exposing Online the Euro...Europeana
1. The document discusses common practices among national aggregators that provide access to cultural heritage objects. It covers areas like mission, domains, communication services, staffing, data, and technical infrastructure.
2. Key activities of national aggregators include giving free and high quality access to cultural heritage objects through a single point of access, as well as promoting their country's cultural resources and setting quality standards.
3. The document provides details on common approaches to areas like modules development, hardware infrastructure, metadata mapping and processing, and cooperation with Europeana. It also discusses future trends and makes recommendations around developing a national strategy and framework.
16,40 16,55 h. open aire eblida-naple conferenceFESABID
The document discusses the EU's open access policies and the OpenAIRE project.
The EU requires publications and data from publicly funded research to be made openly accessible. The OpenAIRE project aims to deliver an infrastructure to identify, deposit and provide access to publications from EU-funded projects. It links publications to research projects and provides a repository for "homeless" publications not housed elsewhere.
National Stakeholders Coordination Group Meeting 1- Invitation and agenda v4.0Michael Hübner
Invitation and agenda for the first regular meeting of the European National Stakeholders Coordination Group on Smart Networks for the Energy Transition on March 28 in Brussels.
LoCloud: Local Content in a Europeana Cloudlocloud
IMCW 2013 Conference
Presentation on LoCloud by B. Yılmaz, Ö. Külcü, Y. Ünal & T. Çakmak, Hacettepe University, Turkey
4-6 September 2013
Limerick, Ireland.
The LoCloud lightweight digital library and alternative content sources, Adam...locloud
The document discusses user stories and requirements for a proposed lightweight digital library system called LoCloud L3D. It provides examples of how smaller libraries and archives could use such a system to digitize and share their collections without specialized IT expertise. Key requirements identified include easy creation of metadata, support for multiple content types, customizable interfaces, and the ability to migrate from other digital library systems. Open issues discussed include prioritizing content types and features.
The Presto4U project supports the establishment of the PrestoCentre Foundation to facilitate collaboration and knowledge sharing around digital audiovisual preservation. PrestoCentre brings together archives, commercial partners, and researchers to enhance long-term access to cultural heritage through communities of practice, tools, services, and policy recommendations. It has over 100 archive and affiliate members and 20 commercial members internationally.
EDF2013: Language Technology Panel, Hans-Ulrich von Freyberg: Language Infras...European Data Forum
Language Technology Panel talk of Hans-Ulrich von Freyberg, at the European Data Forum 2013, 10 April 2013 in Dublin, Ireland: Language Infrastructures
Europeana Network Association Members Council Meeting, Copenhagen by Max KaiserEuropeana
The document summarizes the report from the IIIF Task Force on their work to promote the adoption of IIIF within the Europeana network. The key points are:
1. The task force identified trends in IIIF adoption, provided recommendations to Europeana, and proposed establishing a working group to support further adoption by content providers.
2. The recommendations cover raising awareness of IIIF, community involvement, and technical implementation strategies, including two approaches - upgrading digital asset systems or using a service-oriented strategy.
3. Establishing a permanent working group within Europeana is recommended to provide ongoing support and advocacy for IIIF adoption.
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
The value of EOSC from a user perspective: Key themes and actions from Day 1EOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels.
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
CAPDM Ltd is an expert in XML markup, information architecture, and e-publishing for education providers. They helped design and deliver the EBS eMBA program, Europe's largest distance learning MBA with over 10,000 graduates. The program integrates 800 deliverables across 37 courses in print and online formats in English, Spanish, Chinese, and Arabic. CAPDM believes in well-designed pedagogy and content that can be delivered to any learning management system using XML single source publishing.
Reducing Infrastructure and Service Fragmentation EOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels.
The presentation gives a detailed overview of 5 Science Demonstrators, namely FusionHPC, Photon-Neutron, TextCrowd, PanCancer.
For more information on the EOSCpilot Science Demonstrators visit: https://eoscpilot.eu/science-demonstrators
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
OpenAIRE Advance: A trusted eInfrastructure for the EOSCEOSCpilot .eu
This presentation was held at the 1st EOSC Stakeholder Forum 28-29/11/2017 in Brussels by Yannis Ioannidis, Athena Research and Innovation Centre.
For more information on the 1st EOSC Stakeholder Forum visit: https://eoscpilot.eu/eosc-stakeholder-forum-shaping-future-eosc
Follow EOSCpilot on Twitter: https://twitter.com/eoscpilot
and LinkedIn: https://uk.linkedin.com/in/eoscpiloteu
The document summarizes the Web@rchive Austria project. It discusses how the Austrian National Library archives websites on the internet, including major domains in Austria and important websites that change regularly. It notes some of the challenges in web archiving like the short lifespan of webpages and how content selection must be careful. Examples of archived websites are provided from different time periods to show the project history.
Europeana Newspapers - the Gateway to European Newspapers Onlinecneudecker
Europeana Newspapers - the Gateway to European Newspapers Online
IFLA 2013 Satellite Meeting on Newspaper & Genloc Sections, Science Centre Singapore, 14-15 August 2013, Singapore.
Refinement
Europeana Newspapers Workshop: A Gateway to European Newspapers Online. Research Information Infrastructures and the Future Role of Libraries.
LIBER 2013 Annual Conference, Bavarian State Library, 26-29 June 2013, Munich, Germany.
Climbing the Tower of Babel: Challenges and Opportunities in Multilingual Dat...cneudecker
LIDER Workshop: Datos Enlazados y Multilingüismo para la Lingüística y las Humanidades Digitales, 20 October 2015, Madrid, Spain, http://lider-project.eu/workshopMadrid/
OCR challenges in historic documents and the contribution of IMPACTcneudecker
OCR challenges in historic documents and the contribution of IMPACT
IFLA 2010 Satellite Meeting "New Techniques for Old Documents", 16-18 August 2010, Uppsala, Sweden.
The IMPACT Interoperability Framework - Workflows for OCR and beyondcneudecker
The IMPACT Interoperability Framework - Workflows for OCR and beyond
Better, faster, cheaper. Solutions of the IMPACT Centre of Competence and future challenges, The British Library, 24-25 October 2011, London, United Kingdom.
Collaborative Workflow Development and Experimentation in the Digital Humanitiescneudecker
A Service-Oriented-Architecture for Collaborative Workflow Development and Experimentation in the Digital Humanities
2012 Leipzig eHumanities Seminar, 10 October 2012, Leipzig, Germany.
The Elephant in the Library - Integrating Hadoopcneudecker
This document discusses integrating Hadoop into libraries to help scale up their digitization efforts of cultural heritage materials. It provides background on two libraries' digitization projects and data volumes. It then outlines challenges of scaling up like use cases exploring document recognition, file format migration, and web archiving using Hadoop. Scenarios demonstrate running analytics on book metadata and page images and web archives stored in Hadoop.
Who cares about yesterday's news? Use cases and requirements for newspaper digitization. Presentation held at IFLA News Media Conference 2016, 20-22 April, Hamburg, Germany.
This document discusses the use of scientific workflows and Taverna, a workflow management system, for digital preservation processes. It provides background on scientific workflows and why they are useful for automating repetitive tasks and chaining components. Taverna allows linking of various services and tools for file format identification, migration, validation, quality assurance, and other preservation uses. Examples of preservation workflows in Taverna are provided, including validation of file formats using web services and applying duplicate detection to book page images. Scaling workflows to large datasets using Hadoop is also discussed.
An Experimental Workflow Development Platform for Historical Document Digitis...cneudecker
An Experimental Workflow Development Platform for Historical Document Digitisation and Analysis
International Workshop on Historical Document Imaging and Processing (HIP).
ICDAR 2011, 16-17 September 2011, Beijing, China.
The document discusses IMPACT, a project supported by the European Community to develop a uniform technical framework for end users to work with digital library tools and applications. The framework is built on open source components and standards and uses a service-oriented architecture. It allows tools to be transformed into web services and combined into workflows for tasks like optical character recognition. The project is coordinated by the National Library of the Netherlands and evaluates workflows using datasets and ground truths.
The document discusses a project called IMPACT that is supported by the European Community under its FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands. It focuses on developing computer lexicons to help with optical character recognition (OCR) and information retrieval of historical documents.
The Europeana Cloud project aims to build a shared digital infrastructure for cultural heritage institutions in Europe. The project has 35 partner institutions and will develop tools and services for researchers. It will ingest 2.4 million metadata records and 5 million content items. Work Package 1 focuses on engaging with humanities and social sciences researchers to understand their needs and inform the development of the cloud infrastructure and services. Activities include forming an advisory board, conducting surveys, and holding expert forums to help define a research content strategy and user requirements. The goal is to better support research through aggregation of content in the cloud.
Europeana Cloud Work Package 1: Assessing Researchers' Needs in the CloudTU Delft, Netherlands
A presentation given about Work Package 1 of the Europeana Cloud project http://pro.europeana.eu/web/europeana-cloud
By Agiatis Bernadou and Alastair Dunning
Given at http://dighumlab.dk/news/single-news/artikel/cfp-cultural-heritage-creative-tools-and-archives-workshop/, June 2013
This document discusses optical character recognition (OCR) of historical newspapers. It describes the digitization process, which includes image capturing, text and structure recognition, natural language processing, and content representation. OCR accuracy can be improved through layout analysis, structural metadata extraction, and identifying different content units like articles, advertisements, and entertainment sections. The goal is to make the content and knowledge within digitized newspapers accessible beyond the scanned text.
Targeted Language Resources for the Digitisation of Historical CollectionsEmma Huber
The document discusses the IMPACT project, which is supported by the European Community under the FP7 ICT Work Programme. The project aims to develop targeted language resources for digitizing historical collections. It is coordinated by the National Library of the Netherlands. The document outlines some of the challenges of optical character recognition (OCR) and information retrieval (IR) on historical texts due to factors like image quality, historical language variants, and unknown words. It proposes the development of specialized language resources like lexicons and language models to help address these challenges.
The document discusses the IMPACT project, which is supported by the European Community to improve optical character recognition (OCR). It is coordinated by the National Library of the Netherlands. The document outlines the OCR process of binarization, segmentation, and pattern matching. It highlights improvements made by the IMPACT project to these steps. Specifically, it details how IMPACT developed better algorithms for binarization, segmentation of text blocks and lines, and recognition of languages like Fraktur.
The IMPACT project is supported by the European Community under its FP7 ICT Work Programme and coordinated by the National Library of the Netherlands. It aims to address challenges in digitizing historical materials through technical solutions for tasks like document warping correction, OCR, named entity recognition and collaborative correction. The project involves 13 universities and research centers and 2 industry partners.
Science Demonstrator Session: Social and Earth SciencesEOSCpilot .eu
The main focus of Science Demonstrator sessions is to provide feedback to the EOSC community on the first experience of science demonstrators in the practical use of the emerging EOSC ecosystem.
Each panel will consist of a representative of a Science Demonstrator that will provide an overview of their experiences in the use of emerging EOSC services.
These sessions will help members of the scientific communities understanding the current state of maturity of the EOSC ecosystem and what is obtainable in a field of scientific research. It is also valuable to prospective Service Providers who wish to discover what are the challenges and opportunities that user communities might have to deal with, as a result of the adoption of their services.
This session will focus on Social and Earth Sciences.
ECLAP White paper, social network for Cultural Heritage on Peforming artsPaolo Nesi
the experience of a new generation digital content service is presented, namely ECLAP (European Collected Library of Artistic Performance, http://www.eclap.eu). ECLAP is a live lab in which several new technologies and solutions in the area of semantic computing and social media have been developed and put under trial of the final users and institutions. On this regard, ECLAP is open for both content and results experimentations, and presently comprises more than 35 prestigious international institutions; ECLAP provides services and tools for automated content ingestion, adaptation, metadata ingestion and editing, semantic information extraction, indexing and distribution by exploiting the most innovative and consolidated technologies. ECLAP supports the institutions in all their activities: metadata selection and mapping, content ingestion, to the definition and management of permissions and licenses on contents, and finally managing their users on ECLAP services. According to ECLAP workflow, the obtained metadata are sent to Europeana only after that the metadata have been enriched and linked to a reachable digital resource and when the IPR details have been finalized, with needed quality level. An ECLAP IPR Model can be associated with each single content or collection. ECLAP also provides infrastructural connection for direct promotion of content towards a large number of social networks, including: Facebook, LinkedIn, Diggs, Twitter, etc. On ECLAP, each content provider may have its own distribution channel/group (including a forum and a blog in addition to the space for their content collections, and the groups can be open, moderated or private) with the possibility of customizing the group user interface according to their logo and colours. This multitenant modality permits at the institutions to see ECLAP as a non-intrusive service, to reinforce their brand and at the same time to exploit and experiment a number of innovative ECLAP tools, to accelerate the promotion exploiting ECLAP social media, LOD and Europeana channels, and ready to access new users for their content. ECLAP provides the unique videos, images and texts related to more than 50 years of activity of the Dario Fo and Franca Rame theatre company, featuring videos, photos, texts, drawings, paintings, sketches, posters, copies of contracts and of invoices, notes, books, articles. Other unique, irreplaceable material includes video, audio recordings and photos of performances, workshops, seminars, rehearsals of Jerzy Grotowski, Peter Brook, Gennadi Bogdanov, Anatolij Vasil’ev, Alberto Sordi, Carmelo Bene, Giorgio Strehler, Mimmo Cuticchio, Gian Maria Volonté, Judith Malina.
Europeana Cloud - Alastair Dunning - November 2013Europeana
What is the Europeana Cloud project doing and why? Find out in this presentation from Alastair Dunning, project coordinator, from The European Library.
Slides from Aly Conteh's introduction to IMPACT project at the British Library Demo-day on the 12th July 2011.
Making Text Digitisation - Faster, Better & Cheaper
Science Demonstrator Session: Physics and AstrophysicsEOSCpilot .eu
The main focus of Science Demonstrator sessions is to provide feedback to the EOSC community on the first experience of science demonstrators in the practical use of the emerging EOSC ecosystem.
Each panel will consist of a representative of a Science Demonstrator that will provide an overview of their experiences in the use of emerging EOSC services.
These sessions will help members of the scientific communities understanding the current state of maturity of the EOSC ecosystem and what is obtainable in a field of scientific research. It is also valuable to prospective Service Providers who wish to discover what are the challenges and opportunities that user communities might have to deal with, as a result of the adoption of their services.
This session will focus on Physics and Astrophysics.
EuropeanaTech x AI: Qurator.ai @ Berlin State Librarycneudecker
The EuropeanaTech Community and Europeana Foundation are delighted to introduce a new webinar series to explore the opportunities and challenges of working with Artificial Intelligence in the cultural heritage and arts sector.
Digitisation and Digital Humanities - what is the role of Libraries?cneudecker
The document discusses the role of libraries in digitization and digital humanities. It provides an overview of the Berlin State Library's digitization efforts including its in-house digitization center that produces 1.7M images annually. It also describes the library's digital collections portal containing over 180,000 digitized documents. Additionally, it outlines several projects involving newspaper digitization, optical character recognition improvement, named entity recognition, and developing an experimental space for digital research.
Multimodal Perspectives for Digitised Historical Newspaperscneudecker
This document discusses challenges and opportunities in analyzing digitized historical newspapers. It describes several projects aimed at improving OCR accuracy using deep learning models, extracting structural information using computer vision and heuristics, and establishing standards for metadata and evaluation. Key challenges include the need for more granular and representative ground truth newspaper data, methods that combine machine learning and domain knowledge, and community efforts around shared tasks, seminars, and an atlas of digitized newspapers to advance interdisciplinary research. The overall goal is to make cultural heritage collections more accessible online through improved digitization and analysis of newspapers.
OCR-D: An end-to-end open source OCR framework for historical printed documentscneudecker
OCR-D is an open source framework for optical character recognition (OCR) of historical printed documents. It consists of a coordination project and 8 module projects that develop technical solutions for challenges in OCR of historical prints. The goals are to standardize metadata, annotations, and formats to enable large-scale OCR of historical texts. OCR-D provides specifications, reference implementations, ground truth data, and scientific workflows to support development and evaluation of OCR tools and methods for historical documents.
Extrablatt: The Latest News on Newspaper Digitisation in Europecneudecker
This document summarizes recent developments in newspaper digitization projects across Europe. It discusses Germany's efforts to establish a national newspaper portal and increase availability of digitized newspapers through a DFG funding call. It also briefly outlines newspaper digitization work in other countries like the UK, Sweden, Denmark, and Switzerland. Finally, it provides an overview of the Europeana Newspapers project and efforts to find a new home for its 10TB of digitized newspaper data, as well as growing interest from digital humanities researchers in utilizing digitized historical newspapers.
The Europeana Newspapers project digitized over 1,000 newspaper titles containing 3.3 million issues from 12 European libraries in 40 languages from 1618-2016. The newspapers were run through optical character recognition to make 12 million pages searchable by keyword. Metadata and scans were made public domain and searchable through the TEL Historic Newspaper Browser, which allows browsing by newspaper, date, and other facets. Researchers have used the collection for various studies and it will relaunch in 2018 with improved search and an interface directly on Europeana, supporting further annotation and transcription of the newspapers.
OpenID AuthZEN Interop Read Out - AuthorizationDavid Brossard
During Identiverse 2024 and EIC 2024, members of the OpenID AuthZEN WG got together and demoed their authorization endpoints conforming to the AuthZEN API
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxSitimaJohn
Ocean Lotus cyber threat actors represent a sophisticated, persistent, and politically motivated group that poses a significant risk to organizations and individuals in the Southeast Asian region. Their continuous evolution and adaptability underscore the need for robust cybersecurity measures and international cooperation to identify and mitigate the threats posed by such advanced persistent threat groups.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
1. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
A sustainable infrastructure for
large scale document image analysis
HPC Cloud day – 4 October
2. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
Background
IMPACT – Improving Access to Text (2008 – 2011)
Large-scale integrating research project, funded by the EC
– Consortium of 26 partners
– Coordinated by the National Library of the Netherlands (KB)
– EU funding: € 12 100 000 (FP7 ICT Work Programme)
– From 2012: sustainable Centre of Competence with alternative
2
resources
Main objectives:
- Innovate OCR technology
- Capacity building in mass-digitisation
3. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
OCR: A multitude of challenges…
VVt Venetien den 1.Junij, Anno 1618.
DJgn i f paffato te S' aö'Jifeert mo?üen/bah .)etgi'uotbciraetail)i.r/JtmelchontDecht te /
sbnbe bele btr felbrr geiufttceert baer bnber eeniglje jprant o^fen/bie ftcb .met
beSpaenfcbeu enbeeemgljen bifet Cbeiiupcen berbonbru befe
3
4. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
OCR: A multitude of challenges…
• I. OCR challenges (gothic fonts, bleed-through, warping, etc.)
5. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
OCR: A multitude of challenges…
• II. Language challenges (spelling variants, inflection, and many
more!)
Example: historical variants of the Dutch word ‘wereld’ (world):
werelt weerelt wereld weerelds wereldt werelden weereld werrelts waerelds weerlyt
wereldts vveerelts waereld weerelden waerelden weerlt werlt werelds sweerels
zwerlys swarels swerelts werelts swerrels weirelts tsweerelds werret vverelt werlts
werrelt worreld werlden wareld weirelt weireld waerelt werreld werld vvereld weerelts
werlde tswerels werreldts weereldt wereldje waereldje weurlt wald weëled
6. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
6
IMPACT Solutions
From a technical perspective:
> 20 software toolkits for solving different problems
Such as:
OCR (C++, C#),
Image Processing & Lexica (DLL),
Command Line Tools (Win/Linux),
Java, Ruby, PHP, Perl, etc.
IMPACT Interoperability Framework (IIF)
7. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
7
Architecture
IMPACT Interoperability Framework: Technologies
- Java 6
- Generic Web Service Wrapper
- Apache Maven
- Apache Tomcat
- Apache Axis2
- Apache Synapse
- Taverna Workflow Engine
IMPACT Interoperability Framework: Dataset
- PHP/mySQL database, frontend for search
- approx. 5 TB raw data (images, text files, metadata) and growing
8. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
How does it work?
1. Digitisation/OCR challenges registered and tagged in database
8
Warped text
2. Database contains 99,95% correct result: “ground truth”
9. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
How does it work?
3. Researcher develops new method to tackle a problem
4. Research prototype is wrapped to a SOAP web service
9
10. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
10
How does it work?
5. Web service is integrated as a workflow module
6. Workflow module can be evaluated, based on the ground truth
11. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
11
Current setup
Enterprise Service Bus
receives requests from
users and distributes
the load to the available
worker nodes (= server
with all services installed)
Main effect:
Process parallelization,
Load distribution,
Fail over
Drawback:
Data is sent to worker nodes all around Europe =
huge amount of data needs to be sent over the net!
12. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
Proposed setup
Set up worker nodes on the HPC cloud (same location)
Advantage:
- Improve speed and availability for concurrent users
- Remove constraints for large-scale processing
12
13. IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
Benefits
Scalable platform
Availability of resources to a large number of users
Enable research into scalable computing
Consolidation of support and maintenance
Various interfaces (web/local)
13
Editor's Notes
DDD: Krantentitel: Courante uyt Italien, Duytslandt, &c .Datum, editie: 14-06-1618, Dag Uitgever: Caspar van Hilten Plaats van Uitgave: Amsterdam. Actual selection from online database