SlideShare a Scribd company logo
1 of 19
Download to read offline
Summary of the Deployment Scenarios
and Functional Requirements
Evangelos Motesnitsalis
Technical Coordinator
ARCHIVER Consolidation Event
5 June 2019
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 2
Contents
Recap
Common Characteristics
Service Layers Mapping
Testing plans
Summary and Next Steps
Recap of
Deployment Scenarios
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 4
High Energy Physics Deployment Scenarios
The BaBar Experiment
In 2020 the BaBar Experiment infrastructure at SLAC will be decommissioned. As a result, the 2 PB
of BaBar data can no longer be stored at the host laboratory and alternative solutions need to be
found. Currently a copy of the data is being held by CERN IT. We want to ensure that a complete
copy of Babar data will be retained for possible comparisons with data from other experiments
and sharing through the CERN Open Data Portal.
CERN Open Data Portal
The CERN Open Data portal disseminates close to 2 PBs of primary and derived datasets from
partical physics as they were released by LHC Collaborations and is being used for both education
and research purposes. The CERN Open Data Service Managers seek an easy-to-use, easy-to-
achieve independent archiving and backup for its holdings based on SIPs [Submission Information
Packages] with intelligent and reliable disaster recovery mechanisms.
CERN Digital Memory
We want to archive the ~1.5 PB of CERN Digital Memory, containing digitized analog documents
produced by the institution in the 20th century as well as the digital production of the 21st
century, including new types like web sites, social medias, emails, etc.
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 5
Life Sciences Deployment Scenarios
EMBL on FIRE
EMBL-EBI provides data archiving services to the global molecular biology community. These
data archives are currently based on an internal service (FIRE: FIle REplication) that stores the
files in two different systems: a distributed object store and tape.
FIRE currently holds 20PB of data and is growing at 40% per year. We want to ensure that:
FIRE can achieve cost-effective scaling via cloud-based storage solutions
Data can effectively be distributed on cloud infrastructure, covering the increasing needs for cloud-hosted analysis
EMBL Cloud Data Caching
As research communities access more and more of internal data from cloud services for their
data analysis, it makes sense to progressively cache data in the cloud, with the on-premises
data being replicated and discarded as required. Which data should be cached, how much and
for how long, will be a tradeoff between the cost of cloud storage and of having the network
capacity/latency to download the data multiple times.
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 6
The MAGIC Cherenkov gamma-ray telescopes and the PAUcam camera for the William
Herschel Telescope are located in the Observatorio del Roque de los Muchachos, in Canary
Islands, Spain. The first Large Scale Telescope of the next-generation Cherenkov Telescope
Array (CTA) is also there. They produce about 0.3 PB of raw data per year which is
automatically sent to PIC in Barcelona.
PIC Large File Storage
We want to substitute the current in-house tape library storage. Each instance of the
service to be purchased is the 5-year safe-keeping of a yearly dataset from a single source.
PIC Mixed File Remote Storage
We also want to be able to archive the derived datasets from at most two sources,
becoming part of the yearly dataset. In addition, anytime during the 4 years following the
creation of the data, additional versions of derived datasets may need to be uploaded.
PIC Data Distribution
We also want to substitute the Hierarchical Storage Manager, disk storage and data
distribution service. Each instance of the service to be purchased is the 5-year safe-keeping
and data distribution of a yearly dataset and its derived datasets.
Astronomy Deployment Scenarios
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 7
Photon Science Deployment Scenarios
PETRA III is the worldwide most brilliant storage ring based X-ray sources for high energy photons with 22
beamlines distributed over three experimental halls are concurrently available for users. The European
XFEL is a world's largest X-ray laser generating 27 000 ultrashort X-ray per second and with a brilliance that
is a billion times higher than that of the best conventional X-ray radiation sources.
PETRA III /EuXFEL – Individual Scientist
Individual scientist at DESY need a service to create archives for their experiment data as well as their
publications with specific capabilities such as data ingestion via browser or third-party copies.
PETRA III /EuXFEL – Manual Data Archiving
Experiment managers want to be able to create/manage/delete archives via APIs/CLIs based on accepted
data policies supporting a wide range of options for cloud and on-prem storage, while being able to utilize
existing user credentials, authentication techniques and identification mechanisms.
PETRA III /EuXFEL – Integrated Data Archiving
Long-lived collaborations present a growing need to plan and execute archiving operations in a fully
automated, policy-based, certified, and documented way, based on APIs.
Common Characteristics
Summary of the Deployment Scenarios and Functional Requirements 9
FAIR Principles
Findable
AccessibleInteroperable
Re-Usable
• Accurate and relevant description
• Data usage license and detailed
provenance
• Retrievable with free protocols
• Accessible metadata even after
deletion
• Global, unique identifiers
• Rich Metadata, indexes, search
capabilities
• Qualified reference to other data
• Formal, shared and broadly applicable
knowledge representation standards
https://www.go-fair.org/
5 June 2019
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 10
OAIS Reference Model
Common Characteristics
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 11
Scientific Data Storage in the PB Range
Solid needs for Federated AAI Services
Sustained Data Ingest Rates
Access to GEANT Network
Development under the OAIS Reference Model and FAIR Principles
Data Privacy and Compliance
Significant Monitoring Requirements
Sustainable Business Models and Costs
Service Layers Mapping
Service Layers and Deployment Scenarios Mappings
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 13
Data integrity/security; cloud/hybrid deployment
Data volume in the PB range; high, sustained ingest data rates
ISO certification: 27000, 27040, 19086 and related standards
Archives connected to the GEANT network
OAIS conformant services: data readability formats, normalization,
obsolesce monitoring, files fixity, authenticity checks, etc.
ISO 14721/16393, 26324 and related standards
User services: search, discover, share, indexing, data removal, etc.
Access under Federated IAM
Layer 1
Storage/Basic Archiving/Secure
backup
Layer 2
Preservation
Layer 3
Baseline user services
Layer 4
Advanced services
High level services: visual representation of data (domain specific),
reproducibility of scientific analyses, etc.
EMBL1–FIRE
PIC2–MixedFileRemoteStorage
DESY1–PETRAIII/EUXFEL
CERN3–CERNOpenData
CERN2–CERNDigitalMemory
CERN1–TheBaBarExperiment
PIC3–DataDistribution
EMBL2–CloudCaching
PIC1–LargeFileStorage
Testing Plans
Testing Plans
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 15
The Buyers Group will request demo access to the current product offerings during
the Design Phase.
Testing will focus on Functionality for the Prototype Phase and Performance,
Scalability, and Reliability for the Pilot Phase.
The Buyers Group will provide a set of tests derived from the Buyers Group
deployment scenarios and the Functional Specifications.
The tests will have clear assessment criteria for pass/fail.
The Buyers Group expects to deploy tests only after a clear indication of the
contractor that the tests were run successfully by the contractor themselves.
We plan to present the initial set of tests by the Design Phase Kick-off.
Assessment of the tests results will have implications on the assessment of the
respective phase results and on the payments to be executed.
Basic Functionality Testing Examples
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 16
Ingestion:
ability to submit a particular dataset of X size to the Archiving Service within time Y
Access:
ability to recall a particular part of a file, file or dataset within time Y
Monitoring and Dashboard:
ability to access displayed informations via web browser and trigger basic management function
e.g. data deletion, fixity checks, etc.
Audit and Log:
ability to access detailed access logs for a particular file/dataset
Summary
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 18
Overview
C3 – CERN Open Data
C1 – The BaBar Experiment
C2 – CERN Digital Memory
P1 – Large File Remote Storage
P3 – Data Distribution
P2 – Mixed File Remote Storage
E1 – FIRE
E2 – Cloud Caching
D1 – PETRA III / EUXFEL
5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 19
Summary and Next Steps
The primary goal for all the Deployment Scenarios is the preservation and long-term archiving of data in the PB
range with high sustained ingest rates for complex data types.
If this can be achieved easily, all the scenarios would benefit greatly from an added Software Reproducability and Open
Data Distribution Layer on top of the archiving solution.
These deployment scenarios exhibit many similarities such as the scientific complex data types, the need for
federated AAI services, the significant monitoring requirements, and the development under OAIS and FAIR.
We welcome your feedback on the draft of the “Functional Specifications” documents until 14 June.
The Buyers group will co-design and co-develop with you a test plan:
The plan will be based on the outcome of the Design Phase, the Functional Specifications document, and the Deployment
Scenarios needs
The test assessment will be a deciding factor to qualify solutions to the subsequent phases
The tests will focus on basic functionality capabilities during the prototype phase and performance, efficiency, and
scalability during the pilot phase

More Related Content

What's hot

HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board  HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board Helix Nebula The Science Cloud
 
Archiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver
 
3 archiver omc deployment_scenarios
3 archiver omc deployment_scenarios3 archiver omc deployment_scenarios
3 archiver omc deployment_scenariosArchiver
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueRaul Palma
 
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysThe GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysRepository Fringe
 
Tdr Overview Pres Advocates
Tdr Overview Pres AdvocatesTdr Overview Pres Advocates
Tdr Overview Pres Advocatesjamestoon
 
Linked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureLinked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureRaul Palma
 

What's hot (20)

The Archiver project
The Archiver projectThe Archiver project
The Archiver project
 
The Science Cloud Users: Challenges and Needs
The Science Cloud Users: Challenges and NeedsThe Science Cloud Users: Challenges and Needs
The Science Cloud Users: Challenges and Needs
 
Sharing Big Data - Bob Jones
Sharing Big Data - Bob JonesSharing Big Data - Bob Jones
Sharing Big Data - Bob Jones
 
Who is doing what, and how do we know? [PEPRS]
Who is doing what, and how do we know? [PEPRS]Who is doing what, and how do we know? [PEPRS]
Who is doing what, and how do we know? [PEPRS]
 
HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board  HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board
 
Archiver 3rd omc_project_overview
Archiver 3rd omc_project_overviewArchiver 3rd omc_project_overview
Archiver 3rd omc_project_overview
 
3 archiver omc deployment_scenarios
3 archiver omc deployment_scenarios3 archiver omc deployment_scenarios
3 archiver omc deployment_scenarios
 
Geoservices Activities at EDINA
Geoservices Activities at EDINAGeoservices Activities at EDINA
Geoservices Activities at EDINA
 
UK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas WorkshopUK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas Workshop
 
Open Access Repository Junction
Open Access Repository JunctionOpen Access Repository Junction
Open Access Repository Junction
 
Fedora Oxford Dec09
Fedora Oxford Dec09Fedora Oxford Dec09
Fedora Oxford Dec09
 
Open @ EDINA
Open @ EDINAOpen @ EDINA
Open @ EDINA
 
Crowdsourcing the Past with AddressingHistory
Crowdsourcing the Past with AddressingHistory Crowdsourcing the Past with AddressingHistory
Crowdsourcing the Past with AddressingHistory
 
COBWEB: Brief Introduction, GBIF Secretariat
COBWEB: Brief Introduction, GBIF SecretariatCOBWEB: Brief Introduction, GBIF Secretariat
COBWEB: Brief Introduction, GBIF Secretariat
 
SafeNet: Progress and Data Gathering
SafeNet: Progress and Data GatheringSafeNet: Progress and Data Gathering
SafeNet: Progress and Data Gathering
 
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch CatalogueExposing EO Linked (meta-)Data from OpenSearch Catalogue
Exposing EO Linked (meta-)Data from OpenSearch Catalogue
 
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony MathysThe GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
The GoGeo Vision for Repositories (Pecha Kucha) - Tony Mathys
 
Metadata Working Group - Status update
Metadata Working Group -Status updateMetadata Working Group -Status update
Metadata Working Group - Status update
 
Tdr Overview Pres Advocates
Tdr Overview Pres AdvocatesTdr Overview Pres Advocates
Tdr Overview Pres Advocates
 
Linked Data with hybrid services in Agriculture
Linked Data with hybrid services in AgricultureLinked Data with hybrid services in Agriculture
Linked Data with hybrid services in Agriculture
 

Similar to Summary of the Deployment Scenarios and Functional Requirements

Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Cyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCameron Kiddle
 
iRODS UGM 2016 Preso Summary FINAL
iRODS UGM 2016 Preso Summary FINALiRODS UGM 2016 Preso Summary FINAL
iRODS UGM 2016 Preso Summary FINALRandy Splinter
 
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...EDINA, University of Edinburgh
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and CeremonyArchiver
 
IRJET- Redsc: Reliablity of Data Sharing in Cloud
IRJET- Redsc: Reliablity of Data Sharing in CloudIRJET- Redsc: Reliablity of Data Sharing in Cloud
IRJET- Redsc: Reliablity of Data Sharing in CloudIRJET Journal
 
Cloud Storage System like Dropbox
Cloud Storage System like DropboxCloud Storage System like Dropbox
Cloud Storage System like DropboxIRJET Journal
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTERN Australia
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Progress of the Helix Nebula Science Cloud PCP Project
Progress of the Helix Nebula Science Cloud PCP ProjectProgress of the Helix Nebula Science Cloud PCP Project
Progress of the Helix Nebula Science Cloud PCP ProjectHelix Nebula The Science Cloud
 
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMBLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMIRJET Journal
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330glorykim
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330광영 김
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformLarry Smarr
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT
 

Similar to Summary of the Deployment Scenarios and Functional Requirements (20)

Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing ServicesArchiver at CS3 - Cloud Storage Synchronization and Sharing Services
Archiver at CS3 - Cloud Storage Synchronization and Sharing Services
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Cyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in Science
 
iRODS UGM 2016 Preso Summary FINAL
iRODS UGM 2016 Preso Summary FINALiRODS UGM 2016 Preso Summary FINAL
iRODS UGM 2016 Preso Summary FINAL
 
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
Ensuring Continuing Access to Online Scholarly Resources Stewardship & Servic...
 
Design phase kick-off event and Ceremony
Design phase kick-off event and CeremonyDesign phase kick-off event and Ceremony
Design phase kick-off event and Ceremony
 
IRJET- Redsc: Reliablity of Data Sharing in Cloud
IRJET- Redsc: Reliablity of Data Sharing in CloudIRJET- Redsc: Reliablity of Data Sharing in Cloud
IRJET- Redsc: Reliablity of Data Sharing in Cloud
 
Cloud Storage System like Dropbox
Cloud Storage System like DropboxCloud Storage System like Dropbox
Cloud Storage System like Dropbox
 
Tim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasetsTim Malthus_Towards standards for the exchange of field spectral datasets
Tim Malthus_Towards standards for the exchange of field spectral datasets
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
An4201262267
An4201262267An4201262267
An4201262267
 
Progress of the Helix Nebula Science Cloud PCP Project
Progress of the Helix Nebula Science Cloud PCP ProjectProgress of the Helix Nebula Science Cloud PCP Project
Progress of the Helix Nebula Science Cloud PCP Project
 
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEMBLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
BLOCKCHAIN IMPLEMENTATION IN EDUCATIONAL SYSTEM
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
[IJET-V2I2P9] Authors:Reshma A. Hegde1, Madhura Prakash
[IJET-V2I2P9] Authors:Reshma A. Hegde1, Madhura Prakash[IJET-V2I2P9] Authors:Reshma A. Hegde1, Madhura Prakash
[IJET-V2I2P9] Authors:Reshma A. Hegde1, Madhura Prakash
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 

More from Archiver

Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyArchiver
 
Wrapping Up and Next Steps¶
Wrapping Up and Next Steps¶Wrapping Up and Next Steps¶
Wrapping Up and Next Steps¶Archiver
 
ARCHIVER Tender Requirements
ARCHIVER Tender RequirementsARCHIVER Tender Requirements
ARCHIVER Tender RequirementsArchiver
 
Wrapping up and_next_steps_stansted
Wrapping up and_next_steps_stanstedWrapping up and_next_steps_stansted
Wrapping up and_next_steps_stanstedArchiver
 
20190523 archiver fim
20190523 archiver fim20190523 archiver fim
20190523 archiver fimArchiver
 
Geant cloud peering-v2
Geant cloud peering-v2Geant cloud peering-v2
Geant cloud peering-v2Archiver
 
Archiver omc stansted_tendering_procedure_and_requirements_final
Archiver omc stansted_tendering_procedure_and_requirements_finalArchiver omc stansted_tendering_procedure_and_requirements_final
Archiver omc stansted_tendering_procedure_and_requirements_finalArchiver
 
Wrapping up_and_next_steps
Wrapping up_and_next_stepsWrapping up_and_next_steps
Wrapping up_and_next_stepsArchiver
 
Introduction to_planning_poker_addestino
Introduction to_planning_poker_addestinoIntroduction to_planning_poker_addestino
Introduction to_planning_poker_addestinoArchiver
 
Archiver 2nd_OMC event_Barcelona_Project Overview
Archiver 2nd_OMC event_Barcelona_Project OverviewArchiver 2nd_OMC event_Barcelona_Project Overview
Archiver 2nd_OMC event_Barcelona_Project OverviewArchiver
 
Archiver OMC event_Barcelona_ Welcome to_accio
Archiver OMC event_Barcelona_ Welcome to_accio Archiver OMC event_Barcelona_ Welcome to_accio
Archiver OMC event_Barcelona_ Welcome to_accio Archiver
 
6 presentation wrapping up and next steps v2
6 presentation wrapping up and next steps v26 presentation wrapping up and next steps v2
6 presentation wrapping up and next steps v2Archiver
 
5 introduction to geant
5 introduction to geant5 introduction to geant
5 introduction to geantArchiver
 
4 archiver omc session 1
4 archiver omc session 1 4 archiver omc session 1
4 archiver omc session 1 Archiver
 
2 procurement and legal aspects
2 procurement and legal aspects 2 procurement and legal aspects
2 procurement and legal aspects Archiver
 
1 archiver omc project_overview
1 archiver omc project_overview1 archiver omc project_overview
1 archiver omc project_overviewArchiver
 

More from Archiver (18)

Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
Wrapping Up and Next Steps¶
Wrapping Up and Next Steps¶Wrapping Up and Next Steps¶
Wrapping Up and Next Steps¶
 
ARCHIVER Tender Requirements
ARCHIVER Tender RequirementsARCHIVER Tender Requirements
ARCHIVER Tender Requirements
 
Wrapping up and_next_steps_stansted
Wrapping up and_next_steps_stanstedWrapping up and_next_steps_stansted
Wrapping up and_next_steps_stansted
 
20190523 archiver fim
20190523 archiver fim20190523 archiver fim
20190523 archiver fim
 
Geant cloud peering-v2
Geant cloud peering-v2Geant cloud peering-v2
Geant cloud peering-v2
 
Archiver omc stansted_tendering_procedure_and_requirements_final
Archiver omc stansted_tendering_procedure_and_requirements_finalArchiver omc stansted_tendering_procedure_and_requirements_final
Archiver omc stansted_tendering_procedure_and_requirements_final
 
Wrapping up_and_next_steps
Wrapping up_and_next_stepsWrapping up_and_next_steps
Wrapping up_and_next_steps
 
Introduction to_planning_poker_addestino
Introduction to_planning_poker_addestinoIntroduction to_planning_poker_addestino
Introduction to_planning_poker_addestino
 
Archiver 2nd_OMC event_Barcelona_Project Overview
Archiver 2nd_OMC event_Barcelona_Project OverviewArchiver 2nd_OMC event_Barcelona_Project Overview
Archiver 2nd_OMC event_Barcelona_Project Overview
 
Archiver OMC event_Barcelona_ Welcome to_accio
Archiver OMC event_Barcelona_ Welcome to_accio Archiver OMC event_Barcelona_ Welcome to_accio
Archiver OMC event_Barcelona_ Welcome to_accio
 
6 presentation wrapping up and next steps v2
6 presentation wrapping up and next steps v26 presentation wrapping up and next steps v2
6 presentation wrapping up and next steps v2
 
5 introduction to geant
5 introduction to geant5 introduction to geant
5 introduction to geant
 
4 archiver omc session 1
4 archiver omc session 1 4 archiver omc session 1
4 archiver omc session 1
 
2 procurement and legal aspects
2 procurement and legal aspects 2 procurement and legal aspects
2 procurement and legal aspects
 
1 archiver omc project_overview
1 archiver omc project_overview1 archiver omc project_overview
1 archiver omc project_overview
 

Recently uploaded

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Intelisync
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 

Recently uploaded (20)

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 

Summary of the Deployment Scenarios and Functional Requirements

  • 1. Summary of the Deployment Scenarios and Functional Requirements Evangelos Motesnitsalis Technical Coordinator ARCHIVER Consolidation Event 5 June 2019
  • 2. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 2 Contents Recap Common Characteristics Service Layers Mapping Testing plans Summary and Next Steps
  • 4. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 4 High Energy Physics Deployment Scenarios The BaBar Experiment In 2020 the BaBar Experiment infrastructure at SLAC will be decommissioned. As a result, the 2 PB of BaBar data can no longer be stored at the host laboratory and alternative solutions need to be found. Currently a copy of the data is being held by CERN IT. We want to ensure that a complete copy of Babar data will be retained for possible comparisons with data from other experiments and sharing through the CERN Open Data Portal. CERN Open Data Portal The CERN Open Data portal disseminates close to 2 PBs of primary and derived datasets from partical physics as they were released by LHC Collaborations and is being used for both education and research purposes. The CERN Open Data Service Managers seek an easy-to-use, easy-to- achieve independent archiving and backup for its holdings based on SIPs [Submission Information Packages] with intelligent and reliable disaster recovery mechanisms. CERN Digital Memory We want to archive the ~1.5 PB of CERN Digital Memory, containing digitized analog documents produced by the institution in the 20th century as well as the digital production of the 21st century, including new types like web sites, social medias, emails, etc.
  • 5. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 5 Life Sciences Deployment Scenarios EMBL on FIRE EMBL-EBI provides data archiving services to the global molecular biology community. These data archives are currently based on an internal service (FIRE: FIle REplication) that stores the files in two different systems: a distributed object store and tape. FIRE currently holds 20PB of data and is growing at 40% per year. We want to ensure that: FIRE can achieve cost-effective scaling via cloud-based storage solutions Data can effectively be distributed on cloud infrastructure, covering the increasing needs for cloud-hosted analysis EMBL Cloud Data Caching As research communities access more and more of internal data from cloud services for their data analysis, it makes sense to progressively cache data in the cloud, with the on-premises data being replicated and discarded as required. Which data should be cached, how much and for how long, will be a tradeoff between the cost of cloud storage and of having the network capacity/latency to download the data multiple times.
  • 6. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 6 The MAGIC Cherenkov gamma-ray telescopes and the PAUcam camera for the William Herschel Telescope are located in the Observatorio del Roque de los Muchachos, in Canary Islands, Spain. The first Large Scale Telescope of the next-generation Cherenkov Telescope Array (CTA) is also there. They produce about 0.3 PB of raw data per year which is automatically sent to PIC in Barcelona. PIC Large File Storage We want to substitute the current in-house tape library storage. Each instance of the service to be purchased is the 5-year safe-keeping of a yearly dataset from a single source. PIC Mixed File Remote Storage We also want to be able to archive the derived datasets from at most two sources, becoming part of the yearly dataset. In addition, anytime during the 4 years following the creation of the data, additional versions of derived datasets may need to be uploaded. PIC Data Distribution We also want to substitute the Hierarchical Storage Manager, disk storage and data distribution service. Each instance of the service to be purchased is the 5-year safe-keeping and data distribution of a yearly dataset and its derived datasets. Astronomy Deployment Scenarios
  • 7. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 7 Photon Science Deployment Scenarios PETRA III is the worldwide most brilliant storage ring based X-ray sources for high energy photons with 22 beamlines distributed over three experimental halls are concurrently available for users. The European XFEL is a world's largest X-ray laser generating 27 000 ultrashort X-ray per second and with a brilliance that is a billion times higher than that of the best conventional X-ray radiation sources. PETRA III /EuXFEL – Individual Scientist Individual scientist at DESY need a service to create archives for their experiment data as well as their publications with specific capabilities such as data ingestion via browser or third-party copies. PETRA III /EuXFEL – Manual Data Archiving Experiment managers want to be able to create/manage/delete archives via APIs/CLIs based on accepted data policies supporting a wide range of options for cloud and on-prem storage, while being able to utilize existing user credentials, authentication techniques and identification mechanisms. PETRA III /EuXFEL – Integrated Data Archiving Long-lived collaborations present a growing need to plan and execute archiving operations in a fully automated, policy-based, certified, and documented way, based on APIs.
  • 9. Summary of the Deployment Scenarios and Functional Requirements 9 FAIR Principles Findable AccessibleInteroperable Re-Usable • Accurate and relevant description • Data usage license and detailed provenance • Retrievable with free protocols • Accessible metadata even after deletion • Global, unique identifiers • Rich Metadata, indexes, search capabilities • Qualified reference to other data • Formal, shared and broadly applicable knowledge representation standards https://www.go-fair.org/ 5 June 2019
  • 10. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 10 OAIS Reference Model
  • 11. Common Characteristics 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 11 Scientific Data Storage in the PB Range Solid needs for Federated AAI Services Sustained Data Ingest Rates Access to GEANT Network Development under the OAIS Reference Model and FAIR Principles Data Privacy and Compliance Significant Monitoring Requirements Sustainable Business Models and Costs
  • 13. Service Layers and Deployment Scenarios Mappings 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 13 Data integrity/security; cloud/hybrid deployment Data volume in the PB range; high, sustained ingest data rates ISO certification: 27000, 27040, 19086 and related standards Archives connected to the GEANT network OAIS conformant services: data readability formats, normalization, obsolesce monitoring, files fixity, authenticity checks, etc. ISO 14721/16393, 26324 and related standards User services: search, discover, share, indexing, data removal, etc. Access under Federated IAM Layer 1 Storage/Basic Archiving/Secure backup Layer 2 Preservation Layer 3 Baseline user services Layer 4 Advanced services High level services: visual representation of data (domain specific), reproducibility of scientific analyses, etc. EMBL1–FIRE PIC2–MixedFileRemoteStorage DESY1–PETRAIII/EUXFEL CERN3–CERNOpenData CERN2–CERNDigitalMemory CERN1–TheBaBarExperiment PIC3–DataDistribution EMBL2–CloudCaching PIC1–LargeFileStorage
  • 15. Testing Plans 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 15 The Buyers Group will request demo access to the current product offerings during the Design Phase. Testing will focus on Functionality for the Prototype Phase and Performance, Scalability, and Reliability for the Pilot Phase. The Buyers Group will provide a set of tests derived from the Buyers Group deployment scenarios and the Functional Specifications. The tests will have clear assessment criteria for pass/fail. The Buyers Group expects to deploy tests only after a clear indication of the contractor that the tests were run successfully by the contractor themselves. We plan to present the initial set of tests by the Design Phase Kick-off. Assessment of the tests results will have implications on the assessment of the respective phase results and on the payments to be executed.
  • 16. Basic Functionality Testing Examples 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 16 Ingestion: ability to submit a particular dataset of X size to the Archiving Service within time Y Access: ability to recall a particular part of a file, file or dataset within time Y Monitoring and Dashboard: ability to access displayed informations via web browser and trigger basic management function e.g. data deletion, fixity checks, etc. Audit and Log: ability to access detailed access logs for a particular file/dataset
  • 18. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 18 Overview C3 – CERN Open Data C1 – The BaBar Experiment C2 – CERN Digital Memory P1 – Large File Remote Storage P3 – Data Distribution P2 – Mixed File Remote Storage E1 – FIRE E2 – Cloud Caching D1 – PETRA III / EUXFEL
  • 19. 5 June 2019 Summary of the Deployment Scenarios and Functional Requirements 19 Summary and Next Steps The primary goal for all the Deployment Scenarios is the preservation and long-term archiving of data in the PB range with high sustained ingest rates for complex data types. If this can be achieved easily, all the scenarios would benefit greatly from an added Software Reproducability and Open Data Distribution Layer on top of the archiving solution. These deployment scenarios exhibit many similarities such as the scientific complex data types, the need for federated AAI services, the significant monitoring requirements, and the development under OAIS and FAIR. We welcome your feedback on the draft of the “Functional Specifications” documents until 14 June. The Buyers group will co-design and co-develop with you a test plan: The plan will be based on the outcome of the Design Phase, the Functional Specifications document, and the Deployment Scenarios needs The test assessment will be a deciding factor to qualify solutions to the subsequent phases The tests will focus on basic functionality capabilities during the prototype phase and performance, efficiency, and scalability during the pilot phase