SlideShare a Scribd company logo
1 of 13
Download to read offline
Services for the EOSC:
Test Suite
João Fernandes (CERN)
Outline
• Cloud Validation and Testing: Why?
• Timeline & Context
• TestSuite & Scientific workload deployments
• EOSC Roadmap & Benefits for the Community
2
Cloud Validation & Testing: Why?
• Cloud Commodity Services? Perhaps, but they still need validation
for multiple domain research use cases
• Which type of GPUs you provide? Do you provide Quantum HW?
• Which Libraries do you offer for ML? Do you offer fast interconnects?
• At which scale you offer resources? Which regions are they available? At
what speed and path do you interconnect to the GÉANT network?
• Several research domains, several cloud platforms available
• Not feasible for manually run tests
• Validation needs to scale in type, cloud stack and number of services
3
Timeline & Context
4
2021
2020
2016
2019
CERN contribution - OCRE Test Suite
5
• Automated
• Deployment with open technologies: Ansible,
Terraform, Docker, K8s
• Heterogeneous
• Containerised tests to be deployed with all
dependencies, providing working examples to
researchers in how to deploy applications
• Central Repository
• Collects tests and validation results in a structured
manner (CERN S3 service)
• Results Dashboard
• Allows organisations to get a results overview,
dashboard consumes JSONs files from the CERN S3
bucket
Process to include tests
• Discussion with developers to include the use case
• Assessment of the work to be done and include requirements
• Collection of information
• Documentation, Contact Person and Applicable License
•Licensing established
• Test Suite: developed by CERN, FOSS under AGPL
• Test License: Responsibility of Test Owners
6
OCRE - Deployments
7
Benchmark / Test Description and details Run on Domain Covered Contributor
Data Repatriation
Simple Data Repatriation: exporting data from the public
cloud to Zenodo.
Single-node, 8 cores and ~30GB memory.
Accessibility and
Network
Connectivity
CERN
DODAS
Validation of generation of public cloud clusters on-demand
for batch workload execution.
Single-node, 8 cores and ~30GB memory. Compute (CPU) INFN
HEP CPU
Benchmarks
CPU benchmarking based on reduced versions of several
real-world physics workloads.
Single-node, 8 cores and ~30GB memory.
Default configurations.
Compute (CPU) CERN
Networking
performance tests
with perfSONAR
End to end network measurements using perfSONAR.
• Buyer-side endpoint was pse01-gva.cern.ch (used IPv4)
• Latency measured with ping
• Trace measured with traceroute
• Throughput measured with iperf3, single stream
Provider-side endpoint ran on a 8-core and
~30GB VM.
Networking and
Connectivity
ESnet, GÉANT,
Indiana U.,
Internet2, U. of
Michigan RNP,
CERN
Single node GAN
training (ProGAN)
Satellite image analysis and generation using Progressive
Growing GANs. Configurations:
• images_amount: 100
• kimg: 300
Single-node cluster with 1 NVIDIA V100
card.
Compute (GPU)
and machine
learning services
UNOSAT & CERN
Openlab
Distributed GAN
training (NNLO)
Distributed training of Generative Adversarial Networks.
Configurations:
• epochs: 10
• benchmark: nnlo
• datasetSize: 30
Six-node cluster with 1 NVIDIA V100 per
node, hence totalling to 6 GPUs.
Compute (GPU),
machine learning
services and
distributed
computing
CERN Openlab
COSBench and S3
validation
Cloud Object Storage Benchmarking and testing. Default
configurations.
No VM utilised in this case, COSBench’s
source server ran on the CERN Openstack
cloud.
Connectivity,
storage, and APIs
Intel
Additional Information: https://eosc-testsuite.readthedocs.io/en/latest/testsCatalog.html
OCRE - Providers validated by CERN
8
Validation categories defined in OCRE
9
Category 1
The cloud provider offers an extended range of mature services, integration of software, hardware and service options at
scale that can cover a wide number of research applications at scale. This includes, for example, Machine Learning, HPC,
and even Quantum Computing.
User interfaces are offered in different modes (Console, API, CLI), and are straightforward and intuitive for users ranging
from beginner to advanced.
Category 2
The cloud provider offers a range of mature services, with a good level of integration of software and hardware; in some
cases, supporting service options that can cover several research applications including, for example, Machine Learning.
User interfaces are offered in different modes (Console, API, CLI), and their configuration is straightforward and intuitive for
users ranging from beginner to advanced.
Category 3
The cloud provider offers a limited number of services, with very little integration of software and hardware.
User interfaces in different modes (Console, API, CLI) are not always present and require users to be familiar with aspects
of cloud architectures and resource-provisioning methods.
Category 4
The cloud provider offers a very limited number of services where only a small number of applications can be deployed with
virtually no integration across the hardware and software stacks.
User Interfaces in different modes (Console, API, CLI) are not always present and require users to be familiar with aspects
of cloud architectures and resource provisioning methods.
Category 5
The cloud provider offers a customised limited type of service where only specific applications can be deployed at a limited
scale.
Criteria: ease of access, service maturity, scale, integration of h/w & s/w stacks
Test Suite: Benefits for the community
10
Working examples of technical deployments
○ Raising awareness of cloud technology/interfaces/costing/optimization
○ Build skills to make informed choices about the best cloud solution to solve a given research problem
Accumulated practical technical experience with 20+ cloud providers
○ Ability to quickly run small samples of real representative workloads
○ Documented recommendations and guidance, based on the experience
○ Lessons learned from the testing and validation activity
○ types of resources supported, software stacks, network connectivity, etc.
Test Suite framework ready to use for cloud procurements
○ Adapted to be used during a market survey of a procurement exercise
○ Cloud offers are commodity, but vary in capabilities and technological implementation
○ Test Suite exposes those differences to research organisations for their benefit
References
11
● Repository
○ https://github.com/cern-it-efp/EOSC-Testsuite
● Documentation
○ https://eosc-testsuite.readthedocs.io/en/latest/
○ OCRE Deliverable D4.2 - Lessons learned, recommendations and guidance for
research organisations
● Recorded Demos of the Test Suite
○ EGI conference 2021
o https://www.youtube.com/watch?v=KENk4KnFmhs
○ EGI conference 2020
o https://www.youtube.com/watch?v=ZznFp9IlGR0
EOSC Service Operation Model Proposal
12
EOSC Compliance
Testing
Neutral validation and testing of cloud provider capabilities; Providing working examples of deployments;
To be used either before a procurement action, feeding realistic information to the tender exercise, or during contract
execution for effective contract monitoring;
Monitoring of cloud providers regions, verify the use of EC member states for data sovereignty reasons.
EOSC Test Suite interface shall follow the EOSC branding.
Support Roles
Breakdown of responsibilities between Test Suite team and Cloud vendors:
Test runs are executed by the Test Suite team, end researchers, or cloud vendor architects.
Validation results are to be accessed by researchers on all platforms. Each service provider can have access only to its
results.
Cloud service providers: direct support lines covering multiple time zones and are contractually responsible for ensuring that
services remain available to be validated, manageable and secure.
Documentation & Code
Repositories
Documentation is reviewed to allow the tool to be deployed independently.
Extensive documentation is maintained using the RST language, whose source code is available in the Test Suite’s GitHub
repository.
Code compiles into a set of HTML files, which are available online.
The repositories must remain accessible and ensuring the following, README files updated, Correct management of
branches and monitor issues and pull requests, as an open-source project.
Development &
Maintenance Effort
1.5 FTE Software Development or similar background
EOSC Procurement recently announced (35M): https://ted.europa.eu/udl?uri=TED:NOTICE:234660-2022:TEXT:EN:HTML
Thank You!

More Related Content

Similar to 01-06 OCRE Test Suite - Fernandes.pdf

Openstack_administration
Openstack_administrationOpenstack_administration
Openstack_administration
Ashish Sharma
 

Similar to 01-06 OCRE Test Suite - Fernandes.pdf (20)

Enabling the Deployment of Edge Services with the Open Network Edge Services ...
Enabling the Deployment of Edge Services with the Open Network Edge Services ...Enabling the Deployment of Edge Services with the Open Network Edge Services ...
Enabling the Deployment of Edge Services with the Open Network Edge Services ...
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
 
Summit 16: The Hitchhiker/Hacker's Guide to NFV Benchmarking
Summit 16: The Hitchhiker/Hacker's Guide to NFV BenchmarkingSummit 16: The Hitchhiker/Hacker's Guide to NFV Benchmarking
Summit 16: The Hitchhiker/Hacker's Guide to NFV Benchmarking
 
Learn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFVLearn more about the tremendous value Open Data Plane brings to NFV
Learn more about the tremendous value Open Data Plane brings to NFV
 
Current & Future Use-Cases of OpenDaylight
Current & Future Use-Cases of OpenDaylightCurrent & Future Use-Cases of OpenDaylight
Current & Future Use-Cases of OpenDaylight
 
2307 - DevBCN - Otel 101_compressed.pdf
2307 - DevBCN - Otel 101_compressed.pdf2307 - DevBCN - Otel 101_compressed.pdf
2307 - DevBCN - Otel 101_compressed.pdf
 
Bridging Concepts and Practice in eScience via Simulation-driven Engineering
Bridging Concepts and Practice in eScience via Simulation-driven EngineeringBridging Concepts and Practice in eScience via Simulation-driven Engineering
Bridging Concepts and Practice in eScience via Simulation-driven Engineering
 
Openstack_administration
Openstack_administrationOpenstack_administration
Openstack_administration
 
HNSciCloud: Project Results and lessons learned
HNSciCloud: Project Results and lessons learnedHNSciCloud: Project Results and lessons learned
HNSciCloud: Project Results and lessons learned
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Democratizing machine learning on kubernetes
Democratizing machine learning on kubernetesDemocratizing machine learning on kubernetes
Democratizing machine learning on kubernetes
 
Feec telecom-nw-softwarization-aug-2015
Feec telecom-nw-softwarization-aug-2015Feec telecom-nw-softwarization-aug-2015
Feec telecom-nw-softwarization-aug-2015
 
NFV features in kubernetes
NFV features in kubernetesNFV features in kubernetes
NFV features in kubernetes
 
OIF Transport SDN Interop - ECOC 2016
OIF Transport SDN Interop - ECOC 2016OIF Transport SDN Interop - ECOC 2016
OIF Transport SDN Interop - ECOC 2016
 
Modernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-ArchitectModernizing Testing as Apps Re-Architect
Modernizing Testing as Apps Re-Architect
 
Summit 16: Multi-site OPNFV Testing Challenges
Summit 16: Multi-site OPNFV Testing ChallengesSummit 16: Multi-site OPNFV Testing Challenges
Summit 16: Multi-site OPNFV Testing Challenges
 
Microservices.pdf
Microservices.pdfMicroservices.pdf
Microservices.pdf
 
Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...
 
Future Internet: Managing Innovation and Testbed
Future Internet: Managing Innovation and TestbedFuture Internet: Managing Innovation and Testbed
Future Internet: Managing Innovation and Testbed
 
The RECAP Project: Large Scale Simulation Framework
The RECAP Project: Large Scale Simulation FrameworkThe RECAP Project: Large Scale Simulation Framework
The RECAP Project: Large Scale Simulation Framework
 

More from OCRE | Open Clouds for Research Environments

More from OCRE | Open Clouds for Research Environments (20)

01-15 Towards a Dutch Comprehensive Childhood Cancer Commons.pdf
01-15 Towards a Dutch Comprehensive Childhood Cancer Commons.pdf01-15 Towards a Dutch Comprehensive Childhood Cancer Commons.pdf
01-15 Towards a Dutch Comprehensive Childhood Cancer Commons.pdf
 
Sentinel-2 Semantic Data & Information Cube Austria
Sentinel-2 Semantic Data & Information Cube AustriaSentinel-2 Semantic Data & Information Cube Austria
Sentinel-2 Semantic Data & Information Cube Austria
 
presentation_OCRE_Demontis_Lisbona_2002_12_07.pdf
presentation_OCRE_Demontis_Lisbona_2002_12_07.pdfpresentation_OCRE_Demontis_Lisbona_2002_12_07.pdf
presentation_OCRE_Demontis_Lisbona_2002_12_07.pdf
 
02-01 Quantifying war damage in Ukraine - Yailymova.pdf
02-01 Quantifying war damage in Ukraine - Yailymova.pdf02-01 Quantifying war damage in Ukraine - Yailymova.pdf
02-01 Quantifying war damage in Ukraine - Yailymova.pdf
 
02-02 Assessment of coastline change over the last 38 years - Gafeira, Jones.pdf
02-02 Assessment of coastline change over the last 38 years - Gafeira, Jones.pdf02-02 Assessment of coastline change over the last 38 years - Gafeira, Jones.pdf
02-02 Assessment of coastline change over the last 38 years - Gafeira, Jones.pdf
 
01-05 EO Community Engagement.pdf
01-05 EO Community Engagement.pdf01-05 EO Community Engagement.pdf
01-05 EO Community Engagement.pdf
 
01-07 Adoption Funding and Procurement Mechanism.pdf
01-07 Adoption Funding and Procurement Mechanism.pdf01-07 Adoption Funding and Procurement Mechanism.pdf
01-07 Adoption Funding and Procurement Mechanism.pdf
 
01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf
 
01-13 iCANDID - Roxanne Wyns_v01.00.pdf
01-13 iCANDID - Roxanne Wyns_v01.00.pdf01-13 iCANDID - Roxanne Wyns_v01.00.pdf
01-13 iCANDID - Roxanne Wyns_v01.00.pdf
 
01-14 Analysis of Liquid Biopsies - Ibrahim.pdf
01-14 Analysis of Liquid Biopsies - Ibrahim.pdf01-14 Analysis of Liquid Biopsies - Ibrahim.pdf
01-14 Analysis of Liquid Biopsies - Ibrahim.pdf
 
01-04 Working with the Community - Carrillo.pdf
01-04 Working with the Community - Carrillo.pdf01-04 Working with the Community - Carrillo.pdf
01-04 Working with the Community - Carrillo.pdf
 
01-09 The Virtual Aorta project - Hynes.pdf
01-09 The Virtual Aorta project - Hynes.pdf01-09 The Virtual Aorta project - Hynes.pdf
01-09 The Virtual Aorta project - Hynes.pdf
 
01-11 StreamAir - Donald.pdf
01-11 StreamAir - Donald.pdf01-11 StreamAir - Donald.pdf
01-11 StreamAir - Donald.pdf
 
Presentation_OCRE_Steegmans.pdf
Presentation_OCRE_Steegmans.pdfPresentation_OCRE_Steegmans.pdf
Presentation_OCRE_Steegmans.pdf
 
2022-12-06-OCRE-EOSC-overview - Siemen.pdf
2022-12-06-OCRE-EOSC-overview - Siemen.pdf2022-12-06-OCRE-EOSC-overview - Siemen.pdf
2022-12-06-OCRE-EOSC-overview - Siemen.pdf
 
01-16 OCRE CESPU Presentation Complete -Dinis, Magalhaes, Gomes.pdf
01-16 OCRE CESPU Presentation Complete -Dinis, Magalhaes, Gomes.pdf01-16 OCRE CESPU Presentation Complete -Dinis, Magalhaes, Gomes.pdf
01-16 OCRE CESPU Presentation Complete -Dinis, Magalhaes, Gomes.pdf
 
01-03 Opening - Heyns.pdf
01-03 Opening - Heyns.pdf01-03 Opening - Heyns.pdf
01-03 Opening - Heyns.pdf
 
OCRE webinar - April 14 - Jakob Tendel.pdf
OCRE webinar - April 14 - Jakob Tendel.pdfOCRE webinar - April 14 - Jakob Tendel.pdf
OCRE webinar - April 14 - Jakob Tendel.pdf
 
OCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdfOCRE webinar - April 14 - Dave Heyns.pdf
OCRE webinar - April 14 - Dave Heyns.pdf
 
Introduction Presentation for Webinar: Cloud and Earth Observation Procuremen...
Introduction Presentation for Webinar: Cloud and Earth Observation Procuremen...Introduction Presentation for Webinar: Cloud and Earth Observation Procuremen...
Introduction Presentation for Webinar: Cloud and Earth Observation Procuremen...
 

Recently uploaded

Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 

Recently uploaded (20)

Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 

01-06 OCRE Test Suite - Fernandes.pdf

  • 1. Services for the EOSC: Test Suite João Fernandes (CERN)
  • 2. Outline • Cloud Validation and Testing: Why? • Timeline & Context • TestSuite & Scientific workload deployments • EOSC Roadmap & Benefits for the Community 2
  • 3. Cloud Validation & Testing: Why? • Cloud Commodity Services? Perhaps, but they still need validation for multiple domain research use cases • Which type of GPUs you provide? Do you provide Quantum HW? • Which Libraries do you offer for ML? Do you offer fast interconnects? • At which scale you offer resources? Which regions are they available? At what speed and path do you interconnect to the GÉANT network? • Several research domains, several cloud platforms available • Not feasible for manually run tests • Validation needs to scale in type, cloud stack and number of services 3
  • 5. CERN contribution - OCRE Test Suite 5 • Automated • Deployment with open technologies: Ansible, Terraform, Docker, K8s • Heterogeneous • Containerised tests to be deployed with all dependencies, providing working examples to researchers in how to deploy applications • Central Repository • Collects tests and validation results in a structured manner (CERN S3 service) • Results Dashboard • Allows organisations to get a results overview, dashboard consumes JSONs files from the CERN S3 bucket
  • 6. Process to include tests • Discussion with developers to include the use case • Assessment of the work to be done and include requirements • Collection of information • Documentation, Contact Person and Applicable License •Licensing established • Test Suite: developed by CERN, FOSS under AGPL • Test License: Responsibility of Test Owners 6
  • 7. OCRE - Deployments 7 Benchmark / Test Description and details Run on Domain Covered Contributor Data Repatriation Simple Data Repatriation: exporting data from the public cloud to Zenodo. Single-node, 8 cores and ~30GB memory. Accessibility and Network Connectivity CERN DODAS Validation of generation of public cloud clusters on-demand for batch workload execution. Single-node, 8 cores and ~30GB memory. Compute (CPU) INFN HEP CPU Benchmarks CPU benchmarking based on reduced versions of several real-world physics workloads. Single-node, 8 cores and ~30GB memory. Default configurations. Compute (CPU) CERN Networking performance tests with perfSONAR End to end network measurements using perfSONAR. • Buyer-side endpoint was pse01-gva.cern.ch (used IPv4) • Latency measured with ping • Trace measured with traceroute • Throughput measured with iperf3, single stream Provider-side endpoint ran on a 8-core and ~30GB VM. Networking and Connectivity ESnet, GÉANT, Indiana U., Internet2, U. of Michigan RNP, CERN Single node GAN training (ProGAN) Satellite image analysis and generation using Progressive Growing GANs. Configurations: • images_amount: 100 • kimg: 300 Single-node cluster with 1 NVIDIA V100 card. Compute (GPU) and machine learning services UNOSAT & CERN Openlab Distributed GAN training (NNLO) Distributed training of Generative Adversarial Networks. Configurations: • epochs: 10 • benchmark: nnlo • datasetSize: 30 Six-node cluster with 1 NVIDIA V100 per node, hence totalling to 6 GPUs. Compute (GPU), machine learning services and distributed computing CERN Openlab COSBench and S3 validation Cloud Object Storage Benchmarking and testing. Default configurations. No VM utilised in this case, COSBench’s source server ran on the CERN Openstack cloud. Connectivity, storage, and APIs Intel Additional Information: https://eosc-testsuite.readthedocs.io/en/latest/testsCatalog.html
  • 8. OCRE - Providers validated by CERN 8
  • 9. Validation categories defined in OCRE 9 Category 1 The cloud provider offers an extended range of mature services, integration of software, hardware and service options at scale that can cover a wide number of research applications at scale. This includes, for example, Machine Learning, HPC, and even Quantum Computing. User interfaces are offered in different modes (Console, API, CLI), and are straightforward and intuitive for users ranging from beginner to advanced. Category 2 The cloud provider offers a range of mature services, with a good level of integration of software and hardware; in some cases, supporting service options that can cover several research applications including, for example, Machine Learning. User interfaces are offered in different modes (Console, API, CLI), and their configuration is straightforward and intuitive for users ranging from beginner to advanced. Category 3 The cloud provider offers a limited number of services, with very little integration of software and hardware. User interfaces in different modes (Console, API, CLI) are not always present and require users to be familiar with aspects of cloud architectures and resource-provisioning methods. Category 4 The cloud provider offers a very limited number of services where only a small number of applications can be deployed with virtually no integration across the hardware and software stacks. User Interfaces in different modes (Console, API, CLI) are not always present and require users to be familiar with aspects of cloud architectures and resource provisioning methods. Category 5 The cloud provider offers a customised limited type of service where only specific applications can be deployed at a limited scale. Criteria: ease of access, service maturity, scale, integration of h/w & s/w stacks
  • 10. Test Suite: Benefits for the community 10 Working examples of technical deployments ○ Raising awareness of cloud technology/interfaces/costing/optimization ○ Build skills to make informed choices about the best cloud solution to solve a given research problem Accumulated practical technical experience with 20+ cloud providers ○ Ability to quickly run small samples of real representative workloads ○ Documented recommendations and guidance, based on the experience ○ Lessons learned from the testing and validation activity ○ types of resources supported, software stacks, network connectivity, etc. Test Suite framework ready to use for cloud procurements ○ Adapted to be used during a market survey of a procurement exercise ○ Cloud offers are commodity, but vary in capabilities and technological implementation ○ Test Suite exposes those differences to research organisations for their benefit
  • 11. References 11 ● Repository ○ https://github.com/cern-it-efp/EOSC-Testsuite ● Documentation ○ https://eosc-testsuite.readthedocs.io/en/latest/ ○ OCRE Deliverable D4.2 - Lessons learned, recommendations and guidance for research organisations ● Recorded Demos of the Test Suite ○ EGI conference 2021 o https://www.youtube.com/watch?v=KENk4KnFmhs ○ EGI conference 2020 o https://www.youtube.com/watch?v=ZznFp9IlGR0
  • 12. EOSC Service Operation Model Proposal 12 EOSC Compliance Testing Neutral validation and testing of cloud provider capabilities; Providing working examples of deployments; To be used either before a procurement action, feeding realistic information to the tender exercise, or during contract execution for effective contract monitoring; Monitoring of cloud providers regions, verify the use of EC member states for data sovereignty reasons. EOSC Test Suite interface shall follow the EOSC branding. Support Roles Breakdown of responsibilities between Test Suite team and Cloud vendors: Test runs are executed by the Test Suite team, end researchers, or cloud vendor architects. Validation results are to be accessed by researchers on all platforms. Each service provider can have access only to its results. Cloud service providers: direct support lines covering multiple time zones and are contractually responsible for ensuring that services remain available to be validated, manageable and secure. Documentation & Code Repositories Documentation is reviewed to allow the tool to be deployed independently. Extensive documentation is maintained using the RST language, whose source code is available in the Test Suite’s GitHub repository. Code compiles into a set of HTML files, which are available online. The repositories must remain accessible and ensuring the following, README files updated, Correct management of branches and monitor issues and pull requests, as an open-source project. Development & Maintenance Effort 1.5 FTE Software Development or similar background EOSC Procurement recently announced (35M): https://ted.europa.eu/udl?uri=TED:NOTICE:234660-2022:TEXT:EN:HTML