SlideShare a Scribd company logo
vCHEP2021- ciangottini@pg.infn.it
First experiences with a portable
analysis infrastructure for LHC at INFN
Diego Ciangottini1
, Tommaso Boccali2
, Andrea Ceccanti3
, Daniele Spiga1
, Davide Salomoni3
, Tommaso
Tedeschi1
, and Mirco Tracolli1
1
INFN Sezione di Perugia
2
INFN Sezione di Pisa
3
INFN-CNAF
1
vCHEP2021- ciangottini@pg.infn.it
Outline
● Our vision
● The approach
● The architectural pillars
● Data access for CMS experiment
● First experience with user workflows
● Results and challenges
● Conclusions and plans
2
vCHEP2021- ciangottini@pg.infn.it
Our vision
Simplify the setup for a new generation of multi-purpose facility for:
● Making a typical LHC analysis workflow quicker w.r.t. a GRID based workflow
○ supporting the majority of the analysis use cases based on flat rootple/numpy-ish array
○ compatible with day to day analysis development: Interactive / quasi interactive
● Transparent / “easy” access to specialized HW
○ looking forward to more ML-based analysis / workflow
■ E.g. starting with a typical signal-vs-background discriminator
● Reproducible and scalable environment capable to offload toward external
resources (e.g. HPC, cloud)
○ Possibly abstracting away from the lower level infrastructure implementations
○ Integration within the portfolio of INFN-Cloud infrastructure
○ Offloading intensive workflow to HPC ( i.e. at CINECA )
3
vCHEP2021- ciangottini@pg.infn.it
The approach
● Given the current variety of tools to manage and deploy container based
infrastructure the aim has moved to simplify the setup of such a facility on top
of Kubernetes (whether being it provided by commercial clouds or by
on-demand and self-hosted solutions)
○ A single machine equivalent deployment on Docker is also available for situation in which a
multi-node setup is not required
● Highly based on service composition model
○ Customize and re-use templates
■ Also for different experiment needs
○ Containers
○ Avoid technology lock-in
○ Declarative/template based approach
4
vCHEP2021- ciangottini@pg.infn.it
The architectural pillars
● JupyterHub as the single entrypoint
○ Helm Charts + Helmfile adopted as
templating
■ Full integrability within the services
portfolio of the INFN-Cloud
○ Docker-compose for single machine env
● Token-based authentication via
Indigo-IAM
○ The access to compute and cache resources
is managed via OIDC claims
● Interactive and auto-scalable batch
analysis as an all-in-one solution
N.B. No CMS-specific parts here!
But rather a customizable base setup!
5
Interactive UI Batch payloads
vCHEP2021- ciangottini@pg.infn.it
A key point: the data access
Based on experiment needs, the setup
can be customized via Helm values
thanks to the modularity of the component
integration. E.g. data access for the CMS
experiment deployment:
● The experiment software is shared
through a repository hosted on
CVMFS
● An XCache server configured to
interact with the CMS remote
storage federation
6
● HTCondor on K8s automatic scaling
● Experimenting Auth/N translation layer via XCache
Work done in synergy with the ESCAPE EU project
vCHEP2021- ciangottini@pg.infn.it
User-driven development
● Development driven by the users’ feedback
○ Started the commissioning of the prototype with a real ongoing analysis at CMS
○ Integration with the workflow of “ssWW VBS with hadronic tau, mu/electron and two jets in final
state”
■ PyROOT and NanoAOD-tools libraries for cut-based analysis steps
■ most used data-science and ML libraries for studies performed via Jupyter notebook
■ 7 TB NanoAOD dataset analyzed via integrated HTCondor batch
■ 3 TB skimmed data (flat rootples) inspected via interactive python-based analysis
● The first set of tests have proven to provide users with access to an all-in-one
solution:
○ From the submission to HTCondor to the interactive python-based programming
■ reducing the time to re-run a single step of the analysis
● A reduced overhead from the user perspective comes also from the adoption
of the AuthN/Z model based on OIDC w.r.t. the one currently based on X509
7
vCHEP2021- ciangottini@pg.infn.it
Current results and lessons
8
We learnt how to satisfy a set of minimal
requirements:
● Analysis validation
○ Including a first ML discriminator
○ No problem reported about caching layer
● Automatic scalability and resource
optimization
Next challenges:
● Integration of a node-level caching
● Dynamic offloading of payloads to
specialized (e.g. GPU) or
opportunistic resources
○ based on system load
vCHEP2021- ciangottini@pg.infn.it
Conclusion and plans
A first experience about providing an analysis infrastructure for the physicist at
INFN has been made, now it’s crucial to move toward an evolution in terms of
scale and integrations:
● Push further the integration, starting a comprehensive test campaign at
national level
○ i.e. via INFN-Cloud resources, via HPC at CINECA resources
○ Opening to other testers/experiments by the end of the year
■ Helpful in tuning further the requirements
■ Comparison planned with same deployment on single machine setup
● Transparent exploitation of heterogeneous hardware and hybrid providers
○ e.g. scaling out toward Clouds and HPC
● Measurement on the impact the cache layer on high I/O workflows
9
vCHEP2021- ciangottini@pg.infn.it
BACKUP
10
vCHEP2021- ciangottini@pg.infn.it
Key features
● HTCondor on K8s automatic scaling
○ Autoscaling based on custom metrics of
HTCondor Worker Nodes
■ Any metric coming from HTCondor queue
can be configured as a trigger
● Experimenting Auth/N translation layer
via XCache
○ Local cache auth/N via OIDC on client side,
while x509 service proxy is used to fetch data
from AAA federation
○ This makes the whole facility able to be almost
X509-free
11

More Related Content

What's hot

Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Hong-Linh Truong
 
On-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloadsOn-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloads
GeoffroyVallee
 
Microservices architecture presentation
Microservices architecture presentationMicroservices architecture presentation
Microservices architecture presentation
Joseph SHYIRAMBERE
 
Defining the stack for service delivery models and interoperability in the in...
Defining the stack for service delivery models and interoperability in the in...Defining the stack for service delivery models and interoperability in the in...
Defining the stack for service delivery models and interoperability in the in...
ieeepondy
 
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioTHE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
DevOpsDays Tel Aviv
 
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTsFIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE
 
Openstack workshop @ Kalasalingam
Openstack workshop @ KalasalingamOpenstack workshop @ Kalasalingam
Openstack workshop @ Kalasalingam
Beny Raja
 
Hyperledger community update 20180528
Hyperledger community update 20180528Hyperledger community update 20180528
Hyperledger community update 20180528
Arnaud Le Hors
 
Serverless apps with OpenWhisk
Serverless apps with OpenWhiskServerless apps with OpenWhisk
Serverless apps with OpenWhisk
Daniel Krook
 
FIWARE Wednesday Webinars - FIWARE Overview
FIWARE Wednesday Webinars - FIWARE OverviewFIWARE Wednesday Webinars - FIWARE Overview
FIWARE Wednesday Webinars - FIWARE Overview
FIWARE
 
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
Duncan Johnston-Watt
 
Maria Guseva - The solution of merge hell in monorepo
Maria Guseva - The solution of merge hell in monorepoMaria Guseva - The solution of merge hell in monorepo
Maria Guseva - The solution of merge hell in monorepo
matteo mazzeri
 
Hyperledger community update 201805
Hyperledger community update 201805Hyperledger community update 201805
Hyperledger community update 201805
Arnaud Le Hors
 
Hyperledger Lightning Talk
Hyperledger Lightning TalkHyperledger Lightning Talk
Hyperledger Lightning Talk
Andrew Kennedy
 
Hyperledger community update Feb 20, 2018
Hyperledger community update Feb 20, 2018Hyperledger community update Feb 20, 2018
Hyperledger community update Feb 20, 2018
Arnaud Le Hors
 
WRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
WRNP18 - Software Defined Infrastructures: Multi-Domain OrchestrationWRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
WRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
Christian Esteve Rothenberg
 
Blockchain explained FIATA Congress 20180910
Blockchain explained FIATA Congress 20180910Blockchain explained FIATA Congress 20180910
Blockchain explained FIATA Congress 20180910
Arnaud Le Hors
 
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
Duncan Johnston-Watt
 
Hyperledger Fabric Technical Deep Dive 20190618
Hyperledger Fabric Technical Deep Dive 20190618Hyperledger Fabric Technical Deep Dive 20190618
Hyperledger Fabric Technical Deep Dive 20190618
Arnaud Le Hors
 
The Role of Blockchain in Enterprise Commerce and Product Content Management
The Role of Blockchain in Enterprise Commerce and Product Content ManagementThe Role of Blockchain in Enterprise Commerce and Product Content Management
The Role of Blockchain in Enterprise Commerce and Product Content Management
SAP Customer Experience
 

What's hot (20)

Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing UncertaintiesModeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
 
On-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloadsOn-node resource manager for containerized HPC workloads
On-node resource manager for containerized HPC workloads
 
Microservices architecture presentation
Microservices architecture presentationMicroservices architecture presentation
Microservices architecture presentation
 
Defining the stack for service delivery models and interoperability in the in...
Defining the stack for service delivery models and interoperability in the in...Defining the stack for service delivery models and interoperability in the in...
Defining the stack for service delivery models and interoperability in the in...
 
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.ioTHE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
THE STATE OF OPENTELEMETRY, DOTAN HOROVITS, Logz.io
 
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTsFIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
FIWARE Wednesday Webinars - Integrating FIWARE with Blockchain/DLTs
 
Openstack workshop @ Kalasalingam
Openstack workshop @ KalasalingamOpenstack workshop @ Kalasalingam
Openstack workshop @ Kalasalingam
 
Hyperledger community update 20180528
Hyperledger community update 20180528Hyperledger community update 20180528
Hyperledger community update 20180528
 
Serverless apps with OpenWhisk
Serverless apps with OpenWhiskServerless apps with OpenWhisk
Serverless apps with OpenWhisk
 
FIWARE Wednesday Webinars - FIWARE Overview
FIWARE Wednesday Webinars - FIWARE OverviewFIWARE Wednesday Webinars - FIWARE Overview
FIWARE Wednesday Webinars - FIWARE Overview
 
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
Running Hyperledger Sawtooth in Production - Hyperledger Global Forum 2018
 
Maria Guseva - The solution of merge hell in monorepo
Maria Guseva - The solution of merge hell in monorepoMaria Guseva - The solution of merge hell in monorepo
Maria Guseva - The solution of merge hell in monorepo
 
Hyperledger community update 201805
Hyperledger community update 201805Hyperledger community update 201805
Hyperledger community update 201805
 
Hyperledger Lightning Talk
Hyperledger Lightning TalkHyperledger Lightning Talk
Hyperledger Lightning Talk
 
Hyperledger community update Feb 20, 2018
Hyperledger community update Feb 20, 2018Hyperledger community update Feb 20, 2018
Hyperledger community update Feb 20, 2018
 
WRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
WRNP18 - Software Defined Infrastructures: Multi-Domain OrchestrationWRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
WRNP18 - Software Defined Infrastructures: Multi-Domain Orchestration
 
Blockchain explained FIATA Congress 20180910
Blockchain explained FIATA Congress 20180910Blockchain explained FIATA Congress 20180910
Blockchain explained FIATA Congress 20180910
 
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
Delivering a packaged Blockchain platform for developers using Hyperledger Sa...
 
Hyperledger Fabric Technical Deep Dive 20190618
Hyperledger Fabric Technical Deep Dive 20190618Hyperledger Fabric Technical Deep Dive 20190618
Hyperledger Fabric Technical Deep Dive 20190618
 
The Role of Blockchain in Enterprise Commerce and Product Content Management
The Role of Blockchain in Enterprise Commerce and Product Content ManagementThe Role of Blockchain in Enterprise Commerce and Product Content Management
The Role of Blockchain in Enterprise Commerce and Product Content Management
 

Similar to Portable data analysis infrastracture for LHC at INFN -vCHEP2021

ICN in the IRTF and IETF
ICN in the IRTF and IETFICN in the IRTF and IETF
ICN in the IRTF and IETF
Dirk Kutscher
 
DEEP general presentation
DEEP general presentationDEEP general presentation
DEEP general presentation
EUDAT
 
Integrating Puppet and Gitolite for sysadmins cooperations
Integrating Puppet and Gitolite for sysadmins cooperationsIntegrating Puppet and Gitolite for sysadmins cooperations
Integrating Puppet and Gitolite for sysadmins cooperations
Luca Mazzaferro
 
The Open Science Grid
The Open Science GridThe Open Science Grid
The Open Science Grid
Rob Gardner
 
Rethinking the Mobile Code Offloading Paradigm: From Concept to Practice
Rethinking the Mobile Code Offloading Paradigm: From Concept to PracticeRethinking the Mobile Code Offloading Paradigm: From Concept to Practice
Rethinking the Mobile Code Offloading Paradigm: From Concept to Practice
MobileSoft
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
CloudLightning
 
HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board  HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board
Helix Nebula The Science Cloud
 
OpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic CloudOpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic Cloud
Jakub Pavlik
 
tip oopt pse-summit2017
tip oopt pse-summit2017tip oopt pse-summit2017
tip oopt pse-summit2017
domenico di mola
 
Berlin AWS meetup: here.com on AWS
Berlin AWS meetup: here.com on AWSBerlin AWS meetup: here.com on AWS
Berlin AWS meetup: here.com on AWS
Cristian Măgherușan-Stanciu
 
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc20205212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
JoannaStachera1
 
DAMG7245-Fall23-FinalProjectProposal.pdf
DAMG7245-Fall23-FinalProjectProposal.pdfDAMG7245-Fall23-FinalProjectProposal.pdf
DAMG7245-Fall23-FinalProjectProposal.pdf
ssuserf0a206
 
Cloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps ApproachCloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps Approach
Nicola Ferraro
 
Data Science in Production: Technologies That Drive Adoption of Data Science ...
Data Science in Production: Technologies That Drive Adoption of Data Science ...Data Science in Production: Technologies That Drive Adoption of Data Science ...
Data Science in Production: Technologies That Drive Adoption of Data Science ...
Nir Yungster
 
Telemetry Updates - Juno Edition
Telemetry Updates - Juno Edition Telemetry Updates - Juno Edition
Telemetry Updates - Juno Edition
OpenStack Foundation
 
Service Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBITService Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBIT
Amrita Prasad
 
The FN Project by Maximilian Jerg
The FN Project by Maximilian JergThe FN Project by Maximilian Jerg
The FN Project by Maximilian Jerg
Harald Schmaldienst
 
Session3 01.clemens neudecker
Session3 01.clemens neudeckerSession3 01.clemens neudecker
Session3 01.clemens neudecker
IMPACT Centre of Competence
 
OCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documentsOCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documents
cneudecker
 
SDN in the Management Plane: OpenConfig and Streaming Telemetry
SDN in the Management Plane: OpenConfig and Streaming TelemetrySDN in the Management Plane: OpenConfig and Streaming Telemetry
SDN in the Management Plane: OpenConfig and Streaming Telemetry
Anees Shaikh
 

Similar to Portable data analysis infrastracture for LHC at INFN -vCHEP2021 (20)

ICN in the IRTF and IETF
ICN in the IRTF and IETFICN in the IRTF and IETF
ICN in the IRTF and IETF
 
DEEP general presentation
DEEP general presentationDEEP general presentation
DEEP general presentation
 
Integrating Puppet and Gitolite for sysadmins cooperations
Integrating Puppet and Gitolite for sysadmins cooperationsIntegrating Puppet and Gitolite for sysadmins cooperations
Integrating Puppet and Gitolite for sysadmins cooperations
 
The Open Science Grid
The Open Science GridThe Open Science Grid
The Open Science Grid
 
Rethinking the Mobile Code Offloading Paradigm: From Concept to Practice
Rethinking the Mobile Code Offloading Paradigm: From Concept to PracticeRethinking the Mobile Code Offloading Paradigm: From Concept to Practice
Rethinking the Mobile Code Offloading Paradigm: From Concept to Practice
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
 
HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board  HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board
 
OpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic CloudOpenStack Journey in Tieto Elastic Cloud
OpenStack Journey in Tieto Elastic Cloud
 
tip oopt pse-summit2017
tip oopt pse-summit2017tip oopt pse-summit2017
tip oopt pse-summit2017
 
Berlin AWS meetup: here.com on AWS
Berlin AWS meetup: here.com on AWSBerlin AWS meetup: here.com on AWS
Berlin AWS meetup: here.com on AWS
 
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc20205212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
 
DAMG7245-Fall23-FinalProjectProposal.pdf
DAMG7245-Fall23-FinalProjectProposal.pdfDAMG7245-Fall23-FinalProjectProposal.pdf
DAMG7245-Fall23-FinalProjectProposal.pdf
 
Cloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps ApproachCloud Native Applications on Kubernetes: a DevOps Approach
Cloud Native Applications on Kubernetes: a DevOps Approach
 
Data Science in Production: Technologies That Drive Adoption of Data Science ...
Data Science in Production: Technologies That Drive Adoption of Data Science ...Data Science in Production: Technologies That Drive Adoption of Data Science ...
Data Science in Production: Technologies That Drive Adoption of Data Science ...
 
Telemetry Updates - Juno Edition
Telemetry Updates - Juno Edition Telemetry Updates - Juno Edition
Telemetry Updates - Juno Edition
 
Service Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBITService Engineering, ZHAW for CeBIT
Service Engineering, ZHAW for CeBIT
 
The FN Project by Maximilian Jerg
The FN Project by Maximilian JergThe FN Project by Maximilian Jerg
The FN Project by Maximilian Jerg
 
Session3 01.clemens neudecker
Session3 01.clemens neudeckerSession3 01.clemens neudecker
Session3 01.clemens neudecker
 
OCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documentsOCR-D: An end-to-end open source OCR framework for historical printed documents
OCR-D: An end-to-end open source OCR framework for historical printed documents
 
SDN in the Management Plane: OpenConfig and Streaming Telemetry
SDN in the Management Plane: OpenConfig and Streaming TelemetrySDN in the Management Plane: OpenConfig and Streaming Telemetry
SDN in the Management Plane: OpenConfig and Streaming Telemetry
 

Recently uploaded

Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
eitps1506
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Sérgio Sacani
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
QusayMaghayerh
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
ABHISHEK SONI NIMT INSTITUTE OF MEDICAL AND PARAMEDCIAL SCIENCES , GOVT PG COLLEGE NOIDA
 
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
Creative-Biolabs
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE  AND ITS BENIFITS.pptxIMPORTANCE OF ALGAE  AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
OmAle5
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Sérgio Sacani
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Sérgio Sacani
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
RAYMUNDONAVARROCORON
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 

Recently uploaded (20)

Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
CLASS 12th CHEMISTRY SOLID STATE ppt (Animated)
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
 
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
Mechanisms and Applications of Antiviral Neutralizing Antibodies - Creative B...
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE  AND ITS BENIFITS.pptxIMPORTANCE OF ALGAE  AND ITS BENIFITS.pptx
IMPORTANCE OF ALGAE AND ITS BENIFITS.pptx
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
 
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...Discovery of An Apparent Red, High-Velocity Type Ia Supernova at  𝐳 = 2.9  wi...
Discovery of An Apparent Red, High-Velocity Type Ia Supernova at 𝐳 = 2.9 wi...
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Clinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdfClinical periodontology and implant dentistry 2003.pdf
Clinical periodontology and implant dentistry 2003.pdf
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 

Portable data analysis infrastracture for LHC at INFN -vCHEP2021

  • 1. vCHEP2021- ciangottini@pg.infn.it First experiences with a portable analysis infrastructure for LHC at INFN Diego Ciangottini1 , Tommaso Boccali2 , Andrea Ceccanti3 , Daniele Spiga1 , Davide Salomoni3 , Tommaso Tedeschi1 , and Mirco Tracolli1 1 INFN Sezione di Perugia 2 INFN Sezione di Pisa 3 INFN-CNAF 1
  • 2. vCHEP2021- ciangottini@pg.infn.it Outline ● Our vision ● The approach ● The architectural pillars ● Data access for CMS experiment ● First experience with user workflows ● Results and challenges ● Conclusions and plans 2
  • 3. vCHEP2021- ciangottini@pg.infn.it Our vision Simplify the setup for a new generation of multi-purpose facility for: ● Making a typical LHC analysis workflow quicker w.r.t. a GRID based workflow ○ supporting the majority of the analysis use cases based on flat rootple/numpy-ish array ○ compatible with day to day analysis development: Interactive / quasi interactive ● Transparent / “easy” access to specialized HW ○ looking forward to more ML-based analysis / workflow ■ E.g. starting with a typical signal-vs-background discriminator ● Reproducible and scalable environment capable to offload toward external resources (e.g. HPC, cloud) ○ Possibly abstracting away from the lower level infrastructure implementations ○ Integration within the portfolio of INFN-Cloud infrastructure ○ Offloading intensive workflow to HPC ( i.e. at CINECA ) 3
  • 4. vCHEP2021- ciangottini@pg.infn.it The approach ● Given the current variety of tools to manage and deploy container based infrastructure the aim has moved to simplify the setup of such a facility on top of Kubernetes (whether being it provided by commercial clouds or by on-demand and self-hosted solutions) ○ A single machine equivalent deployment on Docker is also available for situation in which a multi-node setup is not required ● Highly based on service composition model ○ Customize and re-use templates ■ Also for different experiment needs ○ Containers ○ Avoid technology lock-in ○ Declarative/template based approach 4
  • 5. vCHEP2021- ciangottini@pg.infn.it The architectural pillars ● JupyterHub as the single entrypoint ○ Helm Charts + Helmfile adopted as templating ■ Full integrability within the services portfolio of the INFN-Cloud ○ Docker-compose for single machine env ● Token-based authentication via Indigo-IAM ○ The access to compute and cache resources is managed via OIDC claims ● Interactive and auto-scalable batch analysis as an all-in-one solution N.B. No CMS-specific parts here! But rather a customizable base setup! 5 Interactive UI Batch payloads
  • 6. vCHEP2021- ciangottini@pg.infn.it A key point: the data access Based on experiment needs, the setup can be customized via Helm values thanks to the modularity of the component integration. E.g. data access for the CMS experiment deployment: ● The experiment software is shared through a repository hosted on CVMFS ● An XCache server configured to interact with the CMS remote storage federation 6 ● HTCondor on K8s automatic scaling ● Experimenting Auth/N translation layer via XCache Work done in synergy with the ESCAPE EU project
  • 7. vCHEP2021- ciangottini@pg.infn.it User-driven development ● Development driven by the users’ feedback ○ Started the commissioning of the prototype with a real ongoing analysis at CMS ○ Integration with the workflow of “ssWW VBS with hadronic tau, mu/electron and two jets in final state” ■ PyROOT and NanoAOD-tools libraries for cut-based analysis steps ■ most used data-science and ML libraries for studies performed via Jupyter notebook ■ 7 TB NanoAOD dataset analyzed via integrated HTCondor batch ■ 3 TB skimmed data (flat rootples) inspected via interactive python-based analysis ● The first set of tests have proven to provide users with access to an all-in-one solution: ○ From the submission to HTCondor to the interactive python-based programming ■ reducing the time to re-run a single step of the analysis ● A reduced overhead from the user perspective comes also from the adoption of the AuthN/Z model based on OIDC w.r.t. the one currently based on X509 7
  • 8. vCHEP2021- ciangottini@pg.infn.it Current results and lessons 8 We learnt how to satisfy a set of minimal requirements: ● Analysis validation ○ Including a first ML discriminator ○ No problem reported about caching layer ● Automatic scalability and resource optimization Next challenges: ● Integration of a node-level caching ● Dynamic offloading of payloads to specialized (e.g. GPU) or opportunistic resources ○ based on system load
  • 9. vCHEP2021- ciangottini@pg.infn.it Conclusion and plans A first experience about providing an analysis infrastructure for the physicist at INFN has been made, now it’s crucial to move toward an evolution in terms of scale and integrations: ● Push further the integration, starting a comprehensive test campaign at national level ○ i.e. via INFN-Cloud resources, via HPC at CINECA resources ○ Opening to other testers/experiments by the end of the year ■ Helpful in tuning further the requirements ■ Comparison planned with same deployment on single machine setup ● Transparent exploitation of heterogeneous hardware and hybrid providers ○ e.g. scaling out toward Clouds and HPC ● Measurement on the impact the cache layer on high I/O workflows 9
  • 11. vCHEP2021- ciangottini@pg.infn.it Key features ● HTCondor on K8s automatic scaling ○ Autoscaling based on custom metrics of HTCondor Worker Nodes ■ Any metric coming from HTCondor queue can be configured as a trigger ● Experimenting Auth/N translation layer via XCache ○ Local cache auth/N via OIDC on client side, while x509 service proxy is used to fetch data from AAA federation ○ This makes the whole facility able to be almost X509-free 11