SlideShare a Scribd company logo
1 of 21
Download to read offline
Collaboration, Big Data and the
 search for the Higgs Boson
   Intel European Research and Innovation
                 Conference
              October 23rd 2012


       Andrzej Nowak, CERN openlab
               Andrzej.Nowak@cern.ch
The European Particle Physics Laboratory based in
              Geneva, Switzerland

 Founded in 1954 by 12 countries for fundamental
     physics research in a post-war Europe

In 2012, it is a global effort of 20 member countries
and scientists from 110 nationalities, working on the
    world’s most ambitious physics experiments

         ~2’500 personnel, > 15’000 users
             ~1 bln CHF yearly budget

          Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   2
•   How to explain particles have mass?
•   What is most of the universe made of?
•   Why is there little anti-matter?
•   What happened in the Big Bang?




          Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   3
Mont Blanc (4,808m)




                                                                             Geneva (pop. 190’000)

Lake Geneva (310m deep)




Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson
The Large Hadron Collider

                  27 km underground
          superconducting ring – possibly the
           largest machine ever built by man




     40 million collisions per second




          150-200 MW power consumption



   Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   5
Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   6
Andrzej Nowak - Collaboration, Big Data and the search for the Higgs
                                                                       7
                              Boson
Data flow from the LHC detectors
                                                            Reconstruction

                                               Selection and
                                              reconstruction




Online triggering and
filtering in detectors



                                                Raw Data                   Event
                                                 (100%)                 reprocessing

                                                                                                   Event
                                                                                                summary data
                                                                                                   (10%)
  Event simulation


                                                 Analysis
                                                                        Batch physics
                         Analysis                objects
                                                                          analysis
                                                  (1%)
                                                                                                Processed data



                   Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson                    8
100 PB                                                                                       1G

                                       Big Data
             10 PB
                                                                                                      100 M




                                                                                                              Number of files
              1 PB
Tape usage




                                                                                                      10 M

         100 TB


                                                                                                      1M
             10 TB


                                                                  Approximate, smoothed values
              1 TB                                                                                    100 k
                     2003                  2005                   2008                  2010      2012
                     Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson         9
The LHC Computing Grid

 INSERT
WORKLOAD
  HERE




     Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   10
Collaboration on big data and computing
         The Worldwide LHC Computing Grid


Tier-0 (CERN): data                                                                                nearly 160 sites
recording,
reconstruction and
distribution                                                                                          ~250’000 cores

Tier-1: permanent
storage, re-
processing,                                                                                           173 PB of storage
analysis

Tier-2: Simulation,
                                                                                                   > 2 million jobs/day
end-user analysis




                      Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson                 11
Cutting edge science
• Accelerating Science and Innovation




        Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   12
It would have been impossible to release physics results so quickly without
the outstanding performance of the Grid (including the CERN Tier-0)

    Number of concurrent ATLAS jobs Jan-July 2012

                                                          Includes MC production,
                                                          user and group analysis
                                                          at CERN, 10 Tier1-s,
                                                          ~ 70 Tier-2 federations
                100 k                                      > 80 sites


                                                         > 1500 distinct ATLAS users
                                                         do analysis on the GRID




 Available resources fully used/stressed (beyond pledges in some cases)
 Massive production of 8 TeV Monte Carlo samples
 Very effective and flexible Computing Model and Operation team  accommodate high
  trigger rates and pile-up, intense MC simulation, analysis demands from worldwide
  users (through e.g. dynamic data placement)
A wealth of knowledge

                             Physics
Academic     Summer                            Technical           CERN
                              and                                                   Outreach    EU FP7
 Training    Student                            Training          Teacher
                           computing                                                programs   programs
program      program                           program            schools
                            schools




                Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson            14
Innovation in science
     Medical Applications as an Example of Particle Physics Spin-off

                                             Hadron Therapy
                                                          Tumour                                         Leadership in Ion
                                                           Target                                        Beam Therapy now
                                                                                                         in Europe and Japan
                                             Protons
                                             light ions
  Accelerating particle beams                                                  X-ray           protons
   ~30’000 accelerators worldwide           >70’000 patients treated worldwide (30 facilities)
     ~17’000 used for medicine              >21’000 patients treated in Europe (9 facilities)


                                              Imaging                         PET Scanner
                                             Clinical trial in Portugal for
                                             new breast imaging system
                                                      (ClearPEM)




     Detecting particles
                                                                                                                    15
From F.Hemmer            Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson
Innovation in computing

1989: First high                                                                                 2012: LHC
                            1999: The Grid                        2003: Several
  bandwidth                                                                                       delivering
                                vision                           Internet2 land
 transatlantic                                                                                  intense data
                             materializes                        speed records
     links                                                                                       challenges


                                     2001: CERN wins
   1991: The World                   Computerworld’s                                  2008: The WLCG
     Wide Web is                       21st Century                                    is the world’s
    born at CERN                    Achievement Award                                   largest grid
                                         for SHIFT




                   Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson            16
The CERN openlab
  A unique research partnership of CERN and the industry
  Objective: The advancement of cutting-edge computing
   solutions to be used by the worldwide LHC community

• Partners support manpower and equipment in dedicated
  competence centers
• openlab delivers published research and evaluations based
  on partners’ solutions – in a very challenging setting
• Created robust hands-on training program in various
  computing topics, including international computing
  schools; Summer Student program
• Past involvement: Enterasys Networks, IBM, Voltaire, F-
  secure, Stonesoft, EDS; Future involvement: Huawei
• Now in phase IV: 2012-2014

                   http://cern.ch/openlab


                     Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   17
A European Cloud Computing Partnership:
         big science teams up with big business




                                                                                                          To create an Earth
                                                 To support the                Setting up a new
       Strategic Plan                          computing capacity             service to simplify
                                                                                                         Observation platform,
                                                                                                              focusing on
                                               needs for the ATLAS             analysis of large
        Establish multi-tenant,                                                                            earthquake and
                                                  experiment               genomes, for a deeper
         multi-provider cloud                                                                              volcano research
                                                                            insight into evolution
         infrastructure                                                        and biodiversity
        Identify and adopt policies
         for trust, security and
         privacy

        Create governance
         structure

        Define funding schemes




                            Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson                       18
From B.Jones
Big(ger) data

Data rates at the LHC to increase by ~100x


   Raw data:                       Exabytes                                 Millions of
  an exabyte                        stored                                  computing
  per second?                       yearly?                                   cores?




          “Sustainable computing”
           Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson     19
Future directions in computing
• Software replacing hardware
  – Programmability replaces rigid
    structures
• Intensive compute
  – Local farms must have much higher
    processing capacity
• Accelerators
  – Experiments with Intel MIC and GPUs
• Silicon photonics
           Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   20
Accelerating Science and
     Innovation

 Continued support of the worldwide
physics community and the European
             population

Great science and engineering + great
     partners = great innovation


       Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson   21

More Related Content

Viewers also liked

Easiest way to start with Shell scripting
Easiest way to start with Shell scriptingEasiest way to start with Shell scripting
Easiest way to start with Shell scriptingAkshay Siwal
 
Vodafone beta factory - GEC 2015
Vodafone beta factory - GEC 2015Vodafone beta factory - GEC 2015
Vodafone beta factory - GEC 2015Marcello Viti
 
Secure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical HackingSecure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical HackingNitish Kasar
 
Introduction to anonymity network tor
Introduction to anonymity network torIntroduction to anonymity network tor
Introduction to anonymity network torKhaled Mosharraf
 
Tor the onion router
Tor  the onion routerTor  the onion router
Tor the onion routerAshly Liza
 
Reverse Engineering - Methods and Process
Reverse Engineering - Methods and ProcessReverse Engineering - Methods and Process
Reverse Engineering - Methods and ProcessLa_Lu
 
Cyber security-report-2017
Cyber security-report-2017Cyber security-report-2017
Cyber security-report-2017NRC
 
Introduction to SSH
Introduction to SSHIntroduction to SSH
Introduction to SSHHemant Shah
 
Privacy is for losers 2016
Privacy is for losers 2016Privacy is for losers 2016
Privacy is for losers 2016Cain Ransbottyn
 
How to TEDx [Presentation Design Tips] - #TED #TEDX
How to TEDx [Presentation Design Tips] - #TED #TEDXHow to TEDx [Presentation Design Tips] - #TED #TEDX
How to TEDx [Presentation Design Tips] - #TED #TEDXEmpowered Presentations
 
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013Cain Ransbottyn
 

Viewers also liked (17)

Tor Pivoting Networks Share
Tor Pivoting Networks Share Tor Pivoting Networks Share
Tor Pivoting Networks Share
 
Easiest way to start with Shell scripting
Easiest way to start with Shell scriptingEasiest way to start with Shell scripting
Easiest way to start with Shell scripting
 
Vodafone beta factory - GEC 2015
Vodafone beta factory - GEC 2015Vodafone beta factory - GEC 2015
Vodafone beta factory - GEC 2015
 
Secure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical HackingSecure Shell - a Presentation on Ethical Hacking
Secure Shell - a Presentation on Ethical Hacking
 
Introduction to anonymity network tor
Introduction to anonymity network torIntroduction to anonymity network tor
Introduction to anonymity network tor
 
Ethical hacking with Python tools
Ethical hacking with Python toolsEthical hacking with Python tools
Ethical hacking with Python tools
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
How TOR works?
How TOR works?How TOR works?
How TOR works?
 
Tor the onion router
Tor  the onion routerTor  the onion router
Tor the onion router
 
Wireshark Basics
Wireshark BasicsWireshark Basics
Wireshark Basics
 
Reverse Engineering - Methods and Process
Reverse Engineering - Methods and ProcessReverse Engineering - Methods and Process
Reverse Engineering - Methods and Process
 
Cyber security-report-2017
Cyber security-report-2017Cyber security-report-2017
Cyber security-report-2017
 
Introduction to SSH
Introduction to SSHIntroduction to SSH
Introduction to SSH
 
Privacy is for losers 2016
Privacy is for losers 2016Privacy is for losers 2016
Privacy is for losers 2016
 
tor
tortor
tor
 
How to TEDx [Presentation Design Tips] - #TED #TEDX
How to TEDx [Presentation Design Tips] - #TED #TEDXHow to TEDx [Presentation Design Tips] - #TED #TEDX
How to TEDx [Presentation Design Tips] - #TED #TEDX
 
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013
Privacy is an Illusion and you’re all losers! - Cryptocow - Infosecurity 2013
 

Similar to Accelerating Science through Big Data Collaboration

The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceRobert Grossman
 
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...PyData
 
Big Data for Big Discoveries
Big Data for Big DiscoveriesBig Data for Big Discoveries
Big Data for Big DiscoveriesGovnet Events
 
Set My Data Free: High-Performance CI for Data-Intensive Research
Set My Data Free: High-Performance CI for Data-Intensive ResearchSet My Data Free: High-Performance CI for Data-Intensive Research
Set My Data Free: High-Performance CI for Data-Intensive ResearchLarry Smarr
 
How Global-Scale Personal Lightwaves are Transforming Scientific Research
How Global-Scale Personal Lightwaves are Transforming Scientific ResearchHow Global-Scale Personal Lightwaves are Transforming Scientific Research
How Global-Scale Personal Lightwaves are Transforming Scientific ResearchLarry Smarr
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and KnowledgeIan Foster
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Robert Grossman
 
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...EarthCube
 
PR-110: An Analysis of Scale Invariance in Object Detection – SNIP
PR-110: An Analysis of Scale Invariance in Object Detection – SNIPPR-110: An Analysis of Scale Invariance in Object Detection – SNIP
PR-110: An Analysis of Scale Invariance in Object Detection – SNIPjaewon lee
 
Accelerating Toward the Singularity
Accelerating Toward the SingularityAccelerating Toward the Singularity
Accelerating Toward the SingularityLarry Smarr
 
Big Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsBig Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsAndrew Lowe
 
Building a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryBuilding a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryLarry Smarr
 
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...Larry Smarr
 
The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceRobert Grossman
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? Robert Grossman
 
The World Wide Distributed Computing Architecture of the LHC Datagrid
The World Wide Distributed Computing Architecture of the LHC DatagridThe World Wide Distributed Computing Architecture of the LHC Datagrid
The World Wide Distributed Computing Architecture of the LHC DatagridSwiss Big Data User Group
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World Lazowskaguest43b4df3
 

Similar to Accelerating Science through Big Data Collaboration (20)

Jarp big data_sydney_v7
Jarp big data_sydney_v7Jarp big data_sydney_v7
Jarp big data_sydney_v7
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of Science
 
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
Enabling Real Time Analysis & Decision Making - A Paradigm Shift for Experime...
 
Big Data for Big Discoveries
Big Data for Big DiscoveriesBig Data for Big Discoveries
Big Data for Big Discoveries
 
Set My Data Free: High-Performance CI for Data-Intensive Research
Set My Data Free: High-Performance CI for Data-Intensive ResearchSet My Data Free: High-Performance CI for Data-Intensive Research
Set My Data Free: High-Performance CI for Data-Intensive Research
 
How Global-Scale Personal Lightwaves are Transforming Scientific Research
How Global-Scale Personal Lightwaves are Transforming Scientific ResearchHow Global-Scale Personal Lightwaves are Transforming Scientific Research
How Global-Scale Personal Lightwaves are Transforming Scientific Research
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and Knowledge
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
 
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
Toward Real-Time Analysis of Large Data Volumes for Diffraction Studies by Ma...
 
PR-110: An Analysis of Scale Invariance in Object Detection – SNIP
PR-110: An Analysis of Scale Invariance in Object Detection – SNIPPR-110: An Analysis of Scale Invariance in Object Detection – SNIP
PR-110: An Analysis of Scale Invariance in Object Detection – SNIP
 
Accelerating Toward the Singularity
Accelerating Toward the SingularityAccelerating Toward the Singularity
Accelerating Toward the Singularity
 
Big Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsBig Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle Physics
 
Building a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive DiscoveryBuilding a Global Collaboration System for Data-Intensive Discovery
Building a Global Collaboration System for Data-Intensive Discovery
 
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
The OptIPlanet Collaboratory Supporting Microbial Metagenomics Researchers Wo...
 
The Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data ScienceThe Transformation of Systems Biology Into A Large Data Science
The Transformation of Systems Biology Into A Large Data Science
 
2014 moore-ddd
2014 moore-ddd2014 moore-ddd
2014 moore-ddd
 
Eps edison volta
Eps edison voltaEps edison volta
Eps edison volta
 
What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care? What is a Data Commons and Why Should You Care?
What is a Data Commons and Why Should You Care?
 
The World Wide Distributed Computing Architecture of the LHC Datagrid
The World Wide Distributed Computing Architecture of the LHC DatagridThe World Wide Distributed Computing Architecture of the LHC Datagrid
The World Wide Distributed Computing Architecture of the LHC Datagrid
 
E Science As A Lens On The World Lazowska
E Science As A Lens On The World   LazowskaE Science As A Lens On The World   Lazowska
E Science As A Lens On The World Lazowska
 

Recently uploaded

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Accelerating Science through Big Data Collaboration

  • 1. Collaboration, Big Data and the search for the Higgs Boson Intel European Research and Innovation Conference October 23rd 2012 Andrzej Nowak, CERN openlab Andrzej.Nowak@cern.ch
  • 2. The European Particle Physics Laboratory based in Geneva, Switzerland Founded in 1954 by 12 countries for fundamental physics research in a post-war Europe In 2012, it is a global effort of 20 member countries and scientists from 110 nationalities, working on the world’s most ambitious physics experiments ~2’500 personnel, > 15’000 users ~1 bln CHF yearly budget Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 2
  • 3. How to explain particles have mass? • What is most of the universe made of? • Why is there little anti-matter? • What happened in the Big Bang? Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 3
  • 4. Mont Blanc (4,808m) Geneva (pop. 190’000) Lake Geneva (310m deep) Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson
  • 5. The Large Hadron Collider 27 km underground superconducting ring – possibly the largest machine ever built by man 40 million collisions per second 150-200 MW power consumption Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 5
  • 6. Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 6
  • 7. Andrzej Nowak - Collaboration, Big Data and the search for the Higgs 7 Boson
  • 8. Data flow from the LHC detectors Reconstruction Selection and reconstruction Online triggering and filtering in detectors Raw Data Event (100%) reprocessing Event summary data (10%) Event simulation Analysis Batch physics Analysis objects analysis (1%) Processed data Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 8
  • 9. 100 PB 1G Big Data 10 PB 100 M Number of files 1 PB Tape usage 10 M 100 TB 1M 10 TB Approximate, smoothed values 1 TB 100 k 2003 2005 2008 2010 2012 Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 9
  • 10. The LHC Computing Grid INSERT WORKLOAD HERE Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 10
  • 11. Collaboration on big data and computing The Worldwide LHC Computing Grid Tier-0 (CERN): data nearly 160 sites recording, reconstruction and distribution ~250’000 cores Tier-1: permanent storage, re- processing, 173 PB of storage analysis Tier-2: Simulation, > 2 million jobs/day end-user analysis Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 11
  • 12. Cutting edge science • Accelerating Science and Innovation Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 12
  • 13. It would have been impossible to release physics results so quickly without the outstanding performance of the Grid (including the CERN Tier-0) Number of concurrent ATLAS jobs Jan-July 2012 Includes MC production, user and group analysis at CERN, 10 Tier1-s, ~ 70 Tier-2 federations 100 k  > 80 sites > 1500 distinct ATLAS users do analysis on the GRID  Available resources fully used/stressed (beyond pledges in some cases)  Massive production of 8 TeV Monte Carlo samples  Very effective and flexible Computing Model and Operation team  accommodate high trigger rates and pile-up, intense MC simulation, analysis demands from worldwide users (through e.g. dynamic data placement)
  • 14. A wealth of knowledge Physics Academic Summer Technical CERN and Outreach EU FP7 Training Student Training Teacher computing programs programs program program program schools schools Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 14
  • 15. Innovation in science Medical Applications as an Example of Particle Physics Spin-off Hadron Therapy Tumour Leadership in Ion Target Beam Therapy now in Europe and Japan Protons light ions Accelerating particle beams X-ray protons ~30’000 accelerators worldwide >70’000 patients treated worldwide (30 facilities) ~17’000 used for medicine >21’000 patients treated in Europe (9 facilities) Imaging PET Scanner Clinical trial in Portugal for new breast imaging system (ClearPEM) Detecting particles 15 From F.Hemmer Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson
  • 16. Innovation in computing 1989: First high 2012: LHC 1999: The Grid 2003: Several bandwidth delivering vision Internet2 land transatlantic intense data materializes speed records links challenges 2001: CERN wins 1991: The World Computerworld’s 2008: The WLCG Wide Web is 21st Century is the world’s born at CERN Achievement Award largest grid for SHIFT Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 16
  • 17. The CERN openlab A unique research partnership of CERN and the industry Objective: The advancement of cutting-edge computing solutions to be used by the worldwide LHC community • Partners support manpower and equipment in dedicated competence centers • openlab delivers published research and evaluations based on partners’ solutions – in a very challenging setting • Created robust hands-on training program in various computing topics, including international computing schools; Summer Student program • Past involvement: Enterasys Networks, IBM, Voltaire, F- secure, Stonesoft, EDS; Future involvement: Huawei • Now in phase IV: 2012-2014 http://cern.ch/openlab Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 17
  • 18. A European Cloud Computing Partnership: big science teams up with big business To create an Earth To support the Setting up a new Strategic Plan computing capacity service to simplify Observation platform, focusing on needs for the ATLAS analysis of large  Establish multi-tenant, earthquake and experiment genomes, for a deeper multi-provider cloud volcano research insight into evolution infrastructure and biodiversity  Identify and adopt policies for trust, security and privacy  Create governance structure  Define funding schemes Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 18 From B.Jones
  • 19. Big(ger) data Data rates at the LHC to increase by ~100x Raw data: Exabytes Millions of an exabyte stored computing per second? yearly? cores? “Sustainable computing” Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 19
  • 20. Future directions in computing • Software replacing hardware – Programmability replaces rigid structures • Intensive compute – Local farms must have much higher processing capacity • Accelerators – Experiments with Intel MIC and GPUs • Silicon photonics Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 20
  • 21. Accelerating Science and Innovation Continued support of the worldwide physics community and the European population Great science and engineering + great partners = great innovation Andrzej Nowak - Collaboration, Big Data and the search for the Higgs Boson 21