e-Infrastructure @ Science
Upcoming SlideShare
Loading in...5
×
 

e-Infrastructure @ Science

on

  • 322 views

Presentation held for at the MIK seminars 2012, Amsterdam

Presentation held for at the MIK seminars 2012, Amsterdam

Statistics

Views

Total Views
322
Views on SlideShare
322
Embed Views
0

Actions

Likes
0
Downloads
5
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as OpenOffice

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • BiG Grid project proposal Both a problem and opportunity; combine data-set, colloborate on analysis, share the maintenance and curation best practices We need; reliable archiving, secure and easy access, retrieval facilities (discovery / search), communication about the data (now you access it)
  • BiG Grid project proposal Both a problem and opportunity; combine data-set, colloborate on analysis, share the maintenance and curation best practices We need; reliable archiving, secure and easy access, retrieval facilities (discovery / search), communication about the data (now you access it)

e-Infrastructure @ Science e-Infrastructure @ Science Presentation Transcript

  • E-Infrastructures @ science Tom Visser e-Science consultant SARA [email_address]
  • Today
      ##Context
    • Research
    • Who is SARA
    • Who am I
    • What is BiG Grid
    • Why is BiG Grid
    • ##Funding structures
    • National
    • International
    • Partnerships
      ##Us @ work ## Problems and challenges
  • SARA
    • Since 1971
    • Supporting research
    • Providing services
      • Network,
      • HTC computing
      • HPC computing
      • Data-services & mass storage
      • Support and development; optimization, projects-support.
    • Being a partner
  • Huygens National Super IBM Power 6, 3328 cores, 15.25 TB of memory, 700 TB of disk space, 60 TFlop/s LISA National Compute Cluster Dell cluster 4480 cores, 12 TB of memory, 20 TFlop/s Grid Rerouces 2376 Cores, 3408 TB of disk, 2000 TB tape 12 BioInfo Sites Life Science Grid High Energy Physics, Astronomy, Bio Info Visualization Tiled Panel Display Remote Visualization Network SURFnet 6 AMSix Netherlight Innovative Infrastructures Cloud GPU Hadoop Beehub
    • About ME
    • MA Social Informatics @ UvA
    • Online scientific Collaboration in european project
    • 2 years IBED
    • 3 years @ SARA working for BiG Grid project
      • E-science and cloud services
      • Guardian angel
      • Community communicator
      • Account management
      • (Inter)National scientific communities
  • BiG Grid project
    • NIKHEF, NCF, NBIC
    • Providing a world class e-science infrastructure
    • Part of the larger european grid
    • > 6000 compute cores
    • > 10 PB disk
    • Tape storage
    • Support and development
  • E-Infrastructure NL WUR Life Science Grid 16 Grid cores 18 TB disk RUN Life Science Grid 32 Grid cores 18 TB disk UMCG Life Science Grid 32 Grid cores 18 TB disk Keygene Life Science Grid 32 Grid cores 18 TB disk Erasmus MC Life Science Grid 32 Grid cores 18 TB disk LUMC Life Science Grid 32 Grid cores 18 TB disk UU Life Science Grid 32 Grid cores 18 TB disk SARA Central Facillity 2400 cores 3450 TB disk 4000 TB tape 128 Cloud cores Nikhef Central Facillity 2500 Grid cores 1350 TB disk Philips Research Central Facillity 1648 Grid cores 20 TB disk RUG Central Facillity 294 Grid cores 34 TB disk AMC Life Science Grid 32 Grid cores 18 TB disk NKI Life Science Grid 32 Grid cores 18 TB disk SARA Huygens Super Lisa Cluster Visualization Hadoop TUD Life Science Grid 32 Grid cores 18 TB disk
    • Data explosion
  • e-science
    • 1999 term
    • computation
    • collaboration
    • lots of data
    • Shift of paradigm
    • Google paper:
    • “The unreasonable effectiveness of data”
  • Funding
    • It's all about the money?
    • National
      • FES
      • OCW (NWO)
      • ELI
    • European, big ESFRI programmes
    • Companies
  • Us @ work
  •  
  • Data ingest service (sneakernet)
    • Harddrives coming from Hong Kong (BGI)
    • Are you serious?
    • Fast network; end-to-end
    • Backpack with drives
    • Couriers with drives
    • Set up experimental ingest
    • March 2012 in production
  • From problem to result
    • Definition of own role & contribution
    • Realistic objectives
    • Trustworthy knowledgeable partner
      • Adapting to pace and needs of scientific project
    • Education
    • Support
    • Development
    • Funding dedicated programmers
    • Experimental technologies
    • Keeping it all operational
  • Types of problems
    • Data intensive – information intensive
    • Memory; IO; data-locality
    • Easy scalable – complex integrated pipelines
    • Legacy; you never start from scratch
    • Licensing / privacy
    • Local policies
    • Who decides
  • EbioGrid platform
    • Create a national support basis for e-BioScience to both expert bioinformaticians and expert life scientists.
    • Exploit BIG Grid infrastructure in the life science R&D
    • Create functional Problem Solving Environments (PSEs) for the selected technology areas that deal with high demand in computing resources
    • Connect with the NBIC-BioAssist and BiG Grid programs.
    • e-BioScience
      Life Science Research
      Research Support
      BioAssist Engineering Team
      Genomics
      Bio-interpret.
      Biobanking
      Proteomics
      Short cooperative projects
      Task force
      Tools
      TA project
      PSEs
      NGS
      MAS
      MAT
      NCS
      BBC
      BiG Grid
      BioAssist
      Support & Development Team
      Operations Team
      Cooperative projects
      Analysis, design & implementation of software environment
      Infrastructure
      Installation and running of the compute and storage systems
      e-Core
      e-BioGrid
      Bioinformatics tools oriented
      ICT Infrastructure oriented
  • Problems and challenges
    • Inspire and motivate
    • Keeping all stakeholders happy
      • Infrastructure needs money
      • Funding implies overhead
      • Can't do everything, making choices
    • Sain development
    • Invisibility of infrastructure
    • Collaboration can bite individual excellence
    • Loose coupling or tight integration
    • Keeping up with fast changes
  • Conclusion
    • We have a collaborative challenge
    • NL/EU is very well positioned
      • Great potential of the network
      • Governments recognize importance
    • Keeping up with fast technology changes
    • Trans-disciplinarity and integration are key
    • Strong community is key
      • Knowledge, skills and technology
    • Will e-science become science again?
  • References / credits
    • http://www.biggrid.nl
    • http://www.e-biogrid.nl
    • http://www.sara.nl
    • http://www.nwo.nl
    • BBMRI image NY times: http://goo.gl/I130Q
    • http://www.bbmri.nl
    • http://www.egi.eu
    • http://www.necen.nl concept drawing by J.J.Bot