e-Infrastructure @ Science

329 views

Published on

Presentation held for at the MIK seminars 2012, Amsterdam

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
329
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
8
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • BiG Grid project proposal Both a problem and opportunity; combine data-set, colloborate on analysis, share the maintenance and curation best practices We need; reliable archiving, secure and easy access, retrieval facilities (discovery / search), communication about the data (now you access it)
  • BiG Grid project proposal Both a problem and opportunity; combine data-set, colloborate on analysis, share the maintenance and curation best practices We need; reliable archiving, secure and easy access, retrieval facilities (discovery / search), communication about the data (now you access it)
  • e-Infrastructure @ Science

    1. 1. E-Infrastructures @ science Tom Visser e-Science consultant SARA [email_address]
    2. 2. Today <ul>##Context <li>Research
    3. 3. Who is SARA
    4. 4. Who am I
    5. 5. What is BiG Grid
    6. 6. Why is BiG Grid
    7. 7. ##Funding structures
    8. 8. National
    9. 9. International
    10. 10. Partnerships </li></ul><ul>##Us @ work ## Problems and challenges </ul>
    11. 11. SARA <ul><li>Since 1971
    12. 12. Supporting research
    13. 13. Providing services </li><ul><li>Network,
    14. 14. HTC computing
    15. 15. HPC computing
    16. 16. Data-services & mass storage
    17. 17. Support and development; optimization, projects-support. </li></ul><li>Being a partner </li></ul>
    18. 18. Huygens National Super IBM Power 6, 3328 cores, 15.25 TB of memory, 700 TB of disk space, 60 TFlop/s LISA National Compute Cluster Dell cluster 4480 cores, 12 TB of memory, 20 TFlop/s Grid Rerouces 2376 Cores, 3408 TB of disk, 2000 TB tape 12 BioInfo Sites Life Science Grid High Energy Physics, Astronomy, Bio Info Visualization Tiled Panel Display Remote Visualization Network SURFnet 6 AMSix Netherlight Innovative Infrastructures Cloud GPU Hadoop Beehub
    19. 19. <ul>About ME </ul><ul><li>MA Social Informatics @ UvA
    20. 20. Online scientific Collaboration in european project
    21. 21. 2 years IBED
    22. 22. 3 years @ SARA working for BiG Grid project </li><ul><li>E-science and cloud services
    23. 23. Guardian angel
    24. 24. Community communicator
    25. 25. Account management
    26. 26. (Inter)National scientific communities </li></ul></ul>
    27. 27. BiG Grid project <ul><li>NIKHEF, NCF, NBIC
    28. 28. Providing a world class e-science infrastructure
    29. 29. Part of the larger european grid
    30. 30. > 6000 compute cores
    31. 31. > 10 PB disk
    32. 32. Tape storage
    33. 33. Support and development </li></ul>
    34. 34. E-Infrastructure NL WUR Life Science Grid 16 Grid cores 18 TB disk RUN Life Science Grid 32 Grid cores 18 TB disk UMCG Life Science Grid 32 Grid cores 18 TB disk Keygene Life Science Grid 32 Grid cores 18 TB disk Erasmus MC Life Science Grid 32 Grid cores 18 TB disk LUMC Life Science Grid 32 Grid cores 18 TB disk UU Life Science Grid 32 Grid cores 18 TB disk SARA Central Facillity 2400 cores 3450 TB disk 4000 TB tape 128 Cloud cores Nikhef Central Facillity 2500 Grid cores 1350 TB disk Philips Research Central Facillity 1648 Grid cores 20 TB disk RUG Central Facillity 294 Grid cores 34 TB disk AMC Life Science Grid 32 Grid cores 18 TB disk NKI Life Science Grid 32 Grid cores 18 TB disk SARA Huygens Super Lisa Cluster Visualization Hadoop TUD Life Science Grid 32 Grid cores 18 TB disk
    35. 35. <ul>Data explosion </ul>
    36. 36. e-science <ul><li>1999 term
    37. 37. computation
    38. 38. collaboration
    39. 39. lots of data
    40. 40. Shift of paradigm
    41. 41. Google paper:
    42. 42. “The unreasonable effectiveness of data” </li></ul>
    43. 43. Funding <ul><li>It's all about the money?
    44. 44. National </li><ul><li>FES
    45. 45. OCW (NWO)
    46. 46. ELI </li></ul><li>European, big ESFRI programmes
    47. 47. Companies </li></ul>
    48. 48. Us @ work
    49. 50. Data ingest service (sneakernet) <ul><li>Harddrives coming from Hong Kong (BGI)
    50. 51. Are you serious?
    51. 52. Fast network; end-to-end
    52. 53. Backpack with drives
    53. 54. Couriers with drives
    54. 55. Set up experimental ingest
    55. 56. March 2012 in production </li></ul>
    56. 57. From problem to result <ul><li>Definition of own role & contribution
    57. 58. Realistic objectives
    58. 59. Trustworthy knowledgeable partner </li><ul><li>Adapting to pace and needs of scientific project </li></ul><li>Education
    59. 60. Support
    60. 61. Development
    61. 62. Funding dedicated programmers
    62. 63. Experimental technologies
    63. 64. Keeping it all operational </li></ul>
    64. 65. Types of problems <ul><li>Data intensive – information intensive
    65. 66. Memory; IO; data-locality
    66. 67. Easy scalable – complex integrated pipelines
    67. 68. Legacy; you never start from scratch
    68. 69. Licensing / privacy
    69. 70. Local policies
    70. 71. Who decides </li></ul>
    71. 72. EbioGrid platform <ul><li>Create a national support basis for e-BioScience to both expert bioinformaticians and expert life scientists.
    72. 73. Exploit BIG Grid infrastructure in the life science R&D
    73. 74. Create functional Problem Solving Environments (PSEs) for the selected technology areas that deal with high demand in computing resources
    74. 75. Connect with the NBIC-BioAssist and BiG Grid programs. </li></ul>
    75. 76. <ul>e-BioScience </ul><ul>Life Science Research </ul><ul>Research Support </ul><ul>BioAssist Engineering Team </ul><ul>Genomics </ul><ul>Bio-interpret. </ul><ul>Biobanking </ul><ul>Proteomics </ul><ul>… </ul><ul>Short cooperative projects </ul><ul>Task force </ul><ul>Tools </ul><ul>TA project </ul><ul>PSEs </ul><ul>NGS </ul><ul>MAS </ul><ul>MAT </ul><ul>NCS </ul><ul>BBC </ul><ul>BiG Grid </ul><ul>BioAssist </ul><ul>Support & Development Team </ul><ul>Operations Team </ul><ul>Cooperative projects </ul><ul>Analysis, design & implementation of software environment </ul><ul>Infrastructure </ul><ul>Installation and running of the compute and storage systems </ul><ul>e-Core </ul><ul>e-BioGrid </ul><ul>Bioinformatics tools oriented </ul><ul>ICT Infrastructure oriented </ul>
    76. 77. Problems and challenges <ul><li>Inspire and motivate
    77. 78. Keeping all stakeholders happy </li><ul><li>Infrastructure needs money
    78. 79. Funding implies overhead
    79. 80. Can't do everything, making choices </li></ul><li>Sain development
    80. 81. Invisibility of infrastructure
    81. 82. Collaboration can bite individual excellence
    82. 83. Loose coupling or tight integration
    83. 84. Keeping up with fast changes </li></ul>
    84. 85. Conclusion <ul><li>We have a collaborative challenge
    85. 86. NL/EU is very well positioned </li><ul><li>Great potential of the network
    86. 87. Governments recognize importance </li></ul><li>Keeping up with fast technology changes
    87. 88. Trans-disciplinarity and integration are key
    88. 89. Strong community is key </li><ul><li>Knowledge, skills and technology </li></ul><li>Will e-science become science again? </li></ul>
    89. 90. References / credits <ul><li>http://www.biggrid.nl
    90. 91. http://www.e-biogrid.nl
    91. 92. http://www.sara.nl
    92. 93. http://www.nwo.nl
    93. 94. BBMRI image NY times: http://goo.gl/I130Q
    94. 95. http://www.bbmri.nl
    95. 96. http://www.egi.eu
    96. 97. http://www.necen.nl concept drawing by J.J.Bot </li></ul>

    ×