Your SlideShare is downloading. ×
Cloud Computing y Big Data,próxima frontera de la innovaciónCloud Computing and Big Data,the next frontier of innovationJo...
HOW DID SCIENCE START?
Source: Prof. Mateo Valero, BSC-CNS 2010
Source: Prof. Mateo Valero, BSC-CNS 2010
HOW IS SCIENCE ADVANCING TODAY?
Source: Prof. Mateo Valero, BSC-CNS 2010
Source: Prof. Mateo Valero, BSC-CNS 2010
MATHEMATICAL CALCULATIONS?         WHERE?
MN3              Cores/chip                 8              Chip/node                  2Compute       Cores/node           ...
FOR SOME SPANISH RESEARCH GROUPS!
AND…FOR THE REST OF THE WORLD?
GOOD NEWS!Source: http://news.cnet.com/8301-13846_3-57349321-62/amazon-takes-supercomputing-to-the-cloud
CLOUD COMPUTING?
Source: http://www.wired.com/wiredenterprise/2011/12/nonexistent-supercomputer/all/1
Source: http://www.facebook.com/media/            set/?set=a.190842620965185.47008.140375289345252   40 Mw28.000 m2
Foto: Google
HUGE DATA CENTERSFoto: Google                        > football pitch x 4
Source: http://www.google.com/about/datacenters/gallery/images
Source: http://www.google.com/about/datacenters/gallery/images
Source: http://www.google.com/about/datacenters/gallery/images
Different IT             productionFoto: J.T.
CLOUD COMPUTING:            IT as a serviceOn-demand self-service                                           Pay per use  R...
Example of benefits (IaaS):1 computer in a rackfor 120 hours          120 computers in three                       racks f...
AND DATA?
Source: http://www.docuciencia.es/2009/05/lhc-el-acelerador-de-particulas/“… the LHC produces 1PetaByte of data every seco...
1 Gigabyte (GB) = 1.000.000.000 byte1 Terabyte (TB) = 1.000 Gigabyte (GB)1 Petabyte (PB) = 1.000.000 Gigabyte (GB)1 Exabyt...
Deluge of data created daily                               Source: Economist , Feb 25th, 2010 http://www.economist.com/nod...
Big Data?definition?
BIG DATA?Big Data is data that exceeds thestoring, processing and managingcapacity of conventional systems.
BIG DATA?The reason is that the data is toobig, moves too fast, or doesn’t fitthe structures of our current systems’archit...
BIG DATA?Moreover, to gain value from thisdata, we must change the way toanalyze it.
BIG DATA?Big Data is data that exceeds the storing,processing and managing capacity ofconventional systems.The reason is t...
NEW CHALLENGESthat must be addressed urgently, in order to respond     to the needs of the advancement of science         ...
Affordable Storage
But scanning disks…assume 100MB/sec
But scanning disks…assume 100MB/secmore than 5 hours
approach: massive parallelism    assume 20.000 disks:scanning 2 TB takes 1 secondSource: http://www.google.com/about/datac...
1 Data processing challengesRethinking data processing is required:      MapReduce, Storm, S4,…  Source: http://www.google...
2 Data storage challengesNew Storage technologies are required                     HHD 100 cheaper than RAM               ...
3 Data management challenges   Relational DB can’t support everythingExample: eventual consistencySolution: “NoSQL systems...
4 Obtaining value from data        The information is non actionable knowledge-             Data                  predicti...
Cloud Computing   and Big Data:the next frontier of    science and     innovation
Thank you for your attentionwww.JordiTorres.org - @JordiTorresBCN     www.smartcityexpo.com                 www.bsc.es/eBu...
Thank you for your attentionwww.JordiTorres.org - @JordiTorresBCN     www.smartcityexpo.com                 www.bsc.es/eBu...
Cloud Computing y Big Data, próxima frontera de la innovación
Cloud Computing y Big Data, próxima frontera de la innovación
Upcoming SlideShare
Loading in...5
×

Cloud Computing y Big Data, próxima frontera de la innovación

308

Published on

Jordi Torres, UPC Barcelona Tech. Barcelona Supercomputing Center.

Jornada 'El impacto de la Nube y el Big Data en la Ciencia'

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
308
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Transcript of "Cloud Computing y Big Data, próxima frontera de la innovación"

  1. 1. Cloud Computing y Big Data,próxima frontera de la innovaciónCloud Computing and Big Data,the next frontier of innovationJordi Torres, UPC-BSCMadrid, 21 Marzo 2013
  2. 2. HOW DID SCIENCE START?
  3. 3. Source: Prof. Mateo Valero, BSC-CNS 2010
  4. 4. Source: Prof. Mateo Valero, BSC-CNS 2010
  5. 5. HOW IS SCIENCE ADVANCING TODAY?
  6. 6. Source: Prof. Mateo Valero, BSC-CNS 2010
  7. 7. Source: Prof. Mateo Valero, BSC-CNS 2010
  8. 8. MATHEMATICAL CALCULATIONS? WHERE?
  9. 9. MN3 Cores/chip 8 Chip/node 2Compute Cores/node 16 Nodes 3028 Total cores 48448 Freq. 2,6 Gflops/core 20,8Performance Gflops/node 332,8 Total Tflops 1000,0 GB/core (GB) 2Memory GB/node (GB) 32 Total (TB) 96,89 Latency (μs) 0,7Network Bandwidth (Gb/s) 40Storage (TB) 2000Consumption (KW) 1080
  10. 10. FOR SOME SPANISH RESEARCH GROUPS!
  11. 11. AND…FOR THE REST OF THE WORLD?
  12. 12. GOOD NEWS!Source: http://news.cnet.com/8301-13846_3-57349321-62/amazon-takes-supercomputing-to-the-cloud
  13. 13. CLOUD COMPUTING?
  14. 14. Source: http://www.wired.com/wiredenterprise/2011/12/nonexistent-supercomputer/all/1
  15. 15. Source: http://www.facebook.com/media/ set/?set=a.190842620965185.47008.140375289345252 40 Mw28.000 m2
  16. 16. Foto: Google
  17. 17. HUGE DATA CENTERSFoto: Google > football pitch x 4
  18. 18. Source: http://www.google.com/about/datacenters/gallery/images
  19. 19. Source: http://www.google.com/about/datacenters/gallery/images
  20. 20. Source: http://www.google.com/about/datacenters/gallery/images
  21. 21. Different IT productionFoto: J.T.
  22. 22. CLOUD COMPUTING: IT as a serviceOn-demand self-service Pay per use Rapid elasticity Ubiquitous access .... Source: http://www.telegraph.co.uk/technology /reviews/9241719/Power-Ethernet-Sockets-review.html
  23. 23. Example of benefits (IaaS):1 computer in a rackfor 120 hours 120 computers in three racks for 1 hour Idea : Tutorial SC2011 - Robert Grossman
  24. 24. AND DATA?
  25. 25. Source: http://www.docuciencia.es/2009/05/lhc-el-acelerador-de-particulas/“… the LHC produces 1PetaByte of data every second, big data andlack of computing resources were becoming the European Organizationfor Nuclear Research’s biggest IT challenges…” Source: computerweekly.com/news/2240173897/CERN-adopts -OpenStack-private-cloud-to-solve-big-data-challenges
  26. 26. 1 Gigabyte (GB) = 1.000.000.000 byte1 Terabyte (TB) = 1.000 Gigabyte (GB)1 Petabyte (PB) = 1.000.000 Gigabyte (GB)1 Exabyte (EB) = 1.000.000.000 Gigabyte (GB)1 Zettabyte (ZB) = 1.000.000.000.000 (GB)
  27. 27. Deluge of data created daily Source: Economist , Feb 25th, 2010 http://www.economist.com/node/15579717
  28. 28. Big Data?definition?
  29. 29. BIG DATA?Big Data is data that exceeds thestoring, processing and managingcapacity of conventional systems.
  30. 30. BIG DATA?The reason is that the data is toobig, moves too fast, or doesn’t fitthe structures of our current systems’architectures.
  31. 31. BIG DATA?Moreover, to gain value from thisdata, we must change the way toanalyze it.
  32. 32. BIG DATA?Big Data is data that exceeds the storing,processing and managing capacity ofconventional systems.The reason is that the data is too big,moves too fast, or doesn’t fit thestructures of our current systems’architectures.Moreover, to gain value from this data, wemust change the way to analyze it.
  33. 33. NEW CHALLENGESthat must be addressed urgently, in order to respond to the needs of the advancement of science 1. Storing 2. Managing 3. Processing 4. Analyzing
  34. 34. Affordable Storage
  35. 35. But scanning disks…assume 100MB/sec
  36. 36. But scanning disks…assume 100MB/secmore than 5 hours
  37. 37. approach: massive parallelism assume 20.000 disks:scanning 2 TB takes 1 secondSource: http://www.google.com/about/datacenters/gallery/images/_2000/IDI_018.jpg
  38. 38. 1 Data processing challengesRethinking data processing is required: MapReduce, Storm, S4,… Source: http://www.google.com/about/datacenters/gallery/images/_2000/IDI_018.jpg
  39. 39. 2 Data storage challengesNew Storage technologies are required HHD 100 cheaper than RAM But 1000 times slowerRAM vs HHD Solid- state drive (SSD) Not volatilePresent solutions: Storage Class Memory (SCM)Research:
  40. 40. 3 Data management challenges Relational DB can’t support everythingExample: eventual consistencySolution: “NoSQL systems”Research: New management systems Source: gigaom.com/cloud/big-data- and-nosql-march-to-the-enterprise/ 43
  41. 41. 4 Obtaining value from data The information is non actionable knowledge- Data prediction using data mining & + machine learning techniquesValue Volume Information Research: The majority of algorithms function well in thousands of+ registers, however at the moment they Knowledge - are impractical for thousands of milions.
  42. 42. Cloud Computing and Big Data:the next frontier of science and innovation
  43. 43. Thank you for your attentionwww.JordiTorres.org - @JordiTorresBCN www.smartcityexpo.com www.bsc.es/eBusiness Autonomic Systems and e-Business Platforms research line at BSC/UPC
  44. 44. Thank you for your attentionwww.JordiTorres.org - @JordiTorresBCN www.smartcityexpo.com www.bsc.es/eBusiness Autonomic Systems and e-Business Platforms research line at BSC/UPC

×