Submit Search
Upload
Into The Wonderful
•
14 likes
•
4,956 views
Matt Wood
Follow
An introduction to cloud computing from a scientific research perspective.
Read less
Read more
Technology
Business
Slideshow view
Report
Share
Slideshow view
Report
Share
1 of 84
Download now
Download to read offline
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
More Related Content
What's hot
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
What's hot
(20)
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
An intriduction to hive
An intriduction to hive
Hadoop basics
Hadoop basics
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Pig - Analyzing data sets
Pig - Analyzing data sets
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Big data Hadoop presentation
Big data Hadoop presentation
Introduction to Hive
Introduction to Hive
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Introduction to pig & pig latin
Introduction to pig & pig latin
Hadoop
Hadoop
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
Python in big data world
Python in big data world
Hadoop workshop
Hadoop workshop
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Hadoopsummit16 myui
Hadoopsummit16 myui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Similar to Into The Wonderful
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
HostedbyConfluent
Puppet for Sys Admins
Puppet for Sys Admins
Puppet
The Sandbox Container Directory
The Sandbox Container Directory
Bongwon Lee
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Amazon Web Services
Filesystems Lisbon 2018
Filesystems Lisbon 2018
Frank de Jonge
data-mesh-101.pptx
data-mesh-101.pptx
TarekHamdi8
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
daveconnors
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
DataHub
DataHub
Aditya Parameswaran
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Sabbir Alam
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Yahoo Developer Network
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Cloudera, Inc.
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
PCM
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Folio3 Software
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Andy Petrella
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
InfluxData
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Advertising Specialties Alliance
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
C4Media
Similar to Into The Wonderful
(20)
Data Science
Data Science
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
Puppet for Sys Admins
Puppet for Sys Admins
The Sandbox Container Directory
The Sandbox Container Directory
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Filesystems Lisbon 2018
Filesystems Lisbon 2018
data-mesh-101.pptx
data-mesh-101.pptx
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
DataHub
DataHub
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
More from Matt Wood
Genomics in the Cloud
Genomics in the Cloud
Matt Wood
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Extreme Informatics
Extreme Informatics
Matt Wood
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
Matt Wood
The A to Z of developing for the web
The A to Z of developing for the web
Matt Wood
Introduction to Scrum
Introduction to Scrum
Matt Wood
30 Minutes With Rails
30 Minutes With Rails
Matt Wood
Subversion Best Practices
Subversion Best Practices
Matt Wood
Lucene
Lucene
Matt Wood
Introduction to the Semantic Web
Introduction to the Semantic Web
Matt Wood
More from Matt Wood
(12)
Genomics in the Cloud
Genomics in the Cloud
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Genomes On Rails
Genomes On Rails
Genomes On Rails
Genomes On Rails
Extreme Informatics
Extreme Informatics
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
The A to Z of developing for the web
The A to Z of developing for the web
Introduction to Scrum
Introduction to Scrum
30 Minutes With Rails
30 Minutes With Rails
Subversion Best Practices
Subversion Best Practices
Lucene
Lucene
Introduction to the Semantic Web
Introduction to the Semantic Web
Recently uploaded
Architecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
apidays
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Product Anonymous
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Angeliki Cooney
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
Sandro Moreira
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
apidays
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
The Digital Insurer
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
MadyBayot
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
Andrey Devyatkin
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Dropbox
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
debabhi2
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
apidays
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Jago de Vreede
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Overkill Security
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
Overkill Security
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
MIND CTI
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
danishmna97
Recently uploaded
(20)
Architecting Cloud Native Applications
Architecting Cloud Native Applications
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
Into The Wonderful
1.
Into the Wonderful Towards
a Virtual Institute
2.
you are here
3.
Data
4.
Lots of data
5.
Lots of data,
lots of people
6.
Lots of data,
lots of people, lots of compute
7.
Lots of data,
lots of people, lots of compute, lots of uses
8.
Lots of data,
lots of people, lots of compute, lots of uses, lots and lots and lots and lots...
9.
Trillionics
10.
A platform for
science
11.
12.
1
Get 2 Select 3 Work 4 Save
13.
1
Get 2 Select 3 Work 4 Save
14.
1
Get 2 Select 3 Work 4 Save
15.
1
Get 2 Select 3 Work 4 Save
16.
Work is the
killer app get here quickly
17.
Work = publications
18.
Problematic for complex
data
19.
1
Get 2 Select 3 Work 4 Save
20.
1
Get: flat files / databases 2 Select 3 Work 4 Save
21.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work 4 Save
22.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save
23.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save: flat files / databases
24.
Get
Filter Work Save
25.
Get
Filter Work Save
26.
Get
Filter Work Save
27.
Get
Filter Work Save
28.
Get
Filter Work Save
29.
Get
Filter Work Save
30.
Get
Filter Work Save
31.
Get
Filter Work Save
32.
Filter
Save Get Work Get Filter Work Save
33.
Filter
Save Work Get Get Work Get Filter Work Save
34.
Filter
Save Work Get Get Work Get Filter Work Save
35.
36.
Virtualise
37.
Get
Save
38.
Data platform Get
Save
39.
Data platform Get
Save Work
40.
Data platform Get
Save Work App platform
41.
Data accessible via
services
42.
Applications accessible
via services
43.
Data platform
Get / Save Work Projects / SNP calling App platform
44.
Distribute
45.
Data platform Hintxon
Get / Save San Diego Work App platform
46.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work
47.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work
48.
A distributed mindset
49.
map/reduce
50.
1. map
51.
@a = [
1, 2, 3 ] @result = [] for each $value in @a push @result, map($value) end sub map($incoming) return ($incoming * 10) end
52.
2. reduce
53.
reduce(@result) sub reduce($r)
<transform $r> end
54.
independent
55.
of array size!
independent
56.
of array size!
independent of ea ch other!
57.
independent distribute across virtual
machines!
58.
Prerequisites
59.
Open data easy to
get a t data
60.
soft ware as
a service Open APIs
61.
Beyond SQL
62.
Accessibility
63.
East coast 24/7
Accessibility Dow n the co rridor West coast
64.
Reliability
65.
Build for flux
66.
Authentication
67.
Privacy
68.
Less software
69.
Distribute everything
70.
Replicate everything
Speed. Redundancy.
71.
Will it scale?
72.
Oh yes
73.
New York Times
74.
11 million TIFs
75.
24 hours
$500
76.
Google, Yahoo!
Amazon
77.
We are here
78.
We need to
start now
79.
2 X
80.
150 Tb/week
81.
We need to
start now as in, like, ye sterday
82.
Petabyte journal club
foomongers.org.uk
83.
Thank you
84.
GREENISGOOD.CO.UK
Download now