Submit Search
Upload
Into The Wonderful
•
14 likes
•
4,956 views
Matt Wood
Follow
An introduction to cloud computing from a scientific research perspective.
Read less
Read more
Technology
Business
Report
Share
Report
Share
1 of 84
Download now
Download to read offline
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
More Related Content
What's hot
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
What's hot
(20)
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
An intriduction to hive
An intriduction to hive
Hadoop basics
Hadoop basics
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Pig - Analyzing data sets
Pig - Analyzing data sets
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Big data Hadoop presentation
Big data Hadoop presentation
Introduction to Hive
Introduction to Hive
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Introduction to pig & pig latin
Introduction to pig & pig latin
Hadoop
Hadoop
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
Python in big data world
Python in big data world
Hadoop workshop
Hadoop workshop
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Hadoopsummit16 myui
Hadoopsummit16 myui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Similar to Into The Wonderful
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
HostedbyConfluent
Puppet for Sys Admins
Puppet for Sys Admins
Puppet
The Sandbox Container Directory
The Sandbox Container Directory
Bongwon Lee
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Amazon Web Services
Filesystems Lisbon 2018
Filesystems Lisbon 2018
Frank de Jonge
data-mesh-101.pptx
data-mesh-101.pptx
TarekHamdi8
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
daveconnors
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
DataHub
DataHub
Aditya Parameswaran
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Sabbir Alam
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Yahoo Developer Network
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Cloudera, Inc.
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
PCM
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Folio3 Software
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Andy Petrella
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
InfluxData
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Advertising Specialties Alliance
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
C4Media
Similar to Into The Wonderful
(20)
Data Science
Data Science
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
Puppet for Sys Admins
Puppet for Sys Admins
The Sandbox Container Directory
The Sandbox Container Directory
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Filesystems Lisbon 2018
Filesystems Lisbon 2018
data-mesh-101.pptx
data-mesh-101.pptx
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
DataHub
DataHub
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
More from Matt Wood
Genomics in the Cloud
Genomics in the Cloud
Matt Wood
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Extreme Informatics
Extreme Informatics
Matt Wood
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
Matt Wood
The A to Z of developing for the web
The A to Z of developing for the web
Matt Wood
Introduction to Scrum
Introduction to Scrum
Matt Wood
30 Minutes With Rails
30 Minutes With Rails
Matt Wood
Subversion Best Practices
Subversion Best Practices
Matt Wood
Lucene
Lucene
Matt Wood
Introduction to the Semantic Web
Introduction to the Semantic Web
Matt Wood
More from Matt Wood
(12)
Genomics in the Cloud
Genomics in the Cloud
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Genomes On Rails
Genomes On Rails
Genomes On Rails
Genomes On Rails
Extreme Informatics
Extreme Informatics
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
The A to Z of developing for the web
The A to Z of developing for the web
Introduction to Scrum
Introduction to Scrum
30 Minutes With Rails
30 Minutes With Rails
Subversion Best Practices
Subversion Best Practices
Lucene
Lucene
Introduction to the Semantic Web
Introduction to the Semantic Web
Recently uploaded
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Ridwan Fadjar
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
OnBoard
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Softradix Technologies
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Delhi Call girls
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
AndikSusilo4
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
soniya singh
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
BookNet Canada
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
Paola De la Torre
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
ThousandEyes
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
carlostorres15106
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Patryk Bandurski
Slack Application Development 101 Slides
Slack Application Development 101 Slides
praypatel2
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Michael W. Hawkins
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
Pixlogix Infotech
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Padma Pradeep
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
Recently uploaded
(20)
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Slack Application Development 101 Slides
Slack Application Development 101 Slides
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
Into The Wonderful
1.
Into the Wonderful Towards
a Virtual Institute
2.
you are here
3.
Data
4.
Lots of data
5.
Lots of data,
lots of people
6.
Lots of data,
lots of people, lots of compute
7.
Lots of data,
lots of people, lots of compute, lots of uses
8.
Lots of data,
lots of people, lots of compute, lots of uses, lots and lots and lots and lots...
9.
Trillionics
10.
A platform for
science
11.
12.
1
Get 2 Select 3 Work 4 Save
13.
1
Get 2 Select 3 Work 4 Save
14.
1
Get 2 Select 3 Work 4 Save
15.
1
Get 2 Select 3 Work 4 Save
16.
Work is the
killer app get here quickly
17.
Work = publications
18.
Problematic for complex
data
19.
1
Get 2 Select 3 Work 4 Save
20.
1
Get: flat files / databases 2 Select 3 Work 4 Save
21.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work 4 Save
22.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save
23.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save: flat files / databases
24.
Get
Filter Work Save
25.
Get
Filter Work Save
26.
Get
Filter Work Save
27.
Get
Filter Work Save
28.
Get
Filter Work Save
29.
Get
Filter Work Save
30.
Get
Filter Work Save
31.
Get
Filter Work Save
32.
Filter
Save Get Work Get Filter Work Save
33.
Filter
Save Work Get Get Work Get Filter Work Save
34.
Filter
Save Work Get Get Work Get Filter Work Save
35.
36.
Virtualise
37.
Get
Save
38.
Data platform Get
Save
39.
Data platform Get
Save Work
40.
Data platform Get
Save Work App platform
41.
Data accessible via
services
42.
Applications accessible
via services
43.
Data platform
Get / Save Work Projects / SNP calling App platform
44.
Distribute
45.
Data platform Hintxon
Get / Save San Diego Work App platform
46.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work
47.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work
48.
A distributed mindset
49.
map/reduce
50.
1. map
51.
@a = [
1, 2, 3 ] @result = [] for each $value in @a push @result, map($value) end sub map($incoming) return ($incoming * 10) end
52.
2. reduce
53.
reduce(@result) sub reduce($r)
<transform $r> end
54.
independent
55.
of array size!
independent
56.
of array size!
independent of ea ch other!
57.
independent distribute across virtual
machines!
58.
Prerequisites
59.
Open data easy to
get a t data
60.
soft ware as
a service Open APIs
61.
Beyond SQL
62.
Accessibility
63.
East coast 24/7
Accessibility Dow n the co rridor West coast
64.
Reliability
65.
Build for flux
66.
Authentication
67.
Privacy
68.
Less software
69.
Distribute everything
70.
Replicate everything
Speed. Redundancy.
71.
Will it scale?
72.
Oh yes
73.
New York Times
74.
11 million TIFs
75.
24 hours
$500
76.
Google, Yahoo!
Amazon
77.
We are here
78.
We need to
start now
79.
2 X
80.
150 Tb/week
81.
We need to
start now as in, like, ye sterday
82.
Petabyte journal club
foomongers.org.uk
83.
Thank you
84.
GREENISGOOD.CO.UK
Download now