Submit Search
Upload
Into The Wonderful
•
14 likes
•
4,956 views
Matt Wood
Follow
An introduction to cloud computing from a scientific research perspective.
Read less
Read more
Technology
Business
Report
Share
Report
Share
1 of 84
Download now
Download to read offline
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
Recommended
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Hadoop Hive Tutorial | Hive Fundamentals | Hive Architecture
Skillspeed
Hive paris
Hive paris
Szehon Ho
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
Gasperi Jerome
Big data and tools
Big data and tools
Shivam Shukla
Hadoop-BigData
Hadoop-BigData
Gigin Krishnan
Big data ecosystem
Big data ecosystem
SlideCentral
Apache Pig for Data Scientists
Apache Pig for Data Scientists
DataWorks Summit
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
More Related Content
What's hot
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Mike Frampton
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
An intriduction to hive
An intriduction to hive
Reza Ameri
Hadoop basics
Hadoop basics
Antonio Silveira
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Cloudera, Inc.
Pig - Analyzing data sets
Pig - Analyzing data sets
Creditas
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Nick Dimiduk
Big data Hadoop presentation
Big data Hadoop presentation
Shivanee garg
Introduction to Hive
Introduction to Hive
Uday Vakalapudi
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
QA or the Highway
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Kelly Technologies
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Hadoop
Hadoop
Bhushan Kulkarni
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
markgrover
Python in big data world
Python in big data world
Rohit
Hadoop workshop
Hadoop workshop
Purna Chander
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Zheng Shao
Hadoopsummit16 myui
Hadoopsummit16 myui
Makoto Yui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Kevin Weil
What's hot
(20)
An introduction to Apache Hadoop Hive
An introduction to Apache Hadoop Hive
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
An intriduction to hive
An intriduction to hive
Hadoop basics
Hadoop basics
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Webinar: Inside Cloudera's Distribution including Apache Hadoop v3
Pig - Analyzing data sets
Pig - Analyzing data sets
Pig, Making Hadoop Easy
Pig, Making Hadoop Easy
Big data Hadoop presentation
Big data Hadoop presentation
Introduction to Hive
Introduction to Hive
A glimpse of test automation in hadoop ecosystem by Deepika Achary
A glimpse of test automation in hadoop ecosystem by Deepika Achary
Hadoop training institutes in bangalore
Hadoop training institutes in bangalore
Introduction to pig & pig latin
Introduction to pig & pig latin
Hadoop
Hadoop
Introduction to Hive and HCatalog
Introduction to Hive and HCatalog
Python in big data world
Python in big data world
Hadoop workshop
Hadoop workshop
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
HIVE: Data Warehousing & Analytics on Hadoop
HIVE: Data Warehousing & Analytics on Hadoop
Hadoopsummit16 myui
Hadoopsummit16 myui
Hadoop, Pig, and Twitter (NoSQL East 2009)
Hadoop, Pig, and Twitter (NoSQL East 2009)
Similar to Into The Wonderful
Data Science
Data Science
Ahmet Bulut
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Chris Baglieri
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
HostedbyConfluent
Puppet for Sys Admins
Puppet for Sys Admins
Puppet
The Sandbox Container Directory
The Sandbox Container Directory
Bongwon Lee
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Amazon Web Services
Filesystems Lisbon 2018
Filesystems Lisbon 2018
Frank de Jonge
data-mesh-101.pptx
data-mesh-101.pptx
TarekHamdi8
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
daveconnors
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
DataHub
DataHub
Aditya Parameswaran
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Sabbir Alam
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Yahoo Developer Network
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Cloudera, Inc.
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
PCM
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Folio3 Software
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Andy Petrella
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
InfluxData
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Advertising Specialties Alliance
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
C4Media
Similar to Into The Wonderful
(20)
Data Science
Data Science
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Finding the needles in the haystack. An Overview of Analyzing Big Data with H...
Off-Label Data Mesh: A Prescription for Healthier Data
Off-Label Data Mesh: A Prescription for Healthier Data
Puppet for Sys Admins
Puppet for Sys Admins
The Sandbox Container Directory
The Sandbox Container Directory
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
(BAC307) The Cold Data Playbook: Building the Ultimate Archive Solution in Am...
Filesystems Lisbon 2018
Filesystems Lisbon 2018
data-mesh-101.pptx
data-mesh-101.pptx
Cassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
DataHub
DataHub
Computer basics--basic comp-oper
Computer basics--basic comp-oper
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop and Pig at Twitter__HadoopSummit2010
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
PCM Vision 2019 Breakout: Quest Software
PCM Vision 2019 Breakout: Quest Software
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Distributed and Fault Tolerant Realtime Computation with Apache Storm, Apache...
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Free Tech Tools - Promotions East 2011
Free Tech Tools - Promotions East 2011
Elastic Data Analytics Platform @Datadog
Elastic Data Analytics Platform @Datadog
More from Matt Wood
Genomics in the Cloud
Genomics in the Cloud
Matt Wood
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Genomes On Rails
Genomes On Rails
Matt Wood
Extreme Informatics
Extreme Informatics
Matt Wood
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
Matt Wood
The A to Z of developing for the web
The A to Z of developing for the web
Matt Wood
Introduction to Scrum
Introduction to Scrum
Matt Wood
30 Minutes With Rails
30 Minutes With Rails
Matt Wood
Subversion Best Practices
Subversion Best Practices
Matt Wood
Lucene
Lucene
Matt Wood
Introduction to the Semantic Web
Introduction to the Semantic Web
Matt Wood
More from Matt Wood
(12)
Genomics in the Cloud
Genomics in the Cloud
How to make Friendfeeds and influence people
How to make Friendfeeds and influence people
Genomes On Rails
Genomes On Rails
Genomes On Rails
Genomes On Rails
Extreme Informatics
Extreme Informatics
What can Bioinformaticians learn from YouTube?
What can Bioinformaticians learn from YouTube?
The A to Z of developing for the web
The A to Z of developing for the web
Introduction to Scrum
Introduction to Scrum
30 Minutes With Rails
30 Minutes With Rails
Subversion Best Practices
Subversion Best Practices
Lucene
Lucene
Introduction to the Semantic Web
Introduction to the Semantic Web
Recently uploaded
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Stephanie Beckett
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Precisely
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
RankYa
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
NavinnSomaal
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
charlottematthew16
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Lorenzo Miniero
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
Fwdays
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
Scott Keck-Warren
How to write a Business Continuity Plan
How to write a Business Continuity Plan
Databarracks
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Hervé Boutemy
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
Enterprise Knowledge
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
2toLead Limited
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Fwdays
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
Slibray Presentation
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
Rizwan Syed
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
UiPathCommunity
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Zilliz
Recently uploaded
(20)
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
How to write a Business Continuity Plan
How to write a Business Continuity Plan
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Into The Wonderful
1.
Into the Wonderful Towards
a Virtual Institute
2.
you are here
3.
Data
4.
Lots of data
5.
Lots of data,
lots of people
6.
Lots of data,
lots of people, lots of compute
7.
Lots of data,
lots of people, lots of compute, lots of uses
8.
Lots of data,
lots of people, lots of compute, lots of uses, lots and lots and lots and lots...
9.
Trillionics
10.
A platform for
science
11.
12.
1
Get 2 Select 3 Work 4 Save
13.
1
Get 2 Select 3 Work 4 Save
14.
1
Get 2 Select 3 Work 4 Save
15.
1
Get 2 Select 3 Work 4 Save
16.
Work is the
killer app get here quickly
17.
Work = publications
18.
Problematic for complex
data
19.
1
Get 2 Select 3 Work 4 Save
20.
1
Get: flat files / databases 2 Select 3 Work 4 Save
21.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work 4 Save
22.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save
23.
1
Get: flat files / databases 2 Select: scripts / directories 3 Work: interesting 4 Save: flat files / databases
24.
Get
Filter Work Save
25.
Get
Filter Work Save
26.
Get
Filter Work Save
27.
Get
Filter Work Save
28.
Get
Filter Work Save
29.
Get
Filter Work Save
30.
Get
Filter Work Save
31.
Get
Filter Work Save
32.
Filter
Save Get Work Get Filter Work Save
33.
Filter
Save Work Get Get Work Get Filter Work Save
34.
Filter
Save Work Get Get Work Get Filter Work Save
35.
36.
Virtualise
37.
Get
Save
38.
Data platform Get
Save
39.
Data platform Get
Save Work
40.
Data platform Get
Save Work App platform
41.
Data accessible via
services
42.
Applications accessible
via services
43.
Data platform
Get / Save Work Projects / SNP calling App platform
44.
Distribute
45.
Data platform Hintxon
Get / Save San Diego Work App platform
46.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work
47.
Distributed storage
Virtualised services Application programming interfaces Getters Filters Savers Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work Work
48.
A distributed mindset
49.
map/reduce
50.
1. map
51.
@a = [
1, 2, 3 ] @result = [] for each $value in @a push @result, map($value) end sub map($incoming) return ($incoming * 10) end
52.
2. reduce
53.
reduce(@result) sub reduce($r)
<transform $r> end
54.
independent
55.
of array size!
independent
56.
of array size!
independent of ea ch other!
57.
independent distribute across virtual
machines!
58.
Prerequisites
59.
Open data easy to
get a t data
60.
soft ware as
a service Open APIs
61.
Beyond SQL
62.
Accessibility
63.
East coast 24/7
Accessibility Dow n the co rridor West coast
64.
Reliability
65.
Build for flux
66.
Authentication
67.
Privacy
68.
Less software
69.
Distribute everything
70.
Replicate everything
Speed. Redundancy.
71.
Will it scale?
72.
Oh yes
73.
New York Times
74.
11 million TIFs
75.
24 hours
$500
76.
Google, Yahoo!
Amazon
77.
We are here
78.
We need to
start now
79.
2 X
80.
150 Tb/week
81.
We need to
start now as in, like, ye sterday
82.
Petabyte journal club
foomongers.org.uk
83.
Thank you
84.
GREENISGOOD.CO.UK
Download now