SlideShare a Scribd company logo
1 of 93
Thinking in MapReduce
Ryan Brush
@ryanbrush
2
We	
  programmers	
  have	
  had
it	
  pre1y	
  good
3
Hardware	
  has	
  scaled	
  up	
  faster
than	
  our	
  problem	
  sets
4
5
So#ware
Engineers
Moore’s	
  
Law
6
But	
  the	
  party	
  is	
  ending
(or	
  at	
  least	
  changing)
7
Data	
  is	
  growing	
  faster	
  than
we	
  can	
  scale	
  individual	
  machines
8
So	
  we	
  have	
  to	
  spread	
  our	
  work	
  
across	
  many	
  machines
9
This	
  is	
  a	
  big	
  deal	
  in	
  health	
  care
9
This	
  is	
  a	
  big	
  deal	
  in	
  health	
  care
Fragmented	
  InformaKon
9
This	
  is	
  a	
  big	
  deal	
  in	
  health	
  care
Fragmented	
  InformaKon
Spread	
  across	
  many	
  systems
9
This	
  is	
  a	
  big	
  deal	
  in	
  health	
  care
Fragmented	
  InformaKon
Spread	
  across	
  many	
  systems
No	
  one	
  has	
  the	
  complete	
  picture
10
We	
  need	
  to	
  put	
  the	
  picture	
  back
together	
  again
10
We	
  need	
  to	
  put	
  the	
  picture	
  back
together	
  again
Be1er-­‐informed	
  decisions
10
We	
  need	
  to	
  put	
  the	
  picture	
  back
together	
  again
Be1er-­‐informed	
  decisions
Reduce	
  systemaKc	
  fricKon
10
We	
  need	
  to	
  put	
  the	
  picture	
  back
together	
  again
Be1er-­‐informed	
  decisions
Understand	
  and	
  improve	
  the
health	
  of	
  populaKons
Reduce	
  systemaKc	
  fricKon
Chart	
  Search
Chart	
  Search
Chart	
  Search
-InformaKon	
  
extracKon
Chart	
  Search
-InformaKon	
  
extracKon
-SemanKc	
  markup	
  of	
  
documents
Chart	
  Search
-InformaKon	
  
extracKon
-SemanKc	
  markup	
  of	
  
documents
-Related	
  concepts	
  in	
  
search	
  results
Medical	
  Alerts
Medical	
  Alerts
Medical	
  Alerts
-Detect	
  health	
  risks	
  
in	
  incoming	
  data
Medical	
  Alerts
-Detect	
  health	
  risks	
  
in	
  incoming	
  data
-NoKfy	
  clinicians	
  to	
  
address	
  those	
  risks
Medical	
  Alerts
-Detect	
  health	
  risks	
  
in	
  incoming	
  data
-NoKfy	
  clinicians	
  to	
  
address	
  those	
  risks
-Quickly	
  include	
  new	
  
knowledge
PopulaKon	
  Health
PopulaKon	
  Health
PopulaKon	
  Health
- Securely	
  bring	
  
together	
  health	
  data	
  
PopulaKon	
  Health
- Securely	
  bring	
  
together	
  health	
  data	
  
- IdenKfy	
  opportuniKes	
  
to	
  improve	
  care
PopulaKon	
  Health
- Securely	
  bring	
  
together	
  health	
  data	
  
- IdenKfy	
  opportuniKes	
  
to	
  improve	
  care
- Support	
  applicaKon	
  of	
  
improvements
PopulaKon	
  Health
- Securely	
  bring	
  
together	
  health	
  data	
  
- IdenKfy	
  opportuniKes	
  
to	
  improve	
  care
- Support	
  applicaKon	
  of	
  
improvements
- Close	
  the	
  loop
17
Peter	
  Norvig,	
  h1p://www.youtube.com/watch?v=yvDCzhbjYWs
The	
  Unreasonable	
  
EffecKveness	
  of	
  Data
17
Peter	
  Norvig,	
  h1p://www.youtube.com/watch?v=yvDCzhbjYWs
Simple	
  models	
  with	
  lots	
  of	
  data	
  almost	
  always	
  
outperform	
  complex	
  models	
  with	
  less	
  data
The	
  Unreasonable	
  
EffecKveness	
  of	
  Data
18
So	
  how	
  can	
  we	
  tackle	
  
such	
  large	
  data	
  sets?
19
Can	
  we	
  adapt	
  what	
  has
worked	
  historically?
Rela%onal	
  Databases	
  are	
  Awesome
Acer	
  all,
Rela%onal	
  Databases	
  are	
  Awesome
Atomic,	
  transacKonal	
  updates
DeclaraKve	
  queries
Guaranteed	
  consistency
Easy	
  to	
  reason	
  about
Long	
  track	
  record	
  of	
  success
Rela%onal	
  Databases	
  are	
  Awesome
…so	
  use	
  them!
Rela%onal	
  Databases	
  are	
  Awesome
…so	
  use	
  them!
But…
Those	
  advantages	
  have	
  a	
  cost
Global,	
  atomic,	
  consistent	
  state	
  means	
  
global	
  coordinaKon
Those	
  advantages	
  have	
  a	
  cost
Global,	
  atomic,	
  consistent	
  state	
  means	
  
global	
  coordinaKon
CoordinaKon	
  does	
  not	
  scale	
  linearly
The	
  costs	
  of	
  coordinaKon
Remember	
  the	
  
network	
  effect?
The	
  costs	
  of	
  coordinaKon
2	
  nodes	
  =	
  1	
  channel
5	
  nodes	
  =	
  10	
  channels
12	
  nodes	
  =	
  66	
  channels
25	
  nodes	
  =	
  300	
  channels
The	
  result	
  is	
  we	
  don’t	
  scale	
  
linearly	
  as	
  we	
  add	
  nodes
Independence Parallelizable
Independence Parallelizable
Parallelizable Scalable
“Shared	
  Nothing”	
  architectures	
  are	
  the
most	
  scalable…
“Shared	
  Nothing”	
  architectures	
  are	
  the
most	
  scalable…
…but	
  most	
  real-­‐world	
  problems	
  
require	
  us	
  to	
  share	
  something…
“Shared	
  Nothing”	
  architectures	
  are	
  the
most	
  scalable…
…but	
  most	
  real-­‐world	
  problems	
  
require	
  us	
  to	
  share	
  something…
…so	
  our	
  designs	
  usually	
  have	
  a	
  parallel
part	
  and	
  a	
  serial	
  part
The	
  key	
  is	
  to	
  make	
  sure	
  the	
  vast	
  majority
of	
  our	
  work	
  in	
  the	
  cloud	
  is	
  independent	
  and
parallelizable.
Amdahl’s	
  Law
S	
  :	
  speed	
  improvement
P	
  :	
  raKo	
  of	
  the	
  problem	
  that	
  
	
  	
  	
  	
  	
  	
  can	
  be	
  parallelized
N:	
  number	
  of	
  processors
MapReduce	
  Primer
Input	
  Data
Split	
  1
Split	
  2
Split	
  3
Split	
  N
.
.
.
Mapper	
  1
Mapper	
  2
Mapper	
  3
Mapper	
  N
.
.
.
Map	
  Phase
Reducer	
  1
Reducer	
  2
Reducer	
  N
.
.
Reduce
Phase
Shuffle
MapReduce	
  Example:	
  Word	
  Count
Books
Count	
  words	
  
per	
  book
.
.
.
Map	
  Phase
Sum	
  words	
  
A-­‐C
.
.
Reduce
Phase
Shuffle
Sum	
  words
D-­‐E
Sum	
  words	
  
W-­‐Z
Count	
  words	
  
per	
  book
Count	
  words	
  
per	
  book
The	
  network	
  is	
  a	
  shared	
  resource
The	
  network	
  is	
  a	
  shared	
  resource
Too	
  much	
  data	
  to	
  move	
  to	
  
computaKon
The	
  network	
  is	
  a	
  shared	
  resource
So	
  move	
  computa3on	
  to	
  data
Too	
  much	
  data	
  to	
  move	
  to	
  
computaKon
MapReduce	
  Data	
  Locality
Input	
  Data
Split	
  1
Split	
  2
Split	
  3
Split	
  N
.
.
.
Mapper	
  1
Mapper	
  2
Mapper	
  3
Mapper	
  N
.
.
.
Map	
  Phase
Reducer	
  1
Reducer	
  2
Reducer	
  N
.
.
Reduce
Phase
Shuffle
=	
  a	
  physical	
  machine
Data	
  locality	
  only	
  guaranteed	
  in	
  
the	
  Map	
  phase
Data	
  locality	
  only	
  guaranteed	
  in	
  
the	
  Map	
  phase
So	
  do	
  as	
  much	
  work	
  as	
  possible	
  there
Data	
  locality	
  only	
  guaranteed	
  in	
  
the	
  Map	
  phase
So	
  do	
  as	
  much	
  work	
  as	
  possible	
  there
Some	
  jobs	
  have	
  no	
  reducer	
  at	
  all!
38
MapReduce	
  is	
  a	
  building	
  block
39
So	
  let’s	
  build	
  higher-­‐level	
  funcKons
Grouping	
  and	
  AggregaKng
Books
Count	
  words	
  
per	
  book
.
.
.
Map	
  Phase
Sum	
  words	
  
A-­‐C
.
.
Reduce
Phase
Shuffle
Sum	
  words
D-­‐E
Sum	
  words	
  
W-­‐Z
Count	
  words	
  
per	
  book
Count	
  words	
  
per	
  book
Joins
Data	
  Set	
  1
Split	
  1
Split	
  2
Split	
  3
Group	
  by	
  key
Map	
  Phase
Reducer	
  1
Reducer	
  2
Reducer	
  N
.
.
Reduce
Phase
Shuffle
Group	
  by	
  key
Group	
  by	
  key
Data	
  Set	
  2
Split	
  1
Split	
  2
Split	
  3
Group	
  by	
  key
Group	
  by	
  key
Group	
  by	
  key
Persons
Split	
  1
Split	
  2
Split	
  3
Group	
  by	
  person	
  id
Map	
  Phase
Reducer	
  1
Reducer	
  2
Reducer	
  N
.
.
Reduce
Phase
Shuffle
Group	
  by	
  person	
  id
Group	
  by	
  person	
  id
Visits
Split	
  1
Split	
  2
Split	
  3
Group	
  by	
  person	
  id
Group	
  by	
  person	
  id
Group	
  by	
  person	
  id
Joins
Map-­‐Side	
  Joins
Data	
  Set	
  1
Split	
  3
Mapper	
  3
Map	
  Phase
Reducer	
  1
Reducer	
  2
.
.
Reduce
Phase
Shuffle
Data	
  set	
  2
Split	
  1
Mapper	
  1
Data	
  set	
  2
Split	
  2
Mapper	
  2
Data	
  set	
  2
44
Filtering
Map	
  or	
  reduce	
  funcKons	
  can	
  simply	
  
discard	
  data	
  we’re	
  not	
  interested	
  in
45
And	
  Others
More	
  sophisKcated	
  
pa1erns	
  composable	
  
DisKnct
Sort
Binning
Top	
  N
...	
  
46
Chain	
  Jobs	
  Together
Large-­‐scale	
  joins	
  must	
  have	
  a	
  reduce	
  phase
MulKple	
  joins	
  or	
  group-­‐by	
  operaKons	
  
mean	
  mulKple	
  jobs
Normalize
Data
Join
Related
Items
Compute
Summary Output
Codified	
  in	
  High-­‐Level	
  Libraries
Hive,	
  Pig,	
  Cascading,	
  and	
  Crunch	
  provide
simple	
  means	
  to	
  use	
  these	
  pa1erns
Apache
Crunch
The	
  era	
  of	
  wriKng	
  MapReduce	
  by	
  hand	
  is	
  over
48
How	
  do	
  we	
  use	
  these	
  tools?
49
Start	
  with	
  the	
  ques3on	
  you
want	
  to	
  ask,	
  then	
  transform	
  the
data	
  to	
  answer	
  it.
50
output	
  =	
  transform	
  (input)
50
output	
  =	
  transform	
  (input)
FuncKonal	
  over	
  
Place-­‐Oriented	
  Programming
51
Work	
  with	
  data	
  holisKcally
51
Work	
  with	
  data	
  holisKcally
Re-­‐running	
  funcKons	
  simpler	
  to	
  
reason	
  about	
  than	
  updaKng	
  state
51
Work	
  with	
  data	
  holisKcally
Re-­‐running	
  funcKons	
  simpler	
  to	
  
reason	
  about	
  than	
  updaKng	
  state
Hadoop	
  makes	
  this	
  possible	
  at	
  scale
52
Don’t	
  be	
  afraid	
  to	
  re-­‐process	
  
the	
  world
52
Don’t	
  be	
  afraid	
  to	
  re-­‐process	
  
the	
  world
Something’s	
  wrong,	
  we’re	
  above	
  95%	
  usage!
-­‐TradiKonal	
  System	
  Administrator
52
Don’t	
  be	
  afraid	
  to	
  re-­‐process	
  
the	
  world
Something’s	
  wrong,	
  we’re	
  above	
  95%	
  usage!
-­‐TradiKonal	
  System	
  Administrator
Something’s	
  wrong,	
  we’re	
  below	
  95%	
  usage!
-­‐Hadoop	
  System	
  Administrator
53
Maximize	
  Resource	
  Usage
54
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
From	
  Databases	
  to	
  Dataspaces
54
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
From	
  Databases	
  to	
  Dataspaces
(Also	
  referred	
  to	
  as	
  Data	
  Lakes)
55
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
Bring	
  all	
  of	
  your	
  data	
  together...
55
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
Bring	
  all	
  of	
  your	
  data	
  together...
..structured	
  or	
  unstructured...
55
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
Bring	
  all	
  of	
  your	
  data	
  together...
...transform	
  it	
  with	
  unlimited
computaKon...
..structured	
  or	
  unstructured...
55
Franklin,	
  Halevy,	
  Maier,	
  h1p://homes.cs.washington.edu/~alon/files/dataspacesDec05.pdf
Bring	
  all	
  of	
  your	
  data	
  together...
...transform	
  it	
  with	
  unlimited
computaKon...
...at	
  any	
  Kme	
  for	
  any	
  new	
  need.
..structured	
  or	
  unstructured...
56
And	
  offer	
  a	
  variety	
  of	
  interacKve
access	
  pa1erns.
56
And	
  offer	
  a	
  variety	
  of	
  interacKve
access	
  pa1erns.
SQL,	
  Search,	
  Domain-­‐Specific	
  Apps
57
Hadoop	
  is	
  becoming	
  an	
  adapKve,	
  
mulK-­‐purpose	
  plasorm.
58
The	
  gap	
  between	
  asking	
  novel	
  
quesKons	
  and	
  our	
  ability	
  to	
  answer	
  
them	
  is	
  closing.
QuesKons?
@ryanbrush
h1ps://engineering.cerner.com
We’re	
  hiring!

More Related Content

Similar to Thinking in MapReduce - StampedeCon 2013

How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamHow to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamTraveloka
 
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010Yahoo Developer Network
 
WorDS of Data Science in the Presence of Heterogenous Computing Architectures
WorDS of Data Science in the Presence of Heterogenous Computing ArchitecturesWorDS of Data Science in the Presence of Heterogenous Computing Architectures
WorDS of Data Science in the Presence of Heterogenous Computing ArchitecturesIlkay Altintas, Ph.D.
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAlbert Bifet
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataAnalyticsWeek
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
Whitepaper: Mining the AWR repository for Capacity Planning and Visualization
Whitepaper: Mining the AWR repository for Capacity Planning and VisualizationWhitepaper: Mining the AWR repository for Capacity Planning and Visualization
Whitepaper: Mining the AWR repository for Capacity Planning and VisualizationKristofferson A
 
2951085 dzone-2016guidetobigdata
2951085 dzone-2016guidetobigdata2951085 dzone-2016guidetobigdata
2951085 dzone-2016guidetobigdatabalu kvm
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphsStanka Dalekova
 
Goto amsterdam-2013-skinned
Goto amsterdam-2013-skinnedGoto amsterdam-2013-skinned
Goto amsterdam-2013-skinnedTed Dunning
 
Why Your Database Queries Stink -SeaGl.org November 11th, 2016
Why Your Database Queries Stink -SeaGl.org November 11th, 2016Why Your Database Queries Stink -SeaGl.org November 11th, 2016
Why Your Database Queries Stink -SeaGl.org November 11th, 2016Dave Stokes
 
Reactive programming at scale
Reactive programming at scale Reactive programming at scale
Reactive programming at scale John McClean
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document usefulssuser3c3f88
 

Similar to Thinking in MapReduce - StampedeCon 2013 (20)

How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data TeamHow to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data Team
 
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010
Exact Inference in Bayesian Networks using MapReduce__HadoopSummit2010
 
WorDS of Data Science in the Presence of Heterogenous Computing Architectures
WorDS of Data Science in the Presence of Heterogenous Computing ArchitecturesWorDS of Data Science in the Presence of Heterogenous Computing Architectures
WorDS of Data Science in the Presence of Heterogenous Computing Architectures
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigData
 
DIET_BLAST
DIET_BLASTDIET_BLAST
DIET_BLAST
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Big Data
Big DataBig Data
Big Data
 
Using MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image AnalysisUsing MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image Analysis
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Whitepaper: Mining the AWR repository for Capacity Planning and Visualization
Whitepaper: Mining the AWR repository for Capacity Planning and VisualizationWhitepaper: Mining the AWR repository for Capacity Planning and Visualization
Whitepaper: Mining the AWR repository for Capacity Planning and Visualization
 
2951085 dzone-2016guidetobigdata
2951085 dzone-2016guidetobigdata2951085 dzone-2016guidetobigdata
2951085 dzone-2016guidetobigdata
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 
Goto amsterdam-2013-skinned
Goto amsterdam-2013-skinnedGoto amsterdam-2013-skinned
Goto amsterdam-2013-skinned
 
Why Your Database Queries Stink -SeaGl.org November 11th, 2016
Why Your Database Queries Stink -SeaGl.org November 11th, 2016Why Your Database Queries Stink -SeaGl.org November 11th, 2016
Why Your Database Queries Stink -SeaGl.org November 11th, 2016
 
Distributed deep learning_over_spark_20_nov_2014_ver_2.8
Distributed deep learning_over_spark_20_nov_2014_ver_2.8Distributed deep learning_over_spark_20_nov_2014_ver_2.8
Distributed deep learning_over_spark_20_nov_2014_ver_2.8
 
Reactive programming at scale
Reactive programming at scale Reactive programming at scale
Reactive programming at scale
 
GoTo Amsterdam 2013 Skinned
GoTo Amsterdam 2013 SkinnedGoTo Amsterdam 2013 Skinned
GoTo Amsterdam 2013 Skinned
 
VOLT - ESWC 2016
VOLT - ESWC 2016VOLT - ESWC 2016
VOLT - ESWC 2016
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document useful
 

More from StampedeCon

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...StampedeCon
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017StampedeCon
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017StampedeCon
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...StampedeCon
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017StampedeCon
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017StampedeCon
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017StampedeCon
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...StampedeCon
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...StampedeCon
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017StampedeCon
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017StampedeCon
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017StampedeCon
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017StampedeCon
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017StampedeCon
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017StampedeCon
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...StampedeCon
 
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...StampedeCon
 
Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016StampedeCon
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016StampedeCon
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016StampedeCon
 

More from StampedeCon (20)

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
 
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
 
Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016
 

Recently uploaded

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 

Recently uploaded (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 

Thinking in MapReduce - StampedeCon 2013