SlideShare a Scribd company logo
1 of 38
Things we will cover
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 2
GOAL
Explain Cloud IoT, its challenges, and a
principled, agile approach to prediction amidst
uncertainty in such a way that people from a
broad audience can (hopefully) relate.
WILL
►  IoT, Cloud landscape, and CTL
►  Prediction Lifecycle
►  Challenges by business domain
►  Data Science Lessons Learned
WILL NOT
►  Big Data
►  Architecture
►  Algorithms
►  Technology
WHO WE ARE
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 3
Who I am
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 4
I am interested in creating intelligent systems
through incorporating humans and machines in
an active learning loop.
►  Decision Scientist with PhD in HCI from Iowa
State
►  Principal Data Scientist for CenturyLink Cloud
►  Curricular Design, Educational Technology,
Online Advertising, Online Retail, Big Data
UX, Cloud, IoT, Physics
►  Hiking, Data journalism, Stocks, Horse Racing
ryankirk.info
Who we are: CenturyLink Cloud
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 5
+ ++
CLOUD COLOCATION NETWORK MANAGED
SERVICES
What is IoT
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 6
Human desire to connect ourselves to
each other via technology
►  Modern plumbing…
►  Telegraph ! Telephone
►  Telephone ! Dial-up
►  Dial-up ! HSN
►  HSN ! WAN
►  WAN ! IoT
Human desire to connect ourselves to
each other via technology to empower
each other
Internet growth > Hardware growth
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 7
motherboard.vice.com
newscientist.com
CenturyLink Cloud IoT Advantage
►  37 states
►  550,000 miles of network
►  Innovative Gigabit
fiber network
►  25MM+ consumer
endpoints
►  60+ DCS
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 8
PROBLEM
STATEMENT
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 9
Problem statement:
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 10
►  Prevent incidents
through early
detection
►  Reduce MTTR by
facilitating root-cause
analytics
►  Facilitate domain
experts and harvest
their knowledge "
GOAL
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 11
Build a real-time artificial intelligence
capable of analyzing all incoming
streams of data in order to know
which actions our machines need to
automatically take.
It’s simple, really… build Skynet
PREDICTION
LANDSCAPE
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 12
Prediction Adoption Model
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 13
Stage I:
INTRODUCTION
1. Design
2. Measure
Stage III:
MATURITY
5. Predict
6. Act TIME
SOPHISTICATION
INTRO GROWTH MATURITY DECLINE
Stage II:
GROWTH
3. Describe
4. Detect
Stage IV:
DECLINE
7. Feedback
8. Obsolescence
Prediction Adoption Model (actual)
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 14
TIME
SOPHISTICATION
CHECK
THIS
OUT
OH NO,
OH NO,
OH NO!
HAHA,
IT
WORKED!
I NEVER
SAID IT
WOULD …
Stage I:
CHECK
THIS OUT
1. It runs
2. Results are
promising
Stage III:
HAHA,
IT WORKED!
5. I surprise myself
sometimes
6. I found a
shortcut to scale it
Stage II:
OH NO, OH NO,
OH NO!
3. It works but it’s
terrible
4. It will never scale
Stage IV:
I NEVER SAID
IT WOULD…
7. How do I prove it is
still working?
8. There is no way to
apply it to this scenario
Stage I: INTRODUCTION
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 15
1. Design
►  What should we measure?
►  What are the core business
processes?
►  What is the unit of analysis?
►  What are our research questions/
hypotheses?
2. Measure
►  Do we push or pull?
►  How often should we measure?
►  How long do we need the data?
►  How do we represent the data
schema?
Stage II: GROWTH
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 16
3. Describe
►  Which metrics relate to our
outcomes of interest?
►  What is the typical value of each
metric?
►  How do you visualize each
metric?
4. Detect
►  What do we expect to happen?
►  Which values/events are
unexpected?
►  When should we alert?
►  How will we scale our analysis?
Stage III: MATURITY
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 17
7. Predict
►  Are there patterns?
►  Are there more complex
relationships?
►  What is going to happen?
►  How do we get training data?
6. Act
►  What actions should we take?
►  How can we incorporate new
outcomes into the current
model?
Stage IV: DECLINE
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 18
7. Feedback
►  Is my model primarily basing its
decisions upon its previous
decisions?
►  Can I separate the model from its
parameters?
►  Can I still evaluate accuracy?
8. Obsolescence
►  Are my business scenarios still
grounded?
►  Do my model assumptions still hold?
►  Does it still scale?
►  Is the intervention still needed?
Domain process involvement
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 19
BUSINESS
►  Is involved early
in defining
requirements
ENGINEERING
►  Builds MVP
►  Solidifies solution
RESEARCH
►  Builds prototype
and suggests
solution
SOLUTION
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 20
Working backwards
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 21
ITEM
1 Skynet
2 Action mapping
3 Action landscape
4 Prediction
5 Categorical learning
6 Training Data
7 Feedback loop
8 High SNR
9 Unsupervised learning
10 Anomaly Detection
11 Normalization
12 Retention
13 Sampling
14 Collection
15 Approach
16 Domain model
“In life, unless you’re more gifted than
Einstein, inversion [i.e. working
backwards] will help you solve
problems.”
Charlie Munger
Working backwards (cont.)
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 22
ITEM STAGE
1 Skynet ACT
2 Action mapping ACT
3 Action landscape ACT
4 Prediction PREDICT
5 Categorical learning PREDICT
6 Training Data PREDICT
7 Feedback loop PREDICT
8 High SNR DETECT
9 Unsupervised learning DETECT
10 Anomaly Detection DETECT
11 Normalization DESCRIBE
12 Retention DESCRIBE
13 Sampling MEASURE
14 Collection MEASURE
15 Approach DESIGN
16 Domain model DESIGN
TIME
SOPHISTICATION
INTRO GROWTH MATURITY DECLINE
Working backwards (cont.)
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 23
ITEM STAGE PRIMARY DOMAIN
1 Skynet ACT ENGINEERING
2 Action mapping ACT BUSINES
3 Action landscape ACT RESEARCH
4 Prediction PREDICT RESEARCH
5 Categorical learning PREDICT RESEARCH
6 Training Data PREDICT ENGINEERING
7 Feedback loop PREDICT BUSINESS
8 High SNR DETECT RESEARCH
9 Unsupervised learning DETECT RESEARCH
10 Anomaly Detection DETECT RESEARCH
11 Normalization DESCRIBE RESEARCH
12 Retention DESCRIBE ENGINEERING
13 Sampling MEASURE RESEARCH
14 Collection MEASURE ENGINEERING
15 Approach DESIGN RESEARCH
16 Domain model DESIGN BUSINESS
This is a WIP
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 24
ITEM STAGE PRIMARY DOMAIN
1 Skynet ACT ENGINEERING
2 Action mapping ACT BUSINES
3 Action landscape ACT RESEARCH
4 Prediction PREDICT RESEARCH
5 Categorical learning PREDICT RESEARCH
6 Training Data PREDICT ENGINEERING
7 Feedback loop PREDICT BUSINESS
8 High SNR DETECT RESEARCH
9 Unsupervised learning DETECT RESEARCH
10 Anomaly Detection DETECT RESEARCH
11 Normalization DESCRIBE RESEARCH
12 Sampling MEASURE RESEARCH
13 Collection MEASURE ENGINEERING
14 Domain model DESIGN BUSINESS
QUEUED
(StampedCon 2017?)
WORKING
PRODUCTION
LESSONS
LEARNED
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 25
16. DOMAIN MODEL
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 26
►  938,076 metrics
►  Verify the unique stream of
data across systems
►  Key-based
DESIGN
15. APPROACH
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 27
VARIABILITY
►  Changes in observed state
►  Plan for variability
UNCERTAINTY
►  Unobserved state(s)
►  Design for uncertainty
DESIGN (cont.)
14. COLLECTION
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 28
►  Agreement of signals
►  Cacophony of
signals
►  How often should we
measure?
►  We have no labeled
training data
►  An approach we
can build upon in the
future
MEASURE
13. SAMPLING
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 29
Shannon-Nyquist Paradox
►  The more you measure
something the more it varies
►  Bias related to time and
variability
►  EG. Temperature yesterday
was 68 degrees
MEASURE (cont.)
12. RETENTION
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 30
►  Recall that precision relates to
sampling consistency
►  Not all metrics are created
equal
►  Coverage remains
problematic
DESCRIBE
11. NORMALIZATION
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 31
Kievit, R.A., Frankenhuis, et al. (2013). Simpson’s paradox in
psychological science. Frontiers in Psychology
Simpson’s Paradox
►  aggregate trend != sum of
individual trends
►  Applies to all aggregates:
sums, averages, correlations,
etc.
►  What is the unit of analysis?
DESCRIBE (cont.)
26-Jul-16 32
Predicted
CenturyLink Confidential
Actual Boundary
10. ANOMALY DETECTION
►  Capture the time series data
for each piece of connected
platform technology
►  Find implicit anomalies within a
time series vector
►  Values that are surprising
►  Highly scalable
DETECT
presented by Ryan Kirk at StampedeCon 2016
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 33
►  Time series data shows
the context behind
anomalies that co-occur
►  Group anomalous
vectors based upon
structural properties and
co-occurrence
►  Up-level anomalies into
higher-order alerts using
contextual information
9. UNSUPERVISED
LEARNING
DETECT (cont.)
8. HIGH SNR
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 34
►  We have also built a search
engine for time series data
that allows us to build cool
looking graphs in real-time
►  We basically do all of this to
empower slack alerts
►  Allows tags to propagate
forwards
7. FEEDBACK LOOP PREDICT
6. TRAINING DATA
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 35
►  Evaluate ALL assumptions
in regards to training data
►  Ideally use active learning
approach or risk
becoming tautological
PREDICT (cont.)
RESULTS
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 36
Prediction Results
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 37
►  38,392,438 predictions every 24hr.
►  Anomaly rate < 0.01% (0.0001)
~3K anomalies/day
►  Accuracy is ~90%
►  Prediction latency ~3.0 seconds
►  ~30 Higher order alerts/day
Want to join me?
Let’s connect:
►  @ryan_kirk
Try CenturyLink Cloud free:
►  ctl.io
We are hiring
►  ctl.io/careers/jobs
Thanks to:
►  StampedeCon2016
►  pixabay.com
26-Jul-16presented by Ryan Kirk at StampedeCon 2016 38

More Related Content

Viewers also liked

Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNDataWorks Summit
 
Get most out of Spark on YARN
Get most out of Spark on YARNGet most out of Spark on YARN
Get most out of Spark on YARNDataWorks Summit
 
HBase Operations and Best Practices
HBase Operations and Best PracticesHBase Operations and Best Practices
HBase Operations and Best PracticesVenu Anuganti
 
ML on Big Data: Real-Time Analysis on Time Series
ML on Big Data: Real-Time Analysis on Time SeriesML on Big Data: Real-Time Analysis on Time Series
ML on Big Data: Real-Time Analysis on Time SeriesSigmoid
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...StampedeCon
 
Time Series Analysis with Spark by Sandy Ryza
Time Series Analysis with Spark by Sandy RyzaTime Series Analysis with Spark by Sandy Ryza
Time Series Analysis with Spark by Sandy RyzaSpark Summit
 
Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraPatrick McFadin
 
Building large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillBuilding large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillHenry Saputra
 
Harnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillHarnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillTerence Yim
 

Viewers also liked (12)

Enabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARNEnabling Diverse Workload Scheduling in YARN
Enabling Diverse Workload Scheduling in YARN
 
Get most out of Spark on YARN
Get most out of Spark on YARNGet most out of Spark on YARN
Get most out of Spark on YARN
 
HBase Operations and Best Practices
HBase Operations and Best PracticesHBase Operations and Best Practices
HBase Operations and Best Practices
 
ML on Big Data: Real-Time Analysis on Time Series
ML on Big Data: Real-Time Analysis on Time SeriesML on Big Data: Real-Time Analysis on Time Series
ML on Big Data: Real-Time Analysis on Time Series
 
Stock Market Analysis
Stock Market AnalysisStock Market Analysis
Stock Market Analysis
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
 
Time Series Analysis with Spark by Sandy Ryza
Time Series Analysis with Spark by Sandy RyzaTime Series Analysis with Spark by Sandy Ryza
Time Series Analysis with Spark by Sandy Ryza
 
Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and Cassandra
 
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & ParquetFile Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
 
Building large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twillBuilding large scale applications in yarn with apache twill
Building large scale applications in yarn with apache twill
 
Harnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache TwillHarnessing the power of YARN with Apache Twill
Harnessing the power of YARN with Apache Twill
 
A Multi Colored YARN
A Multi Colored YARNA Multi Colored YARN
A Multi Colored YARN
 

Similar to Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyzing - StampedeCon 2016

From AirBox to Smart City: where are we and what's next?
From AirBox to Smart City: where are we and what's next?From AirBox to Smart City: where are we and what's next?
From AirBox to Smart City: where are we and what's next?Ling-Jyh Chen
 
IronHacks Live: Info session #3 - COVID-19 Data Science Challenge
IronHacks Live: Info session #3 - COVID-19 Data Science ChallengeIronHacks Live: Info session #3 - COVID-19 Data Science Challenge
IronHacks Live: Info session #3 - COVID-19 Data Science ChallengePurdue RCODI
 
Enabling Data-Driven Private-Public Collaborations
Enabling Data-Driven Private-Public CollaborationsEnabling Data-Driven Private-Public Collaborations
Enabling Data-Driven Private-Public CollaborationsTyrone Grandison
 
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORS
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORSBig Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORS
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORSMatt Stubbs
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Seeling Cheung
 
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 rule
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 ruleWalk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 rule
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 ruleEnterpriseGRC Solutions, Inc.
 
probability & statistics
probability & statisticsprobability & statistics
probability & statisticsDebayanNSarkar
 
probability and statistics.pptx
probability and statistics.pptxprobability and statistics.pptx
probability and statistics.pptxDebayanSarkar12
 
Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Seattle DAML meetup
 
Why I Am a Software Engineer
Why I Am a Software EngineerWhy I Am a Software Engineer
Why I Am a Software EngineerCraig Saunders
 
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...Dr. Haxel Consult
 
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...Tutorial at the European Nanoelectronics Applications, Design & Technology Co...
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...Eugenio Villar
 
Streaming HYpothesis REasoning
Streaming HYpothesis REasoningStreaming HYpothesis REasoning
Streaming HYpothesis REasoningWilliam Smith
 
Smart Factories Redefine Security Paradigms
Smart Factories Redefine Security ParadigmsSmart Factories Redefine Security Paradigms
Smart Factories Redefine Security Paradigmsteam-WIBU
 
First QuantUniversity Online Summit -2020
First QuantUniversity Online Summit -2020First QuantUniversity Online Summit -2020
First QuantUniversity Online Summit -2020QuantUniversity
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG DataPrasant Misra
 
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...KaidlyneNeukam
 

Similar to Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyzing - StampedeCon 2016 (20)

From AirBox to Smart City: where are we and what's next?
From AirBox to Smart City: where are we and what's next?From AirBox to Smart City: where are we and what's next?
From AirBox to Smart City: where are we and what's next?
 
IronHacks Live: Info session #3 - COVID-19 Data Science Challenge
IronHacks Live: Info session #3 - COVID-19 Data Science ChallengeIronHacks Live: Info session #3 - COVID-19 Data Science Challenge
IronHacks Live: Info session #3 - COVID-19 Data Science Challenge
 
Enabling Data-Driven Private-Public Collaborations
Enabling Data-Driven Private-Public CollaborationsEnabling Data-Driven Private-Public Collaborations
Enabling Data-Driven Private-Public Collaborations
 
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORS
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORSBig Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORS
Big Data LDN 2018: USING FAST-DATA TO MAKE SEMICONDUCTORS
 
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
Fiducia & GAD IT AG: From Fraud Detection to Big Data Platform: Bringing Hado...
 
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 rule
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 ruleWalk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 rule
Walk This Way: CIS CSC and NIST CSF is the 80 in the 80/20 rule
 
probability & statistics
probability & statisticsprobability & statistics
probability & statistics
 
probability and statistics.pptx
probability and statistics.pptxprobability and statistics.pptx
probability and statistics.pptx
 
Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016
 
Why I Am a Software Engineer
Why I Am a Software EngineerWhy I Am a Software Engineer
Why I Am a Software Engineer
 
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...
IC-SDV 2018: Harald Jenny (CENTREDOC) When Artificial Intelligence Joins Inte...
 
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...Tutorial at the European Nanoelectronics Applications, Design & Technology Co...
Tutorial at the European Nanoelectronics Applications, Design & Technology Co...
 
Streaming HYpothesis REasoning
Streaming HYpothesis REasoningStreaming HYpothesis REasoning
Streaming HYpothesis REasoning
 
Smart Factories Redefine Security Paradigms
Smart Factories Redefine Security ParadigmsSmart Factories Redefine Security Paradigms
Smart Factories Redefine Security Paradigms
 
Investigating data scientists
Investigating data scientistsInvestigating data scientists
Investigating data scientists
 
First QuantUniversity Online Summit -2020
First QuantUniversity Online Summit -2020First QuantUniversity Online Summit -2020
First QuantUniversity Online Summit -2020
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG Data
 
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...
A Blockchain Technology Solution to Breaking Hilton Hotel's links to Human Tr...
 
Ibm watson
Ibm watsonIbm watson
Ibm watson
 
Isep master thesis
Isep master thesisIsep master thesis
Isep master thesis
 

More from StampedeCon

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...StampedeCon
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017StampedeCon
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017StampedeCon
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...StampedeCon
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017StampedeCon
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017StampedeCon
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017StampedeCon
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...StampedeCon
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...StampedeCon
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017StampedeCon
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017StampedeCon
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017StampedeCon
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017StampedeCon
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017StampedeCon
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017StampedeCon
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...StampedeCon
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016StampedeCon
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016StampedeCon
 
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016StampedeCon
 
Visualizing Big Data – The Fundamentals
Visualizing Big Data – The FundamentalsVisualizing Big Data – The Fundamentals
Visualizing Big Data – The FundamentalsStampedeCon
 

More from StampedeCon (20)

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016
 
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
 
Visualizing Big Data – The Fundamentals
Visualizing Big Data – The FundamentalsVisualizing Big Data – The Fundamentals
Visualizing Big Data – The Fundamentals
 

Recently uploaded

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseWSO2
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 

Recently uploaded (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyzing - StampedeCon 2016

  • 1.
  • 2. Things we will cover 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 2 GOAL Explain Cloud IoT, its challenges, and a principled, agile approach to prediction amidst uncertainty in such a way that people from a broad audience can (hopefully) relate. WILL ►  IoT, Cloud landscape, and CTL ►  Prediction Lifecycle ►  Challenges by business domain ►  Data Science Lessons Learned WILL NOT ►  Big Data ►  Architecture ►  Algorithms ►  Technology
  • 3. WHO WE ARE 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 3
  • 4. Who I am 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 4 I am interested in creating intelligent systems through incorporating humans and machines in an active learning loop. ►  Decision Scientist with PhD in HCI from Iowa State ►  Principal Data Scientist for CenturyLink Cloud ►  Curricular Design, Educational Technology, Online Advertising, Online Retail, Big Data UX, Cloud, IoT, Physics ►  Hiking, Data journalism, Stocks, Horse Racing ryankirk.info
  • 5. Who we are: CenturyLink Cloud 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 5 + ++ CLOUD COLOCATION NETWORK MANAGED SERVICES
  • 6. What is IoT 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 6 Human desire to connect ourselves to each other via technology ►  Modern plumbing… ►  Telegraph ! Telephone ►  Telephone ! Dial-up ►  Dial-up ! HSN ►  HSN ! WAN ►  WAN ! IoT Human desire to connect ourselves to each other via technology to empower each other
  • 7. Internet growth > Hardware growth 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 7 motherboard.vice.com newscientist.com
  • 8. CenturyLink Cloud IoT Advantage ►  37 states ►  550,000 miles of network ►  Innovative Gigabit fiber network ►  25MM+ consumer endpoints ►  60+ DCS 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 8
  • 9. PROBLEM STATEMENT 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 9
  • 10. Problem statement: 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 10 ►  Prevent incidents through early detection ►  Reduce MTTR by facilitating root-cause analytics ►  Facilitate domain experts and harvest their knowledge "
  • 11. GOAL 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 11 Build a real-time artificial intelligence capable of analyzing all incoming streams of data in order to know which actions our machines need to automatically take. It’s simple, really… build Skynet
  • 12. PREDICTION LANDSCAPE 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 12
  • 13. Prediction Adoption Model 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 13 Stage I: INTRODUCTION 1. Design 2. Measure Stage III: MATURITY 5. Predict 6. Act TIME SOPHISTICATION INTRO GROWTH MATURITY DECLINE Stage II: GROWTH 3. Describe 4. Detect Stage IV: DECLINE 7. Feedback 8. Obsolescence
  • 14. Prediction Adoption Model (actual) 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 14 TIME SOPHISTICATION CHECK THIS OUT OH NO, OH NO, OH NO! HAHA, IT WORKED! I NEVER SAID IT WOULD … Stage I: CHECK THIS OUT 1. It runs 2. Results are promising Stage III: HAHA, IT WORKED! 5. I surprise myself sometimes 6. I found a shortcut to scale it Stage II: OH NO, OH NO, OH NO! 3. It works but it’s terrible 4. It will never scale Stage IV: I NEVER SAID IT WOULD… 7. How do I prove it is still working? 8. There is no way to apply it to this scenario
  • 15. Stage I: INTRODUCTION 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 15 1. Design ►  What should we measure? ►  What are the core business processes? ►  What is the unit of analysis? ►  What are our research questions/ hypotheses? 2. Measure ►  Do we push or pull? ►  How often should we measure? ►  How long do we need the data? ►  How do we represent the data schema?
  • 16. Stage II: GROWTH 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 16 3. Describe ►  Which metrics relate to our outcomes of interest? ►  What is the typical value of each metric? ►  How do you visualize each metric? 4. Detect ►  What do we expect to happen? ►  Which values/events are unexpected? ►  When should we alert? ►  How will we scale our analysis?
  • 17. Stage III: MATURITY 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 17 7. Predict ►  Are there patterns? ►  Are there more complex relationships? ►  What is going to happen? ►  How do we get training data? 6. Act ►  What actions should we take? ►  How can we incorporate new outcomes into the current model?
  • 18. Stage IV: DECLINE 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 18 7. Feedback ►  Is my model primarily basing its decisions upon its previous decisions? ►  Can I separate the model from its parameters? ►  Can I still evaluate accuracy? 8. Obsolescence ►  Are my business scenarios still grounded? ►  Do my model assumptions still hold? ►  Does it still scale? ►  Is the intervention still needed?
  • 19. Domain process involvement 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 19 BUSINESS ►  Is involved early in defining requirements ENGINEERING ►  Builds MVP ►  Solidifies solution RESEARCH ►  Builds prototype and suggests solution
  • 20. SOLUTION 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 20
  • 21. Working backwards 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 21 ITEM 1 Skynet 2 Action mapping 3 Action landscape 4 Prediction 5 Categorical learning 6 Training Data 7 Feedback loop 8 High SNR 9 Unsupervised learning 10 Anomaly Detection 11 Normalization 12 Retention 13 Sampling 14 Collection 15 Approach 16 Domain model “In life, unless you’re more gifted than Einstein, inversion [i.e. working backwards] will help you solve problems.” Charlie Munger
  • 22. Working backwards (cont.) 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 22 ITEM STAGE 1 Skynet ACT 2 Action mapping ACT 3 Action landscape ACT 4 Prediction PREDICT 5 Categorical learning PREDICT 6 Training Data PREDICT 7 Feedback loop PREDICT 8 High SNR DETECT 9 Unsupervised learning DETECT 10 Anomaly Detection DETECT 11 Normalization DESCRIBE 12 Retention DESCRIBE 13 Sampling MEASURE 14 Collection MEASURE 15 Approach DESIGN 16 Domain model DESIGN TIME SOPHISTICATION INTRO GROWTH MATURITY DECLINE
  • 23. Working backwards (cont.) 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 23 ITEM STAGE PRIMARY DOMAIN 1 Skynet ACT ENGINEERING 2 Action mapping ACT BUSINES 3 Action landscape ACT RESEARCH 4 Prediction PREDICT RESEARCH 5 Categorical learning PREDICT RESEARCH 6 Training Data PREDICT ENGINEERING 7 Feedback loop PREDICT BUSINESS 8 High SNR DETECT RESEARCH 9 Unsupervised learning DETECT RESEARCH 10 Anomaly Detection DETECT RESEARCH 11 Normalization DESCRIBE RESEARCH 12 Retention DESCRIBE ENGINEERING 13 Sampling MEASURE RESEARCH 14 Collection MEASURE ENGINEERING 15 Approach DESIGN RESEARCH 16 Domain model DESIGN BUSINESS
  • 24. This is a WIP 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 24 ITEM STAGE PRIMARY DOMAIN 1 Skynet ACT ENGINEERING 2 Action mapping ACT BUSINES 3 Action landscape ACT RESEARCH 4 Prediction PREDICT RESEARCH 5 Categorical learning PREDICT RESEARCH 6 Training Data PREDICT ENGINEERING 7 Feedback loop PREDICT BUSINESS 8 High SNR DETECT RESEARCH 9 Unsupervised learning DETECT RESEARCH 10 Anomaly Detection DETECT RESEARCH 11 Normalization DESCRIBE RESEARCH 12 Sampling MEASURE RESEARCH 13 Collection MEASURE ENGINEERING 14 Domain model DESIGN BUSINESS QUEUED (StampedCon 2017?) WORKING PRODUCTION
  • 25. LESSONS LEARNED 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 25
  • 26. 16. DOMAIN MODEL 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 26 ►  938,076 metrics ►  Verify the unique stream of data across systems ►  Key-based DESIGN
  • 27. 15. APPROACH 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 27 VARIABILITY ►  Changes in observed state ►  Plan for variability UNCERTAINTY ►  Unobserved state(s) ►  Design for uncertainty DESIGN (cont.)
  • 28. 14. COLLECTION 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 28 ►  Agreement of signals ►  Cacophony of signals ►  How often should we measure? ►  We have no labeled training data ►  An approach we can build upon in the future MEASURE
  • 29. 13. SAMPLING 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 29 Shannon-Nyquist Paradox ►  The more you measure something the more it varies ►  Bias related to time and variability ►  EG. Temperature yesterday was 68 degrees MEASURE (cont.)
  • 30. 12. RETENTION 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 30 ►  Recall that precision relates to sampling consistency ►  Not all metrics are created equal ►  Coverage remains problematic DESCRIBE
  • 31. 11. NORMALIZATION 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 31 Kievit, R.A., Frankenhuis, et al. (2013). Simpson’s paradox in psychological science. Frontiers in Psychology Simpson’s Paradox ►  aggregate trend != sum of individual trends ►  Applies to all aggregates: sums, averages, correlations, etc. ►  What is the unit of analysis? DESCRIBE (cont.)
  • 32. 26-Jul-16 32 Predicted CenturyLink Confidential Actual Boundary 10. ANOMALY DETECTION ►  Capture the time series data for each piece of connected platform technology ►  Find implicit anomalies within a time series vector ►  Values that are surprising ►  Highly scalable DETECT presented by Ryan Kirk at StampedeCon 2016
  • 33. 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 33 ►  Time series data shows the context behind anomalies that co-occur ►  Group anomalous vectors based upon structural properties and co-occurrence ►  Up-level anomalies into higher-order alerts using contextual information 9. UNSUPERVISED LEARNING DETECT (cont.) 8. HIGH SNR
  • 34. 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 34 ►  We have also built a search engine for time series data that allows us to build cool looking graphs in real-time ►  We basically do all of this to empower slack alerts ►  Allows tags to propagate forwards 7. FEEDBACK LOOP PREDICT
  • 35. 6. TRAINING DATA 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 35 ►  Evaluate ALL assumptions in regards to training data ►  Ideally use active learning approach or risk becoming tautological PREDICT (cont.)
  • 36. RESULTS 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 36
  • 37. Prediction Results 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 37 ►  38,392,438 predictions every 24hr. ►  Anomaly rate < 0.01% (0.0001) ~3K anomalies/day ►  Accuracy is ~90% ►  Prediction latency ~3.0 seconds ►  ~30 Higher order alerts/day
  • 38. Want to join me? Let’s connect: ►  @ryan_kirk Try CenturyLink Cloud free: ►  ctl.io We are hiring ►  ctl.io/careers/jobs Thanks to: ►  StampedeCon2016 ►  pixabay.com 26-Jul-16presented by Ryan Kirk at StampedeCon 2016 38