SlideShare a Scribd company logo
Machine Learning
April 19, 2016
Roadmap to Constructing
A Top Down
Machine Learning Paradigm
1
Introduction to Southwestern Energy
Southwestern Energy Company (NYSE: SWN) is a
leading natural gas and oil company with operations
predominantly in the United States, engaged in
exploration, development and production activities,
including related natural gas gathering and marketing.
Source: http://www.swn.com/
2
Machine Learning, Deep Learning, AI
Roadmap to Constructing a
Top Down Machine Learning Paradigm
E&P organizations are turning more attention to accumulated data to
enhance operating efficiency, safety, and recovery. The computing
paradigm is shifting, the O&G paradigm is shifting, and the rise of the
machine learning paradigm requires careful attention to top-down
integrated systems engineering. A system approach will be presented to
stimulate out-of-the-box thinking to address the machine learning
paradigm.
3
Past Paradigm Shifts
• Seismic
• Horizontal Drilling
• Off Shore
• Factory Drilling
Paradigm Shifts in Process
• Big Crew Change
• Mobility (anytime,
anywhere)
• Big Data
• Machine Learning
The Shifting O&G Paradigm
Source: Mark Reynolds, compilation
4
Changing Paradigms
• Computing Paradigm
(4th Paradigm / eScience)
• O&G Paradigm
(Shale 2.0)
New Paradigms
• Machine Learning Paradigm
Paradigms We Are Discussing Today
5
The Structure of Scientific Revolutions
• Normal Science
– Equilibrium, harmony
• Model Drift
– Outliers cease to be outliers
– Ripples turn to discontinuity
• Model Crisis
– Alternate methods permitted
– Out-of-the-box reconsidered
• Model Revolution
– New model becomes the new-normal
• Paradigm Change
– (Textbooks play catch-up)
Source: Thomas Kuhn, (1962) The Structure of Scientific Revolutions. University of Chicago Press
Mark Reynolds, compilation
Normal
Science
Model Drift
(Anomaly)
Model
CrisisModel
Revolution
Paradigm
Change Kuhn
Cycle
6
The Shifting Computing Paradigm
Descriptive
and
Formulaic
Hypothetical
and
Investigative
Expertise
Driven
Models and
Cases
Multivariant
Differential
Modelling
Source: Mark Reynolds, compilation
eScience
Traditional Science
7
The Shifting Computing Paradigms
• O&G is where we found itEmpirical
• O&G is where we expect itTheoretical
• O&G is where we estimate itComputational
• O&G is where we infer it
Data
Exploration
Source: Mark Reynolds, compilation
8
The Machine Learning Paradigm
“ A computer program is said to learn from experience
(E) with respect to some class of tasks (T) and
performance measure (P), if its performance at tasks in
T, as measured by P, improves with experience E. ”
~Tom Mitchell
Source: Tom Mitchell, Mitchell, T. (1997). Machine Learning, McGraw Hill.
Mark Reynolds, compilation
Machine Learning is the “Extraction of Wisdom
by Understanding the underlying Data”
9
The Catalyst
• Data captured by
instruments
• Data generated by
simulations
• Data acquired by
sensor networks
The Destination
• Solutions from data analysis
• Solutions from data mining
• Solutions from visualization
• Solutions from drill down
• Solutions for bottom line
• Solutions using eScience
Machine Learning in the 4th Paradigm
Source: Mark Reynolds, compilation
eScience and the Fourth Paradigm: Data-Intensive Scientific Discovery and Digital Preservation, Tony Hey, Microsoft Research
http://www.alliancepermanentaccess.org/wp-content/uploads/2011/12/apa2011/15_%28Nov11%29TonyHey-APA%20Meeting.pdf
“ eScience is the set of tools and technologies
to support data federation and collaboration ”
~ Jim Grey
10
Predictive Analytics
• Focuses on Prediction
– Based on Known Properties
– Learned from Training Data
Data Mining
• Focuses on Discovery
– Unknown Properties in Data
– The Analysis Phase of
Knowledge Discovery
Precursors to Machine Learning
Machine Learning is the “Extraction of Wisdom
by Understanding the underlying Data”
~Mark Reynolds
Source: Mark Reynolds, compilation
11
The Machine Learning Paradigm
Unsupervised Learning
Supervised Learning
Semi-Supervised Learning
Reinforcement Learning
24/7
Predictive
Analytics
Data
Mining
Machine
Learning
AI
Source: Mark Reynolds, compilation
12
Principal Concepts in Machine Learning
• Unsupervised Learning
– Data is unlabeled
• Supervised Learning
– Teach and train with data that is well labeled with a
defined output
• Reinforcement Learning
– Validity of data alignment is served as feedback
• Semi-Supervised Learning
– Some of the data is labeled, some is unlabeled
Source: Mark Reynolds, compilation
13
Domestic
• Nest® Thermostats
• Pandora / Amazon
• Spam Detection
• Fraud Detection
• Traffic Light Duty Cycle
• Google
Upstream O&G
• Pump-Jack Duty Cycle
(circa 1986)
• Closed Loop Directional
Drilling (circa 2009)
Examples of Machine Learning
14
The Bridge Into Machine Learning
Today Tomorrow
Integrated Systems Engineering
15
Integrated Systems Engineering
Systems &
Knowledge
Engineer
O&G
Systems
Control
Systems
Remote
Systems
Information
Systems
Embedded
Systems
Robotic
Systems
Data
Fusion
Real-Time
Systems
Look-Back
Analysis
Look-
Ahead
Systems
Land and Regulatory
Geology Geophysics
Drilling Engineering
Completion Engineering
Production Engineering
Reservoir Engineering
Systems Engineering
Source: Mark Reynolds, compilation
16
Integrated Engineering – Top-Down
• Engineering the Source
– Signals, content, and
characterizations
• Engineering the Data
– Address errant data
– Address valid spurious data
– Address data quality
• Engineering the Store
– Repository
– Recall and Reporting
– Representations
Data Acquisition
Data Transmission
Data Retention
Data Analysis
Data Reduction
Source: Mark Reynolds, compilation
17
Integrated Engineering – Top-Down
• Engineering the Store
– Data distribution
– Data staging
• Engineering the Recall
– Simple query
– Cube v Matrix
• Engineering the Use Case
– Destination: human
– Destination: machine
Classification
Regression
Clustering
Density Estimation
Dimensional Reduction
18
Integrated Engineering – System Flow
Acquire Analyze Annunciate Archive Analyze Anticipate Apply
Data
Information
Visualization
Knowledge
Forensics
Understanding
Analysis &
Mining
Wisdom
Anticipating
Application
 Creating Informational Accessibility and Transparency
 Discovering Experiential Performance Improvements
 Segmenting Processes and Process Results
 Replacing Human Decision w/ Automated Algorithms
 Innovating New Models, Products, Services
Source: Mark Reynolds, compilation
19
Integrated Engineering – Top-Down
19
Data
Modeling
Proactive &
Closed-Loop
Systems
Mining and
Analytics
Forensics
Control
Visualization
and
Observation
Source
Capture and
Utilization
• Intelligence during operations (Observation and Anticipation)
• Intelligence reviewing operations (Forensic)
• Intelligence planning operations (Historical and Analytical)
Source: Mark Reynolds, compilation
Well
Plan RT
Prod
RT
Drill
Geo-
steer
RT
Frac
Daily
Rpts
AFE
  
  
20
Applied Machine Learning 101
Training
Data
Pre-
Processing
Learning
Error
Analysis
Model
Learning (Phase 1)
Prediction (Phase 2)
New Data Model
Predictable
Result
21
Representative Algorithms
• Decision Tree Learning
– Maps observation to conclusions
• Association Rule Learning
– Discovering interesting relations
• Artificial Neural Networks
– Incremental function modules
• Inductive Logic Programming
– Rule based representations for
input --> output
• Support Vector Machines
– Classification and regression
• Clustering
– Assignment of observations to
clusters
• Bayesian Networks
– Probabilistic models correlating
variables
• Reinforcement Learning
– Finds policy to map states to
desired outcome
• Representation Learning
– Principal component analysis
• Similarity & Metric Learning
– Pairs of examples train others
• Sparse Dictionary Learning
– Datum as linear combinations
• Genetic Algorithms
– Mimics natural heuristics
22
Machine Learning: Data Diversity
• Macro (or field-level)
– Spatial
– Temporal
• Pad (or offset)
– Spatial
– Temporal
• Well (or wellbore)
– Spatial
– Temporal
• External
– Uploads
– Political, Climate, etc
• The 3 Cs of Data Quality
– Consistency
– Correctness
– Completeness
– [#4] Currency
– [#5] Conformity
Source: Mark Reynolds, compilation
Data Diversity - Spatial, Temporal, Referential
23
The Fast Data ecosystem in O&G
Land
Drilling
Reservoir Completion
Water
Production
Steering Regulatory
Midstream
Source: Assorted web images
24
Algorithmic Approaches (revisited)
• Decision Tree Learning
– Maps observation to conclusions
• Association Rule Learning
– Discovering interesting relations
• Artificial Neural Networks
– Incremental function modules
• Inductive Logic Programming
– Rule based representations for
input --> output
• Support Vector Machines
– Classification and regression
• Clustering
– Assignment of observations to
clusters
• Bayesian Networks
– Probabilistic models correlating
variables
• Reinforcement Learning
– Finds policy to map states to
desired outcome
• Representation Learning
– Principal component analysis
• Similarity & Metric Learning
– Pairs of examples train others
• Sparse Dictionary Learning
– Datum as linear combinations
• Genetic Algorithms
– Mimics natural heuristics
25
Keep Your Eye on the Prize
Data
Information
Knowledge
Understanding
Wisdom
Application
The question is NOT
“How can we … ?”
But instead
“What is the objective?”
( or “Why?” )
26
Mark Reynolds
Mark Reynolds Vitae
• Southwestern Energy
• Lone Star College
• Intent Driven Designs
• Scan Systems
• Sikorsky Aircraft
• General Dynamics
• Southwestern Energy Email
– Mark_Reynolds@swn.com

More Related Content

What's hot

Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
Geoffrey Fox
 
Data science
Data scienceData science
Data science
Sreejith c
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
Sampath Kumar
 
Environmental Data Management and Analytics
Environmental Data Management and AnalyticsEnvironmental Data Management and Analytics
Environmental Data Management and Analytics
Ekonnect Knowledge Foundation
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different Facets
Geoffrey Fox
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
Spotle.ai
 
Data science
Data scienceData science
Data science
9diov
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
Case study
Case studyCase study
Case study
Suraksha Sanghavi
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
Putchong Uthayopas
 
Adding Open Data Value to 'Closed Data' Problems
Adding Open Data Value to 'Closed Data' ProblemsAdding Open Data Value to 'Closed Data' Problems
Adding Open Data Value to 'Closed Data' Problems
Simon Price
 
Predictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesPredictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use Cases
Kimberley Mitchell
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop Slides
Jason Hattrick-Simpers
 
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Satoshi Konno
 
Multipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendationMultipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendation
Kan Yuenyong
 
Case study on gina(gobal innovation network and analysis)
Case study on gina(gobal innovation network and analysis)Case study on gina(gobal innovation network and analysis)
Case study on gina(gobal innovation network and analysis)
SaloniAgrawal41
 
GTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsGTU GeekDay Data Science and Applications
GTU GeekDay Data Science and Applications
Kürşat İNCE
 
Cri big data
Cri big dataCri big data
Cri big data
Putchong Uthayopas
 
Big Data
Big Data Big Data

What's hot (20)

Big Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other thingsBig Data HPC Convergence and a bunch of other things
Big Data HPC Convergence and a bunch of other things
 
Data science
Data scienceData science
Data science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Environmental Data Management and Analytics
Environmental Data Management and AnalyticsEnvironmental Data Management and Analytics
Environmental Data Management and Analytics
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different Facets
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Data science
Data scienceData science
Data science
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
Case study
Case studyCase study
Case study
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
 
Adding Open Data Value to 'Closed Data' Problems
Adding Open Data Value to 'Closed Data' ProblemsAdding Open Data Value to 'Closed Data' Problems
Adding Open Data Value to 'Closed Data' Problems
 
Predictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesPredictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use Cases
 
Hattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop SlidesHattrick Simpers TMS Machine Learning Workshop Slides
Hattrick Simpers TMS Machine Learning Workshop Slides
 
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
Approximate QoS Rule Derivation Based on Root Cause Analysis for Cloud Comput...
 
Multipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendationMultipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendation
 
Case study on gina(gobal innovation network and analysis)
Case study on gina(gobal innovation network and analysis)Case study on gina(gobal innovation network and analysis)
Case study on gina(gobal innovation network and analysis)
 
GTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsGTU GeekDay Data Science and Applications
GTU GeekDay Data Science and Applications
 
Cri big data
Cri big dataCri big data
Cri big data
 
Big Data
Big Data Big Data
Big Data
 

Viewers also liked

Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionMachine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Akihiro Hayashi
 
Making Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to UseMaking Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to Use
Kazuaki Ishizaki
 
Machine Learning under Attack: Vulnerability Exploitation and Security Measures
Machine Learning under Attack: Vulnerability Exploitation and Security MeasuresMachine Learning under Attack: Vulnerability Exploitation and Security Measures
Machine Learning under Attack: Vulnerability Exploitation and Security Measures
Pluribus One
 
Machine Learning with Applications in Categorization, Popularity and Sequence...
Machine Learning with Applications in Categorization, Popularity and Sequence...Machine Learning with Applications in Categorization, Popularity and Sequence...
Machine Learning with Applications in Categorization, Popularity and Sequence...
Nicolas Nicolov
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applicationsAnish Das
 
Artificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningArtificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep Learning
Sujit Pal
 

Viewers also liked (6)

Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU SelectionMachine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection
 
Making Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to UseMaking Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to Use
 
Machine Learning under Attack: Vulnerability Exploitation and Security Measures
Machine Learning under Attack: Vulnerability Exploitation and Security MeasuresMachine Learning under Attack: Vulnerability Exploitation and Security Measures
Machine Learning under Attack: Vulnerability Exploitation and Security Measures
 
Machine Learning with Applications in Categorization, Popularity and Sequence...
Machine Learning with Applications in Categorization, Popularity and Sequence...Machine Learning with Applications in Categorization, Popularity and Sequence...
Machine Learning with Applications in Categorization, Popularity and Sequence...
 
Application of machine learning in industrial applications
Application of machine learning in industrial applicationsApplication of machine learning in industrial applications
Application of machine learning in industrial applications
 
Artificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep LearningArtificial Intelligence, Machine Learning and Deep Learning
Artificial Intelligence, Machine Learning and Deep Learning
 

Similar to 2016 04-19 machine learning

Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018
Mark Reynolds
 
Azure machine learning
Azure machine learningAzure machine learning
Azure machine learning
Mark Reynolds
 
2 introductory slides
2 introductory slides2 introductory slides
2 introductory slides
tafosepsdfasg
 
Introduction to Data Mining and Knowledge DiscoveryChapter 01
Introduction to Data Mining and Knowledge DiscoveryChapter 01Introduction to Data Mining and Knowledge DiscoveryChapter 01
Introduction to Data Mining and Knowledge DiscoveryChapter 01
Mahmudur Rahman
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
XanGwaps
 
Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...
Geoffrey Fox
 
intro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabiintro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabi
botvillain45
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
Nicholas McClure
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysAerospike, Inc.
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Robert Williams
 
How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?
Noam Cohen
 
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public SectorDataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
TamrMarketing
 
machine learning
machine learningmachine learning
machine learning
soundaryasarya
 
crisp.ppt
crisp.pptcrisp.ppt
crisp.ppt
SK Chew
 
crisp.ppt
crisp.pptcrisp.ppt
crisp.ppt
ChiragJoshi59934
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and Techniques
DeepaR42
 
Data mining and Machine learning expained in jargon free & lucid language
Data mining and Machine learning expained in jargon free & lucid languageData mining and Machine learning expained in jargon free & lucid language
Data mining and Machine learning expained in jargon free & lucid language
q-Maxim
 
Big data
Big dataBig data
Big data
Big dataBig data
Big data
Harshit Namdev
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
1904saikrishna
 

Similar to 2016 04-19 machine learning (20)

Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018
 
Azure machine learning
Azure machine learningAzure machine learning
Azure machine learning
 
2 introductory slides
2 introductory slides2 introductory slides
2 introductory slides
 
Introduction to Data Mining and Knowledge DiscoveryChapter 01
Introduction to Data Mining and Knowledge DiscoveryChapter 01Introduction to Data Mining and Knowledge DiscoveryChapter 01
Introduction to Data Mining and Knowledge DiscoveryChapter 01
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...
 
intro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabiintro to ML by the way m toh phasee movie Punjabi
intro to ML by the way m toh phasee movie Punjabi
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 
Get Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California HighwaysGet Started with Data Science by Analyzing Traffic Data from California Highways
Get Started with Data Science by Analyzing Traffic Data from California Highways
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
 
How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?How Data Science Can Grow Your Business?
How Data Science Can Grow Your Business?
 
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public SectorDataOps @ Scale: A Modern Framework for Data Management in the Public Sector
DataOps @ Scale: A Modern Framework for Data Management in the Public Sector
 
machine learning
machine learningmachine learning
machine learning
 
crisp.ppt
crisp.pptcrisp.ppt
crisp.ppt
 
crisp.ppt
crisp.pptcrisp.ppt
crisp.ppt
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and Techniques
 
Data mining and Machine learning expained in jargon free & lucid language
Data mining and Machine learning expained in jargon free & lucid languageData mining and Machine learning expained in jargon free & lucid language
Data mining and Machine learning expained in jargon free & lucid language
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 

Recently uploaded

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 

Recently uploaded (20)

一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 

2016 04-19 machine learning

  • 1. Machine Learning April 19, 2016 Roadmap to Constructing A Top Down Machine Learning Paradigm
  • 2. 1 Introduction to Southwestern Energy Southwestern Energy Company (NYSE: SWN) is a leading natural gas and oil company with operations predominantly in the United States, engaged in exploration, development and production activities, including related natural gas gathering and marketing. Source: http://www.swn.com/
  • 3. 2 Machine Learning, Deep Learning, AI Roadmap to Constructing a Top Down Machine Learning Paradigm E&P organizations are turning more attention to accumulated data to enhance operating efficiency, safety, and recovery. The computing paradigm is shifting, the O&G paradigm is shifting, and the rise of the machine learning paradigm requires careful attention to top-down integrated systems engineering. A system approach will be presented to stimulate out-of-the-box thinking to address the machine learning paradigm.
  • 4. 3 Past Paradigm Shifts • Seismic • Horizontal Drilling • Off Shore • Factory Drilling Paradigm Shifts in Process • Big Crew Change • Mobility (anytime, anywhere) • Big Data • Machine Learning The Shifting O&G Paradigm Source: Mark Reynolds, compilation
  • 5. 4 Changing Paradigms • Computing Paradigm (4th Paradigm / eScience) • O&G Paradigm (Shale 2.0) New Paradigms • Machine Learning Paradigm Paradigms We Are Discussing Today
  • 6. 5 The Structure of Scientific Revolutions • Normal Science – Equilibrium, harmony • Model Drift – Outliers cease to be outliers – Ripples turn to discontinuity • Model Crisis – Alternate methods permitted – Out-of-the-box reconsidered • Model Revolution – New model becomes the new-normal • Paradigm Change – (Textbooks play catch-up) Source: Thomas Kuhn, (1962) The Structure of Scientific Revolutions. University of Chicago Press Mark Reynolds, compilation Normal Science Model Drift (Anomaly) Model CrisisModel Revolution Paradigm Change Kuhn Cycle
  • 7. 6 The Shifting Computing Paradigm Descriptive and Formulaic Hypothetical and Investigative Expertise Driven Models and Cases Multivariant Differential Modelling Source: Mark Reynolds, compilation eScience Traditional Science
  • 8. 7 The Shifting Computing Paradigms • O&G is where we found itEmpirical • O&G is where we expect itTheoretical • O&G is where we estimate itComputational • O&G is where we infer it Data Exploration Source: Mark Reynolds, compilation
  • 9. 8 The Machine Learning Paradigm “ A computer program is said to learn from experience (E) with respect to some class of tasks (T) and performance measure (P), if its performance at tasks in T, as measured by P, improves with experience E. ” ~Tom Mitchell Source: Tom Mitchell, Mitchell, T. (1997). Machine Learning, McGraw Hill. Mark Reynolds, compilation Machine Learning is the “Extraction of Wisdom by Understanding the underlying Data”
  • 10. 9 The Catalyst • Data captured by instruments • Data generated by simulations • Data acquired by sensor networks The Destination • Solutions from data analysis • Solutions from data mining • Solutions from visualization • Solutions from drill down • Solutions for bottom line • Solutions using eScience Machine Learning in the 4th Paradigm Source: Mark Reynolds, compilation eScience and the Fourth Paradigm: Data-Intensive Scientific Discovery and Digital Preservation, Tony Hey, Microsoft Research http://www.alliancepermanentaccess.org/wp-content/uploads/2011/12/apa2011/15_%28Nov11%29TonyHey-APA%20Meeting.pdf “ eScience is the set of tools and technologies to support data federation and collaboration ” ~ Jim Grey
  • 11. 10 Predictive Analytics • Focuses on Prediction – Based on Known Properties – Learned from Training Data Data Mining • Focuses on Discovery – Unknown Properties in Data – The Analysis Phase of Knowledge Discovery Precursors to Machine Learning Machine Learning is the “Extraction of Wisdom by Understanding the underlying Data” ~Mark Reynolds Source: Mark Reynolds, compilation
  • 12. 11 The Machine Learning Paradigm Unsupervised Learning Supervised Learning Semi-Supervised Learning Reinforcement Learning 24/7 Predictive Analytics Data Mining Machine Learning AI Source: Mark Reynolds, compilation
  • 13. 12 Principal Concepts in Machine Learning • Unsupervised Learning – Data is unlabeled • Supervised Learning – Teach and train with data that is well labeled with a defined output • Reinforcement Learning – Validity of data alignment is served as feedback • Semi-Supervised Learning – Some of the data is labeled, some is unlabeled Source: Mark Reynolds, compilation
  • 14. 13 Domestic • Nest® Thermostats • Pandora / Amazon • Spam Detection • Fraud Detection • Traffic Light Duty Cycle • Google Upstream O&G • Pump-Jack Duty Cycle (circa 1986) • Closed Loop Directional Drilling (circa 2009) Examples of Machine Learning
  • 15. 14 The Bridge Into Machine Learning Today Tomorrow Integrated Systems Engineering
  • 16. 15 Integrated Systems Engineering Systems & Knowledge Engineer O&G Systems Control Systems Remote Systems Information Systems Embedded Systems Robotic Systems Data Fusion Real-Time Systems Look-Back Analysis Look- Ahead Systems Land and Regulatory Geology Geophysics Drilling Engineering Completion Engineering Production Engineering Reservoir Engineering Systems Engineering Source: Mark Reynolds, compilation
  • 17. 16 Integrated Engineering – Top-Down • Engineering the Source – Signals, content, and characterizations • Engineering the Data – Address errant data – Address valid spurious data – Address data quality • Engineering the Store – Repository – Recall and Reporting – Representations Data Acquisition Data Transmission Data Retention Data Analysis Data Reduction Source: Mark Reynolds, compilation
  • 18. 17 Integrated Engineering – Top-Down • Engineering the Store – Data distribution – Data staging • Engineering the Recall – Simple query – Cube v Matrix • Engineering the Use Case – Destination: human – Destination: machine Classification Regression Clustering Density Estimation Dimensional Reduction
  • 19. 18 Integrated Engineering – System Flow Acquire Analyze Annunciate Archive Analyze Anticipate Apply Data Information Visualization Knowledge Forensics Understanding Analysis & Mining Wisdom Anticipating Application  Creating Informational Accessibility and Transparency  Discovering Experiential Performance Improvements  Segmenting Processes and Process Results  Replacing Human Decision w/ Automated Algorithms  Innovating New Models, Products, Services Source: Mark Reynolds, compilation
  • 20. 19 Integrated Engineering – Top-Down 19 Data Modeling Proactive & Closed-Loop Systems Mining and Analytics Forensics Control Visualization and Observation Source Capture and Utilization • Intelligence during operations (Observation and Anticipation) • Intelligence reviewing operations (Forensic) • Intelligence planning operations (Historical and Analytical) Source: Mark Reynolds, compilation Well Plan RT Prod RT Drill Geo- steer RT Frac Daily Rpts AFE      
  • 21. 20 Applied Machine Learning 101 Training Data Pre- Processing Learning Error Analysis Model Learning (Phase 1) Prediction (Phase 2) New Data Model Predictable Result
  • 22. 21 Representative Algorithms • Decision Tree Learning – Maps observation to conclusions • Association Rule Learning – Discovering interesting relations • Artificial Neural Networks – Incremental function modules • Inductive Logic Programming – Rule based representations for input --> output • Support Vector Machines – Classification and regression • Clustering – Assignment of observations to clusters • Bayesian Networks – Probabilistic models correlating variables • Reinforcement Learning – Finds policy to map states to desired outcome • Representation Learning – Principal component analysis • Similarity & Metric Learning – Pairs of examples train others • Sparse Dictionary Learning – Datum as linear combinations • Genetic Algorithms – Mimics natural heuristics
  • 23. 22 Machine Learning: Data Diversity • Macro (or field-level) – Spatial – Temporal • Pad (or offset) – Spatial – Temporal • Well (or wellbore) – Spatial – Temporal • External – Uploads – Political, Climate, etc • The 3 Cs of Data Quality – Consistency – Correctness – Completeness – [#4] Currency – [#5] Conformity Source: Mark Reynolds, compilation Data Diversity - Spatial, Temporal, Referential
  • 24. 23 The Fast Data ecosystem in O&G Land Drilling Reservoir Completion Water Production Steering Regulatory Midstream Source: Assorted web images
  • 25. 24 Algorithmic Approaches (revisited) • Decision Tree Learning – Maps observation to conclusions • Association Rule Learning – Discovering interesting relations • Artificial Neural Networks – Incremental function modules • Inductive Logic Programming – Rule based representations for input --> output • Support Vector Machines – Classification and regression • Clustering – Assignment of observations to clusters • Bayesian Networks – Probabilistic models correlating variables • Reinforcement Learning – Finds policy to map states to desired outcome • Representation Learning – Principal component analysis • Similarity & Metric Learning – Pairs of examples train others • Sparse Dictionary Learning – Datum as linear combinations • Genetic Algorithms – Mimics natural heuristics
  • 26. 25 Keep Your Eye on the Prize Data Information Knowledge Understanding Wisdom Application The question is NOT “How can we … ?” But instead “What is the objective?” ( or “Why?” )
  • 27. 26 Mark Reynolds Mark Reynolds Vitae • Southwestern Energy • Lone Star College • Intent Driven Designs • Scan Systems • Sikorsky Aircraft • General Dynamics • Southwestern Energy Email – Mark_Reynolds@swn.com