Mark Reynolds
Azure Machine Learning
Introduction using the AML Learning Studio
Mark Reynolds
October 14, 2017
The Machine Learning Paradigm
Source: Tom Mitchell, Mitchell, T. (1997). Machine Learning, McGraw Hill.
Mark Reynolds, compilation
“ A computer program is said to learn from experience (E)
with respect to some class of tasks (T) and performance
measure (P), if its performance at tasks in T, as measured by
P, improves with experience E. ”
~Tom Mitchell
Machine Learning is the “Extraction of Wisdom
by Understanding the underlying Data”
Copyright © 2018 by Mark Reynolds 2
Buzzwords for 2017
• Big Data
• Machine Learning
• Blockchain
• Tensor Flow
Source: Mark Reynolds, compilation
Copyright © 2018 by Mark Reynolds 3
The Shifting Computing Paradigm
Descriptive
and
Formulaic
Hypothetical
and
Investigative
Expertise
Driven
Models and
Cases
Multivariant
Differential
Modelling
Source: Mark Reynolds, compilation
eScience
Traditional Science
Copyright © 2018 by Mark Reynolds 4
Becoming Knowledge Learning Organizations
Seismic
Drilling
Completions
Production
Data
Information
Visualization
Knowledge
Forensics
Understanding
Analysis & Mining
Wisdom
Anticipating
Application
  
  
RT
Frac
Daily
Rpts
Well
Plan RT
Drill
Geo-
steer
AFE
RT
Prod
… Applying the DIKW Pedagogy
Source: Mark Reynolds, compilation
Copyright © 2018 by Mark Reynolds 5
Integrated Engineering – System Flow
Acquire Analyze Annunciate Archive Analyze Anticipate Apply
Data
Information
Visualization
Knowledge
Forensics
Understanding
Analysis & Mining
Wisdom
Anticipating
Application
 Creating Informational Accessibility and Transparency
 Discovering Experiential Performance Improvements
 Segmenting Processes and Process Results
 Replacing Human Decision w/ Automated Algorithms
 Innovating New Models, Products, Services
Source: Mark Reynolds, compilation
Real Time Analytics (including ML)
Hindsight Insight Foresight
Copyright © 2018 by Mark Reynolds 6
Analytics, Applied
Data
Information
Visualization
Knowledge
Forensics
Understanding
Analysis & Mining
Wisdom
Anticipating
Application
Source: Mark Reynolds, compilation
Hindsight Insight Foresight
Descriptive
Analytics
• What
happened?
Diagnostic
Analytics
• Why did it
happen?
Predictive
Analytics
• What will
happen next?
Prescriptive
Analytics
• What can we
do about it?
Document
and
File
Tabulate
and
Summarize
Chart
and
Plot
Process
and
Analyze
Copyright © 2018 by Mark Reynolds 7
The Machine Learning Paradigm
Unsupervised Learning
Supervised Learning
Semi-Supervised Learning
Reinforcement Learning
24/7
Predictive
Analytics
Data
Mining
Machine
Learning
AI
Beware of the 90-foot
discovery!
Beware of extreme data
vetting!
Beware of rabbit holes!
Beware of over
tweaking!
Source: Mark Reynolds, compilation
Copyright © 2018 by Mark Reynolds 8
The Machine Learning Process
Source: Introduction to Azure, David Chappel
Copyright © 2018 by Mark Reynolds 9
Validating and Applying Successes
Source: Machine Learning has transformed many aspects of our everyday life, can it do the same for public services?, Natalia Angarita , May 23, 2016, Capgemini
https://www.capgemini.com/blog/insights-data-blog/2016/05/machine-learning-has-transformed-many-aspects-of-our-everyday-life
Copyright © 2018 by Mark Reynolds 10
Azure Machine Learning Provides
Source: Introduction to Azure, David Chappel
Copyright © 2018 by Mark Reynolds 11
Common Algorithmic Approaches
• Decision Tree Learning
• Maps observation to conclusions
• Association Rule Learning
• Discovering interesting relations
• Artificial Neural Networks
• Incremental function modules
• Inductive Logic Programming
• Rule based representations for input --> output
• Support Vector Machines
• Classification and regression
• Clustering
• Assignment of observations to clusters
• Bayesian Networks
• Probabilistic models correlating variables
• Reinforcement Learning
• Finds policy to map states to desired outcome
• Representation Learning
• Principal component analysis
• Similarity & Metric Learning
• Pairs of examples train others
• Sparse Dictionary Learning
• Datum as linear combinations
• Genetic Algorithms
• Mimics natural heuristics
Source: Mark Reynolds, compilation
Copyright © 2018 by Mark Reynolds 12
Copyright © 2018 by Mark Reynolds 13
https://studio.azureml.net/
Copyright © 2018 by Mark Reynolds 14
Azure ML Development Workspace
Copyright © 2018 by Mark Reynolds 15
Azure ML Development Workspace
Copyright © 2018 by Mark Reynolds 16
Azure Machine Learning
Copyright © 2018 by Mark Reynolds 17
Regression Algorithms
• Ordinary Least Squares Regression (OLSR)
• Linear Regression
• Logistic Regression
• Stepwise Regression
• Multivariate Adaptive Regression Splines
(MARS)
• Locally Estimated Scatterplot Smoothing
(LOESS)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 18
Instance-based Algorithms
• k-Nearest Neighbor (kNN)
• Learning Vector Quantization (LVQ)
• Self-Organizing Map (SOM)
• Locally Weighted Learning (LWL)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 19
Regularization Algorithms
• Ridge Regression
• Least Absolute Shrinkage and Selection
Operator (LASSO)
• Elastic Net
• Least-Angle Regression (LARS)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 20
Decision Tree Algorithms
• Classification and Regression Tree (CART)
• Iterative Dichotomiser 3 (ID3)
• C4.5 and C5.0 (different versions of a
powerful approach)
• Chi-squared Automatic Interaction
Detection (CHAID)
• Decision Stump
• M5
• Conditional Decision Trees
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 21
Bayesian Algorithms
• Naive Bayes
• Gaussian Naive Bayes
• Multinomial Naive Bayes
• Averaged One-Dependence Estimators
(AODE)
• Bayesian Belief Network (BBN)
• Bayesian Network (BN)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 22
Clustering Algorithms
• k-Means
• k-Medians
• Expectation Maximisation (EM)
• Hierarchical Clustering
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 23
Association Rule Learning Algorithms
• Apriori algorithm
• Eclat algorithm
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 24
Artificial Neural Network Algorithms
• Perceptron
• Back-Propagation
• Hopfield Network
• Radial Basis Function Network (RBFN)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 25
Deep Learning Algorithms
• Deep Boltzmann Machine (DBM)
• Deep Belief Networks (DBN)
• Convolutional Neural Network (CNN)
• Stacked Auto-Encoders
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 26
Dimensionality Reduction Algorithms
• Principal Component Analysis (PCA)
• Principal Component Regression (PCR)
• Partial Least Squares Regression (PLSR)
• Sammon Mapping
• Multidimensional Scaling (MDS)
• Projection Pursuit
• Linear Discriminant Analysis (LDA)
• Mixture Discriminant Analysis (MDA)
• Quadratic Discriminant Analysis (QDA)
• Flexible Discriminant Analysis (FDA)
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 27
Ensemble Algorithms
• Boosting
• Bootstrapped Aggregation (Bagging)
• AdaBoost
• Stacked Generalization (blending)
• Gradient Boosting Machines (GBM)
• Gradient Boosted Regression Trees
(GBRT)
• Random Forest
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 28
Algorithms from Specialty Subfields in ML
• Computational intelligence (evolutionary
algorithms, etc.)
• Computer Vision (CV)
• Natural Language Processing (NLP)
• Recommender Systems
• Reinforcement Learning
• Graphical Models
• …
Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
Copyright © 2018 by Mark Reynolds 29
Keep Your Eye on the Prize
Data
Information
Knowledge
Understanding
Wisdom
Application
The question is NOT
“How can we … ?”
But instead
“What is the objective?”
( or “Why?” )
Copyright © 2018 by Mark Reynolds 30
And – Keep Your Eye on the Machine
Source: https://blogs.wsj.com/tech-europe/2011/04/26/for-sale-the-making-of-a-fly-only-23698655-93-plus-3-99-shipping/
Copyright © 2018 by Mark Reynolds 31
Mark Reynolds
Mark Reynolds Vitae
• Southwestern Energy
• Lone Star College
• Intent Driven Designs
• Scan Systems
• Sikorsky Aircraft
• General Dynamics
Personal Email: mark@DataDriven.Energy
Linkedin: www.linkedin.com/in/MarkDataDriven
Twitter: @MarkDataDriven
http://DigitalTransformation.Engineer
Copyright © 2018 by Mark Reynolds 32

Azure machine learning

  • 1.
    Mark Reynolds Azure MachineLearning Introduction using the AML Learning Studio Mark Reynolds October 14, 2017
  • 2.
    The Machine LearningParadigm Source: Tom Mitchell, Mitchell, T. (1997). Machine Learning, McGraw Hill. Mark Reynolds, compilation “ A computer program is said to learn from experience (E) with respect to some class of tasks (T) and performance measure (P), if its performance at tasks in T, as measured by P, improves with experience E. ” ~Tom Mitchell Machine Learning is the “Extraction of Wisdom by Understanding the underlying Data” Copyright © 2018 by Mark Reynolds 2
  • 3.
    Buzzwords for 2017 •Big Data • Machine Learning • Blockchain • Tensor Flow Source: Mark Reynolds, compilation Copyright © 2018 by Mark Reynolds 3
  • 4.
    The Shifting ComputingParadigm Descriptive and Formulaic Hypothetical and Investigative Expertise Driven Models and Cases Multivariant Differential Modelling Source: Mark Reynolds, compilation eScience Traditional Science Copyright © 2018 by Mark Reynolds 4
  • 5.
    Becoming Knowledge LearningOrganizations Seismic Drilling Completions Production Data Information Visualization Knowledge Forensics Understanding Analysis & Mining Wisdom Anticipating Application       RT Frac Daily Rpts Well Plan RT Drill Geo- steer AFE RT Prod … Applying the DIKW Pedagogy Source: Mark Reynolds, compilation Copyright © 2018 by Mark Reynolds 5
  • 6.
    Integrated Engineering –System Flow Acquire Analyze Annunciate Archive Analyze Anticipate Apply Data Information Visualization Knowledge Forensics Understanding Analysis & Mining Wisdom Anticipating Application  Creating Informational Accessibility and Transparency  Discovering Experiential Performance Improvements  Segmenting Processes and Process Results  Replacing Human Decision w/ Automated Algorithms  Innovating New Models, Products, Services Source: Mark Reynolds, compilation Real Time Analytics (including ML) Hindsight Insight Foresight Copyright © 2018 by Mark Reynolds 6
  • 7.
    Analytics, Applied Data Information Visualization Knowledge Forensics Understanding Analysis &Mining Wisdom Anticipating Application Source: Mark Reynolds, compilation Hindsight Insight Foresight Descriptive Analytics • What happened? Diagnostic Analytics • Why did it happen? Predictive Analytics • What will happen next? Prescriptive Analytics • What can we do about it? Document and File Tabulate and Summarize Chart and Plot Process and Analyze Copyright © 2018 by Mark Reynolds 7
  • 8.
    The Machine LearningParadigm Unsupervised Learning Supervised Learning Semi-Supervised Learning Reinforcement Learning 24/7 Predictive Analytics Data Mining Machine Learning AI Beware of the 90-foot discovery! Beware of extreme data vetting! Beware of rabbit holes! Beware of over tweaking! Source: Mark Reynolds, compilation Copyright © 2018 by Mark Reynolds 8
  • 9.
    The Machine LearningProcess Source: Introduction to Azure, David Chappel Copyright © 2018 by Mark Reynolds 9
  • 10.
    Validating and ApplyingSuccesses Source: Machine Learning has transformed many aspects of our everyday life, can it do the same for public services?, Natalia Angarita , May 23, 2016, Capgemini https://www.capgemini.com/blog/insights-data-blog/2016/05/machine-learning-has-transformed-many-aspects-of-our-everyday-life Copyright © 2018 by Mark Reynolds 10
  • 11.
    Azure Machine LearningProvides Source: Introduction to Azure, David Chappel Copyright © 2018 by Mark Reynolds 11
  • 12.
    Common Algorithmic Approaches •Decision Tree Learning • Maps observation to conclusions • Association Rule Learning • Discovering interesting relations • Artificial Neural Networks • Incremental function modules • Inductive Logic Programming • Rule based representations for input --> output • Support Vector Machines • Classification and regression • Clustering • Assignment of observations to clusters • Bayesian Networks • Probabilistic models correlating variables • Reinforcement Learning • Finds policy to map states to desired outcome • Representation Learning • Principal component analysis • Similarity & Metric Learning • Pairs of examples train others • Sparse Dictionary Learning • Datum as linear combinations • Genetic Algorithms • Mimics natural heuristics Source: Mark Reynolds, compilation Copyright © 2018 by Mark Reynolds 12
  • 13.
    Copyright © 2018by Mark Reynolds 13
  • 14.
  • 15.
    Azure ML DevelopmentWorkspace Copyright © 2018 by Mark Reynolds 15
  • 16.
    Azure ML DevelopmentWorkspace Copyright © 2018 by Mark Reynolds 16
  • 17.
    Azure Machine Learning Copyright© 2018 by Mark Reynolds 17
  • 18.
    Regression Algorithms • OrdinaryLeast Squares Regression (OLSR) • Linear Regression • Logistic Regression • Stepwise Regression • Multivariate Adaptive Regression Splines (MARS) • Locally Estimated Scatterplot Smoothing (LOESS) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 18
  • 19.
    Instance-based Algorithms • k-NearestNeighbor (kNN) • Learning Vector Quantization (LVQ) • Self-Organizing Map (SOM) • Locally Weighted Learning (LWL) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 19
  • 20.
    Regularization Algorithms • RidgeRegression • Least Absolute Shrinkage and Selection Operator (LASSO) • Elastic Net • Least-Angle Regression (LARS) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 20
  • 21.
    Decision Tree Algorithms •Classification and Regression Tree (CART) • Iterative Dichotomiser 3 (ID3) • C4.5 and C5.0 (different versions of a powerful approach) • Chi-squared Automatic Interaction Detection (CHAID) • Decision Stump • M5 • Conditional Decision Trees Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 21
  • 22.
    Bayesian Algorithms • NaiveBayes • Gaussian Naive Bayes • Multinomial Naive Bayes • Averaged One-Dependence Estimators (AODE) • Bayesian Belief Network (BBN) • Bayesian Network (BN) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 22
  • 23.
    Clustering Algorithms • k-Means •k-Medians • Expectation Maximisation (EM) • Hierarchical Clustering Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 23
  • 24.
    Association Rule LearningAlgorithms • Apriori algorithm • Eclat algorithm Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 24
  • 25.
    Artificial Neural NetworkAlgorithms • Perceptron • Back-Propagation • Hopfield Network • Radial Basis Function Network (RBFN) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 25
  • 26.
    Deep Learning Algorithms •Deep Boltzmann Machine (DBM) • Deep Belief Networks (DBN) • Convolutional Neural Network (CNN) • Stacked Auto-Encoders Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 26
  • 27.
    Dimensionality Reduction Algorithms •Principal Component Analysis (PCA) • Principal Component Regression (PCR) • Partial Least Squares Regression (PLSR) • Sammon Mapping • Multidimensional Scaling (MDS) • Projection Pursuit • Linear Discriminant Analysis (LDA) • Mixture Discriminant Analysis (MDA) • Quadratic Discriminant Analysis (QDA) • Flexible Discriminant Analysis (FDA) Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 27
  • 28.
    Ensemble Algorithms • Boosting •Bootstrapped Aggregation (Bagging) • AdaBoost • Stacked Generalization (blending) • Gradient Boosting Machines (GBM) • Gradient Boosted Regression Trees (GBRT) • Random Forest Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 28
  • 29.
    Algorithms from SpecialtySubfields in ML • Computational intelligence (evolutionary algorithms, etc.) • Computer Vision (CV) • Natural Language Processing (NLP) • Recommender Systems • Reinforcement Learning • Graphical Models • … Source: https://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ Copyright © 2018 by Mark Reynolds 29
  • 30.
    Keep Your Eyeon the Prize Data Information Knowledge Understanding Wisdom Application The question is NOT “How can we … ?” But instead “What is the objective?” ( or “Why?” ) Copyright © 2018 by Mark Reynolds 30
  • 31.
    And – KeepYour Eye on the Machine Source: https://blogs.wsj.com/tech-europe/2011/04/26/for-sale-the-making-of-a-fly-only-23698655-93-plus-3-99-shipping/ Copyright © 2018 by Mark Reynolds 31
  • 32.
    Mark Reynolds Mark ReynoldsVitae • Southwestern Energy • Lone Star College • Intent Driven Designs • Scan Systems • Sikorsky Aircraft • General Dynamics Personal Email: mark@DataDriven.Energy Linkedin: www.linkedin.com/in/MarkDataDriven Twitter: @MarkDataDriven http://DigitalTransformation.Engineer Copyright © 2018 by Mark Reynolds 32