Predire il futuro con Machine Learning & Big Data

Data Driven Innovation
Codemotion
Presentation title
Antimo Musone
IT Manager
20 Maggio 2016

About Me
►Antimo Musone
 IT Manager / Architect presso EY
 Co - Founder Fifth Ingenum Srls.
 Ing. Informatica II Università degli Studi di Napoli
 email: antimo.musone@it.ey.com

Indice
►What is Machine Learning ?
►Predictive Analytics
►Machine Overview
►Defining Predictive Analytics
►Supervised Learning
►Unsupervised Learning
►Watson Service
► Cortana Analytics Suite
►Demo

What is Machine Learning ?

Machine Learning / Predictive Analytics
Vision Analytics
Recommenda-
tion engines
Advertising
analysis
Weather
forecasting for
business
planning
Social network
analysis
Legal
discovery and
document
archiving
Pricing analysis
Fraud
detection
Churn
analysis
Equipment
monitoring
Location-based
tracking and
services
Personalized
Insurance
Machine learning &
predictive analytics are
core capabilities that are
needed throughout your
business

Machine Learning Overview
► Formal definition: “The field of machine learning is concerned with the
question of how to construct computer programs that automatically improve
with experience” - Tom M. Mitchell
► Another definition: “The goal of machine learning is to program computers to
use example data or past experience to solve a given problem.” – Introduction to
Machine Learning, 2nd Edition, MIT Press
► ML often involves two primary techniques:
► Supervised Learning: Finding the mapping between inputs and outputs using
correct values to “train” a model
► Unsupervised Learning: Finding patterns in the input data (similar to Density
Estimates in Statistics)

Machine Learning
Data:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Rules, or Algorithms:
about, Learning, language – Spelling and sounding builds words
Learning about language. – Words build sentences
Learning, or Abstraction:
Any new understanding proceeds from previous knowledge.
Data + Rules/ Algorithms = Machine Learning

Traditional programming VS Machine
Learning
Computer
Data
Program
Output
Traditional Programming
Data
Output
Program/Algorithms
Machine Learning
Program can predict the output!
Computer

ML : No, more like gardening
Gardener = You
Seeds = Algorithms
Nutrients = Data
Plants = Programs

ML Sample Application
► Web search
► Computational biology
► Finance
► E-commerce
► Space exploration
► Robotics
► Information extraction
► Social networks
► Debugging
► [Your favorite area]

What is Predictive Analytics?
Presentation title
Wikipedia Definition: (http://en.wikipedia.org/wiki/Predictive_analytics)
“Predictive analytics encompasses a variety of techniques from
statistics, modeling, machine learning, and data mining that analyze
current and historical facts to make predictions about future, or
otherwise unknown, events.”
Facts Predictions
Predictive
Analytics
Techniques

Breaking it Down
Presentation title
“Predictive analytics encompasses a variety of techniques from
statistics, modeling, machine learning, and data mining that analyze
current and historical facts to make predictions about future, or otherwise
unknown, events.”
Machine Learning Use of
computer algorithms to derive
complex formulations based on
objectives and constraints
Tools and Techniques
Data visualization,
segmentation, correlations
Use in Predictive Analytics
Predictive analytics is often
applied in the context of
datasets that are too large for
manual analysis, so data mining
techniques are required
Statistics Focus on learning
population characteristics based on
samples of data
Tools and Techniques p-values,
confidence intervals, sampling,
ANOVA
Underlying theory behind many
parametric models – observed facts
are a sample from a population
including both known/historic and
unknown/future events
Modeling Representations of systems
used to understand the underlying
dynamics of the system
Tools and Techniques
Symbolic logic, proxies
Complex relationships can be simplified
through modeling – these models can
then be used to analyze relationships
between factors

What is a Model?
A model is a simplified representation of observed
effects
Presentation title
Key terms:
 Dependent or target variable – the variable of interest
 Independent or predictor variable(s) – variable(s) used
for explanation/prediction
 Effect – the (quantitative) impact of an independent
variable or combination of independent variables on the
dependent variable
 Main Effect – The direct effect of a single independent variable
on the dependent variable
 Interaction Effect – The effect of a combination of multiple
independent variables on the dependent variable

Two types of model
A model is a simplified representation of observed
effects
Presentation title
Statistical
Parametric Models
Effects are well-quantified and can
be examined
An equation can be used to
represent the model
Emphasis on explanation
“What causes the dependent
variable to change?”
Test hypotheses
p-values, confidence intervals
Machine Learning
Non-parametric models
Effects may be unquantified (“black
box”)
No representative equation
Model may be stochastic, so results
my vary
Emphasis on prediction
“What will the value of the next
observation be?”
Generate hypotheses

Types of Learning
► Supervised (inductive) learning
► Training data includes desired outputs
► Dependent variable is known
► May be statistical or non-statistical
► Unsupervised learning
► Training data does not include desired outputs
► No dependent variable
► Non-statistical
► Semi-supervised learning
► Training data includes a few desired outputs

Machine Learning Problem
Classification or
Categorization
Clustering
Regression
Dimensionality
reduction
Supervised Learning Unsupervised Learning
DiscreteContinuous

What is Logistic Regression?
Regression Models are a form of supervised learning that attempt to fit
“linear” functions to training data – the most common type of regression,
linear regression, should be familiar to most of you as a “best fit line”
Logistic Regression is closely related to linear regression, but fits a
different shape function by using a binomial link function on the dependent
variable

Machine Learning Example
Predict function F(X) for new examples X
Discrete F(X): Classification
Continuous F(X): Regression
F(X) = Probability(X): Probability estimation
Given examples of a function (X, F(X))
The probability of an event X, denoted F(X), represents the proportion of all
events that have X as their outcome, and is typically represented as a
decimal 0<P(X)<1

Machine Learning Example
Apply a prediction function to a feature representation of the image to get the
desired output:
• Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the prediction
function f by minimizing the prediction error on the training set
• Testing: apply f to a never before seen test example x and output the predicted value y = f(x)
output prediction
function
Image
feature
y = f(x)
F( ) = «apple»
F( ) =«tomato»
F( ) = «dog»

Supervised Learning
 Used when you want to predict unknown answers from answers
you already have
 Data is divided into two parts: the data you will use to “teach” the
system (data set), and the data to test the algorithm (test set)
 After you select and clean the data, you select data points that show
the right relationships in the data. The answers are “labels”, the
categories/columns/attributes are “features” and the values
are…values.
 Then you select an algorithm to compute the outcome. (Often you
choose more than one)
 You run the program on the data set, and check to see if you got the
right answer from the test set.
 Once you perform the experiment, you select the best model. This is
the final output – the model is then used against more data to get the
answers you need

Supervised Learning
 Car
 Not Car

Unsupervised Learning
 Used when you want to find unknown answers –
mostly groupings - directly from data
 No simple way to evaluate accuracy of what you learn
 Evaluates more vectors, groups into sets or classifications
 Start with the data
 Apply algorithm
 Evaluate groups

Unsupervised Learning
Example 1 example A Example 2 example
B Example 3 example C
example A example B example C
Example 1 Example 2 Example 3
The clustering strategies have more tendency to transitively group points even if
they are not nearby in feature space

Cross-Validation and Model Evaluation
Cross-validation is a method of ensuring that models generalize to data
they have not been trained to fit
 Given any collection of data points, a model can be developed that fits
the data exactly; however, this model will have no predictive power

Evaluating Predictive Models
Presentation title
Model evaluation involves a combination of objective criteria and
subjective judgment
Objective Measures
Gain or Lift
Sensitivity
Accuracy
Others
Subjective Considerations
Business intuition
Explainability
Simplicity
Usefulness

Gain or Lift
Lift is a measure of the effectiveness of a predictive model calculated as
the ratio between the results obtained with and without the predictive
model.
 Cumulative gains and lift charts are visual aids for measuring model
performance
 Both charts consist of a lift curve and a baseline
 The greater the area between the lift curve and the baseline, the
better

Sensitivity
A Receiver Operating Characteristic (ROC) curve is a plot of test
sensitivity as a function of (1 - specificity) for several possible (arbitrary)
cut off values. The curve illustrates the trade off between type I and type
II errors in a given test.
 The closer the curve follows the left-
hand border and then the top border
of the ROC space, the more
accurate the test, and the area under
the curve is a measure of accuracy.

Cortana Analytics Suite

Data Flow and Architecture
Stream Analytics
TransformIngest
Web logs
Present &
decide
IoT, Mobile
Devices etc.
Social Data
Event Hubs HDInsight
Azure Data
Factory
Azure SQL DB
Azure Data Lake
Azure Machine
Learning
(Fraud detection
etc.)
Power BI
Web
dashboards
Mobile devices
DW / Long-term
storage
Predictive
analytics
Event & data
producers
Azure SQL DW

Process real-time data in Azure using a simple SQL language
Consumes millions of real-time events from Event Hub collected
from devices, sensors, infrastructure, and applications
Performs time-sensitive analysis using SQL-like language against
multiple real-time streams and reference data
Outputs to persistent stores, dashboards or back to devices
Point of
Service Devices
Self Checkout
Stations
Kiosks
Smart
Phones
Slates/
Tablets
PCs/
Laptops
Servers
Digital
Signs
Diagnostic
EquipmentRemote Medical
Monitors
Logic
Controllers
Specialized
DevicesThin
Clients
Handhelds
Security
POS
Terminals
Automation
Devices
Vending
Machines
Kinect
ATM

Fully managed service to support orchestration of data
movement and processing
Connect to relational or non-relational data that is on-
premises or in the cloud
Single pane of glass to monitor and manage data
processing pipelines.
Publish to Power BI
Compose and orchestrate data services at scale
C#
MapReduce
Trusted data
BI & analytics
Hive
Pig
Stored Procedures
Machine Learning

ML Algorithms are best of breed and embrace OSS
• MS + R + Python + BYOA
ML Studio for productive development
• Faster experiments results in faster improvements
• Visual Workflows & ML Experiments
ML Operationalization to remove deployment friction
• Build entire ML Apps & Deploy as Cloud APIs
ML Gallery
• Provide ML applications like apps in an ‘app store’
• Publish/consume APIs in a 2 sided market
Help organizations eliminate undifferentiated heavy lifting
Powerful predictive analytics in Azure
Azure Machine Learning

Power BI investments
New data visualizations and
touch-optimized exploration in
HTML5
Power BI mobile apps across
devices including iPad and
iPhone
Support for new data sources
including SalesForce.com,
Dynamics CRM online and SQL
Server Analysis Services
Dashboard
Tree Map
Power BI dashboards and KPIs for monitoring the health of your business

Vehicle Telemetry Architecture
 Event Hubs for ingesting millions of vehicle
telemetry events into Azure.
 Stream Analytics for gaining real-time
insights on vehicle health and persists that
data into long-term storage for richer batch
analytics.
 Machine Learning for anomaly detection in
real-time and batch processing to gain
predictive insights.
 HDInsight is leveraged to transform data
at scale
 Data Factory handles orchestration,
scheduling, resource management and
monitoring of the batch processing
pipeline.
 Power BI gives this solution a rich
dashboard for real-time data and predictive
analytics visualizations.

Microsft Azure Learning Machine
 Data It’s all about the data. Here’s where you will acquire, compile, and
analyze testing and training data sets for use in creating Azure Machine
Learning predictive models.
 Create the model Use various machine learning algorithms to create new
models that are capable of making predictions based on inferences about the
data sets.
 Evaluate the model Examine the accuracy of new predictive models based
on ability to predict the correct outcome, when both the input and output
values are known in advance. Accuracy is measured in terms of confidence
factor approaching the whole number one.
 Refine and evaluate the model Compare, contrast, and combine alternate
predictive models to find the right combination(s) that can consistently
produce the most accurate results.
 Deploy the model Expose the new predictive model as a scalable cloud web
service, one that is easily accessible over the Internet by any web browser or
mobile client.
 Test and use the model Implement the new predictive model web service in
a test or production application scenario.

Azure Machine Learning algorithms
 Classification algorithms These are used to classify data into
different categories that can then be used to predict one or more
discrete variables, based on the other attributes in the dataset.
 Regression algorithms These are used to predict one or more
continuous variables, such as profit or loss, based on other attributes
in the dataset.
 Clustering algorithms These determine natural groupings and
patterns in datasets and are used to predict grouping classifications
for a given variable.

Predire il futuro con Machine Learning & Big Data

More Related Content

What's hot

Viewers also liked

Similar to Predire il futuro con Machine Learning & Big Data

More from Data Driven Innovation

Recently uploaded

Predire il futuro con Machine Learning & Big Data

Editor's Notes