SlideShare a Scribd company logo
1 of 42
Linear, Machine Learning and
Probabilistic Approaches for
Predictive Analytics
B.M.Pavlyshenko
(Ph.D.)
SoftServe,
Ivan Franko National University of Lviv
e-mail: b.pavlyshenko@gmail.com
{
Typical time series of store sales
LINEAR MODELS AND MACHINE
LEARNING APPROACHES
Time series forecastings by different methods
Time series forecastings by different methods
BAYESIAN INFERENCE
Density of distributions
of regression coefficients.
Box plots for regression
coefficients
Winner Solution for Grupo Bimbo
Inventory Demand Kaggle Competition
most important features
Multilevel predictive model
Analysis of Perishable Products
Sales Using Bayesian Inference
Profit = SoldAmount*(Price – MarginalPrice) – UnsoldAmount* MarginalPrice
Machine Learning, Linear and
Bayesian Models for Logistic
Regression in Failure Detection
Problems
MACHINE LEARNING MODEL
The most important features and their gain values:
Matthews correlation coefficient (MCC) :
MACHINE LEARNING MODEL
ROC curve for
classification results
AUC=0.753
Matthews correlation
coefficient for logistic regression
for different values of probability
threshold.
ROC curve and Matthews correlation coefficient for different sets of features
MACHINE LEARNING MODEL
Features set 1:
AUC=0.75
Features set 2:
AUC=0.91
Dependence of Lambda from AUC
value.
Coefficients of the generalized linear
model for logistic regression
(Lambda=0.03 )
GENERALIZED LINEAR MODEL
BAYESIAN MODEL
model{
for (i in 1:n) {
y[i] ~ dbern(p[i])
logit(p[i]) <- b0+inprod(b[ ],x[i,])
}
b0 ~ dnorm(0,0.0001)
for (j in 1:nfeat) {
b[j] ~ dnorm(0,0.0001)
}
}
Probabilistic model for logistic regression using BUGS syntax
BAYESIAN MODEL
Trace plot for Intercept parameter. Probability density function for
Intercept parameter.
BAYESIAN MODEL
Box plots for logistic regression coefficients.
Combining Machine Learning with
Linear and Bayesian Models
Forecasting of Social, Market
and Financial Trends
The formation of keyword frequent sets with the biggest support value
The analysis of financial tweets
The analysis of financial tweets
The analysis of causal relationship between the frequent sets in tweets and
Apple stock prices.
The results obtained show that it is possible to predict stock prices on the
basis of data mining of informational streams in social networks.
The examples of test studies of semantic concepts in Twitter messages
Royal baby’s name forecasting
The name George was
dominating in the spectrum of
names before the official
announcement.
The examples of test studies of semantic concepts in Twitter messages
Royal baby’s name forecasting
10 first frequent sets were
created by five names, the
three of which are the
components of Prince’s
full name George
Alexander Louis.
The examples of test studies of semantic concepts in Twitter
messages
The Royal baby’s name forecasting
Users’ societies, which formed the discussion trends.
Thank you !

More Related Content

Similar to Linear, Machine Learning and Probabilistic Approaches for Predictive Analytics

Audit report[rollno 49]
Audit report[rollno 49]Audit report[rollno 49]
Audit report[rollno 49]
RAHULROHAM2
 
Enterprise Horizons Supply Chain
Enterprise Horizons Supply ChainEnterprise Horizons Supply Chain
Enterprise Horizons Supply Chain
Capgemini Media
 
Knowledge Graphs and their central role in big data processing: Past, Present...
Knowledge Graphs and their central role in big data processing: Past, Present...Knowledge Graphs and their central role in big data processing: Past, Present...
Knowledge Graphs and their central role in big data processing: Past, Present...
Amit Sheth
 

Similar to Linear, Machine Learning and Probabilistic Approaches for Predictive Analytics (20)

2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
 
Marvin_Capstone
Marvin_CapstoneMarvin_Capstone
Marvin_Capstone
 
Lobsters, Wine and Market Research
Lobsters, Wine and Market ResearchLobsters, Wine and Market Research
Lobsters, Wine and Market Research
 
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
MATLAB IMPLEMENTATION OF SELF-ORGANIZING MAPS FOR CLUSTERING OF REMOTE SENSIN...
 
ppt
pptppt
ppt
 
Deep learning review
Deep learning reviewDeep learning review
Deep learning review
 
Audit report[rollno 49]
Audit report[rollno 49]Audit report[rollno 49]
Audit report[rollno 49]
 
Enterprise Horizons Supply Chain
Enterprise Horizons Supply ChainEnterprise Horizons Supply Chain
Enterprise Horizons Supply Chain
 
Semantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional ApproachSemantics at Scale: A Distributional Approach
Semantics at Scale: A Distributional Approach
 
How to start for machine learning career
How to start for machine learning careerHow to start for machine learning career
How to start for machine learning career
 
Selectivity Estimation for SPARQL Triple Patterns with Shape Expressions
Selectivity Estimation for SPARQL Triple Patterns with Shape ExpressionsSelectivity Estimation for SPARQL Triple Patterns with Shape Expressions
Selectivity Estimation for SPARQL Triple Patterns with Shape Expressions
 
Interpretable Machine Learning
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine Learning
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
Knowledge Graphs and their central role in big data processing: Past, Present...
Knowledge Graphs and their central role in big data processing: Past, Present...Knowledge Graphs and their central role in big data processing: Past, Present...
Knowledge Graphs and their central role in big data processing: Past, Present...
 
The impact of domain-specific stop-word lists on ecommerce website search per...
The impact of domain-specific stop-word lists on ecommerce website search per...The impact of domain-specific stop-word lists on ecommerce website search per...
The impact of domain-specific stop-word lists on ecommerce website search per...
 
WIA 2019 - Using Embeddings to Understand the Variance and Evolution of Data ...
WIA 2019 - Using Embeddings to Understand the Variance and Evolution of Data ...WIA 2019 - Using Embeddings to Understand the Variance and Evolution of Data ...
WIA 2019 - Using Embeddings to Understand the Variance and Evolution of Data ...
 
Nikhil CV
Nikhil CVNikhil CV
Nikhil CV
 
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
 
Comparable entity mining from comparative questions
Comparable entity mining from comparative questionsComparable entity mining from comparative questions
Comparable entity mining from comparative questions
 

Recently uploaded

Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 

Linear, Machine Learning and Probabilistic Approaches for Predictive Analytics

  • 1. Linear, Machine Learning and Probabilistic Approaches for Predictive Analytics B.M.Pavlyshenko (Ph.D.) SoftServe, Ivan Franko National University of Lviv e-mail: b.pavlyshenko@gmail.com
  • 2. { Typical time series of store sales LINEAR MODELS AND MACHINE LEARNING APPROACHES
  • 3. Time series forecastings by different methods
  • 4.
  • 5. Time series forecastings by different methods
  • 6.
  • 7.
  • 8.
  • 9. BAYESIAN INFERENCE Density of distributions of regression coefficients. Box plots for regression coefficients
  • 10.
  • 11.
  • 12. Winner Solution for Grupo Bimbo Inventory Demand Kaggle Competition
  • 13.
  • 14.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. Analysis of Perishable Products Sales Using Bayesian Inference
  • 22.
  • 23.
  • 24.
  • 25. Profit = SoldAmount*(Price – MarginalPrice) – UnsoldAmount* MarginalPrice
  • 26.
  • 27. Machine Learning, Linear and Bayesian Models for Logistic Regression in Failure Detection Problems
  • 28. MACHINE LEARNING MODEL The most important features and their gain values: Matthews correlation coefficient (MCC) :
  • 29. MACHINE LEARNING MODEL ROC curve for classification results AUC=0.753 Matthews correlation coefficient for logistic regression for different values of probability threshold.
  • 30. ROC curve and Matthews correlation coefficient for different sets of features MACHINE LEARNING MODEL Features set 1: AUC=0.75 Features set 2: AUC=0.91
  • 31. Dependence of Lambda from AUC value. Coefficients of the generalized linear model for logistic regression (Lambda=0.03 ) GENERALIZED LINEAR MODEL
  • 32. BAYESIAN MODEL model{ for (i in 1:n) { y[i] ~ dbern(p[i]) logit(p[i]) <- b0+inprod(b[ ],x[i,]) } b0 ~ dnorm(0,0.0001) for (j in 1:nfeat) { b[j] ~ dnorm(0,0.0001) } } Probabilistic model for logistic regression using BUGS syntax
  • 33. BAYESIAN MODEL Trace plot for Intercept parameter. Probability density function for Intercept parameter.
  • 34. BAYESIAN MODEL Box plots for logistic regression coefficients.
  • 35. Combining Machine Learning with Linear and Bayesian Models
  • 36. Forecasting of Social, Market and Financial Trends
  • 37. The formation of keyword frequent sets with the biggest support value The analysis of financial tweets
  • 38. The analysis of financial tweets The analysis of causal relationship between the frequent sets in tweets and Apple stock prices. The results obtained show that it is possible to predict stock prices on the basis of data mining of informational streams in social networks.
  • 39. The examples of test studies of semantic concepts in Twitter messages Royal baby’s name forecasting The name George was dominating in the spectrum of names before the official announcement.
  • 40. The examples of test studies of semantic concepts in Twitter messages Royal baby’s name forecasting 10 first frequent sets were created by five names, the three of which are the components of Prince’s full name George Alexander Louis.
  • 41. The examples of test studies of semantic concepts in Twitter messages The Royal baby’s name forecasting Users’ societies, which formed the discussion trends.