SlideShare a Scribd company logo
1 of 32
1
INFORMS Philadelphia
November 2015
Bin Weng ( Email: bzw0018@auburn.edu)
Ph.D. Candidate of Industrial and System Engineering
Mohamed A. Ahmed (Email: mza0068@auburn.edu)
M.S. Candidate of Industrial and System Engineering
Fadel M. Megahed (Email: fmegahed@auburn.edu)
Assistant Professor of Industrial and System Engineering
Stock Market Prediction Using
Disparate Data Sources
2Stock Market Prediction
 Why?
• The stock market is one
of the most important
way for companies to
raise money.
• About 48% Americans
invested in the stock
market as 2015 (CNBC).
• The successful prediction
of a stock’s future price
could yield significant
PROFIT.
3Stock Market Prediction
 How?
Guess? Fundamental Analysis
Technical Analysis (Charting) Technological Methods
4Stock Market Prediction
 Ray Dalio’s $165B Bridgewater Associates will start
a new artificial-intelligence unit to use predictive
analysis for trades. (Bloomberg, 2015)
5Related Works
Paper Index Selected Papers
[1]
Predicting Financial Markets: Comparing Survey,
News, Twitter and Search Engine Data
[2] A fusion model of HMM, ANN and GA for stock market forecasting
[3] Twitter mood predicts the stock market
[4] Stock Market Prediction System with Modular Neural Networks
[5]
Empirical evaluation of an automated intraday stock recommendation
system incorporating both market data and textual news
[6] A Hybrid Machine Learning System for Stock Market Forecasting
[7]
Market Index and Stock Price Direction Prediction using Machine
Learning Techniques: An empirical study on the KOSPI and HSI
[8] Stock Market Prediction Using Disparate Data Sources (Proposed)
6Related Works
Paper
Data Model
Target
Type of Stock
Market
Data
Technical
Indicator
Social
Media
News
Secondary
Variable
Time
Series
Logistic
Regression
Decision
Trees
Neural
Networks
Support
Vector
Machines
IT Index
Mix of
companies
[1]    
Price
Volume

[2] 

Price 
[3]    Movement 
[4]  

Buy and sell
signal

[5]       
Price
Volume

[6]   Movement 
[7]    Movement  
[8]          Movement  
7Research Motivation
 Which sources of data have the most correlation
with the stock market time series?
 Which logical target has the best prediction
capability with regards to the stock movement?
 Which technological model is best at predicting
the stock movement?
 Can we construct a better model using disparate
data sources?
8Data Sources
9Process Overview
10Data Sources
 Social Media and Internet Data
• “Financial news articles play a large role in influencing the
movement of a stock as humans react to the information.” (M.
Nardo etc. 2015)
• “Data on changes in how often financially
related Wikipedia pages were viewed have contained early signs
of stock market moves.” (H. Moat etc. 2013)
• Blog communication exhibits remarkable
predictive power. (M. Choudhury etc. 2008)
11Data Sources
 Secondary Variables
• The data from Social Media and Internet always have high variability
(e.g. Moving Average, Momentum, Relative Strength Index).
• If the upward or downward movement in predicting variables had an
effect on the target movement?
• What range of the primary variables have predicting power over the
targets?
0
500
1000
1500
2000
2500
3000
3500
1/2/2014 2/2/2014 3/2/2014 4/2/2014 5/2/2014 6/2/2014
Google News & Blogs
12Target Matrix
Target
Type
Method
1 Open (i+1) – Close (i)
2 Open (i+1) – Open (i)
3 Close (i+1) – Close (i)
4 Close (i+1) – Open (i)
5 Volume of trades moves as previous day
13Data Fusion
14Feature Selection
• Simplification of model
• Shorter training times
• Improve accuracy
• Enhanced generalization by reducing overfitting
15Feature Selection
 Chord Diagram
16Feature Selection
 Method :
Recursive feature
elimination (RFE)
 Coding : Python
with multiple
feature selection
package
Pseudo Code of RFE
* Code is available on https://github.com/binweng/SFS
17Feature Selection
Target Variables
Target 1
Close Open High Low P/E Ratio
Wiki_3_day_disparity Wiki_5_day_disparity Wiki_10_day_disparity Wiki_Momentum_1 Wiki_ROC
Google_MA_5 Google_EMA_3 Google_3_Day_disparity Google_5_day_disparity RSI
Stochastic Ocillater Wiki_RSI Google_MA_4 William %R Google_MA_3
Target 2
Close Open High Low P/E Ratio
Wiki_5_day_disparity Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move
Wiki_5day_disparity_M
ove
Google_EMA5_Move
Google_3day_disparity_
Move
Google_ROC_Move Google_RSI_Move Wiki_3_day_disparity
Stochastic Ocillater RSI_Move Wiki_RSI_Move Google_MA_6 Google_Move
Target 3
Close Open High P/E Ratio Stochastic_Move
Wiki_Monentum_1 Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move Wiki_ROC_Move
Google_EMA5_Move
Google_3day_disparity_
Move
Google_ROC_Move Google_RSI_Move Wiki_10_day_disparity
RSI_Move Wiki_RSI_Move Wiki_3_day_disparity Google_Move Google_MA5_Move
Target 4
Close Open High Low P/E Ratio
RSI_Move Wiki_10_day_Disparity Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move
Google_Move
Google_3day_disparity_
Move
Google_ROC_Move Google_RSI_Move William %R
Stochastic Ocillater Stochastic_Move
Wiki_3day_disparity_M
ove
Wiki_ROC_Move Wiki_RSI_Move
Target 5
Close Open High Low William %R
Wiki_Monentum_1 Wiki_RSI Google_MA_2 Google_MA_3 Google_MA_4
Google_MA_9 Google_3_day_disparity Google_5_day_disparity
Google_10_day_disparit
y
Wiki_10_day_disparity
Wiki_3_day_disparity Wiki_5_day_disparity Google_MA_6 Google_MA_7 Google_MA_8
18Model Comparison
19Model Comparison
Source: http://scikit-learn.org/stable/tutorial/machine_learning_map/
20Model Comparison
21Experimental Result
Paper 1 – B. Nair etc., 2010 Paper 2 – A. Chen, 2003
22Experimental Result
• Comparison of Model Accuracy by information input
23Experimental Result
• Evaluate the model using AUC
24Experimental Result
Target Coincidence Matrix for SVM
Target1
Training 0 1 Testing 0 1
0 55 113 0 60 95
1 27 229 1 34 183
Target2
Training 0 1 Testing 0 1
0 160 28 0 156 39
1 37 180 1 32 164
Target3
Training 0 1 Testing 0 1
0 147 46 0 164 32
1 30 172 1 31 174
Target4
Training 0 1 Testing 0 1
0 150 31 0 165 34
1 34 172 1 31 179
Target5
Training 0 1 Testing 0 1
0 177 29 0 183 37
1 130 61 1 125 54
25Target Matrix
Target
Type
Method
1 Open (i+1) – Close (i)
2 Open (i+1) – Open (i)
3 Close (i+1) – Close (i)
4 Close (i+1) – Open (i)
5 Volume of trades as previous day
26Evaluation
 10 – fold cross validation
27Evaluation
 Cross validation result
28Evaluation
Accuracy: 82% - 89%
29Moving Prediction
30Conclusion
• Disparate sources of data help predict the stock market.
• Multiple targets’ prediction results can be used in
conjunction to successfully track stock market
movements.
• Decision tree model and support vector machine model
perform the best interchangeably with different
combinations of input data.
• With all the types of input data, SVMs performed best.
31Future Work
• Identifying and adding into a more inclusive form of
this model, new sources of data that have a predictive
effect on the movement of the stock market, like twitter
sentiment and market news textual analysis.
• Include linguistic modeling, clustering, and controlling
methods like fuzzy theory in obtaining the predictions
of price range.
Fuzzy Membership FunctionFuzzy System
32
INFORMS Philadelphia
November 2015
Bin Weng ( Email: bzw0018@auburn.edu)
Ph.D. Candidate of Industrial and System Engineering
Mohamed A. Ahmed (Email: mza0068@auburn.edu)
M.S. Candidate of Industrial and System Engineering
Fadel M. Megahed (Email: fmegahed@auburn.edu)
Assistant Professor of Industrial and System Engineering
Stock Market Prediction Using
Disparate Data Sources

More Related Content

What's hot

A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...
A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...
A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...ijmvsc
 
Predicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionPredicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionChittagong Independent University
 
stock market prediction
stock market predictionstock market prediction
stock market predictionSRIGINES
 
Stock market analysis
Stock market analysisStock market analysis
Stock market analysisSruti Jain
 
Financial forecastings using neural networks ppt
Financial forecastings using neural networks pptFinancial forecastings using neural networks ppt
Financial forecastings using neural networks pptPuneet Gupta
 
IRJET- Stock Market Prediction using Machine Learning
IRJET- Stock Market Prediction using Machine LearningIRJET- Stock Market Prediction using Machine Learning
IRJET- Stock Market Prediction using Machine LearningIRJET Journal
 
IRJET- Data Visualization and Stock Market and Prediction
IRJET- Data Visualization and Stock Market and PredictionIRJET- Data Visualization and Stock Market and Prediction
IRJET- Data Visualization and Stock Market and PredictionIRJET Journal
 
Stock market analysis using ga and neural network
Stock market analysis using ga and neural networkStock market analysis using ga and neural network
Stock market analysis using ga and neural networkAmr Abd El Latief
 
IRJET- Stock Price Prediction using Long Short Term Memory
IRJET-  	  Stock Price Prediction using Long Short Term MemoryIRJET-  	  Stock Price Prediction using Long Short Term Memory
IRJET- Stock Price Prediction using Long Short Term MemoryIRJET Journal
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningAravind Balaji
 
Google Stock Price Forecasting
Google Stock Price ForecastingGoogle Stock Price Forecasting
Google Stock Price ForecastingArkaprava Kundu
 
Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningSharvil Katariya
 
Stock Market Price Prediction Using Technical Analysis
Stock Market Price Prediction Using Technical AnalysisStock Market Price Prediction Using Technical Analysis
Stock Market Price Prediction Using Technical AnalysisASHEESHVERMA6
 
Stock Price Prediction
Stock Price PredictionStock Price Prediction
Stock Price PredictionManisha Mishra
 
Software for Stock Market Prediction
Software for Stock Market PredictionSoftware for Stock Market Prediction
Software for Stock Market PredictionSSA KPI
 
Finance prediction
Finance predictionFinance prediction
Finance predictionBlaze Rogers
 
Machine learning in finance using python
Machine learning in finance using pythonMachine learning in finance using python
Machine learning in finance using pythonEric Tham
 
MAG Meeting - October 26, 2014
MAG Meeting - October 26, 2014MAG Meeting - October 26, 2014
MAG Meeting - October 26, 2014ggiaroli
 

What's hot (20)

A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...
A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...
A LINEAR REGRESSION APPROACH TO PREDICTION OF STOCK MARKET TRADING VOLUME: A ...
 
Predicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionPredicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector Regression
 
stock market prediction
stock market predictionstock market prediction
stock market prediction
 
Stock market analysis
Stock market analysisStock market analysis
Stock market analysis
 
Stock Market Prediction Using ANN
Stock Market Prediction Using ANNStock Market Prediction Using ANN
Stock Market Prediction Using ANN
 
Financial forecastings using neural networks ppt
Financial forecastings using neural networks pptFinancial forecastings using neural networks ppt
Financial forecastings using neural networks ppt
 
IRJET- Stock Market Prediction using Machine Learning
IRJET- Stock Market Prediction using Machine LearningIRJET- Stock Market Prediction using Machine Learning
IRJET- Stock Market Prediction using Machine Learning
 
IRJET- Data Visualization and Stock Market and Prediction
IRJET- Data Visualization and Stock Market and PredictionIRJET- Data Visualization and Stock Market and Prediction
IRJET- Data Visualization and Stock Market and Prediction
 
Stock market analysis using ga and neural network
Stock market analysis using ga and neural networkStock market analysis using ga and neural network
Stock market analysis using ga and neural network
 
IRJET- Stock Price Prediction using Long Short Term Memory
IRJET-  	  Stock Price Prediction using Long Short Term MemoryIRJET-  	  Stock Price Prediction using Long Short Term Memory
IRJET- Stock Price Prediction using Long Short Term Memory
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learning
 
Google Stock Price Forecasting
Google Stock Price ForecastingGoogle Stock Price Forecasting
Google Stock Price Forecasting
 
Stock market with nn
Stock market with nnStock market with nn
Stock market with nn
 
Stock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised LearningStock Price Trend Forecasting using Supervised Learning
Stock Price Trend Forecasting using Supervised Learning
 
Stock Market Price Prediction Using Technical Analysis
Stock Market Price Prediction Using Technical AnalysisStock Market Price Prediction Using Technical Analysis
Stock Market Price Prediction Using Technical Analysis
 
Stock Price Prediction
Stock Price PredictionStock Price Prediction
Stock Price Prediction
 
Software for Stock Market Prediction
Software for Stock Market PredictionSoftware for Stock Market Prediction
Software for Stock Market Prediction
 
Finance prediction
Finance predictionFinance prediction
Finance prediction
 
Machine learning in finance using python
Machine learning in finance using pythonMachine learning in finance using python
Machine learning in finance using python
 
MAG Meeting - October 26, 2014
MAG Meeting - October 26, 2014MAG Meeting - October 26, 2014
MAG Meeting - October 26, 2014
 

Viewers also liked

Ra Thermoseal - Toothpaste for Hypersensitivity
Ra Thermoseal - Toothpaste for HypersensitivityRa Thermoseal - Toothpaste for Hypersensitivity
Ra Thermoseal - Toothpaste for HypersensitivityICPA Care
 
неделя нач.кл.2015 2016
неделя нач.кл.2015 2016неделя нач.кл.2015 2016
неделя нач.кл.2015 2016evg_22
 
Wonderful Wacky Wide World of Data Analysis Applications
Wonderful Wacky Wide World of Data Analysis ApplicationsWonderful Wacky Wide World of Data Analysis Applications
Wonderful Wacky Wide World of Data Analysis ApplicationsJesus Ramos
 
Diferència entre Espanya i els EUA
Diferència entre Espanya i els EUADiferència entre Espanya i els EUA
Diferència entre Espanya i els EUAMrNouPre
 
Machine Learning For Organizations
Machine Learning For OrganizationsMachine Learning For Organizations
Machine Learning For OrganizationsJesus Ramos
 
Dentin hypersesitivity
Dentin hypersesitivity Dentin hypersesitivity
Dentin hypersesitivity Parth Thakkar
 

Viewers also liked (15)

Futuristic
FuturisticFuturistic
Futuristic
 
Ra Thermoseal - Toothpaste for Hypersensitivity
Ra Thermoseal - Toothpaste for HypersensitivityRa Thermoseal - Toothpaste for Hypersensitivity
Ra Thermoseal - Toothpaste for Hypersensitivity
 
Consultant Practicum for LinkedIn
Consultant Practicum for LinkedInConsultant Practicum for LinkedIn
Consultant Practicum for LinkedIn
 
Resume template 3
Resume template 3Resume template 3
Resume template 3
 
Woo
WooWoo
Woo
 
неделя нач.кл.2015 2016
неделя нач.кл.2015 2016неделя нач.кл.2015 2016
неделя нач.кл.2015 2016
 
TBBJ Article
TBBJ ArticleTBBJ Article
TBBJ Article
 
Arranger
ArrangerArranger
Arranger
 
Mujeres célebres
Mujeres célebresMujeres célebres
Mujeres célebres
 
Wonderful Wacky Wide World of Data Analysis Applications
Wonderful Wacky Wide World of Data Analysis ApplicationsWonderful Wacky Wide World of Data Analysis Applications
Wonderful Wacky Wide World of Data Analysis Applications
 
Diferència entre Espanya i els EUA
Diferència entre Espanya i els EUADiferència entre Espanya i els EUA
Diferència entre Espanya i els EUA
 
Machine Learning For Organizations
Machine Learning For OrganizationsMachine Learning For Organizations
Machine Learning For Organizations
 
Hiram na titik ññ
Hiram na titik ññHiram na titik ññ
Hiram na titik ññ
 
Calidad de vida
Calidad de vidaCalidad de vida
Calidad de vida
 
Dentin hypersesitivity
Dentin hypersesitivity Dentin hypersesitivity
Dentin hypersesitivity
 

Similar to INFORMS 2015

IRJET- Prediction in Stock Marketing
IRJET- Prediction in Stock MarketingIRJET- Prediction in Stock Marketing
IRJET- Prediction in Stock MarketingIRJET Journal
 
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELOPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELIJCI JOURNAL
 
Stock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkStock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkINFOGAIN PUBLICATION
 
Text Analytics- An application in Indian Stock Markets
Text Analytics- An application in Indian Stock MarketsText Analytics- An application in Indian Stock Markets
Text Analytics- An application in Indian Stock MarketsSinjana Ghosh
 
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning TechniquesIRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning TechniquesIRJET Journal
 
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study csandit
 
Stock Market Prediction Analysis
Stock Market Prediction AnalysisStock Market Prediction Analysis
Stock Market Prediction AnalysisIRJET Journal
 
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTION
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTIONIMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTION
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTIONIJDKP
 
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...IRJET Journal
 
Project report on Share Market application
Project report on Share Market applicationProject report on Share Market application
Project report on Share Market applicationKRISHNA PANDEY
 
Aditya report finaL
Aditya report finaLAditya report finaL
Aditya report finaL2767882
 
Online Survey Tool.pdf
Online Survey Tool.pdfOnline Survey Tool.pdf
Online Survey Tool.pdfssuser353e3f
 
The Analysis of Share Market using Random Forest & SVM
The Analysis of Share Market using Random Forest & SVMThe Analysis of Share Market using Random Forest & SVM
The Analysis of Share Market using Random Forest & SVMIRJET Journal
 
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...IRJET Journal
 
IRJET - Stock Market Analysis and Prediction
IRJET - Stock Market Analysis and PredictionIRJET - Stock Market Analysis and Prediction
IRJET - Stock Market Analysis and PredictionIRJET Journal
 
An Overview Of Predictive Analysis Techniques And Applications
An Overview Of Predictive Analysis  Techniques And ApplicationsAn Overview Of Predictive Analysis  Techniques And Applications
An Overview Of Predictive Analysis Techniques And ApplicationsScott Bou
 
The disruptometer: an artificial intelligence algorithm for market insights
The disruptometer: an artificial intelligence algorithm for market insightsThe disruptometer: an artificial intelligence algorithm for market insights
The disruptometer: an artificial intelligence algorithm for market insightsjournalBEEI
 
Applications of Artificial Neural Network in Forecasting of Stock Market Index
Applications of Artificial Neural Network in Forecasting of Stock Market IndexApplications of Artificial Neural Network in Forecasting of Stock Market Index
Applications of Artificial Neural Network in Forecasting of Stock Market Indexpaperpublications3
 
The future Proof Financial: Fintech
The future Proof Financial: FintechThe future Proof Financial: Fintech
The future Proof Financial: FintechMartijn Zoet
 

Similar to INFORMS 2015 (20)

IRJET- Prediction in Stock Marketing
IRJET- Prediction in Stock MarketingIRJET- Prediction in Stock Marketing
IRJET- Prediction in Stock Marketing
 
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODELOPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
OPENING RANGE BREAKOUT STOCK TRADING ALGORITHMIC MODEL
 
Stock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural NetworkStock Market Prediction Using Artificial Neural Network
Stock Market Prediction Using Artificial Neural Network
 
Text Analytics- An application in Indian Stock Markets
Text Analytics- An application in Indian Stock MarketsText Analytics- An application in Indian Stock Markets
Text Analytics- An application in Indian Stock Markets
 
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning TechniquesIRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning Techniques
 
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
Turnover Prediction of Shares Using Data Mining Techniques : A Case Study
 
Using a Big Data approach to adapt Market Strategies
Using a Big Data approach to adapt Market StrategiesUsing a Big Data approach to adapt Market Strategies
Using a Big Data approach to adapt Market Strategies
 
Stock Market Prediction Analysis
Stock Market Prediction AnalysisStock Market Prediction Analysis
Stock Market Prediction Analysis
 
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTION
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTIONIMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTION
IMPROVED TURNOVER PREDICTION OF SHARES USING HYBRID FEATURE SELECTION
 
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...
STOCK PRICE PREDICTION AND RECOMMENDATION USINGMACHINE LEARNING TECHNIQUES AN...
 
Project report on Share Market application
Project report on Share Market applicationProject report on Share Market application
Project report on Share Market application
 
Aditya report finaL
Aditya report finaLAditya report finaL
Aditya report finaL
 
Online Survey Tool.pdf
Online Survey Tool.pdfOnline Survey Tool.pdf
Online Survey Tool.pdf
 
The Analysis of Share Market using Random Forest & SVM
The Analysis of Share Market using Random Forest & SVMThe Analysis of Share Market using Random Forest & SVM
The Analysis of Share Market using Random Forest & SVM
 
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...
ELASTIC PROPERTY EVALUATION OF FIBRE REINFORCED GEOPOLYMER COMPOSITE USING SU...
 
IRJET - Stock Market Analysis and Prediction
IRJET - Stock Market Analysis and PredictionIRJET - Stock Market Analysis and Prediction
IRJET - Stock Market Analysis and Prediction
 
An Overview Of Predictive Analysis Techniques And Applications
An Overview Of Predictive Analysis  Techniques And ApplicationsAn Overview Of Predictive Analysis  Techniques And Applications
An Overview Of Predictive Analysis Techniques And Applications
 
The disruptometer: an artificial intelligence algorithm for market insights
The disruptometer: an artificial intelligence algorithm for market insightsThe disruptometer: an artificial intelligence algorithm for market insights
The disruptometer: an artificial intelligence algorithm for market insights
 
Applications of Artificial Neural Network in Forecasting of Stock Market Index
Applications of Artificial Neural Network in Forecasting of Stock Market IndexApplications of Artificial Neural Network in Forecasting of Stock Market Index
Applications of Artificial Neural Network in Forecasting of Stock Market Index
 
The future Proof Financial: Fintech
The future Proof Financial: FintechThe future Proof Financial: Fintech
The future Proof Financial: Fintech
 

INFORMS 2015

  • 1. 1 INFORMS Philadelphia November 2015 Bin Weng ( Email: bzw0018@auburn.edu) Ph.D. Candidate of Industrial and System Engineering Mohamed A. Ahmed (Email: mza0068@auburn.edu) M.S. Candidate of Industrial and System Engineering Fadel M. Megahed (Email: fmegahed@auburn.edu) Assistant Professor of Industrial and System Engineering Stock Market Prediction Using Disparate Data Sources
  • 2. 2Stock Market Prediction  Why? • The stock market is one of the most important way for companies to raise money. • About 48% Americans invested in the stock market as 2015 (CNBC). • The successful prediction of a stock’s future price could yield significant PROFIT.
  • 3. 3Stock Market Prediction  How? Guess? Fundamental Analysis Technical Analysis (Charting) Technological Methods
  • 4. 4Stock Market Prediction  Ray Dalio’s $165B Bridgewater Associates will start a new artificial-intelligence unit to use predictive analysis for trades. (Bloomberg, 2015)
  • 5. 5Related Works Paper Index Selected Papers [1] Predicting Financial Markets: Comparing Survey, News, Twitter and Search Engine Data [2] A fusion model of HMM, ANN and GA for stock market forecasting [3] Twitter mood predicts the stock market [4] Stock Market Prediction System with Modular Neural Networks [5] Empirical evaluation of an automated intraday stock recommendation system incorporating both market data and textual news [6] A Hybrid Machine Learning System for Stock Market Forecasting [7] Market Index and Stock Price Direction Prediction using Machine Learning Techniques: An empirical study on the KOSPI and HSI [8] Stock Market Prediction Using Disparate Data Sources (Proposed)
  • 6. 6Related Works Paper Data Model Target Type of Stock Market Data Technical Indicator Social Media News Secondary Variable Time Series Logistic Regression Decision Trees Neural Networks Support Vector Machines IT Index Mix of companies [1]     Price Volume  [2]   Price  [3]    Movement  [4]    Buy and sell signal  [5]        Price Volume  [6]   Movement  [7]    Movement   [8]          Movement  
  • 7. 7Research Motivation  Which sources of data have the most correlation with the stock market time series?  Which logical target has the best prediction capability with regards to the stock movement?  Which technological model is best at predicting the stock movement?  Can we construct a better model using disparate data sources?
  • 10. 10Data Sources  Social Media and Internet Data • “Financial news articles play a large role in influencing the movement of a stock as humans react to the information.” (M. Nardo etc. 2015) • “Data on changes in how often financially related Wikipedia pages were viewed have contained early signs of stock market moves.” (H. Moat etc. 2013) • Blog communication exhibits remarkable predictive power. (M. Choudhury etc. 2008)
  • 11. 11Data Sources  Secondary Variables • The data from Social Media and Internet always have high variability (e.g. Moving Average, Momentum, Relative Strength Index). • If the upward or downward movement in predicting variables had an effect on the target movement? • What range of the primary variables have predicting power over the targets? 0 500 1000 1500 2000 2500 3000 3500 1/2/2014 2/2/2014 3/2/2014 4/2/2014 5/2/2014 6/2/2014 Google News & Blogs
  • 12. 12Target Matrix Target Type Method 1 Open (i+1) – Close (i) 2 Open (i+1) – Open (i) 3 Close (i+1) – Close (i) 4 Close (i+1) – Open (i) 5 Volume of trades moves as previous day
  • 14. 14Feature Selection • Simplification of model • Shorter training times • Improve accuracy • Enhanced generalization by reducing overfitting
  • 16. 16Feature Selection  Method : Recursive feature elimination (RFE)  Coding : Python with multiple feature selection package Pseudo Code of RFE * Code is available on https://github.com/binweng/SFS
  • 17. 17Feature Selection Target Variables Target 1 Close Open High Low P/E Ratio Wiki_3_day_disparity Wiki_5_day_disparity Wiki_10_day_disparity Wiki_Momentum_1 Wiki_ROC Google_MA_5 Google_EMA_3 Google_3_Day_disparity Google_5_day_disparity RSI Stochastic Ocillater Wiki_RSI Google_MA_4 William %R Google_MA_3 Target 2 Close Open High Low P/E Ratio Wiki_5_day_disparity Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move Wiki_5day_disparity_M ove Google_EMA5_Move Google_3day_disparity_ Move Google_ROC_Move Google_RSI_Move Wiki_3_day_disparity Stochastic Ocillater RSI_Move Wiki_RSI_Move Google_MA_6 Google_Move Target 3 Close Open High P/E Ratio Stochastic_Move Wiki_Monentum_1 Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move Wiki_ROC_Move Google_EMA5_Move Google_3day_disparity_ Move Google_ROC_Move Google_RSI_Move Wiki_10_day_disparity RSI_Move Wiki_RSI_Move Wiki_3_day_disparity Google_Move Google_MA5_Move Target 4 Close Open High Low P/E Ratio RSI_Move Wiki_10_day_Disparity Wiki_Move Wiki_MA3_Move Wiki_EMA5_Move Google_Move Google_3day_disparity_ Move Google_ROC_Move Google_RSI_Move William %R Stochastic Ocillater Stochastic_Move Wiki_3day_disparity_M ove Wiki_ROC_Move Wiki_RSI_Move Target 5 Close Open High Low William %R Wiki_Monentum_1 Wiki_RSI Google_MA_2 Google_MA_3 Google_MA_4 Google_MA_9 Google_3_day_disparity Google_5_day_disparity Google_10_day_disparit y Wiki_10_day_disparity Wiki_3_day_disparity Wiki_5_day_disparity Google_MA_6 Google_MA_7 Google_MA_8
  • 21. 21Experimental Result Paper 1 – B. Nair etc., 2010 Paper 2 – A. Chen, 2003
  • 22. 22Experimental Result • Comparison of Model Accuracy by information input
  • 23. 23Experimental Result • Evaluate the model using AUC
  • 24. 24Experimental Result Target Coincidence Matrix for SVM Target1 Training 0 1 Testing 0 1 0 55 113 0 60 95 1 27 229 1 34 183 Target2 Training 0 1 Testing 0 1 0 160 28 0 156 39 1 37 180 1 32 164 Target3 Training 0 1 Testing 0 1 0 147 46 0 164 32 1 30 172 1 31 174 Target4 Training 0 1 Testing 0 1 0 150 31 0 165 34 1 34 172 1 31 179 Target5 Training 0 1 Testing 0 1 0 177 29 0 183 37 1 130 61 1 125 54
  • 25. 25Target Matrix Target Type Method 1 Open (i+1) – Close (i) 2 Open (i+1) – Open (i) 3 Close (i+1) – Close (i) 4 Close (i+1) – Open (i) 5 Volume of trades as previous day
  • 26. 26Evaluation  10 – fold cross validation
  • 30. 30Conclusion • Disparate sources of data help predict the stock market. • Multiple targets’ prediction results can be used in conjunction to successfully track stock market movements. • Decision tree model and support vector machine model perform the best interchangeably with different combinations of input data. • With all the types of input data, SVMs performed best.
  • 31. 31Future Work • Identifying and adding into a more inclusive form of this model, new sources of data that have a predictive effect on the movement of the stock market, like twitter sentiment and market news textual analysis. • Include linguistic modeling, clustering, and controlling methods like fuzzy theory in obtaining the predictions of price range. Fuzzy Membership FunctionFuzzy System
  • 32. 32 INFORMS Philadelphia November 2015 Bin Weng ( Email: bzw0018@auburn.edu) Ph.D. Candidate of Industrial and System Engineering Mohamed A. Ahmed (Email: mza0068@auburn.edu) M.S. Candidate of Industrial and System Engineering Fadel M. Megahed (Email: fmegahed@auburn.edu) Assistant Professor of Industrial and System Engineering Stock Market Prediction Using Disparate Data Sources

Editor's Notes

  1. Ray Dalio (born August 8, 1949) is an American businessman and founder of the investment firm Bridgewater Associates.
  2. journal
  3. initial
  4. Does the target satisfy investors who wish to know movement in different time periods? What kind of prediction is being done, short or long term?
  5. Google News could search and explore information from historical achieves dating back over 200 years. Blog provides insights into understanding communication patterns of people. The communication dynamics yield correlations with certain external events, justifying their predictive power.
  6. Feature selection is an indispensable process in the Machine Learning As more and more data is collected and analyzed, decision makers at all levels welcome data visualization software that enables them to see analytical results presented visually, find relevance among the millions of variables, communicate concepts and hypotheses to others, and even predict the future.
  7. It works by recursively removing attributes and building a model on those attributes that remain. Code available
  8. Gap as increase
  9. Stronger lines
  10. This research work points to the fact that historical market data alone is not sufficient to predict the movements of stocks in the market. It validates the proposition that internet search data has predictive power, too.