SlideShare a Scribd company logo
Tunisia Polytechnic School
Data analysis project
Presented by
Mohamed DHAOUI
(3rd year engineering student)
Contact@Mohamed-dhaoui.com
Mohamed.dhaoui.ept@gmail.com
Academic Year : 2015-2016 1
Forecasting stock market movement direction with
support vector machine
2
How SVM works?
Problematic and motivations
Experiment design and results
Conclusion
PLAN
3
Problematic and motivations
• The financial market is a complex, evolutionary, and non-linear dynamical system.
• The financial forecasting is characterized by data intensity, noise, non-stationary,
unstructured nature, high degree of uncertainty, and hidden relationships.
• Movements in market prices are not random. Rather, they behave in a highly non-
linear, dynamic manner.
In this paper, we investigate the predictability of financial movement direction with SVM
by forecasting the weekly movement direction of NIKKEI 225 index.
Financial market
4
Problematic and motivations
• Support vector machine (SVM) is a very specific type of learning algorithms
characterized by the capacity control of the decision function, the use of the
kernel functions and the sparsity of the solution.
• SVM is shown to be very resistant to the over-fitting problem.
• Training SVM is equivalent to solving a linearly constrained quadratic
programming problem so that the solution of SVM is always unique and globally
optimal.
Support Vector Machine
5
Problematic and motivations
• The NIKKEI 225 Index measures the composite price performance of 225
highly capitalized stocks trading on the Tokyo Stock Exchange (TSE),
representing a broad cross-section of Japanese industries.
• There are two basic reasons for the success of these index trading vehicles:
- They provide an effective means for investors to hedge against
potential market risks.
- They create new profit making opportunities for market speculators
and arbitrageur.
NIKKEI 225 index
6
How SVM works?
Linearly separable data
For a two-class linearly separable learning task, the aim of SVC
is to find a hyperplane that can separate two classes of given
samples with a maximal margin.
-> good classification performance
-> guarantees high predictive accuracy for the future data
 the margin corresponds to the shortest distance between
the closest data points to any point on the hyperplane
-> The smallest distance is called the margin of separation
-> The hyperplane is called optimal separating hyperplane if
the margin is maximized
7
How SVM works?
Linearly separable data
8
How SVM works?
Linearly separable data
9
How SVM works?
primal problem
Linearly separable data
10
How SVM works?
Linearly separable data
11
How SVM works?
Linearly separable data
12
How SVM works?
Linearly inseparable data
Introducing a new function:
-> a feature map mapping the input space to a usually high dimensional
feature space where the data points become linearly separable.
is an upper bound on the number of training errors
Controls the trade-off between complexity of the machine
and the number of inseparable points.
Introduced to account for the amount of a
violation of classification by the classifier
13
How SVM works?
Linearly inseparable data
14
How SVM works?
Linearly inseparable data
15
How SVM works?
Linearly inseparable data
-> Introducing Kerner
16
Experiment design and results
• term structure of interest rates (TS)
• short-term interest rate (ST)
• long-term interest rate (LT)
• consumer price index (CPI)
• industrial production (IP)
The economy growth has a close relationship with Japanese export. The largest export
target for Japan is the United States of America (USA), which is the leading economy in
the world. Therefore, the economic condition of USA inRuences Japan economy
• S& P 500 Index is a well-known indicator of the economic condition in USA
• The exchange rate of US Dollars against Japanese Yen (JPY)
Input variables
17
Experiment design and results
-> The behaviors of the NIKKEI 225 Index, the S& P 500 Index and Japanese Yen are very complex. It is
impossible to give an explicit formula to describe the underlying relationship between them
18
Experiment design and results
Data collection
• Source: from the finance section of Yahoo and the Pacific Exchange Rate Service
provided by Professor Werner Antweiler, University of British Columbia, Vancouver,
Canada, respectively.
• Periode: from January 1, 1990 to December 31, 2002
• Number of observations: total of 676 pairs of observations:
- (640 pairs of observations) is used to determine the speci1cations of the
models and parameters. The second part
- (36 pairs of observations) is reserved for out-of-sample evaluation and
comparison of performances among various forecasting models.
19
Experiment design and results
Comparaison with other forecasting methods
• To evaluate the forecasting ability of SVM, we use the random walk model (RW) as a
benchmark for comparison
• RW is a one-step-ahead forecasting method, since it uses the current actual value
to predict the future value as follows:
• We also compare the SVM’s forecasting performance with that of linear discriminant analysis
(LDA) a quadratic discriminant analysis (QDA)
20
Experiment design and results
• LDA: This method maximizes the ratio of between-class variance to the within-class variance in any
particular data set, thereby guaranteeing maximal separability.
• QDA: It is similar to LDA, only dropping the assumption of equal covariance matrices. Therefore,
the boundary between two discrimination regions is allowed to be a quadratic surface
Comparaison with other forecasting methods
21
Experiment design and results
Combining model
A combining model by integrating SVM with other classi1cation methods as follows
Where wi is the weight assigned to classification method I ->
A well-performed forecasting method should be given a larger weight than the others during
the score combination
22
Experiment design and results
• The relative performance of the models is measured by hit ratio
Table: Forecasting performance of different classi1cation methods
23
Experiment design and results
RW performs worst
Why?
• All historic information is summarized in the current value
• increments–positive or negative are uncorrelated (random)
-> in the long run there are as many positive as negative Ructuations making long term predictions
other than the trend impossible
SVM performs best
Why?
• SVM is designed to minimize the structural risk, whereas the previous techniques are usually
based on minimization of empirical risk
• SVM is usually less vulnerable to the over-fitting problem
QDA out-performs LDA in term of hit ratio, because LDA assumes that all the classes have
equal covariance matrices, which is not consistent with the properties of input variable belonging
to different classes
24
Conclusion
• The use of support vector machines to predict financial movement direction.
SVM is a promising type of tool for financial forecasting
• SVM is superior to the other individual classi1cation methods in forecasting weekly
movement direction of NIKKEI 225 Index
• Each method has its own strengths and weaknesses
• The weakness of one method can be balanced by the strengths of another by achieving
a systematic effect
The combining model performs best among all the forecasting methods.
25
Thank you for your
attention

More Related Content

What's hot

Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecasting
Devon Barrow
 
Learning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification DataLearning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification Data
萍華 楊
 
Template attack versus Bayes classifier
Template attack  versus Bayes classifierTemplate attack  versus Bayes classifier
Template attack versus Bayes classifier
Shahid Beheshti University
 
[update] Introductory Parts of the Book "Dive into Deep Learning"
[update] Introductory Parts of the Book "Dive into Deep Learning"[update] Introductory Parts of the Book "Dive into Deep Learning"
[update] Introductory Parts of the Book "Dive into Deep Learning"
Young-Min kang
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detection
Kishor Datta Gupta
 
Applications of numerical methods
Applications of numerical methodsApplications of numerical methods
Applications of numerical methods
Daffodil International University
 
Ai lecture 11(unit02)
Ai lecture  11(unit02)Ai lecture  11(unit02)
Ai lecture 11(unit02)
vikas dhakane
 
An Introduction to Reinforcement Learning - The Doors to AGI
An Introduction to Reinforcement Learning - The Doors to AGIAn Introduction to Reinforcement Learning - The Doors to AGI
An Introduction to Reinforcement Learning - The Doors to AGI
Anirban Santara
 
Aaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble LearningAaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble Learning
AminaRepo
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
Tonmoy Bhagawati
 
Supervised Machine Learning in R
Supervised  Machine Learning  in RSupervised  Machine Learning  in R
Supervised Machine Learning in R
Babu Priyavrat
 
Churn Modeling For Mobile Telecommunications
Churn Modeling For Mobile TelecommunicationsChurn Modeling For Mobile Telecommunications
Churn Modeling For Mobile Telecommunications
Salford Systems
 

What's hot (12)

Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecasting
 
Learning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification DataLearning On The Border:Active Learning in Imbalanced classification Data
Learning On The Border:Active Learning in Imbalanced classification Data
 
Template attack versus Bayes classifier
Template attack  versus Bayes classifierTemplate attack  versus Bayes classifier
Template attack versus Bayes classifier
 
[update] Introductory Parts of the Book "Dive into Deep Learning"
[update] Introductory Parts of the Book "Dive into Deep Learning"[update] Introductory Parts of the Book "Dive into Deep Learning"
[update] Introductory Parts of the Book "Dive into Deep Learning"
 
Policy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detectionPolicy Based reinforcement Learning for time series Anomaly detection
Policy Based reinforcement Learning for time series Anomaly detection
 
Applications of numerical methods
Applications of numerical methodsApplications of numerical methods
Applications of numerical methods
 
Ai lecture 11(unit02)
Ai lecture  11(unit02)Ai lecture  11(unit02)
Ai lecture 11(unit02)
 
An Introduction to Reinforcement Learning - The Doors to AGI
An Introduction to Reinforcement Learning - The Doors to AGIAn Introduction to Reinforcement Learning - The Doors to AGI
An Introduction to Reinforcement Learning - The Doors to AGI
 
Aaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble LearningAaa ped-14-Ensemble Learning: About Ensemble Learning
Aaa ped-14-Ensemble Learning: About Ensemble Learning
 
Presentation on supervised learning
Presentation on supervised learningPresentation on supervised learning
Presentation on supervised learning
 
Supervised Machine Learning in R
Supervised  Machine Learning  in RSupervised  Machine Learning  in R
Supervised Machine Learning in R
 
Churn Modeling For Mobile Telecommunications
Churn Modeling For Mobile TelecommunicationsChurn Modeling For Mobile Telecommunications
Churn Modeling For Mobile Telecommunications
 

Viewers also liked

Jesse Livermore Trading Rules - His Top Stock Market Tips
Jesse Livermore Trading Rules - His Top Stock Market TipsJesse Livermore Trading Rules - His Top Stock Market Tips
Jesse Livermore Trading Rules - His Top Stock Market Tips
Penny Stock Whizzkid
 
Influence financial ratio to stock price
Influence financial ratio to stock priceInfluence financial ratio to stock price
Influence financial ratio to stock price
Intan Ayuna
 
Trading decision trees ( Elaborated by Mohamed DHAOUI )
Trading decision trees ( Elaborated by Mohamed DHAOUI )Trading decision trees ( Elaborated by Mohamed DHAOUI )
Trading decision trees ( Elaborated by Mohamed DHAOUI )
Mohamed DHAOUI
 
Using gamification for human resource - Manu Melwin Joy
Using gamification for human resource - Manu Melwin JoyUsing gamification for human resource - Manu Melwin Joy
Using gamification for human resource - Manu Melwin Joy
manumelwin
 
3 stock valuation_fundamental_analysis
3 stock valuation_fundamental_analysis3 stock valuation_fundamental_analysis
3 stock valuation_fundamental_analysis
Jasleen ChaWla
 
Unit 6: Analyzing Common Stock Notes
Unit 6: Analyzing Common Stock NotesUnit 6: Analyzing Common Stock Notes
Unit 6: Analyzing Common Stock Notes
celsesser
 
Ch 6 Analyzing Common Stock
Ch 6   Analyzing Common StockCh 6   Analyzing Common Stock
Ch 6 Analyzing Common Stockmrhennings
 
Sucess Stories of Gamification in HR - Manu Melwin Joy
Sucess Stories of Gamification in HR - Manu Melwin JoySucess Stories of Gamification in HR - Manu Melwin Joy
Sucess Stories of Gamification in HR - Manu Melwin Joy
manumelwin
 
2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark
DB Tsai
 
Jinxing_LIN_S224266_Poster
Jinxing_LIN_S224266_PosterJinxing_LIN_S224266_Poster
Jinxing_LIN_S224266_Poster
jinxing lin
 
Gamification: driving employee & customer loyalty, a telco scenario
Gamification: driving employee & customer loyalty, a telco scenarioGamification: driving employee & customer loyalty, a telco scenario
Gamification: driving employee & customer loyalty, a telco scenario
Anietie Akpan
 
Analyzing Common Stock
Analyzing Common StockAnalyzing Common Stock
Analyzing Common Stock
Villa Santa Maria
 
Gamification - Let's Talk About Data
Gamification - Let's Talk About DataGamification - Let's Talk About Data
Gamification - Let's Talk About Data
Raf Keustermans
 
Rules of Fundamental Stock Analysis
Rules of Fundamental Stock AnalysisRules of Fundamental Stock Analysis
Rules of Fundamental Stock Analysis
Dhanashri Academy
 
Improving hr management through gamification - Manu Melwin Joy
Improving hr management through gamification - Manu Melwin JoyImproving hr management through gamification - Manu Melwin Joy
Improving hr management through gamification - Manu Melwin Joy
manumelwin
 
Limitations of gamification in recruitment - Gamification in Recruitment
Limitations of gamification in recruitment - Gamification in RecruitmentLimitations of gamification in recruitment - Gamification in Recruitment
Limitations of gamification in recruitment - Gamification in Recruitment
manumelwin
 
solomonaddai
solomonaddaisolomonaddai
solomonaddai
Solomon Addai
 
Presentation Machine Learning
Presentation Machine LearningPresentation Machine Learning
Presentation Machine Learning
Periklis Gogas
 
Barriers to gamification adoption - Gamification in HR - Manu Melwin Joy
Barriers to gamification adoption  - Gamification in HR - Manu Melwin JoyBarriers to gamification adoption  - Gamification in HR - Manu Melwin Joy
Barriers to gamification adoption - Gamification in HR - Manu Melwin Joy
manumelwin
 
Fundamental and technical analysis
Fundamental and technical analysisFundamental and technical analysis
Fundamental and technical analysis
Gerry Gatawa
 

Viewers also liked (20)

Jesse Livermore Trading Rules - His Top Stock Market Tips
Jesse Livermore Trading Rules - His Top Stock Market TipsJesse Livermore Trading Rules - His Top Stock Market Tips
Jesse Livermore Trading Rules - His Top Stock Market Tips
 
Influence financial ratio to stock price
Influence financial ratio to stock priceInfluence financial ratio to stock price
Influence financial ratio to stock price
 
Trading decision trees ( Elaborated by Mohamed DHAOUI )
Trading decision trees ( Elaborated by Mohamed DHAOUI )Trading decision trees ( Elaborated by Mohamed DHAOUI )
Trading decision trees ( Elaborated by Mohamed DHAOUI )
 
Using gamification for human resource - Manu Melwin Joy
Using gamification for human resource - Manu Melwin JoyUsing gamification for human resource - Manu Melwin Joy
Using gamification for human resource - Manu Melwin Joy
 
3 stock valuation_fundamental_analysis
3 stock valuation_fundamental_analysis3 stock valuation_fundamental_analysis
3 stock valuation_fundamental_analysis
 
Unit 6: Analyzing Common Stock Notes
Unit 6: Analyzing Common Stock NotesUnit 6: Analyzing Common Stock Notes
Unit 6: Analyzing Common Stock Notes
 
Ch 6 Analyzing Common Stock
Ch 6   Analyzing Common StockCh 6   Analyzing Common Stock
Ch 6 Analyzing Common Stock
 
Sucess Stories of Gamification in HR - Manu Melwin Joy
Sucess Stories of Gamification in HR - Manu Melwin JoySucess Stories of Gamification in HR - Manu Melwin Joy
Sucess Stories of Gamification in HR - Manu Melwin Joy
 
2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark
 
Jinxing_LIN_S224266_Poster
Jinxing_LIN_S224266_PosterJinxing_LIN_S224266_Poster
Jinxing_LIN_S224266_Poster
 
Gamification: driving employee & customer loyalty, a telco scenario
Gamification: driving employee & customer loyalty, a telco scenarioGamification: driving employee & customer loyalty, a telco scenario
Gamification: driving employee & customer loyalty, a telco scenario
 
Analyzing Common Stock
Analyzing Common StockAnalyzing Common Stock
Analyzing Common Stock
 
Gamification - Let's Talk About Data
Gamification - Let's Talk About DataGamification - Let's Talk About Data
Gamification - Let's Talk About Data
 
Rules of Fundamental Stock Analysis
Rules of Fundamental Stock AnalysisRules of Fundamental Stock Analysis
Rules of Fundamental Stock Analysis
 
Improving hr management through gamification - Manu Melwin Joy
Improving hr management through gamification - Manu Melwin JoyImproving hr management through gamification - Manu Melwin Joy
Improving hr management through gamification - Manu Melwin Joy
 
Limitations of gamification in recruitment - Gamification in Recruitment
Limitations of gamification in recruitment - Gamification in RecruitmentLimitations of gamification in recruitment - Gamification in Recruitment
Limitations of gamification in recruitment - Gamification in Recruitment
 
solomonaddai
solomonaddaisolomonaddai
solomonaddai
 
Presentation Machine Learning
Presentation Machine LearningPresentation Machine Learning
Presentation Machine Learning
 
Barriers to gamification adoption - Gamification in HR - Manu Melwin Joy
Barriers to gamification adoption  - Gamification in HR - Manu Melwin JoyBarriers to gamification adoption  - Gamification in HR - Manu Melwin Joy
Barriers to gamification adoption - Gamification in HR - Manu Melwin Joy
 
Fundamental and technical analysis
Fundamental and technical analysisFundamental and technical analysis
Fundamental and technical analysis
 

Similar to Forecasting stock market movement direction with support vector machine

Student Performance Predictor
Student Performance PredictorStudent Performance Predictor
Student Performance Predictor
IRJET Journal
 
Big Data Project - Final version
Big Data Project - Final versionBig Data Project - Final version
Big Data Project - Final version
Mihir Sanghavi
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
narmeen11
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
SOURAVGHOSH623569
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Sanghamitra Deb
 
Models of Operational research, Advantages & disadvantages of Operational res...
Models of Operational research, Advantages & disadvantages of Operational res...Models of Operational research, Advantages & disadvantages of Operational res...
Models of Operational research, Advantages & disadvantages of Operational res...
Sunny Mervyne Baa
 
Lecture 1.pptx
Lecture 1.pptxLecture 1.pptx
Ali asgher slides copy
Ali asgher slides   copyAli asgher slides   copy
Ali asgher slides copy
Ali Asgher
 
Applications of Machine Learning in High Frequency Trading
Applications of Machine Learning in High Frequency TradingApplications of Machine Learning in High Frequency Trading
Applications of Machine Learning in High Frequency Trading
Ayan Sengupta
 
Unit 3 – AIML.pptx
Unit 3 – AIML.pptxUnit 3 – AIML.pptx
Unit 3 – AIML.pptx
hiblooms
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Important Quantitative Methods by MBA Classes in Mumbai
Important Quantitative Methods by MBA Classes in MumbaiImportant Quantitative Methods by MBA Classes in Mumbai
Important Quantitative Methods by MBA Classes in Mumbai
seomiamia
 
Computational Finance Introductory Lecture
Computational Finance Introductory LectureComputational Finance Introductory Lecture
Computational Finance Introductory Lecture
Stuart Gordon Reid
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation System
IRJET Journal
 
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
ESEM 2014
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
IRJET Journal
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
IRJET Journal
 
software engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyonesoftware engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyone
rebantaofficial
 
Macroeconomic modelling using Eviews
Macroeconomic modelling using EviewsMacroeconomic modelling using Eviews
Macroeconomic modelling using Eviews
Muhammad Anees
 
Testing of Object-Oriented Software
Testing of Object-Oriented SoftwareTesting of Object-Oriented Software
Testing of Object-Oriented Software
Praveen Penumathsa
 

Similar to Forecasting stock market movement direction with support vector machine (20)

Student Performance Predictor
Student Performance PredictorStudent Performance Predictor
Student Performance Predictor
 
Big Data Project - Final version
Big Data Project - Final versionBig Data Project - Final version
Big Data Project - Final version
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
MACHINE LEARNING.pptx
MACHINE LEARNING.pptxMACHINE LEARNING.pptx
MACHINE LEARNING.pptx
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Models of Operational research, Advantages & disadvantages of Operational res...
Models of Operational research, Advantages & disadvantages of Operational res...Models of Operational research, Advantages & disadvantages of Operational res...
Models of Operational research, Advantages & disadvantages of Operational res...
 
Lecture 1.pptx
Lecture 1.pptxLecture 1.pptx
Lecture 1.pptx
 
Ali asgher slides copy
Ali asgher slides   copyAli asgher slides   copy
Ali asgher slides copy
 
Applications of Machine Learning in High Frequency Trading
Applications of Machine Learning in High Frequency TradingApplications of Machine Learning in High Frequency Trading
Applications of Machine Learning in High Frequency Trading
 
Unit 3 – AIML.pptx
Unit 3 – AIML.pptxUnit 3 – AIML.pptx
Unit 3 – AIML.pptx
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentation
 
Important Quantitative Methods by MBA Classes in Mumbai
Important Quantitative Methods by MBA Classes in MumbaiImportant Quantitative Methods by MBA Classes in Mumbai
Important Quantitative Methods by MBA Classes in Mumbai
 
Computational Finance Introductory Lecture
Computational Finance Introductory LectureComputational Finance Introductory Lecture
Computational Finance Introductory Lecture
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation System
 
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
 
Post Graduate Admission Prediction System
Post Graduate Admission Prediction SystemPost Graduate Admission Prediction System
Post Graduate Admission Prediction System
 
software engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyonesoftware engineering powerpoint presentation foe everyone
software engineering powerpoint presentation foe everyone
 
Macroeconomic modelling using Eviews
Macroeconomic modelling using EviewsMacroeconomic modelling using Eviews
Macroeconomic modelling using Eviews
 
Testing of Object-Oriented Software
Testing of Object-Oriented SoftwareTesting of Object-Oriented Software
Testing of Object-Oriented Software
 

Recently uploaded

My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 

Recently uploaded (20)

My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 

Forecasting stock market movement direction with support vector machine

  • 1. Tunisia Polytechnic School Data analysis project Presented by Mohamed DHAOUI (3rd year engineering student) Contact@Mohamed-dhaoui.com Mohamed.dhaoui.ept@gmail.com Academic Year : 2015-2016 1 Forecasting stock market movement direction with support vector machine
  • 2. 2 How SVM works? Problematic and motivations Experiment design and results Conclusion PLAN
  • 3. 3 Problematic and motivations • The financial market is a complex, evolutionary, and non-linear dynamical system. • The financial forecasting is characterized by data intensity, noise, non-stationary, unstructured nature, high degree of uncertainty, and hidden relationships. • Movements in market prices are not random. Rather, they behave in a highly non- linear, dynamic manner. In this paper, we investigate the predictability of financial movement direction with SVM by forecasting the weekly movement direction of NIKKEI 225 index. Financial market
  • 4. 4 Problematic and motivations • Support vector machine (SVM) is a very specific type of learning algorithms characterized by the capacity control of the decision function, the use of the kernel functions and the sparsity of the solution. • SVM is shown to be very resistant to the over-fitting problem. • Training SVM is equivalent to solving a linearly constrained quadratic programming problem so that the solution of SVM is always unique and globally optimal. Support Vector Machine
  • 5. 5 Problematic and motivations • The NIKKEI 225 Index measures the composite price performance of 225 highly capitalized stocks trading on the Tokyo Stock Exchange (TSE), representing a broad cross-section of Japanese industries. • There are two basic reasons for the success of these index trading vehicles: - They provide an effective means for investors to hedge against potential market risks. - They create new profit making opportunities for market speculators and arbitrageur. NIKKEI 225 index
  • 6. 6 How SVM works? Linearly separable data For a two-class linearly separable learning task, the aim of SVC is to find a hyperplane that can separate two classes of given samples with a maximal margin. -> good classification performance -> guarantees high predictive accuracy for the future data  the margin corresponds to the shortest distance between the closest data points to any point on the hyperplane -> The smallest distance is called the margin of separation -> The hyperplane is called optimal separating hyperplane if the margin is maximized
  • 7. 7 How SVM works? Linearly separable data
  • 8. 8 How SVM works? Linearly separable data
  • 9. 9 How SVM works? primal problem Linearly separable data
  • 10. 10 How SVM works? Linearly separable data
  • 11. 11 How SVM works? Linearly separable data
  • 12. 12 How SVM works? Linearly inseparable data Introducing a new function: -> a feature map mapping the input space to a usually high dimensional feature space where the data points become linearly separable. is an upper bound on the number of training errors Controls the trade-off between complexity of the machine and the number of inseparable points. Introduced to account for the amount of a violation of classification by the classifier
  • 13. 13 How SVM works? Linearly inseparable data
  • 14. 14 How SVM works? Linearly inseparable data
  • 15. 15 How SVM works? Linearly inseparable data -> Introducing Kerner
  • 16. 16 Experiment design and results • term structure of interest rates (TS) • short-term interest rate (ST) • long-term interest rate (LT) • consumer price index (CPI) • industrial production (IP) The economy growth has a close relationship with Japanese export. The largest export target for Japan is the United States of America (USA), which is the leading economy in the world. Therefore, the economic condition of USA inRuences Japan economy • S& P 500 Index is a well-known indicator of the economic condition in USA • The exchange rate of US Dollars against Japanese Yen (JPY) Input variables
  • 17. 17 Experiment design and results -> The behaviors of the NIKKEI 225 Index, the S& P 500 Index and Japanese Yen are very complex. It is impossible to give an explicit formula to describe the underlying relationship between them
  • 18. 18 Experiment design and results Data collection • Source: from the finance section of Yahoo and the Pacific Exchange Rate Service provided by Professor Werner Antweiler, University of British Columbia, Vancouver, Canada, respectively. • Periode: from January 1, 1990 to December 31, 2002 • Number of observations: total of 676 pairs of observations: - (640 pairs of observations) is used to determine the speci1cations of the models and parameters. The second part - (36 pairs of observations) is reserved for out-of-sample evaluation and comparison of performances among various forecasting models.
  • 19. 19 Experiment design and results Comparaison with other forecasting methods • To evaluate the forecasting ability of SVM, we use the random walk model (RW) as a benchmark for comparison • RW is a one-step-ahead forecasting method, since it uses the current actual value to predict the future value as follows: • We also compare the SVM’s forecasting performance with that of linear discriminant analysis (LDA) a quadratic discriminant analysis (QDA)
  • 20. 20 Experiment design and results • LDA: This method maximizes the ratio of between-class variance to the within-class variance in any particular data set, thereby guaranteeing maximal separability. • QDA: It is similar to LDA, only dropping the assumption of equal covariance matrices. Therefore, the boundary between two discrimination regions is allowed to be a quadratic surface Comparaison with other forecasting methods
  • 21. 21 Experiment design and results Combining model A combining model by integrating SVM with other classi1cation methods as follows Where wi is the weight assigned to classification method I -> A well-performed forecasting method should be given a larger weight than the others during the score combination
  • 22. 22 Experiment design and results • The relative performance of the models is measured by hit ratio Table: Forecasting performance of different classi1cation methods
  • 23. 23 Experiment design and results RW performs worst Why? • All historic information is summarized in the current value • increments–positive or negative are uncorrelated (random) -> in the long run there are as many positive as negative Ructuations making long term predictions other than the trend impossible SVM performs best Why? • SVM is designed to minimize the structural risk, whereas the previous techniques are usually based on minimization of empirical risk • SVM is usually less vulnerable to the over-fitting problem QDA out-performs LDA in term of hit ratio, because LDA assumes that all the classes have equal covariance matrices, which is not consistent with the properties of input variable belonging to different classes
  • 24. 24 Conclusion • The use of support vector machines to predict financial movement direction. SVM is a promising type of tool for financial forecasting • SVM is superior to the other individual classi1cation methods in forecasting weekly movement direction of NIKKEI 225 Index • Each method has its own strengths and weaknesses • The weakness of one method can be balanced by the strengths of another by achieving a systematic effect The combining model performs best among all the forecasting methods.
  • 25. 25 Thank you for your attention