SlideShare a Scribd company logo
1 of 29
Data Preprocessing section header
• Some stock image here
Correlation Analysis
• Vraiable that were co related to each other and was redundant was
removed.
• E.g
• Payout ratio related with revenue per share
• Gross profit related with profit margin
Missing Values imputation
• Missing value of a given stock is replaced by the Average figure of that
variable for that stock. Step 1
• Step 2. remaining Missing values of a stock were replaced by the
Average figure of that variable for given industry.
• Step 3. still remaining Missing values of a stock were replaced by the
Average figure of that variable for given Sector
• Stock level avgs , industry level avg, sector level avg.
Missing Values imputation
• Missing value of a given stock is replaced by the Average figure of that
variable for that stock. Step 1
• Step 2. remaining Missing values of a stock were replaced by the
Average figure of that variable for given industry.
• Step 3. still remaining Missing values of a stock were replaced by the
Average figure of that variable for given Sector
• Stock level avgs , industry level avg, sector level avg.
Missing Values imputation
• Missing value of a given stock is replaced by the Average figure of that
variable for that stock. Step 1
• Step 2. remaining Missing values of a stock were replaced by the
Average figure of that variable for given industry.
• Step 3. still remaining Missing values of a stock were replaced by the
Average figure of that variable for given Sector
• Stock level avgs , industry level avg, sector level avg.
Missing Values imputation
• Missing value of a given stock is replaced by the Average figure of that
variable for that stock. Step 1
• Step 2. remaining Missing values of a stock were replaced by the
Average figure of that variable for given industry.
• Step 3. still remaining Missing values of a stock were replaced by the
Average figure of that variable for given Sector
• Stock level avgs , industry level avg, sector level avg.
Missing Values imputation
• Missing value of a given stock is replaced by the Average figure of that
variable for that stock. Step 1
• Step 2. remaining Missing values of a stock were replaced by the
Average figure of that variable for given industry.
• Step 3. still remaining Missing values of a stock were replaced by the
Average figure of that variable for given Sector
• Stock level avgs , industry level avg, sector level avg.
Data Transformation
• Encoded categorical features to make them fit for ML Algo
• Converted stock price into three levels
increase
decrease
remains same
•
Data Transformation
• Encoded categorical features to make them fit for ML Algo
• Converted stock price into three levels
increase
decrease
remains same
•
Analysis section header
• Stock image
Building Decision tree
• Simple decision tree
accuracy on test data = 44.5
accuracy 10 fold CV = 44.8
Building Decision tree
• Boosting approach (EdaBoost)
base estimator is DT.
Number of estimators are 101
accuracy on test data = 50.9
accuracy 10 fold CV = 49.9
Is accuracy good in short or long term ?
• Data is divided into four parts
• Checked after quarter . Data points 4011
• Checked after six months. DP 1349
• Checked after year. DP 1156
• Checked after more than a year DP. 1155
Analysis – within a quarter
• Simple decision tree
• accuracy on test data = 37.2
• Running bagging decision tree
• accuracy on test data = 41.2
accuracy 10 fold CV = 44.6
Analysis – after six months
• Simple decision tree
• accuracy on test data = 52.9
• Running bagging decision tree
• accuracy on test data = 62.9
accuracy 10 fold CV = 60.1
Analysis – within one year
• Simple decision tree
• accuracy on test data = 51.3
• Running bagging decision tree
• accuracy on test data = 61.2
accuracy 10 fold CV = 59.9
Analysis – after one year
• Simple decision tree
• accuracy on test data = 64.9
• Running bagging decision tree
• accuracy on test data = 72.7
accuracy 10 fold CV = 70.8
Analysis – after one year
• Simple decision tree
• accuracy on test data = 64.9
• Running bagging decision tree
• accuracy on test data = 72.7
accuracy 10 fold CV = 70.8

More Related Content

Similar to Predicting stock price movement Using Machine Learning

Similar to Predicting stock price movement Using Machine Learning (20)

Six Sigma Overview
Six Sigma OverviewSix Sigma Overview
Six Sigma Overview
 
crossvalidation.pptx
crossvalidation.pptxcrossvalidation.pptx
crossvalidation.pptx
 
MACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptxMACHINE LEARNING YEAR DL SECOND PART.pptx
MACHINE LEARNING YEAR DL SECOND PART.pptx
 
Application for Lecturer Presentation
Application for Lecturer PresentationApplication for Lecturer Presentation
Application for Lecturer Presentation
 
03 machine learning workflow v2
03 machine learning workflow v203 machine learning workflow v2
03 machine learning workflow v2
 
Exploratory factor analysis
Exploratory factor analysisExploratory factor analysis
Exploratory factor analysis
 
Tpm 02
Tpm 02Tpm 02
Tpm 02
 
forecast.ppt
forecast.pptforecast.ppt
forecast.ppt
 
16 descriptive statistics
16 descriptive statistics16 descriptive statistics
16 descriptive statistics
 
addressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenanceaddressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenance
 
Intro to ml_2021
Intro to ml_2021Intro to ml_2021
Intro to ml_2021
 
Practical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems AnalysisPractical Tools for Measurement Systems Analysis
Practical Tools for Measurement Systems Analysis
 
PCA.pptx
PCA.pptxPCA.pptx
PCA.pptx
 
BI PPT Finale
BI PPT FinaleBI PPT Finale
BI PPT Finale
 
Dimensionality Reduction.pptx
Dimensionality Reduction.pptxDimensionality Reduction.pptx
Dimensionality Reduction.pptx
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Software Testing
Software Testing Software Testing
Software Testing
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Attribute MSA
Attribute MSAAttribute MSA
Attribute MSA
 
Attribute MSA
Attribute MSA Attribute MSA
Attribute MSA
 

Recently uploaded

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 

Recently uploaded (20)

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 

Predicting stock price movement Using Machine Learning

  • 1.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12. Data Preprocessing section header • Some stock image here
  • 13. Correlation Analysis • Vraiable that were co related to each other and was redundant was removed. • E.g • Payout ratio related with revenue per share • Gross profit related with profit margin
  • 14. Missing Values imputation • Missing value of a given stock is replaced by the Average figure of that variable for that stock. Step 1 • Step 2. remaining Missing values of a stock were replaced by the Average figure of that variable for given industry. • Step 3. still remaining Missing values of a stock were replaced by the Average figure of that variable for given Sector • Stock level avgs , industry level avg, sector level avg.
  • 15. Missing Values imputation • Missing value of a given stock is replaced by the Average figure of that variable for that stock. Step 1 • Step 2. remaining Missing values of a stock were replaced by the Average figure of that variable for given industry. • Step 3. still remaining Missing values of a stock were replaced by the Average figure of that variable for given Sector • Stock level avgs , industry level avg, sector level avg.
  • 16. Missing Values imputation • Missing value of a given stock is replaced by the Average figure of that variable for that stock. Step 1 • Step 2. remaining Missing values of a stock were replaced by the Average figure of that variable for given industry. • Step 3. still remaining Missing values of a stock were replaced by the Average figure of that variable for given Sector • Stock level avgs , industry level avg, sector level avg.
  • 17. Missing Values imputation • Missing value of a given stock is replaced by the Average figure of that variable for that stock. Step 1 • Step 2. remaining Missing values of a stock were replaced by the Average figure of that variable for given industry. • Step 3. still remaining Missing values of a stock were replaced by the Average figure of that variable for given Sector • Stock level avgs , industry level avg, sector level avg.
  • 18. Missing Values imputation • Missing value of a given stock is replaced by the Average figure of that variable for that stock. Step 1 • Step 2. remaining Missing values of a stock were replaced by the Average figure of that variable for given industry. • Step 3. still remaining Missing values of a stock were replaced by the Average figure of that variable for given Sector • Stock level avgs , industry level avg, sector level avg.
  • 19. Data Transformation • Encoded categorical features to make them fit for ML Algo • Converted stock price into three levels increase decrease remains same •
  • 20. Data Transformation • Encoded categorical features to make them fit for ML Algo • Converted stock price into three levels increase decrease remains same •
  • 22. Building Decision tree • Simple decision tree accuracy on test data = 44.5 accuracy 10 fold CV = 44.8
  • 23. Building Decision tree • Boosting approach (EdaBoost) base estimator is DT. Number of estimators are 101 accuracy on test data = 50.9 accuracy 10 fold CV = 49.9
  • 24. Is accuracy good in short or long term ? • Data is divided into four parts • Checked after quarter . Data points 4011 • Checked after six months. DP 1349 • Checked after year. DP 1156 • Checked after more than a year DP. 1155
  • 25. Analysis – within a quarter • Simple decision tree • accuracy on test data = 37.2 • Running bagging decision tree • accuracy on test data = 41.2 accuracy 10 fold CV = 44.6
  • 26. Analysis – after six months • Simple decision tree • accuracy on test data = 52.9 • Running bagging decision tree • accuracy on test data = 62.9 accuracy 10 fold CV = 60.1
  • 27. Analysis – within one year • Simple decision tree • accuracy on test data = 51.3 • Running bagging decision tree • accuracy on test data = 61.2 accuracy 10 fold CV = 59.9
  • 28. Analysis – after one year • Simple decision tree • accuracy on test data = 64.9 • Running bagging decision tree • accuracy on test data = 72.7 accuracy 10 fold CV = 70.8
  • 29. Analysis – after one year • Simple decision tree • accuracy on test data = 64.9 • Running bagging decision tree • accuracy on test data = 72.7 accuracy 10 fold CV = 70.8