SlideShare a Scribd company logo
1 of 29
Download to read offline
Predicting stock price movements based on newspaper articles
using a novel deep learning approach
Today Tomorrow
100110101
010101010
100110100
© 2018 Deloitte The Netherlands
• Intuitive explanation of
important concepts
• Accessible for non-techies
• Discussion of results
• No code
• No mathematics
• Not comprehensive
You will get to know what deep learning algorithms are and understand why they work
Today
‘Language is probably the hardest problem in science;
nobody really knows how it works,
nobody really knows where it came from
and yet we can all do it.
Michael Corballis
Emeritus Professor at the University of Auckland
TedX talk – The Origins and Evolution of Language: https://www.youtube.com/watch?v=nd5cklw6d6Q&t=95s
© 2018 Deloitte The Netherlands
Machine Learning
Field of study that gives the ability to the computer to self-learn without being explicitly programmed
(Input to machine learning models needs to be numeric!)
Deep learning
Computational models that are composed of multiple processing layers that allow to learn representations with
multiple levels of abstraction.
Natural Language Processing (NLP)
A branch of artificial intelligence that helps computers understand, interpret and manipulate human language.
Natural Language for Financial Forecasting (NLFF)
The domain that uses NLP approaches for financial forecasting.
Definitions
First steps in the world of artificial intelligence are taken by knowing its key concepts
Background domain
The ‘Natural Language for Financial Forecasting’ domain has been growing rapidly in
the last decade
Number of ‘NLFF’ publications in scientific journals
Problem
While stock price movements are known to be mostly influenced by news updates,
most financial companies only include stock price information in their predictive models.
By not including news updates, a huge opportunity is missed
© 2018 Deloitte The Netherlands
▪ Data preprocessing
▪ Explanation approach
➢ Word2Vec word embedding
➢ Convolutional neural network
➢ Recurrent neural network
➢ Sigmoid classification
• Results
• Future research
Agenda
Type of problem
Predicting stock price movements based on newspaper articles amounts to
classifying each article as being a positive/negative article
Approach
The approach, that combines convolutional- and recurrent layers, originates from the
NLP domain and has not been applied to the NLFF domain yet
Start: T articles
Transforming articles into
numerical representations
Deep learning approaches
to learn the algorithm
Final: Classification
Data preprocessing
Data from two separate sources were used to create a dataframe that contains both
textual- as well as stock information
Stock prices
Source: Yahoo Finance
News articles
Source: Lexis Nexis Academic
Embedding layer
Through a word embedding, textual data is represented numerically
Word2vec embedding
A statistical method for efficiently learning a standalone word embedding from a
collection of written texts
Intuitively: ‘King’ – ‘Man’ + ‘Woman’, yields a vector similar to ‘Queen’
Large -0.5117 0.9561 0.3594 0.5076 -0.7155 -0.6644 -0.6312 0.4146 0.1242 …
US -0.4742 -0.2551 0.6133 0.7608 -0.9923 -0.5035 0.894 0.815 -0.4761 …
Technology 0.6217 -0.9585 0.735 0.3087 0.0678 0.6126 0.9418 0.2878 0.4421 …
Companies 0.2931 0.6469 -0.0702 -0.7557 -0.3274 -0.7929 0.4023 0.9049 -0.6831 …
are -0.1175 -0.7925 0.1921 0.9312 -0.5084 0.4869 0.5165 0.1922 0.0133 …
investing -0.8552 -0.2882 0.3138 -0.8883 0.9069 -0.7656 0.1857 -0.6377 -0.7069 …
Example
300 columns
Concept: “show me your friends, and I’ll tell who you are.”
Convolutional layer
Convolutional neural networks perform well at recognizing objects/patterns
Intuition: Convolutional Neural Network (CNN)
CNNs learn to recognize features at different abstraction levels
Face
Hand
Eyes
Nose
Mouth
Ear
Human Face
1 layer 1 layer
Convolutional Neural Network applied on word embeddings
A filter (A) recognizes a pattern in the text (B). The convolutional output (C) represent
the text in terms of a feature
0.46 0.51 0.18 0.43 0.53
Recurrent layer
RNNs perform well at capturing long distant information (memory)
Intuition: Recurrent layer – Long Short Term Memory (LSTM)
To make decisions, a LSTM uses information from both the near as well as distant past
I start off with a broad
knowledge base
Not of all of my knowledge
is relevant so I forget some
Knowledge
Me preparing for the Big Data Expo
Knowledge
New, relevant information is
added to my knowledge
My presentation combines
both prior- as well as new
knowledgeExpo
Knowledge
p=(y|X)
p=(y|X)
Long Short Term Memory – Recurrent Neural Network
The cell state contains the current knowledge,
to and from which information is added and removed
Article input
(from convolutional layer)
Cell state
Forget gate
Input gate
Candidate values
Output gate Current knowledge (Cell state)
1. What can be forgotten from cell state? (Forget gate)
New information
2. What new information can be added to the cell state?
(Candidate values)
3. How much of the new information should be updated?
(Input gate)
Output article
4. What information needs to be outputted per article?
(Output gate)
Sigmoid classification
The output of a Sigmoid classifier is always a value between 0 and 1
Results
Combining a CNN and RNN, yields the best performance
Full model CNN RNN
Sentiment
analysis
Accuracy 56.25% 55.54% 54.82% 52.54%
Accuracy is calculated by dividing the total number of true predictions,
by the total number of predictions
Example dataset
The current dataset contains a lot of noise
Company: Alphabet
Article: ‘Eric Schmidt is executive chairman of Alphabet. not chief executive as incorrectly stated in a column
on November 23.’
Company: Tesla
Article: ‘Mars doesn't have an extradition treaty with the US. - Jim Chanos discussing Tesla with CNBC’
Company: Verizon
Article: ‘China surpassed the US to become the top recipient of foreign direct investment in 2014. The inflow to
the US fell by 60 per cent. primarily because of the Verizon pullout by Vodafone. Five of the top 10 FDI recipients
are developing markets.’
Company: Intel & Tesla
Article: ‘Microsoft and Intel's evolution from the PC to the data centre is proving painful. Uber has settled a key
class-action lawsuit. Tesla's chief has an idea for public transport. #techFT is a daily newsletter on technology.
media and telecoms. You can sign up here.’
Company: Netflix
Article: ‘Whether reaching millennial consumers who want to escape marketing messages. or 'cordcutting’
television viewers. who ditch cable and satellite subscriptions in favour of ad-free Netflix. advertisers are having
to work harder than ever to find their audience. Read the report’
Results literature
The results are similar to the results obtained in the literature
Accuracy Dataset Approach
Pang et al, 2018 53.2% Stock data LSTM
Matsubara et al, 2018 59.0% News articles Deep neural generative model
Huynh at al, 2017 59.98% News articles Combination of LSTM and GRU
Rumelhart et al, 2017 64.74% News articles RNN and self-trained word embedding
Selvin et al, 2017 55.9% Stock data LSTM and CNN
Future research
The current approach can be enhanced in multiple ways
Questions
Contact information
Please contact me for further inquiries
Emil Rijcken
Email: emil@cwi.nl
Linkedin: https://www.linkedin.com/in/emilrijcken/
Mobile: 06 53137886
© 2018 Deloitte The Netherlands
Appendix
Appendix
Problem complexity
Checking all solutions is unfeasible
Assuming:
- 2000 dimensions (e.g. there are 16 convolutional layers with 128 filters each)
- 10 options per dimension
- 31 860 000 000 000 000 calculations per second (fastest computer in the world)
Then: it takes approximately 1.99 x 101979
years (!) to calculate all possible solutions
Finding solution
Through trial and error, different solutions are proposed. The slope of the solution
determines how parameters are set
Example of solution space in 3D-space
1
0

More Related Content

What's hot

Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021
Vincenzo Lomonaco
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
University of Bologna
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
butest
 

What's hot (20)

Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021Continual Learning with Deep Architectures - Tutorial ICML 2021
Continual Learning with Deep Architectures - Tutorial ICML 2021
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Model evaluation in the land of deep learning
Model evaluation in the land of deep learningModel evaluation in the land of deep learning
Model evaluation in the land of deep learning
 
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
 
Zero shot-learning: paper presentation
Zero shot-learning: paper presentationZero shot-learning: paper presentation
Zero shot-learning: paper presentation
 
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
 
Topic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental surveyTopic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental survey
 
Machine learning (domingo's paper)
Machine learning (domingo's paper)Machine learning (domingo's paper)
Machine learning (domingo's paper)
 
A Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine LearningA Few Useful Things to Know about Machine Learning
A Few Useful Things to Know about Machine Learning
 
PWL Seattle #23 - A Few Useful Things to Know About Machine Learning
PWL Seattle #23 - A Few Useful Things to Know About Machine LearningPWL Seattle #23 - A Few Useful Things to Know About Machine Learning
PWL Seattle #23 - A Few Useful Things to Know About Machine Learning
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
 
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre..."An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
"An Introduction to Machine Learning and How to Teach Machines to See," a Pre...
 
Design neural networks with meta learning
Design neural networks with meta learningDesign neural networks with meta learning
Design neural networks with meta learning
 
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep NetworksModel-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
 
An Abstract Framework for Agent-Based Explanations in AI
An Abstract Framework for Agent-Based Explanations in AIAn Abstract Framework for Agent-Based Explanations in AI
An Abstract Framework for Agent-Based Explanations in AI
 
[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business Intelligence[DOLAP2019] Augmented Business Intelligence
[DOLAP2019] Augmented Business Intelligence
 
Tensor Networks and Their Applications on Machine Learning
Tensor Networks and Their Applications on Machine LearningTensor Networks and Their Applications on Machine Learning
Tensor Networks and Their Applications on Machine Learning
 
From Story-Telling to Production
From Story-Telling to ProductionFrom Story-Telling to Production
From Story-Telling to Production
 
[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a Multistore[ADBIS 2021] - Optimizing Execution Plans in a Multistore
[ADBIS 2021] - Optimizing Execution Plans in a Multistore
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 

Similar to PREDICTING STOCK PRICE MOVEMENTS BASED ON NEWSPAPER ARTICLES USING A NOVEL DEEP LEARNING APPROACH - Big Data Expo 2019

Concept extraction from the web of things (3)
Concept extraction from the web of things (3)Concept extraction from the web of things (3)
Concept extraction from the web of things (3)
Amélie Gyrard
 
What is and how does Cloud computing workA.Statement Clou.docx
 What is and how does Cloud computing workA.Statement Clou.docx What is and how does Cloud computing workA.Statement Clou.docx
What is and how does Cloud computing workA.Statement Clou.docx
MARRY7
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
butest
 
Best Data Science Online Training in Hyderabad
  Best Data Science Online Training in Hyderabad  Best Data Science Online Training in Hyderabad
Best Data Science Online Training in Hyderabad
bharathtsofttech
 

Similar to PREDICTING STOCK PRICE MOVEMENTS BASED ON NEWSPAPER ARTICLES USING A NOVEL DEEP LEARNING APPROACH - Big Data Expo 2019 (20)

Concept extraction from the web of things (3)
Concept extraction from the web of things (3)Concept extraction from the web of things (3)
Concept extraction from the web of things (3)
 
Novi sad ai event 1-2018
Novi sad ai event 1-2018Novi sad ai event 1-2018
Novi sad ai event 1-2018
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
 
How Can AI and IoT Power the Chemical Industry?
How Can AI and IoT Power the Chemical Industry?How Can AI and IoT Power the Chemical Industry?
How Can AI and IoT Power the Chemical Industry?
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
On Big Data
On Big DataOn Big Data
On Big Data
 
What is and how does Cloud computing workA.Statement Clou.docx
 What is and how does Cloud computing workA.Statement Clou.docx What is and how does Cloud computing workA.Statement Clou.docx
What is and how does Cloud computing workA.Statement Clou.docx
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and HypeDeep learning: Cutting through the Myths and Hype
Deep learning: Cutting through the Myths and Hype
 
Deep Learning Demystified
Deep Learning DemystifiedDeep Learning Demystified
Deep Learning Demystified
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
 
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELSSENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
SENTIMENT ANALYSIS FOR MOVIES REVIEWS DATASET USING DEEP LEARNING MODELS
 
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge WorkersExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
ExperTwin: An Alter Ego in Cyberspace for Knowledge Workers
 
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
M.Sc. Thesis Topics and Proposals @ Polimi Data Science Lab - 2024 - prof. Br...
 
Best Data Science Online Training in Hyderabad
  Best Data Science Online Training in Hyderabad  Best Data Science Online Training in Hyderabad
Best Data Science Online Training in Hyderabad
 
Transfer Leaning Using Pytorch synopsis Minor project pptx
Transfer Leaning Using Pytorch  synopsis Minor project pptxTransfer Leaning Using Pytorch  synopsis Minor project pptx
Transfer Leaning Using Pytorch synopsis Minor project pptx
 
The Need for Explainable AI - Dorothea Wisemann
The Need for Explainable AI - Dorothea WisemannThe Need for Explainable AI - Dorothea Wisemann
The Need for Explainable AI - Dorothea Wisemann
 
Computation graphs - Tensorflow & CNTK
Computation graphs - Tensorflow & CNTKComputation graphs - Tensorflow & CNTK
Computation graphs - Tensorflow & CNTK
 
Microsoft Dryad
Microsoft DryadMicrosoft Dryad
Microsoft Dryad
 
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
“Real Time Machine Learning Architecture and Sentiment Analysis Applied to Fi...
 

More from webwinkelvakdag

ING: Dirk Mulder
ING: Dirk MulderING: Dirk Mulder
ING: Dirk Mulder
webwinkelvakdag
 

More from webwinkelvakdag (20)

ISM eCompany: Sander Berlinski
ISM eCompany: Sander BerlinskiISM eCompany: Sander Berlinski
ISM eCompany: Sander Berlinski
 
Social Nomads - Lynn
Social Nomads - LynnSocial Nomads - Lynn
Social Nomads - Lynn
 
Thuiswinkel.org & Omoda: Alicja Van Ewijk
Thuiswinkel.org & Omoda: Alicja Van EwijkThuiswinkel.org & Omoda: Alicja Van Ewijk
Thuiswinkel.org & Omoda: Alicja Van Ewijk
 
Worldpay: Maria Prados
Worldpay: Maria PradosWorldpay: Maria Prados
Worldpay: Maria Prados
 
Van Moof: Simon Vreeman
Van Moof: Simon VreemanVan Moof: Simon Vreeman
Van Moof: Simon Vreeman
 
ANWB: Carolina van den Hoven & Margot van Leeuwen
ANWB: Carolina van den Hoven & Margot van LeeuwenANWB: Carolina van den Hoven & Margot van Leeuwen
ANWB: Carolina van den Hoven & Margot van Leeuwen
 
HEMA: Ilse Lankhorst, Bas Karsemeijer
HEMA: Ilse Lankhorst, Bas KarsemeijerHEMA: Ilse Lankhorst, Bas Karsemeijer
HEMA: Ilse Lankhorst, Bas Karsemeijer
 
ISM eCompany: Kees Beckeringh
ISM eCompany: Kees BeckeringhISM eCompany: Kees Beckeringh
ISM eCompany: Kees Beckeringh
 
ING: Dirk Mulder
ING: Dirk MulderING: Dirk Mulder
ING: Dirk Mulder
 
Martijn Kozijn: Jessica van Haaster & Martijn Leclaire
Martijn Kozijn: Jessica van Haaster & Martijn LeclaireMartijn Kozijn: Jessica van Haaster & Martijn Leclaire
Martijn Kozijn: Jessica van Haaster & Martijn Leclaire
 
ING: Dirk Mulder
ING: Dirk MulderING: Dirk Mulder
ING: Dirk Mulder
 
Cemex trescon: Marloe de Ruiter
Cemex trescon: Marloe de RuiterCemex trescon: Marloe de Ruiter
Cemex trescon: Marloe de Ruiter
 
LINDA.Foundation: Jocelyn Nassenstein-Brouwer
LINDA.Foundation: Jocelyn Nassenstein-BrouwerLINDA.Foundation: Jocelyn Nassenstein-Brouwer
LINDA.Foundation: Jocelyn Nassenstein-Brouwer
 
Maersk: Niek Minderhoud
Maersk: Niek MinderhoudMaersk: Niek Minderhoud
Maersk: Niek Minderhoud
 
Q&A: Brenda Hoekstra
Q&A: Brenda HoekstraQ&A: Brenda Hoekstra
Q&A: Brenda Hoekstra
 
Aanhangwagendirect & PI Marketing: Merin Eggink & Mascha Soors
Aanhangwagendirect & PI Marketing: Merin Eggink & Mascha SoorsAanhangwagendirect & PI Marketing: Merin Eggink & Mascha Soors
Aanhangwagendirect & PI Marketing: Merin Eggink & Mascha Soors
 
ISM eCompany: Ralph van Woensel
ISM eCompany: Ralph van WoenselISM eCompany: Ralph van Woensel
ISM eCompany: Ralph van Woensel
 
Lecot: Raf Maesen
Lecot: Raf MaesenLecot: Raf Maesen
Lecot: Raf Maesen
 
Lobbes: Berry de Snoo
Lobbes: Berry de SnooLobbes: Berry de Snoo
Lobbes: Berry de Snoo
 
ISM eCompany: Sander Lems
ISM eCompany: Sander LemsISM eCompany: Sander Lems
ISM eCompany: Sander Lems
 

Recently uploaded

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 

Recently uploaded (20)

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 

PREDICTING STOCK PRICE MOVEMENTS BASED ON NEWSPAPER ARTICLES USING A NOVEL DEEP LEARNING APPROACH - Big Data Expo 2019

  • 1. Predicting stock price movements based on newspaper articles using a novel deep learning approach Today Tomorrow 100110101 010101010 100110100
  • 2. © 2018 Deloitte The Netherlands • Intuitive explanation of important concepts • Accessible for non-techies • Discussion of results • No code • No mathematics • Not comprehensive You will get to know what deep learning algorithms are and understand why they work Today
  • 3. ‘Language is probably the hardest problem in science; nobody really knows how it works, nobody really knows where it came from and yet we can all do it. Michael Corballis Emeritus Professor at the University of Auckland TedX talk – The Origins and Evolution of Language: https://www.youtube.com/watch?v=nd5cklw6d6Q&t=95s
  • 4.
  • 5. © 2018 Deloitte The Netherlands Machine Learning Field of study that gives the ability to the computer to self-learn without being explicitly programmed (Input to machine learning models needs to be numeric!) Deep learning Computational models that are composed of multiple processing layers that allow to learn representations with multiple levels of abstraction. Natural Language Processing (NLP) A branch of artificial intelligence that helps computers understand, interpret and manipulate human language. Natural Language for Financial Forecasting (NLFF) The domain that uses NLP approaches for financial forecasting. Definitions First steps in the world of artificial intelligence are taken by knowing its key concepts
  • 6. Background domain The ‘Natural Language for Financial Forecasting’ domain has been growing rapidly in the last decade Number of ‘NLFF’ publications in scientific journals
  • 7. Problem While stock price movements are known to be mostly influenced by news updates, most financial companies only include stock price information in their predictive models. By not including news updates, a huge opportunity is missed
  • 8. © 2018 Deloitte The Netherlands ▪ Data preprocessing ▪ Explanation approach ➢ Word2Vec word embedding ➢ Convolutional neural network ➢ Recurrent neural network ➢ Sigmoid classification • Results • Future research Agenda
  • 9. Type of problem Predicting stock price movements based on newspaper articles amounts to classifying each article as being a positive/negative article
  • 10. Approach The approach, that combines convolutional- and recurrent layers, originates from the NLP domain and has not been applied to the NLFF domain yet Start: T articles Transforming articles into numerical representations Deep learning approaches to learn the algorithm Final: Classification
  • 11. Data preprocessing Data from two separate sources were used to create a dataframe that contains both textual- as well as stock information Stock prices Source: Yahoo Finance News articles Source: Lexis Nexis Academic
  • 12. Embedding layer Through a word embedding, textual data is represented numerically
  • 13. Word2vec embedding A statistical method for efficiently learning a standalone word embedding from a collection of written texts Intuitively: ‘King’ – ‘Man’ + ‘Woman’, yields a vector similar to ‘Queen’ Large -0.5117 0.9561 0.3594 0.5076 -0.7155 -0.6644 -0.6312 0.4146 0.1242 … US -0.4742 -0.2551 0.6133 0.7608 -0.9923 -0.5035 0.894 0.815 -0.4761 … Technology 0.6217 -0.9585 0.735 0.3087 0.0678 0.6126 0.9418 0.2878 0.4421 … Companies 0.2931 0.6469 -0.0702 -0.7557 -0.3274 -0.7929 0.4023 0.9049 -0.6831 … are -0.1175 -0.7925 0.1921 0.9312 -0.5084 0.4869 0.5165 0.1922 0.0133 … investing -0.8552 -0.2882 0.3138 -0.8883 0.9069 -0.7656 0.1857 -0.6377 -0.7069 … Example 300 columns Concept: “show me your friends, and I’ll tell who you are.”
  • 14. Convolutional layer Convolutional neural networks perform well at recognizing objects/patterns
  • 15. Intuition: Convolutional Neural Network (CNN) CNNs learn to recognize features at different abstraction levels Face Hand Eyes Nose Mouth Ear Human Face 1 layer 1 layer
  • 16. Convolutional Neural Network applied on word embeddings A filter (A) recognizes a pattern in the text (B). The convolutional output (C) represent the text in terms of a feature 0.46 0.51 0.18 0.43 0.53
  • 17. Recurrent layer RNNs perform well at capturing long distant information (memory)
  • 18. Intuition: Recurrent layer – Long Short Term Memory (LSTM) To make decisions, a LSTM uses information from both the near as well as distant past I start off with a broad knowledge base Not of all of my knowledge is relevant so I forget some Knowledge Me preparing for the Big Data Expo Knowledge New, relevant information is added to my knowledge My presentation combines both prior- as well as new knowledgeExpo Knowledge p=(y|X) p=(y|X)
  • 19. Long Short Term Memory – Recurrent Neural Network The cell state contains the current knowledge, to and from which information is added and removed Article input (from convolutional layer) Cell state Forget gate Input gate Candidate values Output gate Current knowledge (Cell state) 1. What can be forgotten from cell state? (Forget gate) New information 2. What new information can be added to the cell state? (Candidate values) 3. How much of the new information should be updated? (Input gate) Output article 4. What information needs to be outputted per article? (Output gate)
  • 20. Sigmoid classification The output of a Sigmoid classifier is always a value between 0 and 1
  • 21. Results Combining a CNN and RNN, yields the best performance Full model CNN RNN Sentiment analysis Accuracy 56.25% 55.54% 54.82% 52.54% Accuracy is calculated by dividing the total number of true predictions, by the total number of predictions
  • 22. Example dataset The current dataset contains a lot of noise Company: Alphabet Article: ‘Eric Schmidt is executive chairman of Alphabet. not chief executive as incorrectly stated in a column on November 23.’ Company: Tesla Article: ‘Mars doesn't have an extradition treaty with the US. - Jim Chanos discussing Tesla with CNBC’ Company: Verizon Article: ‘China surpassed the US to become the top recipient of foreign direct investment in 2014. The inflow to the US fell by 60 per cent. primarily because of the Verizon pullout by Vodafone. Five of the top 10 FDI recipients are developing markets.’ Company: Intel & Tesla Article: ‘Microsoft and Intel's evolution from the PC to the data centre is proving painful. Uber has settled a key class-action lawsuit. Tesla's chief has an idea for public transport. #techFT is a daily newsletter on technology. media and telecoms. You can sign up here.’ Company: Netflix Article: ‘Whether reaching millennial consumers who want to escape marketing messages. or 'cordcutting’ television viewers. who ditch cable and satellite subscriptions in favour of ad-free Netflix. advertisers are having to work harder than ever to find their audience. Read the report’
  • 23. Results literature The results are similar to the results obtained in the literature Accuracy Dataset Approach Pang et al, 2018 53.2% Stock data LSTM Matsubara et al, 2018 59.0% News articles Deep neural generative model Huynh at al, 2017 59.98% News articles Combination of LSTM and GRU Rumelhart et al, 2017 64.74% News articles RNN and self-trained word embedding Selvin et al, 2017 55.9% Stock data LSTM and CNN
  • 24. Future research The current approach can be enhanced in multiple ways
  • 26. Contact information Please contact me for further inquiries Emil Rijcken Email: emil@cwi.nl Linkedin: https://www.linkedin.com/in/emilrijcken/ Mobile: 06 53137886
  • 27. © 2018 Deloitte The Netherlands Appendix Appendix
  • 28. Problem complexity Checking all solutions is unfeasible Assuming: - 2000 dimensions (e.g. there are 16 convolutional layers with 128 filters each) - 10 options per dimension - 31 860 000 000 000 000 calculations per second (fastest computer in the world) Then: it takes approximately 1.99 x 101979 years (!) to calculate all possible solutions
  • 29. Finding solution Through trial and error, different solutions are proposed. The slope of the solution determines how parameters are set Example of solution space in 3D-space 1 0