Risk-Aversion, Risk-Premium and Utility TheoryAshwin Rao
This lecture helps understand the concepts of Risk-Aversion and Risk-Premium viewed from the lens of Utility Theory. These are foundational economic concepts used widely in Financial applications - Portfolio problems and Pricing problems, to name a couple.
This presentations includes the basic fundamentals of time series data forecasting. It starts with basic naive, regression models and then explains advanced ARIMA models.
Risk-Aversion, Risk-Premium and Utility TheoryAshwin Rao
This lecture helps understand the concepts of Risk-Aversion and Risk-Premium viewed from the lens of Utility Theory. These are foundational economic concepts used widely in Financial applications - Portfolio problems and Pricing problems, to name a couple.
This presentations includes the basic fundamentals of time series data forecasting. It starts with basic naive, regression models and then explains advanced ARIMA models.
Applying Deep Learning to Enhance Momentum Trading Strategies in StocksLawrence Takeuchi
Contact author: larrytakeuchi@gmail.com
Abstract
We use an autoencoder composed of stacked restricted Boltzmann machines to extract
features from the history of individual stock prices. Our model is able to discover an enhanced version of the momentum effect in stocks without extensive hand-engineering of input features and deliver an annualized return of 45.93% over the 1990-2009 test period
versus 10.53% for basic momentum.
2. Elemental Economics - Mineral demand.pdfNeal Brewster
After this second you should be able to: Explain the main determinants of demand for any mineral product, and their relative importance; recognise and explain how demand for any product is likely to change with economic activity; recognise and explain the roles of technology and relative prices in influencing demand; be able to explain the differences between the rates of growth of demand for different products.
The secret way to sell pi coins effortlessly.DOT TECH
Well as we all know pi isn't launched yet. But you can still sell your pi coins effortlessly because some whales in China are interested in holding massive pi coins. And they are willing to pay good money for it. If you are interested in selling I will leave a contact for you. Just telegram this number below. I sold about 3000 pi coins to him and he paid me immediately.
Telegram: @Pi_vendor_247
5 Tips for Creating Standard Financial ReportsEasyReports
Well-crafted financial reports serve as vital tools for decision-making and transparency within an organization. By following the undermentioned tips, you can create standardized financial reports that effectively communicate your company's financial health and performance to stakeholders.
This presentation poster infographic delves into the multifaceted impacts of globalization through the lens of Nike, a prominent global brand. It explores how globalization has reshaped Nike's supply chain, marketing strategies, and cultural influence worldwide, examining both the benefits and challenges associated with its global expansion.
PPrreesseenntteedd bbyy:: GGrroouupp 66
GGlloobbaalliizzaattiioonn
o f
PP
oo
ll
yy
ee
ss
tt
ee
rr
RR
uu
bb
bb
ee
rr
EE
tt
hh
yy
ll
ee
nn
ee
VV
ii
nn
yy
ll
AA
cc
ee
tt
aa
tt
ee
GG
ee
nn
uu
ii
nn
ee
LL
ee
aa
tt
hh
ee
rr
SS
yy
nn
tt
hh
ee
tt
ii
cc
LL
ee
aa
tt
hh
ee
rr
CC
oo
tt
tt
oo
nn
C
o
u
n
t
r
i
e
s
I
n
v
o
l
v
e
d
Ni
k
e
h
a
s
m
o
r
e
t
h
a
n
7
0
0
s
h
o
p
s
i
n
c
o
n
t
r
a
c
t
w
i
t
h
w
o
r
l
d
w
i
d
e,
w
h
e
r
e
i
n
t
h
e
i
r
offi
c
e
s
a
n
d
i
n
d
e
p
e
n
d
e
n
t
fa
c
t
o
r
y
o
u
t
l
e
t
s
a
r
e
fo
u
n
d
w
i
t
h
i
n
t
h
e
p
r
e
m
i
s
e
s
of
ap
p
r
o
x
i
m
a
t
e
l
y
4
5
c
o
u
n
t
r
i
e
s.
AAuussttrraalliiaa
China
India
IInnddoonneessiiaa
TThhaaiillaanndd
TTuurrkkeeyy
USA
VViieettnnaamm
NNiikkee SSuuppppllyy CChhaaiinn
RRuubbbbeerr,, FFaabbrriicc
aanndd ootthheerr rraaww
mmaatteerriiaallss
Shoe
MMaannuuffaaccttuurriinngg
aanndd AAsssseemmbbllyy
MMaarrkkeettiinngg
SSppoorrttiinngg ggooooddss,,
ddeevveellooppmmeenntt
aanndd SShhooee ssttoorreess
OOnnlliinnee,, CCaattaalloogg
aanndd ootthheerr rreettaaiill
NNiikkee bbrraannddeedd
shoes
PPrroodduucctt
ddeevveellooppmmeenntt
CCuussttoommeerr nneeeeddss//wwaannttss ffeeeeddbbaacckk
NNiikk
Nike Supply Chain
Globalization of Nike
Nike Manufacturing Process
Rubber Materials Nike
Ethylene Vinyl Acetate Nike
Genuine Leather Nike
Synthetic Leather Nike
Cotton in Nike Apparel
Nike Shops Worldwide
Nike Manufacturing Countries
Cold Cement Assembly Nike
3D Printing Nike Shoes
Nike Product Development
Nike Marketing Strategies
Nike Customer Feedback
Nike Distribution Centers
Automation in Nike Manufacturing
Nike Consumer Direct Acceleration
Nike Logistics and Transport
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...Quotidiano Piemontese
Turin Startup Ecosystem 2024
Una ricerca de il Club degli Investitori, in collaborazione con ToTeM Torino Tech Map e con il supporto della ESCP Business School e di Growth Capital
when will pi network coin be available on crypto exchange.DOT TECH
There is no set date for when Pi coins will enter the market.
However, the developers are working hard to get them released as soon as possible.
Once they are available, users will be able to exchange other cryptocurrencies for Pi coins on designated exchanges.
But for now the only way to sell your pi coins is through verified pi vendor.
Here is the telegram contact of my personal pi vendor
@Pi_vendor_247
Yes of course, you can easily start mining pi network coin today and sell to legit pi vendors in the United States.
Here the telegram contact of my personal vendor.
@Pi_vendor_247
#pi network #pi coins #legit #passive income
#US
What price will pi network be listed on exchangesDOT TECH
The rate at which pi will be listed is practically unknown. But due to speculations surrounding it the predicted rate is tends to be from 30$ — 50$.
So if you are interested in selling your pi network coins at a high rate tho. Or you can't wait till the mainnet launch in 2026. You can easily trade your pi coins with a merchant.
A merchant is someone who buys pi coins from miners and resell them to Investors looking forward to hold massive quantities till mainnet launch.
I will leave the telegram contact of my personal pi vendor to trade with.
@Pi_vendor_247
4. What is AI/Machine Learning?
・Cerebral Cortex:Unsupervised Learning
・Basal Ganglia:Reinforcement Learning
・Cerebellum:Supervised Learning
Machine Learning is one of the ways we expect to achieve AI.
- With the recent development of ML, ML is used in the same way as AI.
ML is an algorithm that imitates the function of the brain.
Learning means finding a model that represents the data well.
5. What is AI/Machine Learning?
𝑋:Feature
???Model:CNN
𝑌:Label
In case of image recognition, the goal is to identify the image
… Problem
Formulation
… Data
𝑓
In supervised learning, output is called label and input is called feature.
Supervised learning learn the relationship between inputs and outputs.
Cat
Deciding what model, features, and labels to use is called
problem formulation.
6. What is AI/Machine Learning?
Model:RNN
In case of translation, the goal is to translate English into Japanese
… Problem
Formulation
… Data
This is a pen.
???
This is a pen in Japanese.
𝑓
In supervised learning, output is called label and input is called feature.
Supervised learning learn the relationship between inputs and outputs.
𝑋:Feature 𝑌:Label
これはペンです
Deciding what model, features, and labels to use is called
problem formulation.
7. What is AI/Machine Learning?
Model:ML
In case of finance, the goal is to predict future stock prices.
… Problem
Formulation
… Data
???
+10%𝑓
In supervised learning, output is called label and input is called feature.
Supervised learning learn the relationship between inputs and outputs.
𝑋:Feature 𝑌:Label
Deciding what model, features, and labels to use is called
problem formulation.
8. A. Firstly, we need to decide what data we use and set up an
appropriate problem formulation.
Problem Formulation
• Time Series
• Cross Section
Data
• Technical
• Fundamental
• Alternative
AI/Machine Learning
How to predict stock prices w/ AI and ML?
9. Problem Formulation
クロスセクションで予測
Stock A Stock B ・・・
Today
Yesterday
・
・
・
・
Buystockswithabsolutely
highreturns. Buy stocks with relatively high returns.
Investment Universe
TimeSeries
Cross SectionOrthogonal
10. Problem Formulation:Time Series
Past (known) data at time t
Predict a stock future price/return as time series data.
・ 𝑟𝑡+1: (Absolute) return at time t+1 (Label)
・ 𝑟𝑡+1 = 𝑓(𝑟𝑡, … , 𝑥 𝑡, … )
・ 𝑟𝑡, 𝑥 𝑡: Past returns and external variables (Feature)
Short term forecast (shorter than monthly) → trading strategy
Problem Formulation: Learn the following model 𝑓
11. Problem Formulation:Cross Section
Predict multiple stocks that are relatively better in a universe.
・ 𝑠𝑐𝑜𝑟𝑒𝑖: (Relative) return of each stock in the universe (Label)
・ 𝑠𝑐𝑜𝑟𝑒𝑖 = 𝑓(𝑿𝒊)
・ 𝑿𝒊: Criteria for describing the score: Factor (Feature)
Long term forecast (longer than monthly) → portfolio construction
Time information is omitted
Problem Formulation: Learn the following model 𝑓
12. Investment
Horizon
Minutes Hour Day Week Month Year
Problem
Formulation
Time Series
Cross Section
Data Technical
Fundamental
Alternative Micro(Corporate)→Macro
News、PoS、etc,…
AI/Machine Learning
Problem Formulation:Summary
14. Time Series × Technical × Supervised
“Stock Price Prediction with Fluctuation Patterns Using Indexing
Dynamic Time Warping and k∗-Nearest Neighbors”.
Nakagawa et.al. (2018)
New Frontiers in Artificial Intelligence
https://doi.org/10.1007/978-3-319-93794-6_7
15. Motivation
Following nearest trend Following similar trend
Momentum Patterns
Predict the future price using the past price fluctuation patterns.
Refine the “formation analysis” by machine learning.
Nakagawa et.al. (2018) “Stock Price Prediction with Fluctuation Patterns Using
Indexing Dynamic Time Warping and k∗-Nearest Neighbors”.
New Frontiers in Artificial Intelligence
https://doi.org/10.1007/978-3-319-93794-6_7
16. Method - Outline
Current pattern
Umm...
Similar!!
1. Measure 2. Predict
・How to measure the similarity?
Indexing Dynamic Time Warping(IDTW)
・How to predict with extracted fluctuations?
k*-Nearest Neighbors (k*-NN)
We propose a method to predict future stock prices based on
the shape of the past price fluctuations similar to the current.
17. frequencymeasuring period
1. Decide the measuring period and frequency.
2. Consider the difference of price levels.
Near?
Distance
3. Define the distance between time-series.
✔ Daily closing prices within a month.
✔ Dynamic Time Warping (DTW) method
How to measure the similarity between stock price movements?
✔ We index prices and adjust price levels.
Method - IDTW
18. ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
Correspondence
With Euclidean Distance
1
n
n
Sequences are aligned “one to one”.
●
●
● ● ●
●
●
●
●
●
● ●
●
●
●
●
●
● ●
Correspondence
With DTW
1
m
n
Nonlinear alignments are possible.
Fixed Time Axis “Warped” Time Axis
DTW is a method that warps the time axis to find the optimum
correspondence and calculates the similarity between two-time series.
What is DTW??
19. Why do we use DTW??
・ DTW can be applied to a pair of time-series of different lengths.
・ Similarity of DTW fits human intuition.
・ DTW is a standard method to measure the similarity among
various time series data (UCR Datasets)
What is DTW??
https://cran.r-project.org/web/packages/dtw/index.html
R language package dtw
20. Method - Outline
Current pattern
Umm...
Similar!!
1. Measure 2. Predict
・How to measure the similarity?
Indexing Dynamic Time Warping(IDTW)
・How to predict with extracted fluctuations?
k*-Nearest Neighbors (k*-NN)
We propose a method to predict future stock prices based on
past price fluctuations similar to the current.
21. Method - k-NN
✔ 𝑘-NN algorithm is one of the most fundamental non-parametric methods.
Similarity
𝑘 = 1 𝑘 = 3
Prediction:
+2.5%
Prediction:
(𝛼12.5 + 𝛼21.1 − 𝛼35.1)%
Return on
Next month
(Labels)
…
+2.5%
𝑦1
+1.1%
𝑦2
−5.1%
𝑦3
−3.2%
𝑦4
0.15𝛽1 0.24𝛽2 0.41𝛽3 0.63𝛽4 𝛽𝑖 = 𝐼𝐷𝑇𝑊 𝑥0, 𝑥𝑖
Fluctuation
pattern
(Features)
𝑥0
𝑥1 𝑥2 𝑥3 𝑥4
?%
✔ Prediction is the average of the values of 𝑘 nearest neighbors.
22. Method - k*-NN
✔ 𝑘-NN algorithm is one of the most fundamental non-parametric methods.
𝑘 = 1 𝑘 = 3
Prediction:
+2.5%
Prediction:
(𝛼12.5 + 𝛼21.1 − 𝛼35.1)%
…
+2.5%
𝑦1
+1.1%
𝑦2
−5.1%
𝑦3
−3.2%
𝑦4
?%
✔ Prediction is the average of the values of 𝑘 nearest neighbors.
1. How many k should we use to predict?
2. How to decide the optimal weight alpha?
𝑘∗
-NN is a simple approach for calculating optimal 𝒌 and weight 𝜶.
https://cran.r-project.org/web/packages/ksNN/index.html
R language package ksNN
23. Empirical Study
✔ To demonstrate the advantages of IDTW+k*NN,
we analyze its performance using following major world stock indices.
✔ In-sample period : January 1989 ~ December 2005.
Out-of-sample period : January 2006 ~ August 2017.
✔ Compared six methods combining DTW, DDTW, or IDTW with k-NN or k*-NN.
Ticker Name Country Constituents
TPX TOPIX Japan Companies listed on the First Section of the Tokyo Stock Exchange
SPX S&P500 U.S. 500 large companies listed on the NYSE or NASDAQ
CAC CAC40 France 40 companies listed on the Euronext Paris
DAX DAX German 30 companies trading on the Frankfurt Stock Exchange
UKX FTSE100 U.K. 100 companies listed on the London Stock Exchange
24. Empirical Study
Step 1. Calculate the similarity between month 𝑡 and all months from 1 to 𝑡 − 1
with DTW, DDTW, or IDTW.
𝑡
1 𝑡 − 1
Calculate each similarity
t-3 t-2 t-1 t21
DTW
DDTW
or
IDTW
or
25. Empirical Study
Step 2. Predict the next month's return using k-NN or k*-NN.
6
120
t-2
27
11
Sorting
by distance
+1.5%
0.02
0.03
0.04
7.99
8.82
+1.9%
-0.5%
+3.0%
-2.7%
𝛽
𝑦Next month’s
returnsdistances
k-NN
k*-NN
Predict
or
+𝟐. 𝟔%
26. Empirical Study
Step 3. If the predicted next month's return is positive, buy one unit of Index at
the end of the month. If negative, sell one unit and calculate the return.
Step 4. Proceed to the next month: 𝑡 = 𝑡 + 1.
𝑡
t t+1
𝑡 + 1
Index Price
(Month End)
100 105
Predict
+𝟐. 𝟔%
+𝟓. 𝟎%Actual
27. Results
✔ Cumulative returns for the six methods in TOPIX.
The out-of-sample period is from January 2006 to August 2017.
Red Line:Our method
(IDTW+k*NN)
Blue Line:
Buy & Hold Index
29. Results
Table 1. The average accuracy of all years for each method.
Table 2. The total return of each method.
30. Results
Table 3. Comparison with Momentum Strategy(Return)
Table 4. Correlation with Momentum Strategy
Here, Prediction in Step 2 is replaced by simple 1Mom: 1 month return and
12 Mom: 12 month return excluding the last month (12 Mom).
31. Summary
✔ We proposed k*-NN with IDTW, a highly accurate stock prediction
method. This method is inspired by formation analysis.
✔ An empirical analysis was conducted with major world indices and
confirmed the following results:
・ IDTW is superior to DTW and DDTW
・ k*-NN is superior to k-NN.
・ IDTW-k*NN is the best prediction method.
32. Time Series × Technical × Unsupervised
“Stock price prediction using k‐medoids clustering with
indexing dynamic time warping”.
Nakagawa et.al. (2019)
Electronics and Communications in Japan, 102(2), 3-8.
https://doi.org/10.1002/ecj.12140
33. Motivation
Price movements
that are likely to rise?
✔ Can we visualize good patterns for forecasting by clustering price
fluctuation patterns?
Price movements
that are likely to fall?
✔ So how should we cluster the price fluctuation patterns?
Nakagawa et.al. (2019). “Stock price prediction using k‐medoids clustering with
indexing dynamic time warping”.
Electronics and Communications in Japan, 102(2), 3-8.
34. ・ Always looks for medoids in the data (Needs only similarity).
✔ Since the similarity was given by IDTW, we should decide the clustering
method.
k-means method
k-medoids method
How to Cluster Price Fluctuation?
・ Most well-used clustering method
・ Clustering data based on centroid
・ Easy to extent (x-means, k-means++)
・ Clustering data based on medoids
(Data point closest to any data point in the cluster)
37. 2007/1
2006/12
Step3: Update medoids
✔ recalculate and update
medoids in each cluster
2004/5
Step2: ClusteringStep1: Initialize medoids
(𝑘: number of medoids) Repeat until the clusters
do not change
Clustering with k-Medoids
38. Empirical Study
✔ Using the TOPIX, we cluster similar price fluctuation patterns and visualize
good patterns for forecasting.
✔ In-sample period : January 1989 ~ December 2005.
Out-of-sample period : January 2007 ~ March 2017.
✔ IDTW with k-medoids is compared to DTW with k-medoids
✔ The number of k are from 2 to 12.
To decide k, we evaluate profitability of the following virtual trading.
39. k-medoids is performed based on
monthly daily price fluctuations up to
the previous month.
Identify the cluster for
the current month.
If there is more positive
(negative) in the cluster,
buy (sell).
Step 2:
Identify the cluster
Step 3:
Decide buy or sell
Step1:
Clustering
Empirical Study
40. ✔ From the viewpoint of profitability, the number of k is 𝟓
Table 5. Profitability and accuracy
by number of clusters
Results
41. ✔ Visualization of 5 clusters (black line: Medoids, x axis: days, y axis: return)
Reversal Momentum Momentum
Results
*The percentage in parentheses indicates the probability that stock price rises in the next month.
*
43. ✔ Using k-Medoids + IDTW, we found that the number of clusters in
the price fluctuation pattern was about 5 in TOPIX.
✔ Consider price fluctuation pattern clustering method (k-Medoids +
IDTW). This is unsupervised approach.
Summary
✔ Large fluctuations are tend to continue.
44. Time Series × Technical × Supervised
“Time-Series Gradient Boosting Tree for Stock Price Prediction”.
Nakagawa and Yoshida (2020)
International Journal of Data Mining, Modelling and Management
(IJDMMM)
To be appear
Fundamental
45. Motivation
✔ We improve a time series decision tree (TSDT) that uses only time
series data.
Nakagawa and Yoshida (2020)
“Time Series Gradient Booting Tree for Stock Price Prediction”.
International Journal of Data Mining, Modelling and Management (To be appear)
(Time series) decision trees
classify the (time series) data
by sorting them down the tree
and leaves provides the labels.
Decision tree is
easy to understand!
Current
46. Motivation
We propose a time-series gradient boosting tree that uses a TSDT
that adds features other than time-series.
(1) Add usual features other
than time series to TSDT
(2) “Boost” this TSDT with
gradient boosting technique
47. Method – Time Series Gradient Boosting Tree
✔ Boosting is one of the methods called ensemble learning.
✔ Gradient boosting tree is most used in various data analysis contests (Kaggle)
and has the best results.
strong model
weak model
Ensemble learning is a method of building a strong (accurate) model by
using multiple models called weak learners.
48. Method – Time Series Gradient Boosting Tree
Fit a new tree to the error
Prettenhofer and Louppe(2014)
Illustration of gradient boosting tree
49. Empirical Study
✔ To demonstrate the advantages of time series gradient boosting tree,
we analyze its performance using following major world stock indices.
✔ In-sample period : 10 years.
Out-of-sample period : 10 years later until 2018/6
Ticker Name Country Data period
TPX TOPIX Japan 1993/8~2018/6
SPX S&P500 U.S. 1990/1~2018/6
CAC CAC40 France 2001/6~2018/6
DAX DAX German 2001/7~2018/6
UKX FTSE100 U.K. 2000/1~2018/6
50. Empirical Study
✔ Feature:
Other Data :As of the end of month in each index, we use
✔ Model: Compared four methods combining Time Series Decision Tree
(TSDT) and Time Series Gradient Boosting Tree (TSGBT) with and
without index data.
Data Description
PER Net Profit/Market Value
PBR Net Asset/Market Value
MOM Stock returns over the past 12 months except for past month
DIV Dividends paid in a year / Market Value
ROE Net income/Net Assets
Time Series Data:Daily price data within a month
✔ Teacher:1 month ahead return
51. Results
Table 6. The average accuracy of all years for each method.
Table 7. The total return for each method.
Improvement
by adding features
Improvement
by boosting
52. Summary
✔ We build a time series decision tree (TSDT) by introducing features
other than time series data.
・Adding features other than time series data improves accuracy and profitability.
・Gradient boosting also improves accuracy and profitability
✔ Furthermore, we propose a time series gradient boosting tree
(TSGBT) that “boosts” time-series decision tree.
✔ An empirical analysis was conducted with major
world indices and confirmed the following results:
53. 1. Introduction
2. Research Examples
- Time series prediction
- Cross sectional prediction
3. Conclusion
Contents
54. Cross Section × Technical × Supervised
“Deep Factor Model”.
Nakagawa et.al. (2018) 3rd Workshop on MIDAS@ECML-PKDD 2018
([preprint] arXiv:1810.01278)
“Cross-sectional Stock Price Prediction using Deep Learning for
Actual Investment Management”
Abe and Nakagawa (2020) AIBC2020
([preprint] :arXiv:2002.06975)
“Deep Learning for Multi-factor Models in Regional and Global Stock
Markets”
Abe and Nakagawa (2020) New Frontier in AI
https://link.springer.com/chapter/10.1007/978-3-030-58790-1_6
Fundamental
55. Cross Sectional Prediction
✔ Over 300 factors until 2012 [Harvey et al. 2017]
✔ In practice, we predict future relative stock returns (scores) by
combining various factors
✔ Criteria for describing the score is called Factor in Finance
For example, Market Value is one of the most famous factors, and this factor combined with
simple sorting portfolio yields a positive return.
Cross-sectional Investment Strategy
✔ finds stocks with relatively high scores based on certain criteria.
56. ROE
1 Month Return
●
●
●
Score
Value
Growth
Quality
Momentum
・Linear Relationship
Factor
Candidates
Factor Classification
by human
Calculate
Relative Score
Cross-sectional Investment Strategy
Both buying and selling are
evaluated equally.
Factors are evaluated monotonically.
57. ROE
1 Month Return
●
●
●
Score
Value
Growth
Quality
Momentum
・Linear Regression
Factor
Candidates
Factor Classification
by human
Calculate
Relative Score
Cross-sectional Investment Strategy
Both buying and selling are
evaluated equally.
Factors are evaluated monotonically.
I would like to capture the non-
linear relationship of factors from a
large number of candidates by
deep learning.
59. 「Deep Factor Model」
Kei Nakagawa, Takumi Uchida and Tomohisa Aoshima,
3rd Workshop on MIDAS@ECML-PKDD 2018 ([preprint] arXiv:1810.01278)
✔ We check the profitability of cross-sectional investment strategy
with deep learning
✔ We predict relative return using 80 factors and invest monthly.
✔ Investment Universe:
TOPIX index constituents (from Apr 2006 to Mar 2016 monthly)
Empirical Study No.1
60. ✔ We use the 16 factors that are used relatively often in practice.
Empirical Study No.1
Factor Candidates Formula
Risk
60VOL Standard deviation of stock returns in the past 60 months
BETA Regression coefficient of stock returns and market risk premium
SKEW Skewness of stock returns in the past 60 months
Quality
ROE Net income/Net Assets
ROA Operating Profit/Total Assets
ACCRUALS Operating Cashflow ‐ Operating Profit
LEVERAGE Total Liabilities / Total Assets
Momentum
12-1MOM Stock returns in the past 12 months except for past month
1MOM Stock returns in the past month
60MOM Stock returns in the past 60 months
Value
PSR Sales/Market Value
PER Net Income / Market Value
PBR Net Assets / Market Value
PCFR Operating Cashflow / Market Value
Size
CAP log(Market Value)
ILLIQ average(Stock Returns/Trading Volume)
61. ✔ Use 16 x 5 = 80 features for the last 5 points each quarter of 16 factors.
Empirical Study No.1
✔ We Learn the model using the training data for 120 sets.
✔ Training data (1 set)
・Feature: 80 features of 16 factors
・Teacher: One month ahead relative returns
62. Model Description
Deep Learning(Deep 1) We use full-connected network. The hidden layers are (80-50-
10),(80-80-50-50-10-10). We use the ReLU as the activation
function and Adam for the optimization algorithm.Deep Learning(Deep 2)
Linear Model
(Linear Regression;LR)
“sklearn.linear_model.LinearRegression”
Support Vector
Regression(SVR)
“sklearn.svm.SVR”
Random Forest(RF) “sklearn.ensemble.RandomForestRegressor”
✔ We use linear and nonlinear models as benchmarks.
✔ We make a long/short portfolio strategy for a net-zero investment to buy
top stocks and to sell bottom stocks with equal weighting in quintile portfolios.
Empirical Study No.1
✔ For the quintile portfolio performance, we calculate the
annualized average return, and Sharpe ratio.
64. Results No.1
✔ This graph shows the Sharpe Ratio of each methods
0.00
0.20
0.40
0.60
0.80
1.00
1.20
1.40
1.60
Deep 1 Deep 2 LR SVR RF
ShapeRatio
The results of deep learning are better
than other nonlinear models.
The linear model is not bad either.
Linear model is good if a poor
nonlinear model is used.
65. Empirical Study No.2
✔ We check the profitability of cross-sectional investment
strategy with deep learning
✔ We predict relative return using 33 factors and invest daily basis.
✔ Investment Universe:
Cross-sectional Stock Price Prediction using Deep Learning for Actual Investment Management
Masaya Abe and Kei Nakagawa,
AIBC2020 ([preprint] arXiv: 2002.06975)
TOPIX500 index constituents (from Apr 2013 to Mar 2017 daily)
66. Empirical Study No.2
✔ We use the 33 factors that are used relatively often in practice.
No. Factor
1 Return from previous day
2 Return from 2 days ago
3 Return from 3 days ago
4 Return from 5 days ago
5 Return from 10 days ago
6 Return from 20 days ago
7 Return from 40 days ago
8 Return from 60 days ago
9 Average trading volume over the past 60 days
10 Average trading volume over the past 5 days/60 days
11 Average trading volume over the past 10 days/60 days
12 Average trading volume over the past 20 days/60 days
13 Change in operating income forecast from 5 days ago
14 Change in operating income forecast from 10 days ago
15 Change in operating income forecast from 20 days ago
16 Change in target stock price forecast from 5 days ago
17 Change in target stock price forecast from 10 days ago
No. Factor
18 Change in target stock price forecast from 20 days ago
19 Book-value to Price Ratio
20 Earnings to Price Ratio
21 Dividend Yield
22 Sales to Price Ratio
23 Cashflow to Price Ratio
24 Return on Equity
25 Return on Asset
26 Return on Invested Capital
27 Accruals
28 Total Asset Turnover Rate
29 Current Ratio
30 Equity Ratio
31 Total Asset Growth Rate
32 Capital Expenditure Growth Rate
33 Investment to Asset
- -
67. ✔ We use linear and nonlinear models as benchmarks.
Empirical Study No.2
Model Hidden Layer (Dropout) Epoch
DNN_1
500 – 200 – 100 – 50 – 10
(50% – 40% – 30% – 20% – 10%)
20
DNN_2
500 – 200 – 100 – 50 – 10
(50% – 40% – 30% – 20% – 10%)
30
DNN_3
200 – 200 – 100 – 100 – 50
(50% – 50% – 30% –30% – 10%)
20
DNN_4
200 – 200 – 100 – 100 – 50
(50% – 50% – 30% – 30% – 10%)
30
DNN_5
300 – 300 – 150 – 150 – 50
(50% – 50% – 30% – 30% – 10%)
20
DNN_6
300 – 300 – 150 – 150 – 50
(50% – 50% – 30% – 30% – 10%)
30
・ Deep Learning ・ Random Forest (RF)
Model # of feature # of tree depth
RF_1 11 1000 3
RF_2 11 1000 5
RF_3 11 1000 7
model regularization
RR_1 0.1
RR_2 1.0
RR_3 10.0
・ Linear Model (Ridge Regression;RR)
✔ We make a quintile long/short portfolio and
calculate the annualized average return, and Sharpe ratio.
68. For the epoch number, 20 is good.
In this study, 30 is overfitting.
Empirical Study No.2
✔ This graph shows the (annualized) returns of each methods
0.00%
2.00%
4.00%
6.00%
8.00%
10.00%
12.00%
Return
For the level of return,
the linear models(RR) are very good.
69. Empirical Study No.2
✔ This graph shows the Sharpe Ratio of each methods
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
2
ShapeRatio
For the epoch number, 20 is good.
In this study, 30 is overfitting.
For the Shape ratio,
DNNs are the very good.
70. Empirical Study No.3
✔ We check the profitability of cross-sectional investment
strategy with deep learning
✔ We predict relative return using 20 factors and invest monthly.
✔ Investment Universe:
Deep Learning for Multi-factor Models in Regional and Global Stock Markets
Masaya Abe and Kei Nakagawa,
New Frontiers in Artificial Intelligence (To be appear)
World : MSCI WORLD
North America: MSCI North Americas
Europe: MSCI Europe & Middle East
Asia Pacific: MSCI Pacific
(from 2005 to 2017 monthly)
71. Empirical Study No.3
✔ We use the 20 factors that are used relatively often in practice.
No. Factor No. Factor No. Factor
1 Book-to-market ratio 8 Return on invested capital 15 EPS Revision(1 month)
2 Earnings-to-price ratio 9 Accruals 16 EPS Revision(3 months)
3 Dividend yield 10 Total Asset Growth Rate 17 Past stock return(1 month)
4 Sales-to-price ratio 11 Current ratio 18 Past stock return(12 months)
5 Cash flow-to-price ratio 12 Equity ratio 19 Volatility
6 Return on equity 13 Total Asset Turnover Rate 20 Skewness
7 Return on asset 14 CAPEX Growth Rate
72. ✔ We use linear and nonlinear models as benchmarks.
Empirical Study No.3
Model Description
Deep Learning
We use full-connected network. The hidden layers are (150-
150-100-100-50-50). Dropout rate: (50% – 50% – 30% –
30% – 10% – 10%).We use the ReLU as the activation function
and Adam for the optimization algorithm.
Linear Model
(Ridge Regression)
“sklearn.linear_model.Ridge”
Support Vector Regression “sklearn.svm.SVR”
Random Forest “sklearn.ensemble.RandomForestRegressor”
Gradient Boosting Tree “XGBRegressor”
✔ We make a quintile long/short portfolio and
calculate the annualized average return, and Sharpe ratio.
73. Results No.3
✔ This graph shows the (annualized) returns of each methods
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
DNN
RR
XGB
RF
DNN
RR
XGB
RF
DNN
RR
XGB
RF
DNN
RR
XGB
RF
NA EU PF WLD
Return
✓ DNN generally outperforms other
methods in all regions.
✓ The performance decreases in the
order of PF, EU, NA regardless of
the prediction model.
74. Results No.3
✔ This graph shows the Sharpe Ratio of each methods
0.00
0.20
0.40
0.60
0.80
1.00
1.20
1.40 DNN
RR
XGB
RF
DNN
RR
XGB
RF
DNN
RR
XGB
RF
DNN
RR
XGB
RF
NA EU PF WLD
ShapeRatio
✓ The highest Shape ratios are given
by DNN in all regions.
✓ The performance decreases in the
order of PF, EU, NA regardless of
the prediction model.
75. Summary
DNN is generally superior to GB, RF, and RR in Japan and global markets
→ DNN shows promise as a skillful machine learning method in the cross section.
The performance deteriorates in the order of Pacific,
Europe, North America regardless of the prediction model.
→ The higher the market efficiency, the more difficult to earn returns
77. Problem Formulation
• Time Series
• Cross Section
Data
• Technical
• Fundamental
• Alternative
AI/Machine Learning
Conclusion
If you have any question or comment, please feel free to Email me
kei.nak.0315@gmail.com
78. Disclaimer
The Information in this presentation is provided for education and informational purposes only, without any
express or implied warranty of any kind, including warranties of accuracy, completeness, or fitness for any
particular purpose. The Information contained in or provided from or through this material is not intended to be
and does not constitute financial advice, investment advice, trading advice or any other advice. The Information
in this material is general in nature and is not specific to you the viewer or anyone else.
If this document contains any information relating to the past performance of any securities or other investment
strategies and products, you should note that past performance is not indicative of future results. Any market or
investment view mentioned in this document is not intended to be investment research. Information contained in
this document is subject to change without any obligation on our part to notify you of any change. If this
document forms part of a presentation or is presented to you together with other documents and materials, this
document should not be read in isolation and may not provide a full explanation of all of the topics presented and
discussed. You should note that investment involves risk. The price of any securities or other investment product
may go down as well as up. The information does not accept any liability or legal responsibility whatsoever for
any decisions taken based upon the material.