SlideShare a Scribd company logo
Selec%on of Related Stocks
using Financial Text Mining
Masanori HIRANO*1, Hiroki SAKAJI*2, Shoko KIMURA*3,
Kiyoshi IZUMI*2, Hiroyasu MATSUSHIMA*2,
Shintaro NAGAO*3, and Atsuo KATO*4
*1 Faculty of Engineering, The University of Tokyo
*2 School of Engineering, The University of Tokyo
*3 QuanKtaKve Investment Department,
Daiwa Asset Management Co. Ltd.
*4 FronKer Technologies Research & ConsulKng Dept.,
Daiwa InsKtute of Research Ltd.
hirano@g.ecc.u-tokyo.ac.jp
hWps://mhirano.jp/
Background
• Pick up some stocks &
build up “themed mutual fund”
• Private investors invest in it
• Lots of stocks
• Tokyo: about 3,600!
• Other stock exchanges
all over the world
• Quite difficult to cover all theme
• Stock abroad
• Not familiar theme
• => Support human!
Robot & Technology related fund
--Robotech--
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
2
Demo: Fund Building Support System
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
3
Robot
Similarity/Ticker/Name Similarity
Reasoning Sentences in Japanese
Similarity/Ticker/Name Word/Similarity/File
Robo (abbr.)
Mini Robot
Usual Case
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
4
Input:
Theme
“AI”
Companies’ Informa4on
IRs
Companies’ HPs
Web Search
Pick up Stocks
Outputs:
Related stocks
Our Method Outline
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
5
Input:
Theme
“AI”
Related Words
“expert system”
“machine learning”
“deep learning”
“natural language processing”
Companies’ Informa>on
IRs
Companies’ Official Websites
Word2vec
Word
matching
Extracted & Reasoning
Sentences
“We automa>cally respond to a
two-way interac>ve AI FAQ
system using natural language
processing.”*
Sum up similarity
Outputs:
Related stocks
+ Similarity to the theme + Reasoning Sentences
*hPps://www.soQbank.jp/corp/group/sbm/news/press/2018/20180301_01/
Crawling &
Preprocessing
Feedback
Our Method Outline
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
6
Input:
Theme
“AI”
Related Words
“expert system”
“machine learning”
“deep learning”
“natural language processing”
Companies’ Informa>on
IRs
Companies’ Official Websites
Word2vec
Word
matching
Extracted & Reasoning
Sentences
“We automa>cally respond to a
two-way interac>ve AI FAQ
system using natural language
processing.”*
Sum up similarity
Outputs:
Related stocks
+ Similarity to the theme + Reasoning Sentences
*hPps://www.soQbank.jp/corp/group/sbm/news/press/2018/20180301_01/
Crawling &
Preprocessing
Feedback
Data
• Data for similarity (word2vec)
1,809,736,365 words w/ 1,147,973 vocabularies
• livedoor news corpus
• Wikipedia Japanese ar3cles (version 21-Jun-2018 22:09)
• Nikkei newspaper ar3cles(1990–2015 and 2017; data
from 2016 were omiHed for technical reasons)
• Companies’ informaMon
• Investors Rela3ons(2012/10/9-2018/5/11; 90,813 files)
• Official Websites(only 2,293,460 pages in all companies)
=> total: 2TB
• We use 5 kinds of data!
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
7
Our Method Outline
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
8
Input:
Theme
“AI”
Related Words
“expert system”
“machine learning”
“deep learning”
“natural language processing”
Companies’ Informa>on
IRs
Companies’ Official Websites
Word2vec
Word
matching
Extracted & Reasoning
Sentences
“We automa>cally respond to a
two-way interac>ve AI FAQ
system using natural language
processing.”*
Sum up similarity
Outputs:
Related stocks
+ Similarity to the theme + Reasoning Sentences
*hOps://www.soPbank.jp/corp/group/sbm/news/press/2018/20180301_01/
Crawling &
Preprocessing
Feedback
Feedback using Cooccurrence
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
9
The master set for the word “AI”
Test set for the word1 “NLP”
Test set for the word2 “machine learning”
…
TYO: 8207 TYO: 1945
TYO: 2154
TYO: 3861
TYO: 2397
TYO: 3045 TYO: 7816
TYO: 1931 TYO: 9984
TYO: 1925
TYO: 4845
TYO: 1400 TYO: 4288
Feedback using Cooccurrence
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
10
The master set for the word “AI”
Test set for the word1 “NLP”
Test set for the word2 “machine learning”
…
TYO: 8207 TYO: 1945
TYO: 2154
TYO: 3861
TYO: 2397
TYO: 3045 TYO: 7816
TYO: 1931 TYO: 9984
TYO: 1925
TYO: 4845
TYO: 1400 TYO: 4288
5 companies
Feedback using Cooccurrence
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
11
The master set for the word “AI”
Test set for the word1 “NLP”
Test set for the word2 “machine learning”
…
TYO: 8207 TYO: 1945
TYO: 2154
TYO: 3861
TYO: 2397
TYO: 3045 TYO: 7816
TYO: 1931 TYO: 9984
TYO: 1925
TYO: 4845
TYO: 1400 TYO: 4288
5 companies
Feedback using Cooccurrence
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
12
The master set for the word “AI”
Test set for the word1 “NLP”
Test set for the word2 “machine learning”
…
TYO: 8207 TYO: 1945
TYO: 2154
TYO: 3861
TYO: 2397
TYO: 3045 TYO: 7816
TYO: 1931 TYO: 9984
TYO: 1925
TYO: 4845
TYO: 1400 TYO: 4288
5 companies
4 companies
=0.8
Similarity btw “AI” & “NLP”
Feedback using Cooccurrence
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
13
The master set for the word “AI”
Test set for the word1 “NLP”
Test set for the word2 “machine learning”
…
TYO: 8207 TYO: 1945
TYO: 2154
TYO: 3861
TYO: 2397
TYO: 3045 TYO: 7816
TYO: 1931 TYO: 9984
TYO: 1925
TYO: 4845
TYO: 1400 TYO: 4288
6 companies
4 companies
=0.66
How to calculate similarity
• A: Similari*es using Word2vec:
• Using 9 models with each sets of hyper parameters
• Only using top-100 similar words in each model
=> harmonic average
• B: Similari*es from feedback using cooccurrence
• Take harmonic average
2"#
" + #
• This is final similarity of each words
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
14
Calculate COMPANY similarity
• Count the number of appearance of each top-10
similar word & original word
• Sum the count weighed by similari9es
=> company similarity
• Original theme word’s weight: 1
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
15
Results
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
16
Similar words to “ ”(health)
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
17
Rank Word word2vec feedback final
1 (getting healthy) 0.6696 0.9921 0.7996
2 (lifestyle diseases) 0.5968 0.9879 0.7441
3 (diet) 0.5724 0.9770 0.7219
4 (nutrition) 0.5692 0.9739 0.7185
5 (antiaging) 0.5433 1.0000 0.7040
6 (prevention medicine) 0.5421 1.0000 0.7031
7 (healthy) 0.5402 0.9774 0.6959
8 (condition of health) 0.5347 0.9803 0.6919
9 (diet) 0.5333 0.9745 0.6893
10 (lifestyle habit) 0.5233 0.9835 0.6831
Top 11-20 (Not used) similar words
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
18
Rank Word word2vec feedback final
11 (beauty) 0.5259 0.9507 0.6772
12 (mental health) 0.5154 0.9809 0.6758
13 (antifat) 0.5009 1.0000 0.6674
14 (prevention) 0.5243 0.8888 0.6595
15 (dietary education) 0.4914 0.9742 0.6532
16 (care prevention) 0.4929 0.9532 0.6498
17 (health management method) 0.4796 1.0000 0.6483
18 (metabolic syndrome (abbr.)) 0.4810 0.9897 0.6474
19 (diet therapy) 0.4774 1.0000 0.6462
20 (welfare) 0.4973 0.9203 0.6457
Results (e-Sports)
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
19
Rank Similarity Ticker Name
1 604.5824 3922 PR TIMES Inc
2 387.4172 6552 GameWith Inc
3 82.69871 9697 CAPCOM CO., LTD.
4 38.20549 9468 Kadokawa Dwango Corp
5 20.53668 9766 Konami Holdings Corp
6 19.26718 3326 Runsystem Co Ltd
7 15.15767 3904 Kayac Inc
8 14.44024 2378 Renaissance Inc
9 13.2471 7860 Avex Inc
10 9.734876 3135 MarketEnterprise Co Ltd
Evaluation (new data to this presentation, paper doesn’t contain)
• 4 experienced fund managers provided answer data
based on their knowledge
• With in only 100 stocks from TOPIX 500 (At random)
• 1 or more fund managers mark as related stocks
=> tagged as “Related”
• No fund manager mark as related
=> tagged as “Not related”
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
20
FM1 FM2 FM3 FM4 Answer data
AAA Inc. ✔ ✔ ✔ ✔ Related
BBB Corp ---
CCC LTD ✔ Related
Evaluation (new data to this presentation, paper doesn’t contain)
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
21
0.6602
0.8743
0.5565
0.6609
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
(beauty) (child-care) (robot) (amusement)
precision
recall
f1
Conclusion & Contribution
• We built the system suppor0ng fund managers to
build up themed mutual fund
• We use some types & kinds of data
11/17/18
ICDM2018ws: 1st CDEC
© Izumi Lab. & M.HIRANO
22
Future work
• More RECALL!!!!
• Fix Crawling Problem <= Critical problem for recall
• Tune some hyper-parameters
• Check casus of the difference btw good & bad cases

More Related Content

Similar to 2018/11/17 ICDMW 2018: Selection of Related Stocks using Financial Text Mining

English Essay Steel. Online assignment writing service.
English Essay Steel. Online assignment writing service.English Essay Steel. Online assignment writing service.
English Essay Steel. Online assignment writing service.
Christine Jones
 
The Magic Formula Approach - A Case Study
The Magic Formula Approach - A Case StudyThe Magic Formula Approach - A Case Study
The Magic Formula Approach - A Case Study
BeyondProxy LLC
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Neo4j
 
Start with why?: how innovators combine ideas to explore new business models
Start with why?:  how innovators combine ideas to explore new business modelsStart with why?:  how innovators combine ideas to explore new business models
Start with why?: how innovators combine ideas to explore new business models
Osaka University
 
20210325 jim spohrer future ai v11
20210325 jim spohrer future ai v1120210325 jim spohrer future ai v11
20210325 jim spohrer future ai v11
ISSIP
 
Social Convergence of Machine Learning in IIoT - Jeffrey Jensen
Social Convergence of Machine Learning in IIoT - Jeffrey JensenSocial Convergence of Machine Learning in IIoT - Jeffrey Jensen
Social Convergence of Machine Learning in IIoT - Jeffrey Jensen
WithTheBest
 
The toyota way Book Review
The toyota way Book ReviewThe toyota way Book Review
The toyota way Book ReviewRoshani Mehta
 
Dr Shigeo Kagami
Dr Shigeo KagamiDr Shigeo Kagami
Dr Shigeo Kagami
British Council
 
The Ultimate Things About IoT
The Ultimate Things About IoTThe Ultimate Things About IoT
The Ultimate Things About IoT
Dr. Mazlan Abbas
 
Interesting Thing about Internet of Things
Interesting Thing about Internet of ThingsInteresting Thing about Internet of Things
Interesting Thing about Internet of Things
Dr. Mazlan Abbas
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
Dr. Haxel Consult
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AI
CHENHuiMei
 
Trl jaist 20180304 v6
Trl jaist 20180304 v6Trl jaist 20180304 v6
Trl jaist 20180304 v6
ISSIP
 
Move out from your comfort zone!
Move out from your comfort zone!Move out from your comfort zone!
Move out from your comfort zone!
Osaka University
 
Ai4 good 20190419 v3
Ai4 good 20190419 v3Ai4 good 20190419 v3
Ai4 good 20190419 v3
ISSIP
 
[MindsLab] Company introduction_2019_summary
[MindsLab] Company introduction_2019_summary[MindsLab] Company introduction_2019_summary
[MindsLab] Company introduction_2019_summary
Taejoon Yoo
 
Future of ai 20190612 v1
Future of ai 20190612 v1Future of ai 20190612 v1
Future of ai 20190612 v1
ISSIP
 
Toyota
ToyotaToyota
Toyota
Jill Baldwin
 
Tracxn Research - Consumer Robotics Report, june 2017
Tracxn Research - Consumer Robotics Report, june 2017Tracxn Research - Consumer Robotics Report, june 2017
Tracxn Research - Consumer Robotics Report, june 2017
Tracxn
 

Similar to 2018/11/17 ICDMW 2018: Selection of Related Stocks using Financial Text Mining (20)

English Essay Steel. Online assignment writing service.
English Essay Steel. Online assignment writing service.English Essay Steel. Online assignment writing service.
English Essay Steel. Online assignment writing service.
 
The Magic Formula Approach - A Case Study
The Magic Formula Approach - A Case StudyThe Magic Formula Approach - A Case Study
The Magic Formula Approach - A Case Study
 
Data analysis final
Data analysis finalData analysis final
Data analysis final
 
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
Workshop -  Architecting Innovative Graph Applications- GraphSummit MilanWorkshop -  Architecting Innovative Graph Applications- GraphSummit Milan
Workshop - Architecting Innovative Graph Applications- GraphSummit Milan
 
Start with why?: how innovators combine ideas to explore new business models
Start with why?:  how innovators combine ideas to explore new business modelsStart with why?:  how innovators combine ideas to explore new business models
Start with why?: how innovators combine ideas to explore new business models
 
20210325 jim spohrer future ai v11
20210325 jim spohrer future ai v1120210325 jim spohrer future ai v11
20210325 jim spohrer future ai v11
 
Social Convergence of Machine Learning in IIoT - Jeffrey Jensen
Social Convergence of Machine Learning in IIoT - Jeffrey JensenSocial Convergence of Machine Learning in IIoT - Jeffrey Jensen
Social Convergence of Machine Learning in IIoT - Jeffrey Jensen
 
The toyota way Book Review
The toyota way Book ReviewThe toyota way Book Review
The toyota way Book Review
 
Dr Shigeo Kagami
Dr Shigeo KagamiDr Shigeo Kagami
Dr Shigeo Kagami
 
The Ultimate Things About IoT
The Ultimate Things About IoTThe Ultimate Things About IoT
The Ultimate Things About IoT
 
Interesting Thing about Internet of Things
Interesting Thing about Internet of ThingsInteresting Thing about Internet of Things
Interesting Thing about Internet of Things
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
 
When AOI meets AI
When AOI meets AIWhen AOI meets AI
When AOI meets AI
 
Trl jaist 20180304 v6
Trl jaist 20180304 v6Trl jaist 20180304 v6
Trl jaist 20180304 v6
 
Move out from your comfort zone!
Move out from your comfort zone!Move out from your comfort zone!
Move out from your comfort zone!
 
Ai4 good 20190419 v3
Ai4 good 20190419 v3Ai4 good 20190419 v3
Ai4 good 20190419 v3
 
[MindsLab] Company introduction_2019_summary
[MindsLab] Company introduction_2019_summary[MindsLab] Company introduction_2019_summary
[MindsLab] Company introduction_2019_summary
 
Future of ai 20190612 v1
Future of ai 20190612 v1Future of ai 20190612 v1
Future of ai 20190612 v1
 
Toyota
ToyotaToyota
Toyota
 
Tracxn Research - Consumer Robotics Report, june 2017
Tracxn Research - Consumer Robotics Report, june 2017Tracxn Research - Consumer Robotics Report, june 2017
Tracxn Research - Consumer Robotics Report, june 2017
 

More from Masanori HIRANO

2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
Masanori HIRANO
 
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
Masanori HIRANO
 
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
Masanori HIRANO
 
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
Masanori HIRANO
 
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
Masanori HIRANO
 
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
Masanori HIRANO
 
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
Masanori HIRANO
 
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
Masanori HIRANO
 
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
Masanori HIRANO
 
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
Masanori HIRANO
 
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
Masanori HIRANO
 
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
Masanori HIRANO
 
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
Masanori HIRANO
 
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
Masanori HIRANO
 
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
Masanori HIRANO
 

More from Masanori HIRANO (15)

2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
2023/03/04 sigfin30: 原資産価格過程不要な敵対的Deep Hedging
 
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
2023/03/04 sigfin30 PR: Special Session on Applied Informatics in Finance and...
 
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
2022/11/17 PRIMA2022: Analysis of Carbon Neutrality Scenarios of Industrial C...
 
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
2022/11/17 PRIMA2022: Does Order Simultaneity Affect the Data Mining Task in ...
 
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
2022/10/30 BESC2022: Parameter Tuning Method for Multi-agent Simulation using...
 
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
2022/06/15 JSAI2022: Data-driven Agent Design for Artificial Market Simulation
 
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
2022/05/05 CIFEr2022: Concept and Practice of Artificial Market Data Mining P...
 
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
2022/03/12 sigfin28: オプションによるオプションのヘッジを可能にする二重 Deep Hedging 機構
 
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
2020/11/19 PRIMA2020: Simulation of Unintentional Collusion Caused by Auto Pr...
 
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
2020/11/19 PRIMA2020: Implementation of Real Data for Financial Market Simula...
 
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
2020/06/08 JSAI2020: STBM: Stochastic Trading Behavior Model for Financial Ma...
 
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
2020/03/18 NLP2020: 金融文書のための別タスク学習による教師なし重要文判定
 
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
2019/10/31 PRIMA2019: Comparison of Behaviors of Actual and Simulated HFT Tra...
 
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
2018/10/30 PRIMA Workshop 2018: Impact Assessments of the CAR Regulation usin...
 
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
2018/06/06 JSAI2018 Effects Analysis of CAR Regulations on Financial Markets ...
 

Recently uploaded

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
thanhdowork
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
Robbie Edward Sayers
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 

Recently uploaded (20)

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Hori...
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
HYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generationHYDROPOWER - Hydroelectric power generation
HYDROPOWER - Hydroelectric power generation
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang,  ICLR 2024, MLILAB, KAIST AI.pdfJ.Yang,  ICLR 2024, MLILAB, KAIST AI.pdf
J.Yang, ICLR 2024, MLILAB, KAIST AI.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 

2018/11/17 ICDMW 2018: Selection of Related Stocks using Financial Text Mining

  • 1. Selec%on of Related Stocks using Financial Text Mining Masanori HIRANO*1, Hiroki SAKAJI*2, Shoko KIMURA*3, Kiyoshi IZUMI*2, Hiroyasu MATSUSHIMA*2, Shintaro NAGAO*3, and Atsuo KATO*4 *1 Faculty of Engineering, The University of Tokyo *2 School of Engineering, The University of Tokyo *3 QuanKtaKve Investment Department, Daiwa Asset Management Co. Ltd. *4 FronKer Technologies Research & ConsulKng Dept., Daiwa InsKtute of Research Ltd. hirano@g.ecc.u-tokyo.ac.jp hWps://mhirano.jp/
  • 2. Background • Pick up some stocks & build up “themed mutual fund” • Private investors invest in it • Lots of stocks • Tokyo: about 3,600! • Other stock exchanges all over the world • Quite difficult to cover all theme • Stock abroad • Not familiar theme • => Support human! Robot & Technology related fund --Robotech-- 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 2
  • 3. Demo: Fund Building Support System 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 3 Robot Similarity/Ticker/Name Similarity Reasoning Sentences in Japanese Similarity/Ticker/Name Word/Similarity/File Robo (abbr.) Mini Robot
  • 4. Usual Case 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 4 Input: Theme “AI” Companies’ Informa4on IRs Companies’ HPs Web Search Pick up Stocks Outputs: Related stocks
  • 5. Our Method Outline 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 5 Input: Theme “AI” Related Words “expert system” “machine learning” “deep learning” “natural language processing” Companies’ Informa>on IRs Companies’ Official Websites Word2vec Word matching Extracted & Reasoning Sentences “We automa>cally respond to a two-way interac>ve AI FAQ system using natural language processing.”* Sum up similarity Outputs: Related stocks + Similarity to the theme + Reasoning Sentences *hPps://www.soQbank.jp/corp/group/sbm/news/press/2018/20180301_01/ Crawling & Preprocessing Feedback
  • 6. Our Method Outline 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 6 Input: Theme “AI” Related Words “expert system” “machine learning” “deep learning” “natural language processing” Companies’ Informa>on IRs Companies’ Official Websites Word2vec Word matching Extracted & Reasoning Sentences “We automa>cally respond to a two-way interac>ve AI FAQ system using natural language processing.”* Sum up similarity Outputs: Related stocks + Similarity to the theme + Reasoning Sentences *hPps://www.soQbank.jp/corp/group/sbm/news/press/2018/20180301_01/ Crawling & Preprocessing Feedback
  • 7. Data • Data for similarity (word2vec) 1,809,736,365 words w/ 1,147,973 vocabularies • livedoor news corpus • Wikipedia Japanese ar3cles (version 21-Jun-2018 22:09) • Nikkei newspaper ar3cles(1990–2015 and 2017; data from 2016 were omiHed for technical reasons) • Companies’ informaMon • Investors Rela3ons(2012/10/9-2018/5/11; 90,813 files) • Official Websites(only 2,293,460 pages in all companies) => total: 2TB • We use 5 kinds of data! 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 7
  • 8. Our Method Outline 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 8 Input: Theme “AI” Related Words “expert system” “machine learning” “deep learning” “natural language processing” Companies’ Informa>on IRs Companies’ Official Websites Word2vec Word matching Extracted & Reasoning Sentences “We automa>cally respond to a two-way interac>ve AI FAQ system using natural language processing.”* Sum up similarity Outputs: Related stocks + Similarity to the theme + Reasoning Sentences *hOps://www.soPbank.jp/corp/group/sbm/news/press/2018/20180301_01/ Crawling & Preprocessing Feedback
  • 9. Feedback using Cooccurrence 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 9 The master set for the word “AI” Test set for the word1 “NLP” Test set for the word2 “machine learning” … TYO: 8207 TYO: 1945 TYO: 2154 TYO: 3861 TYO: 2397 TYO: 3045 TYO: 7816 TYO: 1931 TYO: 9984 TYO: 1925 TYO: 4845 TYO: 1400 TYO: 4288
  • 10. Feedback using Cooccurrence 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 10 The master set for the word “AI” Test set for the word1 “NLP” Test set for the word2 “machine learning” … TYO: 8207 TYO: 1945 TYO: 2154 TYO: 3861 TYO: 2397 TYO: 3045 TYO: 7816 TYO: 1931 TYO: 9984 TYO: 1925 TYO: 4845 TYO: 1400 TYO: 4288 5 companies
  • 11. Feedback using Cooccurrence 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 11 The master set for the word “AI” Test set for the word1 “NLP” Test set for the word2 “machine learning” … TYO: 8207 TYO: 1945 TYO: 2154 TYO: 3861 TYO: 2397 TYO: 3045 TYO: 7816 TYO: 1931 TYO: 9984 TYO: 1925 TYO: 4845 TYO: 1400 TYO: 4288 5 companies
  • 12. Feedback using Cooccurrence 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 12 The master set for the word “AI” Test set for the word1 “NLP” Test set for the word2 “machine learning” … TYO: 8207 TYO: 1945 TYO: 2154 TYO: 3861 TYO: 2397 TYO: 3045 TYO: 7816 TYO: 1931 TYO: 9984 TYO: 1925 TYO: 4845 TYO: 1400 TYO: 4288 5 companies 4 companies =0.8 Similarity btw “AI” & “NLP”
  • 13. Feedback using Cooccurrence 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 13 The master set for the word “AI” Test set for the word1 “NLP” Test set for the word2 “machine learning” … TYO: 8207 TYO: 1945 TYO: 2154 TYO: 3861 TYO: 2397 TYO: 3045 TYO: 7816 TYO: 1931 TYO: 9984 TYO: 1925 TYO: 4845 TYO: 1400 TYO: 4288 6 companies 4 companies =0.66
  • 14. How to calculate similarity • A: Similari*es using Word2vec: • Using 9 models with each sets of hyper parameters • Only using top-100 similar words in each model => harmonic average • B: Similari*es from feedback using cooccurrence • Take harmonic average 2"# " + # • This is final similarity of each words 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 14
  • 15. Calculate COMPANY similarity • Count the number of appearance of each top-10 similar word & original word • Sum the count weighed by similari9es => company similarity • Original theme word’s weight: 1 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 15
  • 16. Results 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 16
  • 17. Similar words to “ ”(health) 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 17 Rank Word word2vec feedback final 1 (getting healthy) 0.6696 0.9921 0.7996 2 (lifestyle diseases) 0.5968 0.9879 0.7441 3 (diet) 0.5724 0.9770 0.7219 4 (nutrition) 0.5692 0.9739 0.7185 5 (antiaging) 0.5433 1.0000 0.7040 6 (prevention medicine) 0.5421 1.0000 0.7031 7 (healthy) 0.5402 0.9774 0.6959 8 (condition of health) 0.5347 0.9803 0.6919 9 (diet) 0.5333 0.9745 0.6893 10 (lifestyle habit) 0.5233 0.9835 0.6831
  • 18. Top 11-20 (Not used) similar words 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 18 Rank Word word2vec feedback final 11 (beauty) 0.5259 0.9507 0.6772 12 (mental health) 0.5154 0.9809 0.6758 13 (antifat) 0.5009 1.0000 0.6674 14 (prevention) 0.5243 0.8888 0.6595 15 (dietary education) 0.4914 0.9742 0.6532 16 (care prevention) 0.4929 0.9532 0.6498 17 (health management method) 0.4796 1.0000 0.6483 18 (metabolic syndrome (abbr.)) 0.4810 0.9897 0.6474 19 (diet therapy) 0.4774 1.0000 0.6462 20 (welfare) 0.4973 0.9203 0.6457
  • 19. Results (e-Sports) 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 19 Rank Similarity Ticker Name 1 604.5824 3922 PR TIMES Inc 2 387.4172 6552 GameWith Inc 3 82.69871 9697 CAPCOM CO., LTD. 4 38.20549 9468 Kadokawa Dwango Corp 5 20.53668 9766 Konami Holdings Corp 6 19.26718 3326 Runsystem Co Ltd 7 15.15767 3904 Kayac Inc 8 14.44024 2378 Renaissance Inc 9 13.2471 7860 Avex Inc 10 9.734876 3135 MarketEnterprise Co Ltd
  • 20. Evaluation (new data to this presentation, paper doesn’t contain) • 4 experienced fund managers provided answer data based on their knowledge • With in only 100 stocks from TOPIX 500 (At random) • 1 or more fund managers mark as related stocks => tagged as “Related” • No fund manager mark as related => tagged as “Not related” 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 20 FM1 FM2 FM3 FM4 Answer data AAA Inc. ✔ ✔ ✔ ✔ Related BBB Corp --- CCC LTD ✔ Related
  • 21. Evaluation (new data to this presentation, paper doesn’t contain) 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 21 0.6602 0.8743 0.5565 0.6609 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 (beauty) (child-care) (robot) (amusement) precision recall f1
  • 22. Conclusion & Contribution • We built the system suppor0ng fund managers to build up themed mutual fund • We use some types & kinds of data 11/17/18 ICDM2018ws: 1st CDEC © Izumi Lab. & M.HIRANO 22 Future work • More RECALL!!!! • Fix Crawling Problem <= Critical problem for recall • Tune some hyper-parameters • Check casus of the difference btw good & bad cases