Population-based Portfolio Selection

•

0 likes•1,403 views

I was shocked to figure out that any good backtest result, which I have previously obtained, could simply be converging to a local minima that performs well just on the test set or in other words, winning the lottery due to random initialization. In order to prevent this kind of test-set overfitting. I have developed a system that is able to simultaneously train 10K trials, which are all differently initialized, in GPU and then validated with the mean portfolio, rather than selecting the best trial. More details of the method and the blind-set results are present in this presentation. This is also first method I guess, which optimizes a constant rebalanced portfolio for a defined risk-adjusted reward and an assumed transaction cost, using Reinforcement Learning (Policy Gradient).

Technology

“Population-Based”
Portfolio Selection
Preventing Test-Set Overﬁtting due to
Random-Initialization & Selection Bias
January 2020

UCRP Optimization
- Current portfolio selection methods only address the selection of an optimal
buy & hold portfolio but do not help selecting a constant rebalanced portfolio.
- Due to the mean-reversion, a minimum volatility constant rebalanced portfolio,
that has no-positive return when simply being held, can also generate proﬁts.
- In this work, I have rather tried to select an optimal portfolio for a given trading
policy (such as UCRP) and risk-adjusted reward, under transaction costs (1%).
- Despite portfolio weights, a divergence threshold, which is used for deciding
when to rebalance back to the selected constant portfolio is also optimized.

UCRP Optimization
- The asset weights of a buy & hold portfolio are constantly in change due to the
price changes. UCRP re-balances portfolio back when they diverge too much.
- Red coloured assets below are Inverse ETF products to help hedging the risk.

Test-Set Overﬁtting!
- Regular approach in machine-learning for preventing overﬁtting the training set
is using a validation set to decide when to terminate the training process.
- While optimizing a portfolio-weight vector, one has risk of ﬁnding a policy that
performs well on the validation set but would not generalize on a blind test set.
- In fact, due to the random-initialization, one can obtain a portfolio weight that
performs already best on the validation set that will be used for early-stopping.
- For that reason, it is diﬃcult to prevent test-set overﬁtting and deciding on an
early-stopping epoch that can be utilized for re-training weights for live-trading.

Training a Population
- Instead of optimizing a single portfolio-weight, train a population of them in
parallel. At the initialization, some will already be overﬁtting the validation set.
- After each epoch of training, use the mean-weight of top 50% candidates for
calculating the validation loss that will be used for early-stopping the training.
- Combine train and validation set to train the population until the early-stopping
epoch. Use the mean-weight of top 50% candidates for evaluating in a test set.
- In this project, 8192 portfolio-weight candidates, which include 33 ETF(s), are
optimized in parallel via Autograd package of PyTorch with a RTX2060 GPU.

Conclusion & Future-work
- Paper-trading has already been done since the past 3-months and has been
performing in parallel with back-test results. Next goal is to start live-trading.
- Instead of optimizing the divergence threshold, train a basic model that makes
decision of when to rebalance portfolio back to the selected constant weights
or liquidate it, using diverged weights and selected portfolio returns as inputs.
- More evolutionary approaches for accelerating the training such as resampling
the bottom X% candidates from the distribution of the rest of the candidates.
- Real-time 3D visualization of the population during training using TensorBoard.

Who I am?
I am Chief Data Scientist (CDS) of an Anti-Money Laundering startup,
Hawk:AI. I was also CDS at ConnectedLife GmbH, a global All-in-One
Smart Living & Healthcare Technology provider. I founded AI startups
(LivingRooms GmbH & OTA Expert Inc) and also worked in internationally
reputable Research Institutes including Socio-Digital Systems (Human
Experience & Design) Group in Computer Mediated Living Laboratory of
Microsoft Research Cambridge (MSRC) and Quality & Usability Group of
Deutsche Telekom Innovation Laboratories (T-Labs), besides Computer
Vision & Pattern Analysis (VPALAB), Computer Graphics (CGLAB) and
Distributed Artiﬁcial Intelligence (DAI-Labor) laboratories of Sabanci
University & TU-Berlin where I have co-authored 35+ publications on AI

What's hot

Asset pricing modelsAakash Kulkarni

APT portfolio mnagmntRAJESH KATIYAR

MEI Summit 2011: Professor Noël AmencInvestments Network marcus evans

Markowitz portfolio theoryDebiprasad Dash

Risk Management: Maximising Long-Term Growth PresentationQuantInsti

Capital asset pricing model (CAPM)Simran Kaur

Coursework- Soton (Single Index Model and CAPM)Ece Akbulut

Modern Portfolio Theory (Mpt) - AAII Milwaukeebergsa

Investment Analysis 107 August 2011mrittmayer

Arbitrage pricing theory (apt)Dr. Satyanarayan Pandey

Diversification and portfolio analysis@ bec domsBabasab Patil

Ts performanceTradeSlide

Relative valuationAnkita Nagarkoti

"Enhancing Statistical Significance of Backtests" by Dr. Ernest Chan, Managin...Quantopian

Return and risk of portfolio with probabilityshijintp

"Trading Without Regret" by Dr. Michael Kearns, Professor at the Computer and...Quantopian

Qir 2013q2 usWeydert Wealth Management

Relative Valuation - Techniques & ApplicationCorporate Professionals

Modeling Transaction Costs for Algorithmic StrategiesQuantopian

CAPM part 2Bikash Kumar

What's hot (20)

Asset pricing models

APT portfolio mnagmnt

MEI Summit 2011: Professor Noël Amenc

Markowitz portfolio theory

Risk Management: Maximising Long-Term Growth Presentation

Capital asset pricing model (CAPM)

Coursework- Soton (Single Index Model and CAPM)

Modern Portfolio Theory (Mpt) - AAII Milwaukee

Investment Analysis 107 August 2011

Arbitrage pricing theory (apt)

Diversification and portfolio analysis@ bec doms

Ts performance

Relative valuation

"Enhancing Statistical Significance of Backtests" by Dr. Ernest Chan, Managin...

Return and risk of portfolio with probability

"Trading Without Regret" by Dr. Michael Kearns, Professor at the Computer and...

Qir 2013q2 us

Relative Valuation - Techniques & Application

Modeling Transaction Costs for Algorithmic Strategies

CAPM part 2

Similar to Population-based Portfolio Selection

ML MODULE 5.pdfShiwani Gupta

MLX 2018 - Marcos López de Prado, Lawrence Berkeley National Laboratory Comp...Mehdi Merai Ph.D.(c)

All that Glitters Is Not Gold_Comparing Backtest and Out-of-Sample Performanc...justinlent

Deep reinforcement for portfolio managementSonam Srivastava

Barga Data Science lecture 10Roger Barga

Graph based approaches for over-sampling in the context of ordinal regressionieeepondy

GA.-.Presentationoldmanpat

The Seven Habits of Highly Effective Test FacilitiesWilhelm Graupner, Ph.D.

Improving Returns from the Markowitz Model using GA- AnEmpirical Validation o...idescitation

The AAA Test Transformation ModelSushant Hublikar

Top 10 Data Science Practitioner PitfallsSri Ambati

Using Optimization to find Synthetic Equity Universes that minimize Survivors...OpenMetrics Solutions LLC

Adaptibility: the new competitive advantageShwetanshu Gupta

Lean Six SigmaShankaran Rd

StackAdapt Machine Learning PipelineLarkin Liu

Machine learning - session 4Luis Borbon

Optimum Investment Selection process-Nov 9-2013Gary Crosbie

Whole test suite generationJPINFOTECH JAYAPRAKASH

Dnn guidelinesNaitik Shukla

CT Dose Notifications and Alerts AAPM 2014KeLu25

Similar to Population-based Portfolio Selection (20)

ML MODULE 5.pdf

MLX 2018 - Marcos López de Prado, Lawrence Berkeley National Laboratory Comp...

All that Glitters Is Not Gold_Comparing Backtest and Out-of-Sample Performanc...

Deep reinforcement for portfolio management

Barga Data Science lecture 10

Graph based approaches for over-sampling in the context of ordinal regression

GA.-.Presentation

The Seven Habits of Highly Effective Test Facilities

Improving Returns from the Markowitz Model using GA- AnEmpirical Validation o...

The AAA Test Transformation Model

Top 10 Data Science Practitioner Pitfalls

Using Optimization to find Synthetic Equity Universes that minimize Survivors...

Adaptibility: the new competitive advantage

Lean Six Sigma

StackAdapt Machine Learning Pipeline

Machine learning - session 4

Optimum Investment Selection process-Nov 9-2013

Whole test suite generation

Dnn guidelines

CT Dose Notifications and Alerts AAPM 2014

Recently uploaded

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxFIDO Alliance

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...ScyllaDB

Tales from a Passkey Provider Progress from Awareness to Implementation.pptxFIDO Alliance

WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Paige Cruz

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc

ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Skynet Technologies

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?Paolo Missier

Navigating the Large Language Model choices_Ravi DaparthiRaviKumarDaparthi

Microsoft BitLocker Bypass Attack Method.pdfOverkill Security

2024 May Patch TuesdayIvanti

AI in Action: Real World Use Cases by AnitarajAnitaRaj43

ADP Passwordless Journey Case Study.pptxFIDO Alliance

State of the Smart Building Startup Landscape 2024!Memoori

How we scaled to 80K users by doing nothing!.pdfSrushith Repakula

“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdfMuhammad Subhan

How to Check GPS Location with a Live Tracker in Pakistandanishmna97

Recently uploaded (20)

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx

WebRTC and SIP not just audio and video @ OpenSIPS 2024

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...

ChatGPT and Beyond - Elevating DevOps Productivity

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?

Navigating the Large Language Model choices_Ravi Daparthi

Microsoft BitLocker Bypass Attack Method.pdf

2024 May Patch Tuesday

AI in Action: Real World Use Cases by Anitaraj

ADP Passwordless Journey Case Study.pptx

State of the Smart Building Startup Landscape 2024!

How we scaled to 80K users by doing nothing!.pdf

“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf

How to Check GPS Location with a Live Tracker in Pakistan

Population-based Portfolio Selection

1. “Population-Based” Portfolio Selection Preventing Test-Set Overﬁtting due to Random-Initialization & Selection Bias January 2020

2. UCRP Optimization - Current portfolio selection methods only address the selection of an optimal buy & hold portfolio but do not help selecting a constant rebalanced portfolio. - Due to the mean-reversion, a minimum volatility constant rebalanced portfolio, that has no-positive return when simply being held, can also generate proﬁts. - In this work, I have rather tried to select an optimal portfolio for a given trading policy (such as UCRP) and risk-adjusted reward, under transaction costs (1%). - Despite portfolio weights, a divergence threshold, which is used for deciding when to rebalance back to the selected constant portfolio is also optimized.

3. UCRP Optimization - The asset weights of a buy & hold portfolio are constantly in change due to the price changes. UCRP re-balances portfolio back when they diverge too much. - Red coloured assets below are Inverse ETF products to help hedging the risk.

4. Test-Set Overfitting! - Regular approach in machine-learning for preventing overfitting the training set is using a validation set to decide when to terminate the training process. - While optimizing a portfolio-weight vector, one has risk of finding a policy that performs well on the validation set but would not generalize on a blind test set. - In fact, due to the random-initialization, one can obtain a portfolio weight that performs already best on the validation set that will be used for early-stopping. - For that reason, it is difficult to prevent test-set overfitting and deciding on an early-stopping epoch that can be utilized for re-training weights for live-trading.

5. Training a Population - Instead of optimizing a single portfolio-weight, train a population of them in parallel. At the initialization, some will already be overﬁtting the validation set. - After each epoch of training, use the mean-weight of top 50% candidates for calculating the validation loss that will be used for early-stopping the training. - Combine train and validation set to train the population until the early-stopping epoch. Use the mean-weight of top 50% candidates for evaluating in a test set. - In this project, 8192 portfolio-weight candidates, which include 33 ETF(s), are optimized in parallel via Autograd package of PyTorch with a RTX2060 GPU.

6. Out-of-Sample Performance (5 Years)

7. Out-of-Sample Performance (5 Years)

8. Out-of-Sample Performance (5 Years)

10. Conclusion & Future-work - Paper-trading has already been done since the past 3-months and has been performing in parallel with back-test results. Next goal is to start live-trading. - Instead of optimizing the divergence threshold, train a basic model that makes decision of when to rebalance portfolio back to the selected constant weights or liquidate it, using diverged weights and selected portfolio returns as inputs. - More evolutionary approaches for accelerating the training such as resampling the bottom X% candidates from the distribution of the rest of the candidates. - Real-time 3D visualization of the population during training using TensorBoard.

11. Who I am? I am Chief Data Scientist (CDS) of an Anti-Money Laundering startup, Hawk:AI. I was also CDS at ConnectedLife GmbH, a global All-in-One Smart Living & Healthcare Technology provider. I founded AI startups (LivingRooms GmbH & OTA Expert Inc) and also worked in internationally reputable Research Institutes including Socio-Digital Systems (Human Experience & Design) Group in Computer Mediated Living Laboratory of Microsoft Research Cambridge (MSRC) and Quality & Usability Group of Deutsche Telekom Innovation Laboratories (T-Labs), besides Computer Vision & Pattern Analysis (VPALAB), Computer Graphics (CGLAB) and Distributed Artiﬁcial Intelligence (DAI-Labor) laboratories of Sabanci University & TU-Berlin where I have co-authored 35+ publications on AI

Population-based Portfolio Selection

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Population-based Portfolio Selection

Similar to Population-based Portfolio Selection (20)

Recently uploaded

Recently uploaded (20)

Population-based Portfolio Selection