Singular value decompostion - application in data analytics

•

2 likes•482 views

Singular Value Decomposition (SVD) is a useful technique for matrices factorisation and as such is ideally suited for handling large sets of data. Mathematica 10 built-in powerful and optimised SVD functionality enables efficient and quick processing of big data where number of records reaches millions.

Data & Analytics

Singular Value Decomposition with Mathematica 10
Singular value decomposition (SVD) is a useful matrix factorisation technique ideally suited for large datasets. Similarly to the PCA discussed previously, SVD can be applied in the multivariate analysis to obtain reduced footprint of data diagrams with just few variables which represent fundamental patterns in the noisy data. Symbolically, the SVD breaks the matrix into 3 components: where is the left m x n singular vectors matrix, is a diagonal n x n singular values matrix and is the right n x n singular vectors matrix. The SVD also performs the sorting of columns using high-to-low singular value ordering with the highest singular value occupying the upper left corner and the smallest singular value sitting in the bottom right corner of matrix . Mathematica 10 provides rich functionality for SVD processing with well-defined functions.
Let’s assume the fund invests into large set of financial instruments [25,000] where the daily prices have been collected for 1Y. This is quite large dataset with 6.25 mil of records. The objective is to analyse the behaviour of this large universe. The sample of first 25 securities is shown below:
We apply the SVD method to detect major trends in this portfolio. Since the dataset is large, our objective is to identify the driving elements in the group. We want to reduce the 250 x 25,000 matrix into a smaller manageable set.
We use two Mathematica function: (i) SingularValueDecomposition and (ii) SingularValueList which provide the necessary tools for our analysis. To optimise the work, we reduce the number of factors to 10, i.e. focus only on the 10 largest singular values as representative drivers of the entire universe:
This is our result – 10 largest SV. In terms of their ‘weight’ in the overall portfolio, we can see that the 1st SV is roughly 50% of the overall explanation of noise in the data
The power of SVD comes in detecting driving factors quickly and efficiently. This is the ratio of successive singular values:
With the chart:
And their cumulative weights:

So what does SVD analysis tell us? Instead of looking at the large universe of data, we can construct a reduces set of representative ‘drivers’ that provide explanatory feedback of what causes changes in the values over time. In our case, the 10 largest SV explain approx. 70% of changes in 25,000 series of data. This represents a massive reduction in dimensionality and efficiency of our effort. If higher explanatory level is required, we can increase the factors set to higher number – say 15 – to move to a higher confidence level.
The meaning of factors is similar to that of PCA and they refer to various deformation modes of the time series. In statistical terms they can be linked to the moments of multivariate distribution.
It may be interest to look also the U and V matrices that provide additional useful information. In this respect the rows of matrix V are particularly meaningful as they ‘decompose’ and enrich the information on how each singular value affects column components.
For example, row[1] in the V matrix provides explanation how the mean value of the up-down movement is propagated across each factor:
In the same way, we can examine other important SVs in decreasing order of importance:
Second factor:
Third factor:

Recently uploaded

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

Introduction-to-Machine-Learning (1).pptxfirstjob4

Invezz.com - Grow your wealth with trading signalsInvezz1

Industrialised data - the key to AI success.pdfLars Albertsson

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Halmar dropshipping via API with DroFxolyaivanovalion

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

Midocean dropshipping via API with DroFxolyaivanovalion

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

VidaXL dropshipping via API with DroFx.pptx

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Log Analysis using OSSEC sasoasasasas.pptx

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Introduction-to-Machine-Learning (1).pptx

Invezz.com - Grow your wealth with trading signals

Industrialised data - the key to AI success.pdf

BabyOno dropshipping via API with DroFx.pptx

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Edukaciniai dropshipping via API with DroFx

Halmar dropshipping via API with DroFx

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Unveiling Insights: The Role of a Data Analyst

Sampling (random) method and Non random.ppt

Midocean dropshipping via API with DroFx

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Featured

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference

Barbie - Brand Strategy PresentationErica Santiago

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software

Featured (20)

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...

Barbie - Brand Strategy Presentation

Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well

Singular value decompostion - application in data analytics

1. Singular Value Decomposition with Mathematica 10 Singular value decomposition (SVD) is a useful matrix factorisation technique ideally suited for large datasets. Similarly to the PCA discussed previously, SVD can be applied in the multivariate analysis to obtain reduced footprint of data diagrams with just few variables which represent fundamental patterns in the noisy data. Symbolically, the SVD breaks the matrix into 3 components: where is the left m x n singular vectors matrix, is a diagonal n x n singular values matrix and is the right n x n singular vectors matrix. The SVD also performs the sorting of columns using high-to-low singular value ordering with the highest singular value occupying the upper left corner and the smallest singular value sitting in the bottom right corner of matrix . Mathematica 10 provides rich functionality for SVD processing with well-defined functions. Let’s assume the fund invests into large set of financial instruments [25,000] where the daily prices have been collected for 1Y. This is quite large dataset with 6.25 mil of records. The objective is to analyse the behaviour of this large universe. The sample of first 25 securities is shown below: We apply the SVD method to detect major trends in this portfolio. Since the dataset is large, our objective is to identify the driving elements in the group. We want to reduce the 250 x 25,000 matrix into a smaller manageable set. We use two Mathematica function: (i) SingularValueDecomposition and (ii) SingularValueList which provide the necessary tools for our analysis. To optimise the work, we reduce the number of factors to 10, i.e. focus only on the 10 largest singular values as representative drivers of the entire universe: This is our result – 10 largest SV. In terms of their ‘weight’ in the overall portfolio, we can see that the 1st SV is roughly 50% of the overall explanation of noise in the data The power of SVD comes in detecting driving factors quickly and efficiently. This is the ratio of successive singular values: With the chart: And their cumulative weights:

2. So what does SVD analysis tell us? Instead of looking at the large universe of data, we can construct a reduces set of representative ‘drivers’ that provide explanatory feedback of what causes changes in the values over time. In our case, the 10 largest SV explain approx. 70% of changes in 25,000 series of data. This represents a massive reduction in dimensionality and efficiency of our effort. If higher explanatory level is required, we can increase the factors set to higher number – say 15 – to move to a higher confidence level. The meaning of factors is similar to that of PCA and they refer to various deformation modes of the time series. In statistical terms they can be linked to the moments of multivariate distribution. It may be interest to look also the U and V matrices that provide additional useful information. In this respect the rows of matrix V are particularly meaningful as they ‘decompose’ and enrich the information on how each singular value affects column components. For example, row[1] in the V matrix provides explanation how the mean value of the up-down movement is propagated across each factor: In the same way, we can examine other important SVs in decreasing order of importance: Second factor: Third factor:

Singular value decompostion - application in data analytics

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Singular value decompostion - application in data analytics