SlideShare a Scribd company logo
1 of 11
Download to read offline
TPM Challenge
Tackling Performance Bottlenecks in
Market Data Service
Problems
• App becomes slow or unresponsive during market open and close
• Users report frustration and difficulty executing trades
• Potential impact on user satisfaction, retention & market competitiveness
Goal
Primary objective is to ensure that market data is effectively ingested and delivered to the FE
application, providing users with real-time and accurate information on stock prices, market trends,
and related financial data
Assumptions
• Stock trading app with 10 Million Active Users.
• Peak usage occurs during market open (9:30 AM) and close (4:00 PM), with user activity increasing
by ~100% during these times
• Each user makes an average of 5 API calls per session
• 3rd Party Data Provider Services are working fine. Issue is not being faced by any other competitor(s)
• Market data service is hosted on a single server with limited resources
• App utilizes a centralized database for Market data and User information
• Issue caused solely due to market data service and there is no impact from other services
• Rate limiting for API access is already implemented for external users
Data Collection and Analysis
Quantitative Qualitative
APM tools
(bottleneck identification)
App response times,
error rates
Resource utilization
(CPU, memory, network)
User interviews and
surveys (pain points)
Engineer interviews
(potential causes)
App logs and DB queries
(technical issues)
Metric Baseline Value Peak Hour Value Peak Hour Degradation
App Startup Time 3.5 seconds 7.5 seconds 114.30%
Screen Load Time 3.0 seconds 5.0 seconds 66.70%
User Error Rate 1% 2.50% 150%
API Response Time 100 milliseconds 223 milliseconds 123%
DBMs Av. Load % 47% 89% 89.36%
IOPS 5000 15000 200%
Session Length 12 minutes 15 minutes
Key Metrics
Current Architecture
Market Data Backend Service
Client Applications
Real-time data
streaming update
- Webhook
Communication Layer
Data Ingestion
Service
Data Processing
Service
Data Storage
Service
Normalization, Cleansing, QC
Data Delivery Service
Increased data
requests
Increased
DB I/O
Increased
Network
Congestion
Fetch Data -
Exchanges, Instruments
& Quotes
Primary Issues
• Scalability:
Servers or databases reaching capacity
• Inefficient data processing:
Slow data parsing & aggregation
• Network bottlenecks:
Limited bandwidth/latency issues/API timeouts
Request for
new data
NYSE
NASDAQ
TSX
3rd Party
Data
Broker
Centralized
Database
Proposed Architecture
Market Data Backend Service
Client Applications
Data Ingestion
Service
Data Processing
Service
Data Storage
Service
ALB
Horizontal Scaling
Data Delivery Service
Publish data to Data Exchange
Solution Candidates
• Scheduled Horizontal Scaling
Async - Update
Cache Service
Fetch Non-cached Data from DB
& Update Cache
Decoupling Services
DB
Sharding
Subscribe to relevant
Kafka topics
Real-time data
streaming update
- Webhook
NYSE
NASDAQ
TSX
3rd Party
Data
Broker
Worker Instances
Worker Nodes
Worker Nodes
Data Request
from FE
Worker Instances
Worker Nodes
Worker Nodes
Data Ingestion
Service
Data not present in DB OR
Cache – Request Data
Ingestion Service
• Reduce # requests to BE
• Fetch data from Cache
• Decoupling Data generation
from Data Delivery
• DB Optimizations
Proposed Solutions
Recommendation Description Effort Impact Priority
Horizontal Scaling:
Increase server capacity by adding more
servers during PEAK hours
Low cost, quick implementation -
Improves peak hour performance
Low –Medium High High
Reduce # queries to backend: Implement
web-accelerator like Varnish
Low cost, moderate dev. Effort Medium High High
Caching Market Data:
Store frequently accessed market data to
reduce backend load - Redis
Moderate cost, requires dev. effort -
Reduces DB load, improves retrieval
speed
Medium High High
Optimizing Data Pipelines:
Decoupling data ingestion & delivery
pipelines for efficiency – Kafka
Moderate cost, requires technical
expertise
High High Medium
Database Indexing:
Create indexes on frequently used database
columns like Stock symbols, LTP
Low cost, requires DB expertise Low Medium Medium
Database Sharding:
Divide DB into smaller, logical, horizontal
partitions – moving out historical data
High cost, complex implementation High High Low
Long
Term
Short
Term
Trade-offs & Implementation Details
Category Trade-off Potential Impact Mitigation Strategies
Data Ingestion
Volume vs.
Processing Speed
Worker nodes might become overloaded
during peak hours, leading to data
processing delays & potential latency
Reduced responsiveness ,
inaccurate or delayed
data delivery
Implement scalable infrastructure solutions
like horizontal scaling of worker nodes, utilize
load balancing strategies
Real-time vs.
Historical Data
Granularity
Keeping large amounts of historical data in
the cache increases storage costs & data
retrieval speed. Fetching data on demand
from DB might introduce additional latency
Increased resource
consumption, slower data
delivery for historical
data.
Implement data retention policies to store
only relevant historical data, utilize caching
for frequently accessed data, optimize DB
queries for improved retrieval speed
Messaging Queue
Technology
Choice
Choosing the wrong MQ tech can lead to
performance limitations, scalability issues,
& integration complexity with other system
components.
System bottlenecks, data
delivery failures,
increased dev &
maintenance costs
Research and evaluate different messaging
queue options like RabbitMQ based on
compatibility with existing tech
Estimated Timeline Short Term – 2-6 months | Long term – 6-12 months
Team Involvement Dev team will lead the technical implementation, but collaboration with Devops, & DBAs is crucial
Existing systems might need API integrations or data format transformations to
connect seamlessly with new components.
Thorough testing and validation required.
Integration Challenges
The Way Forward
• Prioritize quick wins and low-risk improvements
• (Scaling out at pre-defined intervals, Web Accelerator, Data Caching)
• Comprehensive monitoring of performance metrics & user feedback
• Data ingestion rate, worker node CPU utilization, Redis cache hit rate, avg. response time etc.
• Load testing using Artificial Load and size the demand
• Gather ongoing user and engineer feedback
• Recommendation will improve Peak & Non-Peak Performance
• Machine Learning-Based Load Prediction factoring in external variables
Success KPIs
• 50% reduction in API response time, DBM Av. Load % & IOPS
• 60% reduction in user error rate
• 50% reduction in app startup time and screen load time
• Improved user experience, churn & increased platform adoption
Thank You
For Viewing

More Related Content

Similar to Technical Product Manager Case Challenge

Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
Cloudera, Inc.
 
Teleran Briefing July 2014
Teleran Briefing July 2014Teleran Briefing July 2014
Teleran Briefing July 2014
Howard Meadow
 
Itfma 2009 charleston_cost_cutting_tool
Itfma 2009 charleston_cost_cutting_toolItfma 2009 charleston_cost_cutting_tool
Itfma 2009 charleston_cost_cutting_tool
Karthik Arumugham
 
Data-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reportingData-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reporting
AnalyticsWeek
 

Similar to Technical Product Manager Case Challenge (20)

Igniting Audience Measurement at Time Warner Cable
Igniting Audience Measurement at Time Warner CableIgniting Audience Measurement at Time Warner Cable
Igniting Audience Measurement at Time Warner Cable
 
Webinar: Achieving Customer Centricity and High Margins in Financial Services...
Webinar: Achieving Customer Centricity and High Margins in Financial Services...Webinar: Achieving Customer Centricity and High Margins in Financial Services...
Webinar: Achieving Customer Centricity and High Margins in Financial Services...
 
Performance Testing
Performance TestingPerformance Testing
Performance Testing
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the Same
 
ATAGTR2017 Batch Workload Modelling and Performance Optimization
ATAGTR2017 Batch Workload Modelling and Performance Optimization ATAGTR2017 Batch Workload Modelling and Performance Optimization
ATAGTR2017 Batch Workload Modelling and Performance Optimization
 
Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?Can data virtualization uphold performance with complex queries?
Can data virtualization uphold performance with complex queries?
 
Pulse2012 Trm Battelle Final
Pulse2012 Trm Battelle FinalPulse2012 Trm Battelle Final
Pulse2012 Trm Battelle Final
 
Production Monitoring Platform
Production Monitoring PlatformProduction Monitoring Platform
Production Monitoring Platform
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Data Services - Business Intelligence Service, Big Data Service
Data Services - Business Intelligence Service, Big Data ServiceData Services - Business Intelligence Service, Big Data Service
Data Services - Business Intelligence Service, Big Data Service
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overview
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and morePerformance Acceleration: Summaries, Recommendation, MPP and more
Performance Acceleration: Summaries, Recommendation, MPP and more
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 
Chapter 11 Enterprise Resource Planning System
Chapter 11 Enterprise Resource Planning SystemChapter 11 Enterprise Resource Planning System
Chapter 11 Enterprise Resource Planning System
 
Teleran Briefing July 2014
Teleran Briefing July 2014Teleran Briefing July 2014
Teleran Briefing July 2014
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use Cases
 
Itfma 2009 charleston_cost_cutting_tool
Itfma 2009 charleston_cost_cutting_toolItfma 2009 charleston_cost_cutting_tool
Itfma 2009 charleston_cost_cutting_tool
 
Data-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reportingData-As-A-Service to enable compliance reporting
Data-As-A-Service to enable compliance reporting
 

More from Arush Sharma (7)

Swiggy Surge Case Competition
Swiggy Surge Case CompetitionSwiggy Surge Case Competition
Swiggy Surge Case Competition
 
Refurbished smartphones - Building trust and consumers purchase intention
Refurbished smartphones - Building trust and consumers purchase intentionRefurbished smartphones - Building trust and consumers purchase intention
Refurbished smartphones - Building trust and consumers purchase intention
 
Optum Stratethon - United Healh Group - UHG
Optum Stratethon - United Healh Group - UHGOptum Stratethon - United Healh Group - UHG
Optum Stratethon - United Healh Group - UHG
 
Renew power - ReLead Case Competition
Renew power - ReLead Case CompetitionRenew power - ReLead Case Competition
Renew power - ReLead Case Competition
 
Xiaomi - MI Summit 2019
Xiaomi - MI Summit 2019 Xiaomi - MI Summit 2019
Xiaomi - MI Summit 2019
 
P2P lending idea
P2P lending idea P2P lending idea
P2P lending idea
 
Ab InBev - BUD Challenge 2019
Ab InBev - BUD Challenge 2019Ab InBev - BUD Challenge 2019
Ab InBev - BUD Challenge 2019
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

Technical Product Manager Case Challenge

  • 2.
  • 3. Tackling Performance Bottlenecks in Market Data Service Problems • App becomes slow or unresponsive during market open and close • Users report frustration and difficulty executing trades • Potential impact on user satisfaction, retention & market competitiveness Goal Primary objective is to ensure that market data is effectively ingested and delivered to the FE application, providing users with real-time and accurate information on stock prices, market trends, and related financial data
  • 4. Assumptions • Stock trading app with 10 Million Active Users. • Peak usage occurs during market open (9:30 AM) and close (4:00 PM), with user activity increasing by ~100% during these times • Each user makes an average of 5 API calls per session • 3rd Party Data Provider Services are working fine. Issue is not being faced by any other competitor(s) • Market data service is hosted on a single server with limited resources • App utilizes a centralized database for Market data and User information • Issue caused solely due to market data service and there is no impact from other services • Rate limiting for API access is already implemented for external users
  • 5. Data Collection and Analysis Quantitative Qualitative APM tools (bottleneck identification) App response times, error rates Resource utilization (CPU, memory, network) User interviews and surveys (pain points) Engineer interviews (potential causes) App logs and DB queries (technical issues) Metric Baseline Value Peak Hour Value Peak Hour Degradation App Startup Time 3.5 seconds 7.5 seconds 114.30% Screen Load Time 3.0 seconds 5.0 seconds 66.70% User Error Rate 1% 2.50% 150% API Response Time 100 milliseconds 223 milliseconds 123% DBMs Av. Load % 47% 89% 89.36% IOPS 5000 15000 200% Session Length 12 minutes 15 minutes Key Metrics
  • 6. Current Architecture Market Data Backend Service Client Applications Real-time data streaming update - Webhook Communication Layer Data Ingestion Service Data Processing Service Data Storage Service Normalization, Cleansing, QC Data Delivery Service Increased data requests Increased DB I/O Increased Network Congestion Fetch Data - Exchanges, Instruments & Quotes Primary Issues • Scalability: Servers or databases reaching capacity • Inefficient data processing: Slow data parsing & aggregation • Network bottlenecks: Limited bandwidth/latency issues/API timeouts Request for new data NYSE NASDAQ TSX 3rd Party Data Broker Centralized Database
  • 7. Proposed Architecture Market Data Backend Service Client Applications Data Ingestion Service Data Processing Service Data Storage Service ALB Horizontal Scaling Data Delivery Service Publish data to Data Exchange Solution Candidates • Scheduled Horizontal Scaling Async - Update Cache Service Fetch Non-cached Data from DB & Update Cache Decoupling Services DB Sharding Subscribe to relevant Kafka topics Real-time data streaming update - Webhook NYSE NASDAQ TSX 3rd Party Data Broker Worker Instances Worker Nodes Worker Nodes Data Request from FE Worker Instances Worker Nodes Worker Nodes Data Ingestion Service Data not present in DB OR Cache – Request Data Ingestion Service • Reduce # requests to BE • Fetch data from Cache • Decoupling Data generation from Data Delivery • DB Optimizations
  • 8. Proposed Solutions Recommendation Description Effort Impact Priority Horizontal Scaling: Increase server capacity by adding more servers during PEAK hours Low cost, quick implementation - Improves peak hour performance Low –Medium High High Reduce # queries to backend: Implement web-accelerator like Varnish Low cost, moderate dev. Effort Medium High High Caching Market Data: Store frequently accessed market data to reduce backend load - Redis Moderate cost, requires dev. effort - Reduces DB load, improves retrieval speed Medium High High Optimizing Data Pipelines: Decoupling data ingestion & delivery pipelines for efficiency – Kafka Moderate cost, requires technical expertise High High Medium Database Indexing: Create indexes on frequently used database columns like Stock symbols, LTP Low cost, requires DB expertise Low Medium Medium Database Sharding: Divide DB into smaller, logical, horizontal partitions – moving out historical data High cost, complex implementation High High Low Long Term Short Term
  • 9. Trade-offs & Implementation Details Category Trade-off Potential Impact Mitigation Strategies Data Ingestion Volume vs. Processing Speed Worker nodes might become overloaded during peak hours, leading to data processing delays & potential latency Reduced responsiveness , inaccurate or delayed data delivery Implement scalable infrastructure solutions like horizontal scaling of worker nodes, utilize load balancing strategies Real-time vs. Historical Data Granularity Keeping large amounts of historical data in the cache increases storage costs & data retrieval speed. Fetching data on demand from DB might introduce additional latency Increased resource consumption, slower data delivery for historical data. Implement data retention policies to store only relevant historical data, utilize caching for frequently accessed data, optimize DB queries for improved retrieval speed Messaging Queue Technology Choice Choosing the wrong MQ tech can lead to performance limitations, scalability issues, & integration complexity with other system components. System bottlenecks, data delivery failures, increased dev & maintenance costs Research and evaluate different messaging queue options like RabbitMQ based on compatibility with existing tech Estimated Timeline Short Term – 2-6 months | Long term – 6-12 months Team Involvement Dev team will lead the technical implementation, but collaboration with Devops, & DBAs is crucial Existing systems might need API integrations or data format transformations to connect seamlessly with new components. Thorough testing and validation required. Integration Challenges
  • 10. The Way Forward • Prioritize quick wins and low-risk improvements • (Scaling out at pre-defined intervals, Web Accelerator, Data Caching) • Comprehensive monitoring of performance metrics & user feedback • Data ingestion rate, worker node CPU utilization, Redis cache hit rate, avg. response time etc. • Load testing using Artificial Load and size the demand • Gather ongoing user and engineer feedback • Recommendation will improve Peak & Non-Peak Performance • Machine Learning-Based Load Prediction factoring in external variables Success KPIs • 50% reduction in API response time, DBM Av. Load % & IOPS • 60% reduction in user error rate • 50% reduction in app startup time and screen load time • Improved user experience, churn & increased platform adoption