SlideShare a Scribd company logo
1 of 32
Download to read offline
Kafka for Financial Data
- The Flipkart way
Shankar Manian & Rajat Gangwar
2
Agenda
• Flipkart - A Background
• Financial Data - Requirements and Challenges
• Case Study
• Future ahead
3
Company Intro
• Founded in 2007, a multi billion dollar company
• Pioneered Indian eCommerce with Innovations like COD
• 200+M Registered users, 50+M Active users
• Market leader in Fashion, Electronics and many more
• 100k+ active sellers
4
What we do - Record Transactions
● Diverse set of sources
● Many parties involved
● Spread over time
5
What we do - Provide Insights
● Power Balance sheet, P & L statement
● Real time trend analytics and forecasts
● Revenue assurance and fraud detection
6
What we do - Be Compliant
● Financial data is highly regulated
● eCommerce is further regulated
● We also provide tax compliance
7
Challenges - Data Quality
● Financial data is highly sensitive
● Legal Implications and Social media backlash
● Need to have real time visibility
8
Challenges - Complete Audit Trail
● Not enough to be 100% correct, be able to prove to be 100%
correct
● Need to be able to “Follow the Money”
● Need to have verifiable, repeatable processing.
9
Challenges - Risk Averse Customer
● Primary customers are accountants, auditors and government
● Hard to convince, naturally paranoid
● Credibility is paramount
Case Study
11
Payout flow - A case study
● Overview
● First solution
● Limitations
● Streaming
● Challenges
● Next steps
12
Payout flow
User
Module
Seller
Module
Marketplace
Accounting
Flipkart
Money Order placed
Item dispatched
Party_from : Flipkart
Party_to : Seller
Amount : 300
13
Payout flow : Seller Dashboard
14
Evolution 0
Invoice Disbursement
15
Evolution 1
Invoice Payment Disbursement
16
Shortcomings
- Not real-time
17
Shortcomings
- Not real-time
- Vicious cycle
18
Shortcomings
- Not real-time
- Vicious cycle
Increase batch
size of each job
Puts extra
load on DB
Builds up lag
19
Shortcomings
- Not real-time
- Vicious cycle
- Throughput saturation
Invoice
Orches
trator
Payment
00:00
00:45
00:59
: 1st Job
01:00
: 2nd Job
20
Shortcomings
- Not real-time
- Vicious cycle
- Throughput saturation
Throughput = Batch size X Number of jobsX
a ~ b
b
(per day)
Saturation Point :
a
21
Problem ?
Triggers an
Event
Event
Consumption
22
Enter Stream...
Order Stream Invoice Invoice Stream
Consumer
30 50
Seller 1 Seller 2
Payout
Dashboard
23
Architecture
Order Stream Invoice
Payout
Dashboard
Invoice Stream
Consumer
30 50
Seller 1 Seller 2
24
Challenges
1. Avoid Locks
10
500
Bucket
Stream Processor
510
20
530
15 201535 153540
Payout
Module
530
0
25
Challenges
1. Avoid Locks
10
500
Bucket 1
Stream Processor
510
20
530
15 201535 153540 Payout
Module
Day 1Day 2
530
Bucket 1
0
Bucket 2
15
26
Challenges
1. Avoid Locks
2. Exactly-once
27
Challenges
1. Avoid Locks
2. Exactly-once
B1B2 B1 : T1
Kafka
Storm
Bucket 1
T1 T1
Bucket 1
T1
Bucket 2
28
Challenges
1. Avoid Locks
2. Exactly-once
Operation Idempotent
X = X + 5 No
X = X + 5,
T = T1
May be
X = 50 Yes
Meaning : The operation will produce the same result,
no matter how many times it is repeated.
Solution : IDEMPOTENCY
29
Challenges
1. Avoid Locks
2. Exactly-once
3. Testing / Validation
Invoice
Payments
Buckets
Disbursement
Old flow
New flow
30
Summary
- Real-time
- No extra load on DB
- Throughput is not bounded
31
Next Steps
Stream
Invoice
Kafka
Aggregation
D M T
Debtor
Month-end
Tax-filing
Stream
Stream
Stream
Stream
Questions?

More Related Content

Similar to Kafka For Financial Data Processing - The Flipkart Way (Shankar Manian and Rajat Gangwar, Flipkart) Kafka Summit London 2019

Direct Insite Investor Presentation - September 2013
Direct Insite Investor Presentation - September 2013Direct Insite Investor Presentation - September 2013
Direct Insite Investor Presentation - September 2013
Company Spotlight
 
Direct Insite Investor Presentation 2013
Direct Insite Investor Presentation 2013 Direct Insite Investor Presentation 2013
Direct Insite Investor Presentation 2013
Company Spotlight
 
Direct Insite Presentation - October 2013
Direct Insite Presentation - October 2013Direct Insite Presentation - October 2013
Direct Insite Presentation - October 2013
Company Spotlight
 
ChakraView – A 360° Approach to Data Quality
ChakraView – A 360° Approach to Data QualityChakraView – A 360° Approach to Data Quality
ChakraView – A 360° Approach to Data Quality
Databricks
 
UiPath Community_Automation Business Analysts.pptx
UiPath Community_Automation Business Analysts.pptxUiPath Community_Automation Business Analysts.pptx
UiPath Community_Automation Business Analysts.pptx
RohitRadhakrishnan8
 

Similar to Kafka For Financial Data Processing - The Flipkart Way (Shankar Manian and Rajat Gangwar, Flipkart) Kafka Summit London 2019 (20)

Alternative payment methods 03 2015 LERNER Consulting
Alternative payment methods 03 2015 LERNER ConsultingAlternative payment methods 03 2015 LERNER Consulting
Alternative payment methods 03 2015 LERNER Consulting
 
DC PPT_GROUP 10.pptx
DC PPT_GROUP 10.pptxDC PPT_GROUP 10.pptx
DC PPT_GROUP 10.pptx
 
How Trek Bicycles Enhanced Its Operations Globally While Mitigating Payment F...
How Trek Bicycles Enhanced Its Operations Globally While Mitigating Payment F...How Trek Bicycles Enhanced Its Operations Globally While Mitigating Payment F...
How Trek Bicycles Enhanced Its Operations Globally While Mitigating Payment F...
 
"Making Accounts Receivable Automation Work for You" Webinar Slide Deck
"Making Accounts Receivable Automation Work for You" Webinar Slide Deck"Making Accounts Receivable Automation Work for You" Webinar Slide Deck
"Making Accounts Receivable Automation Work for You" Webinar Slide Deck
 
Integrating Procurement & AP - Trends, Drivers, & The Role Of Technology
Integrating Procurement & AP - Trends, Drivers, & The Role Of TechnologyIntegrating Procurement & AP - Trends, Drivers, & The Role Of Technology
Integrating Procurement & AP - Trends, Drivers, & The Role Of Technology
 
e-comm new2.ppt
e-comm new2.ppte-comm new2.ppt
e-comm new2.ppt
 
Seattle Digital Identity and Commerce Slides
Seattle Digital Identity and Commerce SlidesSeattle Digital Identity and Commerce Slides
Seattle Digital Identity and Commerce Slides
 
Seattle Digital Identity and Commerce Slides
Seattle Digital Identity and Commerce SlidesSeattle Digital Identity and Commerce Slides
Seattle Digital Identity and Commerce Slides
 
Direct Insite Investor Presentation - September 2013
Direct Insite Investor Presentation - September 2013Direct Insite Investor Presentation - September 2013
Direct Insite Investor Presentation - September 2013
 
Direct Insite Investor Presentation 2013
Direct Insite Investor Presentation 2013 Direct Insite Investor Presentation 2013
Direct Insite Investor Presentation 2013
 
Direct Insite Presentation - October 2013
Direct Insite Presentation - October 2013Direct Insite Presentation - October 2013
Direct Insite Presentation - October 2013
 
ChakraView – A 360° Approach to Data Quality
ChakraView – A 360° Approach to Data QualityChakraView – A 360° Approach to Data Quality
ChakraView – A 360° Approach to Data Quality
 
Financial Accounting
Financial AccountingFinancial Accounting
Financial Accounting
 
Financial accounting-1213527362725423-9
Financial accounting-1213527362725423-9Financial accounting-1213527362725423-9
Financial accounting-1213527362725423-9
 
Clothes R Us Situational analysis
Clothes R Us Situational analysisClothes R Us Situational analysis
Clothes R Us Situational analysis
 
Finance at TransferWise
Finance at TransferWiseFinance at TransferWise
Finance at TransferWise
 
Measuring the Subscription Economy
Measuring the Subscription EconomyMeasuring the Subscription Economy
Measuring the Subscription Economy
 
UiPath Community_Automation Business Analysts.pptx
UiPath Community_Automation Business Analysts.pptxUiPath Community_Automation Business Analysts.pptx
UiPath Community_Automation Business Analysts.pptx
 
Exception analytics - Balancing Risk & Control
Exception analytics - Balancing Risk & ControlException analytics - Balancing Risk & Control
Exception analytics - Balancing Risk & Control
 
Online Payments and You
Online Payments and YouOnline Payments and You
Online Payments and You
 

More from confluent

More from confluent (20)

Evolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI EraEvolving Data Governance for the Real-time Streaming and AI Era
Evolving Data Governance for the Real-time Streaming and AI Era
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023
 

Recently uploaded

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 

Recently uploaded (20)

UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 

Kafka For Financial Data Processing - The Flipkart Way (Shankar Manian and Rajat Gangwar, Flipkart) Kafka Summit London 2019