SlideShare a Scribd company logo
1 of 18
Download to read offline
Scalable	Pipelines
Vivek Nagarajan
Insight	Data	Engineering	Consulting	Project
My	Role
• Reduce	latency	of	running	a	pipeline
• Setup	infrastructure	for	scaling	pipelines
Pre-Pipeline	Stage
<filename,	yaml>
Pre-Pipeline	Stage
My	ETL	Pipeline
My	ETL	Pipeline
My	ETL	Pipeline
Scaling	Pipeline
Scaling	Pipeline
Scaling	Pipeline
Scaling	Pipeline
Demo
Youtube demo:
https://youtu.be/iCMrUqW2n2g
Challenges
• Understand	existing	framework	and	
infrastructure
• Evolving	set	of		requirements
• Scaling	pipelines	in	flink standalone	
cluster	mode
Performance	Stats
• Reduced	time	taken	to	process	pipeline	
by	over	50	percent
• Running	30	pipelines	concurrently	takes	
an	average	of	2	minutes	per	pipeline
Possible	extensions
• Running	Flink on	yarn
• Benchmarking	with	Spark	transformations
• More	support	for	dynamic	transformations
About	Me Vivekanand Ganapathy Nagarajan
Sr.	Software	Engineer,	Rambus	Inc.
MS	in	Computer	Science	with	focus	
on	Algorithms	and	Database
University	of	Southern	California
Worked	at	
Vmware and	
Amazon	as	
Software	
Engineer	in	Test
Passionate	about	
learning	and	staying	
abreast	with	technology
I	am	an	avid	Coursera
learner	having	
completed	over	20	
courses	with	special	
focus	on	Algorithms
Thank	You

More Related Content

Viewers also liked

Viewers also liked (17)

Planificacao referencial be_ev_et
Planificacao referencial be_ev_etPlanificacao referencial be_ev_et
Planificacao referencial be_ev_et
 
음향 기초 안내서
음향 기초 안내서음향 기초 안내서
음향 기초 안내서
 
DissertationFormatted
DissertationFormattedDissertationFormatted
DissertationFormatted
 
Psicobioetica UDF
Psicobioetica UDFPsicobioetica UDF
Psicobioetica UDF
 
2.9 Cartesian products
2.9 Cartesian products2.9 Cartesian products
2.9 Cartesian products
 
2.7 Ordered pairs
2.7 Ordered pairs2.7 Ordered pairs
2.7 Ordered pairs
 
The Light Field Stereoscope | SIGGRAPH 2015
The Light Field Stereoscope | SIGGRAPH 2015The Light Field Stereoscope | SIGGRAPH 2015
The Light Field Stereoscope | SIGGRAPH 2015
 
Отзыв на иск ркн
Отзыв на иск ркнОтзыв на иск ркн
Отзыв на иск ркн
 
Trabajoventiladores
Trabajoventiladores Trabajoventiladores
Trabajoventiladores
 
Literatura del siglo de oro
Literatura del siglo de oroLiteratura del siglo de oro
Literatura del siglo de oro
 
Artificial drone intelligence technology
Artificial drone intelligence technologyArtificial drone intelligence technology
Artificial drone intelligence technology
 
2.8 Ordered tuples
2.8 Ordered tuples2.8 Ordered tuples
2.8 Ordered tuples
 
1EM #01 Natureza da Ciência (2017)
1EM #01 Natureza da Ciência (2017)1EM #01 Natureza da Ciência (2017)
1EM #01 Natureza da Ciência (2017)
 
3EM #02 Alelos e Probabilidade (2017)
3EM #02 Alelos e Probabilidade (2017)3EM #02 Alelos e Probabilidade (2017)
3EM #02 Alelos e Probabilidade (2017)
 
Multi-camera Time-of-Flight Systems | SIGGRAPH 2016
Multi-camera Time-of-Flight Systems | SIGGRAPH 2016Multi-camera Time-of-Flight Systems | SIGGRAPH 2016
Multi-camera Time-of-Flight Systems | SIGGRAPH 2016
 
Making the Most of VR: 10 Tips for Sports Marketers
Making the Most of VR: 10 Tips for Sports MarketersMaking the Most of VR: 10 Tips for Sports Marketers
Making the Most of VR: 10 Tips for Sports Marketers
 
TwitterStock
TwitterStockTwitterStock
TwitterStock
 

Similar to Scalable Pipelines

CV_Tejas_Pandya_3 Yrs Instrument Engineer
CV_Tejas_Pandya_3 Yrs Instrument EngineerCV_Tejas_Pandya_3 Yrs Instrument Engineer
CV_Tejas_Pandya_3 Yrs Instrument Engineer
tejas pandya
 
SCQAA-SF Meeting on May 21 2014
SCQAA-SF Meeting on May 21 2014 SCQAA-SF Meeting on May 21 2014
SCQAA-SF Meeting on May 21 2014
Sujit Ghosh
 
Subramanian Resume
Subramanian ResumeSubramanian Resume
Subramanian Resume
tilak777
 
The art of system and solution testing
The art of system and solution testingThe art of system and solution testing
The art of system and solution testing
gaoliang641
 
Ronan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_webRonan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_web
Ronan Rice
 
Resume - Jason Lewis
Resume - Jason LewisResume - Jason Lewis
Resume - Jason Lewis
Jason Lewis
 
Is accurate system-level power measurement challenging? Check this out!
Is accurate system-level power measurement challenging? Check this out!Is accurate system-level power measurement challenging? Check this out!
Is accurate system-level power measurement challenging? Check this out!
Deepak Shankar
 

Similar to Scalable Pipelines (20)

CV_Tejas_Pandya_3 Yrs Instrument Engineer
CV_Tejas_Pandya_3 Yrs Instrument EngineerCV_Tejas_Pandya_3 Yrs Instrument Engineer
CV_Tejas_Pandya_3 Yrs Instrument Engineer
 
Improving Batch-Process Testing Techniques with a Domain-Specific Language
Improving Batch-Process Testing Techniques with a Domain-Specific LanguageImproving Batch-Process Testing Techniques with a Domain-Specific Language
Improving Batch-Process Testing Techniques with a Domain-Specific Language
 
SCQAA-SF Meeting on May 21 2014
SCQAA-SF Meeting on May 21 2014 SCQAA-SF Meeting on May 21 2014
SCQAA-SF Meeting on May 21 2014
 
GSE Gaining Value From Virtual Control Panels CONTE 2019
GSE Gaining Value From Virtual Control Panels CONTE 2019GSE Gaining Value From Virtual Control Panels CONTE 2019
GSE Gaining Value From Virtual Control Panels CONTE 2019
 
Subramanian Resume
Subramanian ResumeSubramanian Resume
Subramanian Resume
 
Resilience Planning & How the Empire Strikes Back
Resilience Planning & How the Empire Strikes BackResilience Planning & How the Empire Strikes Back
Resilience Planning & How the Empire Strikes Back
 
The art of system and solution testing
The art of system and solution testingThe art of system and solution testing
The art of system and solution testing
 
CCIE Service Provider Techtorial
CCIE Service Provider Techtorial CCIE Service Provider Techtorial
CCIE Service Provider Techtorial
 
Continuous Validation at Scale
Continuous Validation at ScaleContinuous Validation at Scale
Continuous Validation at Scale
 
ASIC Design Fundamentals.pptx
ASIC Design Fundamentals.pptxASIC Design Fundamentals.pptx
ASIC Design Fundamentals.pptx
 
my_updated_CV_14_3_16
my_updated_CV_14_3_16my_updated_CV_14_3_16
my_updated_CV_14_3_16
 
Ronan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_webRonan_Rice_Resume_2016_web
Ronan_Rice_Resume_2016_web
 
Resume - Jason Lewis
Resume - Jason LewisResume - Jason Lewis
Resume - Jason Lewis
 
Resume - Nanda New
Resume - Nanda NewResume - Nanda New
Resume - Nanda New
 
Introduction to performance testing
Introduction to performance testingIntroduction to performance testing
Introduction to performance testing
 
WebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck ThreadsWebLogic Stability; Detect and Analyse Stuck Threads
WebLogic Stability; Detect and Analyse Stuck Threads
 
Sharam salamian
Sharam salamianSharam salamian
Sharam salamian
 
Embracing Failure - Fault Injection and Service Resilience at Netflix
Embracing Failure - Fault Injection and Service Resilience at NetflixEmbracing Failure - Fault Injection and Service Resilience at Netflix
Embracing Failure - Fault Injection and Service Resilience at Netflix
 
Is accurate system-level power measurement challenging? Check this out!
Is accurate system-level power measurement challenging? Check this out!Is accurate system-level power measurement challenging? Check this out!
Is accurate system-level power measurement challenging? Check this out!
 
NCPRE at IIT Bombay : Tech Commercialization
NCPRE at IIT Bombay : Tech CommercializationNCPRE at IIT Bombay : Tech Commercialization
NCPRE at IIT Bombay : Tech Commercialization
 

Recently uploaded

Recently uploaded (20)

DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUMDEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
DEMONSTRATION LESSON IN ENGLISH 4 MATATAG CURRICULUM
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
Book Review of Run For Your Life Powerpoint
Book Review of Run For Your Life PowerpointBook Review of Run For Your Life Powerpoint
Book Review of Run For Your Life Powerpoint
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT TOÁN 2024 - TỪ CÁC TRƯỜNG, TRƯỜNG...
 
How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17How To Create Editable Tree View in Odoo 17
How To Create Editable Tree View in Odoo 17
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of TransportBasic Civil Engineering notes on Transportation Engineering & Modes of Transport
Basic Civil Engineering notes on Transportation Engineering & Modes of Transport
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
 
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)ESSENTIAL of (CS/IT/IS) class 07 (Networks)
ESSENTIAL of (CS/IT/IS) class 07 (Networks)
 
PSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptxPSYPACT- Practicing Over State Lines May 2024.pptx
PSYPACT- Practicing Over State Lines May 2024.pptx
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
An Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppAn Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge App
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 

Scalable Pipelines