SlideShare a Scribd company logo
Driving 10X Efficiency In Testing Hadoop
Applications
Presenters : Swamynathan S & Karan Kumar
Company: InMobi Technology Services
Key Takeaways on Strider
• Test Automation Framework
• Built on Falcon APIs, Oozie, Hadoop
• Sanity, Functional, Regression Testing
• Bootstrapping
• Data type handling using JSONs
• Easy test creation
• Extensible
InMobi and Data
• InMobi - Ad Network
• Handles 1B users
• Crunches 5 TBs of data/day and its growing!
• Data pipeline orchestration - Apache Falcon
Falcon Flow
Feed 1Feed 1 Feed 2Feed 2
Process 1Process 1
Feed 3Feed 3
Feed 4Feed 4 Feed 5Feed 5
Process 2Process 2
Feed 6Feed 6
Process 3Process 3 Feed 7Feed 7
An Example of pipelines
Feed 1Feed 1 Feed 2Feed 2
Process 1Process 1
Feed 3Feed 3
Feed 4Feed 4 Feed 5Feed 5
Process 2Process 2
Feed 6Feed 6
Process 3Process 3 Feed 7Feed 7
Feed xFeed x Feed yFeed y
Process PProcess P
Feed zFeed z
Process Addition
Feed 1Feed 1 Feed 2Feed 2
Process 1Process 1
Feed 3Feed 3
Process 3Process 3 Feed 7Feed 7
Feed xFeed x Feed yFeed y
Process PProcess P
Feed zFeed z
Process Delete
Feed 1Feed 1 Feed 2Feed 2
Process 1Process 1
Feed 3Feed 3
Process 3Process 3 Feed 7Feed 7
Feed xFeed x
Process PProcess P
Feed mFeed m Feed zFeed z
Feed
Add/Delete
Feed 1Feed 1 Feed 2Feed 2
Process 1Process 1
Feed 3’Feed 3’
Process 3Process 3 Feed 7Feed 7
Feed xFeed x
Process PProcess P
Feed mFeed m Feed zFeed z
Feeds Update
Challenges solved resulting in 10X efficiency
Challenge Pre-Strider Way Strider Way
Validate Distribution Individual Script Automated and Seamless
Input Creation / Versioning
Non-scalable and
redundant
Standardised & Versionable
Validation Non scalable pig scripts Generic JSON Validation
Host setup Manual Automated
Regression Expensive Inexpensive
Random and Prod Data Manual Automated and Evolving
Parallelization Not straight forward Yes
End to End test No Yes
CI Not efficient Yes
Test FlowENVIRONMENT
PROVISIONER
ENVIRONMENT
PROVISIONER
ARTIFACT
DEPLOYER
ARTIFACT
DEPLOYER
INPUT
UPLOADING
INPUT
UPLOADING
SCHEDULERSCHEDULER VALIDATORVALIDATOR
EC2EC2
TEST
MACHINE
TEST
MACHINE
SAMPLED
PROD DATA
SAMPLED
PROD DATA
JSON ADAPTERJSON ADAPTER
Environment Provisioning?Functional/Regression Test?Input Data Handling?Validation?
Functional/Regression Test Creation
Suite
testng.xml
Suite
testng.xml
Test SetupTest Setup
Test ExecutorTest Executor
ValidatorValidator
H
D
F
S
F
A
L
C
O
N
F
A
L
C
O
N
O
O
Z
I
E
O
O
Z
I
E
Output
to
JSON
Output
to
JSON
Tester’s Role
JSON
to
Input
JSON
to
Input
Strider’s Role
InputInput
GoldSetGoldSet
InputInput
GoldSetGoldSet
Test
1
Test
2
Example - TestNG.xml
Sample JSONs
Input JSON Output JSON
10X Efficiency - For 100 Test Samples
Primary Area of
Optimization
Hours
Pre-StriderPre-Strider
StriderStrider
Extensibility
• Extensible to
• Create End-to-End tests
• Create input and output converters
• To add custom executors
Road Map
• Enhanced Data Generation
• Performance testing and metrics
• Automated way to generate golden set
Summary
• Test Automation Framework
• Built on Falcon APIs, Oozie, Hadoop
• Sanity,Functional and Regression Testing
• Bootstrapping
• Data type handling using JSONs
• Easy test creation
• Extensible

More Related Content

Similar to Strider_InMobi_HadoopSummit_2015_Brussels

IPv6 Development and Testing Services
IPv6 Development and Testing ServicesIPv6 Development and Testing Services
IPv6 Development and Testing Services
TMA Solutions
 
Neotys PAC 2018 - Helen Bally
Neotys PAC 2018 - Helen BallyNeotys PAC 2018 - Helen Bally
Neotys PAC 2018 - Helen Bally
Neotys_Partner
 
Building functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortalBuilding functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortal
Dmitriy Gumeniuk
 
Production Profiling: What, Why and How
Production Profiling: What, Why and HowProduction Profiling: What, Why and How
Production Profiling: What, Why and How
RichardWarburton
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
DataWorks Summit/Hadoop Summit
 
Ukstar 2017 london- Parasoft
Ukstar 2017 london-  ParasoftUkstar 2017 london-  Parasoft
Ukstar 2017 london- Parasoft
ChantalWauters
 
Brave New World - A wider perspective of our opportunities
Brave New World - A wider perspective of our opportunitiesBrave New World - A wider perspective of our opportunities
Brave New World - A wider perspective of our opportunities
Jayathirtha Rao
 
the grinder testing certification
the grinder testing certificationthe grinder testing certification
the grinder testing certification
Vskills
 
Software Testing: Application And Script Independent Automation Framework: Th...
Software Testing: Application And Script Independent Automation Framework: Th...Software Testing: Application And Script Independent Automation Framework: Th...
Software Testing: Application And Script Independent Automation Framework: Th...
guest0efb5e
 
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
Amazon Web Services
 
Alliance2011 goldcoast Farid
Alliance2011 goldcoast FaridAlliance2011 goldcoast Farid
Alliance2011 goldcoast Farid
Farid Vaswani
 
Nephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele resultsNephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele results
Bioinformatics and Computational Biosciences Branch
 
Ginsbourg.com presentation of performance and load testing 2020
Ginsbourg.com presentation of performance and load testing 2020Ginsbourg.com presentation of performance and load testing 2020
Ginsbourg.com presentation of performance and load testing 2020
Shay Ginsbourg
 
Gatling - SoftShake 2013
Gatling - SoftShake 2013Gatling - SoftShake 2013
Gatling - SoftShake 2013
Pierre DAL-PRA
 
The DevOps Dance - Shift Left, Shift Right - Get It Right
The DevOps Dance - Shift Left, Shift Right - Get It RightThe DevOps Dance - Shift Left, Shift Right - Get It Right
The DevOps Dance - Shift Left, Shift Right - Get It Right
Inflectra
 
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
inovex GmbH
 
Deep Parameters Tuning for Android Mobile Apps
Deep Parameters Tuning for Android Mobile AppsDeep Parameters Tuning for Android Mobile Apps
Deep Parameters Tuning for Android Mobile Apps
Davide De Chiara
 
Ph d defense_Department of Information Technology, Uppsala University, Sweden
Ph d defense_Department of Information Technology, Uppsala University, SwedenPh d defense_Department of Information Technology, Uppsala University, Sweden
Ph d defense_Department of Information Technology, Uppsala University, Sweden
Sabesan Manivasakan
 
Taking Control of your Data with GraphQL
Taking Control of your Data with GraphQLTaking Control of your Data with GraphQL
Taking Control of your Data with GraphQL
Vinci Rufus
 
Oracle Forms Performance Testing PushToTest TestMaker JAT
Oracle Forms Performance Testing PushToTest TestMaker JATOracle Forms Performance Testing PushToTest TestMaker JAT
Oracle Forms Performance Testing PushToTest TestMaker JAT
Clever Moe
 

Similar to Strider_InMobi_HadoopSummit_2015_Brussels (20)

IPv6 Development and Testing Services
IPv6 Development and Testing ServicesIPv6 Development and Testing Services
IPv6 Development and Testing Services
 
Neotys PAC 2018 - Helen Bally
Neotys PAC 2018 - Helen BallyNeotys PAC 2018 - Helen Bally
Neotys PAC 2018 - Helen Bally
 
Building functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortalBuilding functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortal
 
Production Profiling: What, Why and How
Production Profiling: What, Why and HowProduction Profiling: What, Why and How
Production Profiling: What, Why and How
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
 
Ukstar 2017 london- Parasoft
Ukstar 2017 london-  ParasoftUkstar 2017 london-  Parasoft
Ukstar 2017 london- Parasoft
 
Brave New World - A wider perspective of our opportunities
Brave New World - A wider perspective of our opportunitiesBrave New World - A wider perspective of our opportunities
Brave New World - A wider perspective of our opportunities
 
the grinder testing certification
the grinder testing certificationthe grinder testing certification
the grinder testing certification
 
Software Testing: Application And Script Independent Automation Framework: Th...
Software Testing: Application And Script Independent Automation Framework: Th...Software Testing: Application And Script Independent Automation Framework: Th...
Software Testing: Application And Script Independent Automation Framework: Th...
 
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
(DAT311) Large-Scale Genomic Analysis with Amazon Redshift
 
Alliance2011 goldcoast Farid
Alliance2011 goldcoast FaridAlliance2011 goldcoast Farid
Alliance2011 goldcoast Farid
 
Nephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele resultsNephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele results
 
Ginsbourg.com presentation of performance and load testing 2020
Ginsbourg.com presentation of performance and load testing 2020Ginsbourg.com presentation of performance and load testing 2020
Ginsbourg.com presentation of performance and load testing 2020
 
Gatling - SoftShake 2013
Gatling - SoftShake 2013Gatling - SoftShake 2013
Gatling - SoftShake 2013
 
The DevOps Dance - Shift Left, Shift Right - Get It Right
The DevOps Dance - Shift Left, Shift Right - Get It RightThe DevOps Dance - Shift Left, Shift Right - Get It Right
The DevOps Dance - Shift Left, Shift Right - Get It Right
 
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
Bug bites Elephant? Test-driven Quality Assurance in Big Data Application Dev...
 
Deep Parameters Tuning for Android Mobile Apps
Deep Parameters Tuning for Android Mobile AppsDeep Parameters Tuning for Android Mobile Apps
Deep Parameters Tuning for Android Mobile Apps
 
Ph d defense_Department of Information Technology, Uppsala University, Sweden
Ph d defense_Department of Information Technology, Uppsala University, SwedenPh d defense_Department of Information Technology, Uppsala University, Sweden
Ph d defense_Department of Information Technology, Uppsala University, Sweden
 
Taking Control of your Data with GraphQL
Taking Control of your Data with GraphQLTaking Control of your Data with GraphQL
Taking Control of your Data with GraphQL
 
Oracle Forms Performance Testing PushToTest TestMaker JAT
Oracle Forms Performance Testing PushToTest TestMaker JATOracle Forms Performance Testing PushToTest TestMaker JAT
Oracle Forms Performance Testing PushToTest TestMaker JAT
 

Recently uploaded

Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
sameer shah
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 

Recently uploaded (20)

Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens""Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 

Strider_InMobi_HadoopSummit_2015_Brussels

  • 1. Driving 10X Efficiency In Testing Hadoop Applications Presenters : Swamynathan S & Karan Kumar Company: InMobi Technology Services
  • 2. Key Takeaways on Strider • Test Automation Framework • Built on Falcon APIs, Oozie, Hadoop • Sanity, Functional, Regression Testing • Bootstrapping • Data type handling using JSONs • Easy test creation • Extensible
  • 3. InMobi and Data • InMobi - Ad Network • Handles 1B users • Crunches 5 TBs of data/day and its growing! • Data pipeline orchestration - Apache Falcon
  • 5. Feed 1Feed 1 Feed 2Feed 2 Process 1Process 1 Feed 3Feed 3 Feed 4Feed 4 Feed 5Feed 5 Process 2Process 2 Feed 6Feed 6 Process 3Process 3 Feed 7Feed 7 An Example of pipelines
  • 6. Feed 1Feed 1 Feed 2Feed 2 Process 1Process 1 Feed 3Feed 3 Feed 4Feed 4 Feed 5Feed 5 Process 2Process 2 Feed 6Feed 6 Process 3Process 3 Feed 7Feed 7 Feed xFeed x Feed yFeed y Process PProcess P Feed zFeed z Process Addition
  • 7. Feed 1Feed 1 Feed 2Feed 2 Process 1Process 1 Feed 3Feed 3 Process 3Process 3 Feed 7Feed 7 Feed xFeed x Feed yFeed y Process PProcess P Feed zFeed z Process Delete
  • 8. Feed 1Feed 1 Feed 2Feed 2 Process 1Process 1 Feed 3Feed 3 Process 3Process 3 Feed 7Feed 7 Feed xFeed x Process PProcess P Feed mFeed m Feed zFeed z Feed Add/Delete
  • 9. Feed 1Feed 1 Feed 2Feed 2 Process 1Process 1 Feed 3’Feed 3’ Process 3Process 3 Feed 7Feed 7 Feed xFeed x Process PProcess P Feed mFeed m Feed zFeed z Feeds Update
  • 10. Challenges solved resulting in 10X efficiency Challenge Pre-Strider Way Strider Way Validate Distribution Individual Script Automated and Seamless Input Creation / Versioning Non-scalable and redundant Standardised & Versionable Validation Non scalable pig scripts Generic JSON Validation Host setup Manual Automated Regression Expensive Inexpensive Random and Prod Data Manual Automated and Evolving Parallelization Not straight forward Yes End to End test No Yes CI Not efficient Yes
  • 12. Functional/Regression Test Creation Suite testng.xml Suite testng.xml Test SetupTest Setup Test ExecutorTest Executor ValidatorValidator H D F S F A L C O N F A L C O N O O Z I E O O Z I E Output to JSON Output to JSON Tester’s Role JSON to Input JSON to Input Strider’s Role InputInput GoldSetGoldSet InputInput GoldSetGoldSet Test 1 Test 2
  • 14. Sample JSONs Input JSON Output JSON
  • 15. 10X Efficiency - For 100 Test Samples Primary Area of Optimization Hours Pre-StriderPre-Strider StriderStrider
  • 16. Extensibility • Extensible to • Create End-to-End tests • Create input and output converters • To add custom executors
  • 17. Road Map • Enhanced Data Generation • Performance testing and metrics • Automated way to generate golden set
  • 18. Summary • Test Automation Framework • Built on Falcon APIs, Oozie, Hadoop • Sanity,Functional and Regression Testing • Bootstrapping • Data type handling using JSONs • Easy test creation • Extensible