SlideShare a Scribd company logo
1 of 16
Download to read offline
How	
  MediaMath	
  Solved	
  a	
  Cri1cal	
  Repor1ng	
  Problem	
  with	
  Impala	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  1	
  
The	
  Cloudera	
  Sessions	
  
June	
  18,	
  2014	
  
Ram	
  Narayanan,	
  Senior	
  Director	
  of	
  Database	
  Architecture	
  &	
  Opera1ons	
  
Digital	
  Marke1ng	
  Pioneer	
  
•  Founded	
  in	
  2007	
  
•  Global	
  technology	
  company	
  
•  Invented	
  first	
  Demand	
  Side	
  PlaJorm	
  (DSP)	
  for	
  online	
  ads	
  	
  
•  Conducts	
  online	
  adverNsing	
  through	
  real-­‐Nme	
  bidding	
  &	
  
programmaNc	
  buying	
  
	
  
About	
  MediaMath	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  2	
  
About	
  MediaMath	
  
Overview	
  of	
  Real-­‐Time	
  Bidding	
  
Real-­‐1me	
  
Auc1on	
  
<30	
  ms	
  
Adver1ser	
  
(Client)	
  
User	
  
	
  	
  ad	
  
	
  	
  
www.cnn.com	
  
ad	
  
About	
  MediaMath	
  
Overview	
  or	
  Real-­‐Time	
  Bidding	
  
User	
  
www.cnn.com	
  
	
  	
  
Purchased!	
  
	
  	
  ad	
  
www.shoes.com	
  
	
  	
   $$	
   Event	
  Logs	
  
•  Ad	
  OpportuniNes:	
  80-­‐100	
  billion	
  per	
  day	
  	
  
"  1.2	
  million	
  opportuniNes	
  per	
  second	
  at	
  peak	
  
•  We	
  bid	
  on	
  30-­‐40	
  billion	
  ads	
  per	
  day	
  
•  We	
  serve	
  1-­‐2	
  billion	
  ads	
  per	
  day	
  
•  15-­‐20	
  million	
  events	
  (click,	
  sale,	
  online	
  sign-­‐up)	
  per	
  hour	
  
•  2	
  TB	
  of	
  data	
  daily	
  (compressed)	
  
"  Note:	
  This	
  only	
  counts	
  our	
  wins.	
  If	
  we	
  count	
  losses,	
  we	
  easily	
  reach	
  PBs.	
  
About	
  MediaMath	
  
Which	
  ad	
  (impression)	
  led	
  to	
  which	
  ac1on,	
  like	
  a	
  sale	
  or	
  online	
  signup	
  
•  35-­‐40	
  billion	
  recorded	
  impressions	
  served	
  every	
  30	
  days	
  
•  15-­‐20	
  million	
  events	
  per	
  hour	
  
•  Need	
  to	
  join	
  events	
  with	
  impressions	
  2x	
  per	
  hour	
  
à	
  Find	
  matching	
  records	
  
à	
  Perform	
  complex	
  sequencing	
  &	
  allocaNon	
  logic	
  
à	
  Run	
  aggregaNons	
  on	
  results	
  
à	
  Send	
  data	
  to	
  data	
  marts	
  
	
  
à	
  Provide	
  hourly	
  reporNng	
  to	
  clients	
  
	
  
The	
  Repor1ng	
  AZribu1on	
  Problem	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  6	
  
Incumbent	
  Architecture:	
  	
  
Appliance-­‐based	
  (Netezza)	
  	
  
Cost:	
  Expensive	
  -­‐	
  
Scale:	
  Non-­‐incremental	
  scalability	
  -­‐	
  
Performance:	
  ReporNng	
  lag	
  -­‐	
  
ReporNng	
  inflexibility	
  
Product	
  feature	
  constrained	
  -­‐	
  
-­‐	
  
To	
  build	
  a	
  data	
  warehouse	
  architecture	
  that	
  could	
  
perform	
  hourly	
  repor1ng	
  of	
  aZribu1on	
  data	
  at	
  scale	
  that	
  
is	
  affordable	
  and	
  easy	
  to	
  manage.	
  	
  
Our	
  goal	
  
" Scalability	
  
Handle	
  10-­‐50x	
  scale	
  
" Capability	
  	
  
Ability	
  to	
  perform	
  big	
  data	
  joins	
  at	
  scale	
  
" Performance	
  
Complete	
  aggregaNon	
  in	
  <60	
  minutes	
  
" Cost	
  effec1ve	
  
Cheaper	
  than	
  appliance-­‐based	
  soluNons	
  
	
  
	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  9	
  
EvaluaNon	
  Criteria:	
  
" Hive	
  
Run	
  Nme:	
  Took	
  5-­‐6	
  hours	
  to	
  complete	
  
Stability:	
  High	
  	
  
" Pig	
  
Run	
  Nme:	
  Took	
  4-­‐5	
  hours	
  to	
  complete	
  
Stability:	
  High	
  
" Impala	
  Beta	
  (0.6)	
  
Run	
  Nme:	
  Took	
  2-­‐3	
  hours	
  to	
  complete	
  
Stability:	
  Low	
  
	
  
	
  
	
  
Evaluated	
  OpNons:	
  Round	
  1	
  
" Hive:	
  Post-­‐Tuning	
  (map	
  joins,	
  bucke1ng,	
  split	
  size,	
  etc.)	
  
Run	
  Nme:	
  Took	
  2-­‐3	
  hours	
  to	
  complete	
  
Stability:	
  High	
  	
  
" Impala	
  GPA	
  (1.0)	
  (L0	
  compression,	
  slicing,	
  tuning,	
  hw	
  
upgrade)	
  
Run	
  Nme:	
  Took	
  30	
  minutes	
  to	
  complete	
  
Stability:	
  High	
  
	
  
	
  
	
  
Evaluated	
  OpNons:	
  Round	
  2	
  
Data	
  Warehouse	
  Architecture	
  2011	
  
Bid	
  Logs	
  
Pixel	
  Logs	
  
Metadata	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
	
  
	
  	
  	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
ELT	
  
A	
  
T	
  
T	
  
R	
  
I	
  
B	
  
U	
  
T	
  
I	
  
O	
  
N	
  
Repor
ts	
  
Aggr
ega1
on	
  
Netezza	
  
2011	
  
Data	
  Warehouse	
  Architecture	
  2011	
  
Bid	
  Logs	
  
Pixel	
  Logs	
  
Metadata	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
Repor1ng	
  	
  
Data	
  
Marts	
  	
  
	
  
	
  	
  	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
ELT	
  
A	
  
T	
  
T	
  
R	
  
I	
  
B	
  
U	
  
T	
  
I	
  
O	
  
N	
  
Repor
ts	
  
Aggr
ega1
on	
  
	
  	
  	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Reports	
  Aggrega1on	
  
Netezza	
  Hadoop	
  
2013	
  
•  December	
  2013:	
  Peak	
  season	
  
"  New	
  architecture	
  accommodated	
  2x	
  data	
  volume	
  with	
  unprecedented	
  
scalability	
  &	
  stability	
  
•  Present:	
  We	
  are	
  planning	
  to	
  add	
  more	
  features	
  	
  
"  Considering	
  moving	
  some	
  part	
  of	
  aggregaNon	
  into	
  Hadoop	
  
Proof:	
  	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  14	
  
•  Process	
  ONLY	
  the	
  required	
  data	
  
•  Compress	
  your	
  data	
  
•  “Divide	
  &	
  Conquer”	
  your	
  data	
  (i.e.	
  slice	
  and	
  dice)	
  
Lessons	
  Learned	
  &	
  Best	
  Prac1ces	
  
©2014	
  MEDIAMATH	
  INC.	
  	
  15	
  
THANK	
  YOU	
  

More Related Content

What's hot

Altis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Consulting
 
Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Talend
 
Cloud-Con: Integration & Web APIs
Cloud-Con: Integration & Web APIsCloud-Con: Integration & Web APIs
Cloud-Con: Integration & Web APIsSnapLogic
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017SingleStore
 
Netflix Teradata partner's presentation
Netflix Teradata partner's presentationNetflix Teradata partner's presentation
Netflix Teradata partner's presentationVishal Jain
 
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive AnalyticsThe Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive AnalyticsSingleStore
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Tyler Wishnoff
 
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingTapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingSingleStore
 
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...PivotalOpenSourceHub
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...Kai Wähner
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Databricks
 
情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure DataTreasure Data, Inc.
 
Delivering real time analytics in 1 click
Delivering real time analytics in 1 clickDelivering real time analytics in 1 click
Delivering real time analytics in 1 clickJean-Michel Franco
 
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, CouchbaseDatabase Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase✔ Eric David Benari, PMP
 
Big Data Expo 2015 - Talend Delivering Real Time
Big Data Expo 2015 - Talend Delivering Real TimeBig Data Expo 2015 - Talend Delivering Real Time
Big Data Expo 2015 - Talend Delivering Real TimeBigDataExpo
 
Pipelining the Heroes with Kafka and Graph
Pipelining the Heroes with Kafka and GraphPipelining the Heroes with Kafka and Graph
Pipelining the Heroes with Kafka and Graphconfluent
 
Build a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesBuild a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesCaserta
 
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixTableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixBlake Irvine
 
Webinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayWebinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayDataStax
 

What's hot (20)

Altis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data Platform
 
Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance
 
Cloud-Con: Integration & Web APIs
Cloud-Con: Integration & Web APIsCloud-Con: Integration & Web APIs
Cloud-Con: Integration & Web APIs
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Netflix Teradata partner's presentation
Netflix Teradata partner's presentationNetflix Teradata partner's presentation
Netflix Teradata partner's presentation
 
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive AnalyticsThe Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
 
The API Lie
The API LieThe API Lie
The API Lie
 
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingTapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
 
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data
 
Delivering real time analytics in 1 click
Delivering real time analytics in 1 clickDelivering real time analytics in 1 click
Delivering real time analytics in 1 click
 
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, CouchbaseDatabase Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
Database Camp 2016 @ United Nations, NYC - Bob Wiederhold, CEO, Couchbase
 
Big Data Expo 2015 - Talend Delivering Real Time
Big Data Expo 2015 - Talend Delivering Real TimeBig Data Expo 2015 - Talend Delivering Real Time
Big Data Expo 2015 - Talend Delivering Real Time
 
Pipelining the Heroes with Kafka and Graph
Pipelining the Heroes with Kafka and GraphPipelining the Heroes with Kafka and Graph
Pipelining the Heroes with Kafka and Graph
 
Build a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 MinutesBuild a Big Data Warehouse on the Cloud in 30 Minutes
Build a Big Data Warehouse on the Cloud in 30 Minutes
 
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixTableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
 
Webinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayWebinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each Day
 

Viewers also liked

Realizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyRealizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyMediaMath
 
Programmatic Media Scenario
Programmatic Media ScenarioProgrammatic Media Scenario
Programmatic Media ScenarioMediaMath
 
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...MediaMath
 
'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays
'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays
'Tis the Season: Leveraging Social Media Ad Campaigns for the HolidaysMediaMath
 
Realizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyRealizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyMediaMath
 
Organizing for Programmatic Success
Organizing for Programmatic SuccessOrganizing for Programmatic Success
Organizing for Programmatic SuccessMediaMath
 
Argyle CMO ThinkTank Boston
Argyle CMO ThinkTank BostonArgyle CMO ThinkTank Boston
Argyle CMO ThinkTank BostonMediaMath
 
Programmatic Everything: Beyond the Open Auction
Programmatic Everything: Beyond the Open AuctionProgrammatic Everything: Beyond the Open Auction
Programmatic Everything: Beyond the Open AuctionMediaMath
 
An Empowered Entrepreneurial Culture: Multiple CEOs
An Empowered Entrepreneurial Culture: Multiple CEOsAn Empowered Entrepreneurial Culture: Multiple CEOs
An Empowered Entrepreneurial Culture: Multiple CEOsMediaMath
 
O'Reilly Fluent, Web Components Enterprise
O'Reilly Fluent, Web Components EnterpriseO'Reilly Fluent, Web Components Enterprise
O'Reilly Fluent, Web Components EnterpriseMediaMath
 
Realizing Customer Centric Marketing in the Digital Age
Realizing Customer Centric Marketing in the Digital AgeRealizing Customer Centric Marketing in the Digital Age
Realizing Customer Centric Marketing in the Digital AgeMediaMath
 
Drive Business Results with MediaMath Retail
Drive Business Results with MediaMath RetailDrive Business Results with MediaMath Retail
Drive Business Results with MediaMath RetailMediaMath
 
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...Stipso
 
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...Amazon Web Services
 
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...Billions and Billions: Machines, Algorithms, and Growing Business in Programa...
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...MediaMath
 
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...Amazon Web Services
 
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013Amazon Web Services
 
Machine learning with Spark
Machine learning with SparkMachine learning with Spark
Machine learning with SparkKhalid Salama
 

Viewers also liked (20)

Realizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyRealizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic Technology
 
Programmatic Media Scenario
Programmatic Media ScenarioProgrammatic Media Scenario
Programmatic Media Scenario
 
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...
Realizing Customer-Centric Marketing with Programmatic Technology, Argyle CMO...
 
'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays
'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays
'Tis the Season: Leveraging Social Media Ad Campaigns for the Holidays
 
Realizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic TechnologyRealizing Customer-Centric Marketing with Programmatic Technology
Realizing Customer-Centric Marketing with Programmatic Technology
 
Organizing for Programmatic Success
Organizing for Programmatic SuccessOrganizing for Programmatic Success
Organizing for Programmatic Success
 
Argyle CMO ThinkTank Boston
Argyle CMO ThinkTank BostonArgyle CMO ThinkTank Boston
Argyle CMO ThinkTank Boston
 
Programmatic Everything: Beyond the Open Auction
Programmatic Everything: Beyond the Open AuctionProgrammatic Everything: Beyond the Open Auction
Programmatic Everything: Beyond the Open Auction
 
An Empowered Entrepreneurial Culture: Multiple CEOs
An Empowered Entrepreneurial Culture: Multiple CEOsAn Empowered Entrepreneurial Culture: Multiple CEOs
An Empowered Entrepreneurial Culture: Multiple CEOs
 
O'Reilly Fluent, Web Components Enterprise
O'Reilly Fluent, Web Components EnterpriseO'Reilly Fluent, Web Components Enterprise
O'Reilly Fluent, Web Components Enterprise
 
Realizing Customer Centric Marketing in the Digital Age
Realizing Customer Centric Marketing in the Digital AgeRealizing Customer Centric Marketing in the Digital Age
Realizing Customer Centric Marketing in the Digital Age
 
Drive Business Results with MediaMath Retail
Drive Business Results with MediaMath RetailDrive Business Results with MediaMath Retail
Drive Business Results with MediaMath Retail
 
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...
Kirsty Hulse: Audience Segmentation (To Stop Our Content Marketing Being Crap...
 
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...
(ADV403) Dynamic Ad Perf. Reporting w/ Redshift: Data Science, Queries at Sca...
 
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...Billions and Billions: Machines, Algorithms, and Growing Business in Programa...
Billions and Billions: Machines, Algorithms, and Growing Business in Programa...
 
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...
(PFC303) Milliseconds Matter: Design, Deploy, and Operate Your Application fo...
 
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013
GraphLab: Large-Scale Machine Learning on Graphs (BDT204) | AWS re:Invent 2013
 
Graph Analytics
Graph AnalyticsGraph Analytics
Graph Analytics
 
Machine learning with Spark
Machine learning with SparkMachine learning with Spark
Machine learning with Spark
 
Amazon Machine Learning
Amazon Machine LearningAmazon Machine Learning
Amazon Machine Learning
 

Similar to How MediaMath Built Faster, Scalable Attribution Reporting with Hadoop-Impala

Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnectaDigital
 
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...Looker
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...ervogler
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAmazon Web Services
 
See who is using MemSQL
See who is using MemSQLSee who is using MemSQL
See who is using MemSQLjenjermain
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAmazon Web Services
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraAttunity
 
Driving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsDriving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsSingleStore
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Big Data Spain
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Kai Wähner
 
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...SnapLogic
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...BigDataEverywhere
 
Apache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data SpainApache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data SpainLuke Han
 
Driving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsDriving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsSingleStore
 
Hadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsHadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsInside Analysis
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)Karim Lalji
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Holden Ackerman
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsLooker
 

Similar to How MediaMath Built Faster, Scalable Attribution Reporting with Hadoop-Impala (20)

Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
 
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
Big Data Hadoop Briefing Hosted by Cisco, WWT and MapR: MapR Overview Present...
 
Hadoop In The Real World
Hadoop In The Real WorldHadoop In The Real World
Hadoop In The Real World
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
 
See who is using MemSQL
See who is using MemSQLSee who is using MemSQL
See who is using MemSQL
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming Era
 
Driving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsDriving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive Analytics
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
 
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...
Webinar: iPaaS in the Enterprise - What to Look for in a Cloud Integration Pl...
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Apache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data SpainApache Kylin and Use Cases - 2018 Big Data Spain
Apache Kylin and Use Cases - 2018 Big Data Spain
 
Driving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsDriving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive Analytics
 
Hadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both WorldsHadoop and the Relational Database: The Best of Both Worlds
Hadoop and the Relational Database: The Best of Both Worlds
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 

More from MediaMath

Making Programmatic Work
Making Programmatic WorkMaking Programmatic Work
Making Programmatic WorkMediaMath
 
Conquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeConquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeMediaMath
 
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...MediaMath
 
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"MediaMath
 
Reengineering the Omni-Channel Opportunity
Reengineering the Omni-Channel OpportunityReengineering the Omni-Channel Opportunity
Reengineering the Omni-Channel OpportunityMediaMath
 
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014MediaMath
 
Conquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeConquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeMediaMath
 
Data Story 2: Supply Matters
Data Story 2: Supply MattersData Story 2: Supply Matters
Data Story 2: Supply MattersMediaMath
 
Data Story 1: The Publisher is Not Dead
Data Story 1: The Publisher is Not DeadData Story 1: The Publisher is Not Dead
Data Story 1: The Publisher is Not DeadMediaMath
 

More from MediaMath (9)

Making Programmatic Work
Making Programmatic WorkMaking Programmatic Work
Making Programmatic Work
 
Conquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeConquering the Omni-Channel Challenge
Conquering the Omni-Channel Challenge
 
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...
Automating Mobile Success for the Holiday Season - MediaMath and Millennial M...
 
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"
iMedia Brand Summit "Reengineering Omni-Channel Marketing to Drive Revenue"
 
Reengineering the Omni-Channel Opportunity
Reengineering the Omni-Channel OpportunityReengineering the Omni-Channel Opportunity
Reengineering the Omni-Channel Opportunity
 
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014
Laying the Groundwork for Programmatic Success - Digiday Retail Summit 2014
 
Conquering the Omni-Channel Challenge
Conquering the Omni-Channel ChallengeConquering the Omni-Channel Challenge
Conquering the Omni-Channel Challenge
 
Data Story 2: Supply Matters
Data Story 2: Supply MattersData Story 2: Supply Matters
Data Story 2: Supply Matters
 
Data Story 1: The Publisher is Not Dead
Data Story 1: The Publisher is Not DeadData Story 1: The Publisher is Not Dead
Data Story 1: The Publisher is Not Dead
 

Recently uploaded

April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 

How MediaMath Built Faster, Scalable Attribution Reporting with Hadoop-Impala

  • 1. How  MediaMath  Solved  a  Cri1cal  Repor1ng  Problem  with  Impala   ©2014  MEDIAMATH  INC.    1   The  Cloudera  Sessions   June  18,  2014   Ram  Narayanan,  Senior  Director  of  Database  Architecture  &  Opera1ons  
  • 2. Digital  Marke1ng  Pioneer   •  Founded  in  2007   •  Global  technology  company   •  Invented  first  Demand  Side  PlaJorm  (DSP)  for  online  ads     •  Conducts  online  adverNsing  through  real-­‐Nme  bidding  &   programmaNc  buying     About  MediaMath   ©2014  MEDIAMATH  INC.    2  
  • 3. About  MediaMath   Overview  of  Real-­‐Time  Bidding   Real-­‐1me   Auc1on   <30  ms   Adver1ser   (Client)   User      ad       www.cnn.com   ad  
  • 4. About  MediaMath   Overview  or  Real-­‐Time  Bidding   User   www.cnn.com       Purchased!      ad   www.shoes.com       $$   Event  Logs  
  • 5. •  Ad  OpportuniNes:  80-­‐100  billion  per  day     "  1.2  million  opportuniNes  per  second  at  peak   •  We  bid  on  30-­‐40  billion  ads  per  day   •  We  serve  1-­‐2  billion  ads  per  day   •  15-­‐20  million  events  (click,  sale,  online  sign-­‐up)  per  hour   •  2  TB  of  data  daily  (compressed)   "  Note:  This  only  counts  our  wins.  If  we  count  losses,  we  easily  reach  PBs.   About  MediaMath  
  • 6. Which  ad  (impression)  led  to  which  ac1on,  like  a  sale  or  online  signup   •  35-­‐40  billion  recorded  impressions  served  every  30  days   •  15-­‐20  million  events  per  hour   •  Need  to  join  events  with  impressions  2x  per  hour   à  Find  matching  records   à  Perform  complex  sequencing  &  allocaNon  logic   à  Run  aggregaNons  on  results   à  Send  data  to  data  marts     à  Provide  hourly  reporNng  to  clients     The  Repor1ng  AZribu1on  Problem   ©2014  MEDIAMATH  INC.    6  
  • 7. Incumbent  Architecture:     Appliance-­‐based  (Netezza)     Cost:  Expensive  -­‐   Scale:  Non-­‐incremental  scalability  -­‐   Performance:  ReporNng  lag  -­‐   ReporNng  inflexibility   Product  feature  constrained  -­‐   -­‐  
  • 8. To  build  a  data  warehouse  architecture  that  could   perform  hourly  repor1ng  of  aZribu1on  data  at  scale  that   is  affordable  and  easy  to  manage.     Our  goal  
  • 9. " Scalability   Handle  10-­‐50x  scale   " Capability     Ability  to  perform  big  data  joins  at  scale   " Performance   Complete  aggregaNon  in  <60  minutes   " Cost  effec1ve   Cheaper  than  appliance-­‐based  soluNons       ©2014  MEDIAMATH  INC.    9   EvaluaNon  Criteria:  
  • 10. " Hive   Run  Nme:  Took  5-­‐6  hours  to  complete   Stability:  High     " Pig   Run  Nme:  Took  4-­‐5  hours  to  complete   Stability:  High   " Impala  Beta  (0.6)   Run  Nme:  Took  2-­‐3  hours  to  complete   Stability:  Low         Evaluated  OpNons:  Round  1  
  • 11. " Hive:  Post-­‐Tuning  (map  joins,  bucke1ng,  split  size,  etc.)   Run  Nme:  Took  2-­‐3  hours  to  complete   Stability:  High     " Impala  GPA  (1.0)  (L0  compression,  slicing,  tuning,  hw   upgrade)   Run  Nme:  Took  30  minutes  to  complete   Stability:  High         Evaluated  OpNons:  Round  2  
  • 12. Data  Warehouse  Architecture  2011   Bid  Logs   Pixel  Logs   Metadata   Repor1ng     Data   Marts     Repor1ng     Data   Marts     Repor1ng     Data   Marts     Repor1ng     Data   Marts                                       ELT   A   T   T   R   I   B   U   T   I   O   N   Repor ts   Aggr ega1 on   Netezza   2011  
  • 13. Data  Warehouse  Architecture  2011   Bid  Logs   Pixel  Logs   Metadata   Repor1ng     Data   Marts     Repor1ng     Data   Marts     Repor1ng     Data   Marts     Repor1ng     Data   Marts                                       ELT   A   T   T   R   I   B   U   T   I   O   N   Repor ts   Aggr ega1 on                                   Reports  Aggrega1on   Netezza  Hadoop   2013  
  • 14. •  December  2013:  Peak  season   "  New  architecture  accommodated  2x  data  volume  with  unprecedented   scalability  &  stability   •  Present:  We  are  planning  to  add  more  features     "  Considering  moving  some  part  of  aggregaNon  into  Hadoop   Proof:     ©2014  MEDIAMATH  INC.    14  
  • 15. •  Process  ONLY  the  required  data   •  Compress  your  data   •  “Divide  &  Conquer”  your  data  (i.e.  slice  and  dice)   Lessons  Learned  &  Best  Prac1ces   ©2014  MEDIAMATH  INC.    15