SlideShare a Scribd company logo
1 of 20
Download to read offline
IGNITING AUDIENCE MEASUREMENT
AT TIME WARNER CABLE
TIM CASE
Agenda
• Who is Time Warner Cable & Time Warner Cable Media
• What is Audience Measurement?
• Challenges With Legacy Architecture
• Next Generation Architecture
• Lessons Learned
1
Who Am I?
• 10+ years in E-commerce
• Focused on Data Warehousing for the last 5 years
• Certifications
– Cloudera Certified Administrator for Apache Hadoop (CCAH)
– Cloudera Certified Developer for Apache Hadoop (CCDH)
– Teradata Certified Professional
– IBM Certified Specialist - PureData System for Analytics
– Tableau Server Certified Professional
– MicroStrategy Certified Engineering Principal
– Certified ScrumMaster
– Certified SAFe Agilist
• College sports fan – Go Noles!
2
Time Warner Cable & Time Warner Cable Media
Time Warner Cable is among the largest providers of video, high-
speed data and voice services in the U.S., connecting more than
15 million customers to entertainment, information and each
other
• Serves customers in 29 states
• More than 50,000 employees across the U.S.
Time Warner Cable Media, the advertising arm of Time Warner
Cable, provides national, regional and local marketers and
agencies with innovative, strategic and cost effective advertising
solutions.
3
The Audience Measurement platforms enables census reporting of subscriber
viewership and allows us to answer the Five W’s
Who is watching?
– Anonymized demographics, consumer behaviors
What are they watching?
– Station, program information, advertisements
When are they watching?
– Day of week, daypart, time-shifted
Where are they watching?
– Set-top box, TWC TV apps
Why are they watching?
– Program metadata
What is Audience Measurement?
4
Viewership Data
• Set-top box
– Processing more than 500 million events per day
• Largest table is Program Tuning Event Fact
 75 TB of raw data
 180+ Billion records
• TWC TV app (iPad, iPhone, Android, Xbox, etc.)
• Video On Demand (VOD)
Ads Data
• TWC Media and 3rd party spots
Reference Data
• Household demographics
• Program data
• Automotive data
• Political affiliation
5 Heavy
Analytical
Users
200
Audience
Finder
Users
50 Tableau
Consumers
5
Audience Measurement
By the Numbers
• Around 100 Tableau Workbooks
– Authored by the business and IT
• Numerous ad hoc queries
6
Video Viewership Analyzer (VVA)
• Custom application that enables complex audience definition by the user
community
– Date range
– Geography (DMA, Ad Zone or Zip Code)
– Platform (Classic, IPTV)
– Audience Definition
• Daypart
• Station and/or Program
• Demographics (includes line-of-business, propensities, Tribes and automotive)
• Platform usage (VOD, IPTV, high-speed data)
• Custom segmentation
• Output includes ranked list of stations and some high-level metrics
7
Audience Finder
Audience Finder: Reference Program
8
Technology
9
• 3rd Party application ingests raw data and performs anonymization,
correlation and some enrichment/mediation
• TWC ingests files provided by 3rd Party and performs additional enrichment
as well as applying business rules and stitching logic
– Executed in Netezza using SQL and shell scripts
• Two Netezza appliances
– TwinFin 36 used for ELT processing
– TwinFin 72 used for BI and customer-facing workloads
10
Legacy Platform Architecture
Source
Data
TWC Media
Business Logic
Stitching
Filtering
Zombie Logic
Core Logic
Anonymization
Correlation
Mediation
Enrichment
Collection
•Inconsistency
around
reliability and
availability of
source and
reference data
Processing
•Slow catch up
process
•Arch does not
promote speed
to market for
new features
Data Storage
+ Delivery
•Platform
instability
•Does not
support
concurrent
users
Analysis +
Presentation
•Limited
exploration and
interactive
capabilities
Challenges With Legacy Architecture
11
• SLA’s for T-3 and
T-14
• Frequency of
reprocessing
• Reference data
quality
• Duration of
reprocessing
• Team Velocity
when introducing
ETL changes
• Platform
availability
• Query response
times
• Response time
SLA’s during mixed
workload
• User satisfaction
w/ the interface
• Customer
dependency on IT
for changes
Metrics to Assess
Technical Criteria
• Performance
• Supports batch and streaming
• Leverage software engineering patterns
• Open source momentum
• “-ilities”
– Scalability
– Elasticity
– Availability
– Durability
– Extensibility
• Enables DevOps to compliment Agile adoption
– Automated testing
– Test-driven Development (TDD)
– Continuous Integration (CI)
• Strong foundation for Data Lake
12
Data Warehouse
Event Persistence
Hadoop
Visualization
Data Integration
Apache Spark is a more appropriate solution for set-top box processing logic:
Reduces complexity, simplifies code maintenance, improves defect resolution
time, improves run-time.
Can be applied in batch or near real-time with modest changes which positions
for T-x data availability (where ‘x’ is only limited by the availability of reference
data)
Enables use of Agile development principles (test-driven development and
continuous integration) there by Improving time-to-market, code quality, and
radically reducing QA costs and time.
Hadoop/HDFS for storing large historical data positions the organization
to leverage the evolving open source big data analytics technologies
(machine learning, SQL on Hadoop, graph processing, etc.)
Teradata will allow for large volumes of tuning event data to be secure,
easily accessible, and highly available to large numbers of users and at
reasonable cost.
Tableau enables self-service analytics, including advanced algorithms,
against the audience measurement data, then present information to
various consumers in meaningful ways.
Kafka is a high-performance, fault-tolerant, real-time messaging platform that
will allow us to keep a history of tuning events for faster reprocessing. This
component is critical once we are performing near real-time streaming of
events.
13
Technologies Selected
Core Logic
Anonymization
Correlation
Mediation
Enrichment
TWC Media Business
Logic
Stitching
Filtering
Zombie Logic
Initial Nextgen Architecture
Replace MicroStrategy with Tableau to enable self-service
Replace Netezza for customer facing workloads with Teradata, improving
platform stability, enabling sandboxes (e.g., Data Labs) and workload
management tools which assist in managing to performance SLA’s
o Replace 3rd Party application and Netezza ELT with Spark for Collection and
Processing logic (anonymization , correlation, enrichment, filtering,
stitching, & zombie logic)
Source
Data
14
Long-Term Architecture
• Implement an enterprise Data Lake to enable non-Media use cases
• Migrate to Spark Streaming and Kafka to enable near real-time use cases
• Evaluate dedicated infrastructure for more predictable performance
Event
Data
Business Logic
Stitching
Filtering
Zombie Logic
Data Lake
Anonymization
Correlation
Mediation
Enrichment
15
Reference
Data
Lessons Learned
• Have Executive support
• Infrastructure is critical
– Node sizes
– Network
• Leverage the open source community
– Enhancements
– Extensions (Spark Packages)
• Talent is hard to find
– Consider abstractions
16
Partners
17
We’re Hiring!
http://jobs.timewarnercable.com/
Thank You!
tim.case@twcable.com
@timrcase
http://www.linkedin.com/in/timrcase

More Related Content

What's hot

Simplify Cloud Migration to AWS with RISC Network’s Complete App Analysis
Simplify Cloud Migration  to  AWS with RISC Network’s Complete App AnalysisSimplify Cloud Migration  to  AWS with RISC Network’s Complete App Analysis
Simplify Cloud Migration to AWS with RISC Network’s Complete App AnalysisRISC Networks
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionTom Laszewski
 
Product Management Essentials
Product Management EssentialsProduct Management Essentials
Product Management EssentialsÖmer Demir
 
Cloud Readiness 101: Analyzing and Visualizing Your IT Infrastructure
Cloud Readiness 101: Analyzing and Visualizing Your IT InfrastructureCloud Readiness 101: Analyzing and Visualizing Your IT Infrastructure
Cloud Readiness 101: Analyzing and Visualizing Your IT Infrastructurepanagenda
 
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...RightScale
 
AWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAmazon Web Services
 
Openshift serverless Solution
Openshift serverless SolutionOpenshift serverless Solution
Openshift serverless SolutionRyan ZhangCheng
 
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...Amazon Web Services
 
Informatica + Hadoop = Best of Both Worlds
Informatica + Hadoop = Best of Both WorldsInformatica + Hadoop = Best of Both Worlds
Informatica + Hadoop = Best of Both WorldsAhmed Tayeh
 
Best Practices for Architecting VDI with Flash Storage
Best Practices for Architecting VDI with Flash StorageBest Practices for Architecting VDI with Flash Storage
Best Practices for Architecting VDI with Flash StorageRyan Snell
 
Applying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migrationApplying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migrationKacy Clarke
 
APAC Confluent Consumer Data Right the Lowdown and the Lessons
APAC Confluent Consumer Data Right the Lowdown and the LessonsAPAC Confluent Consumer Data Right the Lowdown and the Lessons
APAC Confluent Consumer Data Right the Lowdown and the Lessonsconfluent
 
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...AWS Germany
 
AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAmazon Web Services
 
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaTransform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaPrecisely
 
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...DataWorks Summit/Hadoop Summit
 
API Days Singapore
API Days SingaporeAPI Days Singapore
API Days Singaporeconfluent
 

What's hot (20)

Simplify Cloud Migration to AWS with RISC Network’s Complete App Analysis
Simplify Cloud Migration  to  AWS with RISC Network’s Complete App AnalysisSimplify Cloud Migration  to  AWS with RISC Network’s Complete App Analysis
Simplify Cloud Migration to AWS with RISC Network’s Complete App Analysis
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
 
Product Management Essentials
Product Management EssentialsProduct Management Essentials
Product Management Essentials
 
Cloud Readiness 101: Analyzing and Visualizing Your IT Infrastructure
Cloud Readiness 101: Analyzing and Visualizing Your IT InfrastructureCloud Readiness 101: Analyzing and Visualizing Your IT Infrastructure
Cloud Readiness 101: Analyzing and Visualizing Your IT Infrastructure
 
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
 
Cloud Migration: Moving to the Cloud
Cloud Migration: Moving to the CloudCloud Migration: Moving to the Cloud
Cloud Migration: Moving to the Cloud
 
AWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWSAWS Webcast - Datacenter Migration to AWS
AWS Webcast - Datacenter Migration to AWS
 
Openshift serverless Solution
Openshift serverless SolutionOpenshift serverless Solution
Openshift serverless Solution
 
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...
Best Practices for Data Center Migration Planning - August 2016 Monthly Webin...
 
Cloud Migration
Cloud MigrationCloud Migration
Cloud Migration
 
Informatica + Hadoop = Best of Both Worlds
Informatica + Hadoop = Best of Both WorldsInformatica + Hadoop = Best of Both Worlds
Informatica + Hadoop = Best of Both Worlds
 
Best Practices for Architecting VDI with Flash Storage
Best Practices for Architecting VDI with Flash StorageBest Practices for Architecting VDI with Flash Storage
Best Practices for Architecting VDI with Flash Storage
 
Applying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migrationApplying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migration
 
APAC Confluent Consumer Data Right the Lowdown and the Lessons
APAC Confluent Consumer Data Right the Lowdown and the LessonsAPAC Confluent Consumer Data Right the Lowdown and the Lessons
APAC Confluent Consumer Data Right the Lowdown and the Lessons
 
Enterprise Cloud for your Business Applications
Enterprise Cloud for your Business ApplicationsEnterprise Cloud for your Business Applications
Enterprise Cloud for your Business Applications
 
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
 
AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions Showcase
 
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache KafkaTransform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
 
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...
Building a Graph Database in Neo4j with Spark & Spark SQL to gain new insight...
 
API Days Singapore
API Days SingaporeAPI Days Singapore
API Days Singapore
 

Viewers also liked

MicroStrategy World 2014: Scaling MicroStrategy at eBay
MicroStrategy World 2014: Scaling MicroStrategy at eBayMicroStrategy World 2014: Scaling MicroStrategy at eBay
MicroStrategy World 2014: Scaling MicroStrategy at eBayTim Case
 
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...Amazon Web Services
 
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager Mark Kelley
 
CUMULUS Cloud Broadcast Platform
CUMULUS Cloud Broadcast PlatformCUMULUS Cloud Broadcast Platform
CUMULUS Cloud Broadcast PlatformMaria Baker
 
Next Generation Audience Measurement at Spectrum Reach
Next Generation Audience Measurement at Spectrum ReachNext Generation Audience Measurement at Spectrum Reach
Next Generation Audience Measurement at Spectrum ReachTim Case
 
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...Amazon Web Services
 
MicroStrategy BI Solutions for Retail Industry
MicroStrategy BI Solutions for Retail IndustryMicroStrategy BI Solutions for Retail Industry
MicroStrategy BI Solutions for Retail IndustryBiBoard.Org
 
2. Google Analytics New Interface - Search University 3
2. Google Analytics New Interface - Search University 32. Google Analytics New Interface - Search University 3
2. Google Analytics New Interface - Search University 3Semetis
 
Ancestry Microstrategy World 2015 Presentation
Ancestry Microstrategy World 2015 PresentationAncestry Microstrategy World 2015 Presentation
Ancestry Microstrategy World 2015 PresentationDavid Sanders
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)Moacyr Passador
 
Making Data Visualization & Analytics accessible to Business Users
Making Data Visualization & Analytics accessible to Business UsersMaking Data Visualization & Analytics accessible to Business Users
Making Data Visualization & Analytics accessible to Business UsersHaroen Vermylen
 
MicroStrategy on Amazon Web Services (AWS) Cloud
MicroStrategy on Amazon Web Services (AWS) CloudMicroStrategy on Amazon Web Services (AWS) Cloud
MicroStrategy on Amazon Web Services (AWS) CloudCCG
 
Yahoo Microstrategy 2008
Yahoo Microstrategy 2008Yahoo Microstrategy 2008
Yahoo Microstrategy 2008Amr Awadallah
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy snehal parikh
 
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...Amazon Web Services
 
Microsoft vision & strategy keynote for partners
Microsoft vision & strategy keynote for partnersMicrosoft vision & strategy keynote for partners
Microsoft vision & strategy keynote for partners- Michiel van Vliet -
 

Viewers also liked (20)

MicroStrategy World 2014: Scaling MicroStrategy at eBay
MicroStrategy World 2014: Scaling MicroStrategy at eBayMicroStrategy World 2014: Scaling MicroStrategy at eBay
MicroStrategy World 2014: Scaling MicroStrategy at eBay
 
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...
AWS re:Invent 2016: Turner's cloud native media supply chain for TNT, TBS, Ad...
 
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager
Case Study: Time Warner Cable's Formula for Maximizing Adobe Experience Manager
 
CUMULUS Cloud Broadcast Platform
CUMULUS Cloud Broadcast PlatformCUMULUS Cloud Broadcast Platform
CUMULUS Cloud Broadcast Platform
 
Next Generation Audience Measurement at Spectrum Reach
Next Generation Audience Measurement at Spectrum ReachNext Generation Audience Measurement at Spectrum Reach
Next Generation Audience Measurement at Spectrum Reach
 
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...
AWS re:Invent 2016: Discovery Channel's Broadcast Workflows and Channel Origi...
 
Proiect engleza
Proiect englezaProiect engleza
Proiect engleza
 
MicroStrategy BI Solutions for Retail Industry
MicroStrategy BI Solutions for Retail IndustryMicroStrategy BI Solutions for Retail Industry
MicroStrategy BI Solutions for Retail Industry
 
2. Google Analytics New Interface - Search University 3
2. Google Analytics New Interface - Search University 32. Google Analytics New Interface - Search University 3
2. Google Analytics New Interface - Search University 3
 
Ancestry Microstrategy World 2015 Presentation
Ancestry Microstrategy World 2015 PresentationAncestry Microstrategy World 2015 Presentation
Ancestry Microstrategy World 2015 Presentation
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
 
Making Data Visualization & Analytics accessible to Business Users
Making Data Visualization & Analytics accessible to Business UsersMaking Data Visualization & Analytics accessible to Business Users
Making Data Visualization & Analytics accessible to Business Users
 
MicroStrategy on Amazon Web Services (AWS) Cloud
MicroStrategy on Amazon Web Services (AWS) CloudMicroStrategy on Amazon Web Services (AWS) Cloud
MicroStrategy on Amazon Web Services (AWS) Cloud
 
Monetizing public cloud
Monetizing public cloudMonetizing public cloud
Monetizing public cloud
 
Yahoo Microstrategy 2008
Yahoo Microstrategy 2008Yahoo Microstrategy 2008
Yahoo Microstrategy 2008
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy
 
Microstrategy Overview
Microstrategy OverviewMicrostrategy Overview
Microstrategy Overview
 
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
 
Microstrategy
MicrostrategyMicrostrategy
Microstrategy
 
Microsoft vision & strategy keynote for partners
Microsoft vision & strategy keynote for partnersMicrosoft vision & strategy keynote for partners
Microsoft vision & strategy keynote for partners
 

Similar to Igniting Audience Measurement at Time Warner Cable

Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overviewStratebi
 
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...Denodo
 
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...Chad Lawler
 
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...Precisely
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointconfluent
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesDATAVERSITY
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j
 
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATIONLogitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATIONAvinash Deshpande
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLAPaul Barsch
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2Joe_F
 
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail AnalyticsHow PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail AnalyticsHortonworks
 
Maximize cloud and application performance with hundreds of operations bridge...
Maximize cloud and application performance with hundreds of operations bridge...Maximize cloud and application performance with hundreds of operations bridge...
Maximize cloud and application performance with hundreds of operations bridge...Stefan Bergstein
 
Digital Business Transformation for Energy & Utility company
Digital Business Transformation for Energy & Utility companyDigital Business Transformation for Energy & Utility company
Digital Business Transformation for Energy & Utility companyIlham Ahmed
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringDurga Gadiraju
 
Hadoop Boosts Profits in Media and Telecom Industry
Hadoop Boosts Profits in Media and Telecom IndustryHadoop Boosts Profits in Media and Telecom Industry
Hadoop Boosts Profits in Media and Telecom IndustryDataWorks Summit
 

Similar to Igniting Audience Measurement at Time Warner Cable (20)

Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
 
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
 
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...
What's New in Syncsort's Trillium Line of Data Quality Software - TSS Enterpr...
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Confluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPointConfluent Partner Tech Talk with BearingPoint
Confluent Partner Tech Talk with BearingPoint
 
Assessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use CasesAssessing New Databases– Translytical Use Cases
Assessing New Databases– Translytical Use Cases
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Neo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperativeNeo4j GraphDay Seattle- Sept19- Connected data imperative
Neo4j GraphDay Seattle- Sept19- Connected data imperative
 
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATIONLogitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLA
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
 
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail AnalyticsHow PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
 
Maximize cloud and application performance with hundreds of operations bridge...
Maximize cloud and application performance with hundreds of operations bridge...Maximize cloud and application performance with hundreds of operations bridge...
Maximize cloud and application performance with hundreds of operations bridge...
 
Digital Business Transformation for Energy & Utility company
Digital Business Transformation for Energy & Utility companyDigital Business Transformation for Energy & Utility company
Digital Business Transformation for Energy & Utility company
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Hadoop Boosts Profits in Media and Telecom Industry
Hadoop Boosts Profits in Media and Telecom IndustryHadoop Boosts Profits in Media and Telecom Industry
Hadoop Boosts Profits in Media and Telecom Industry
 

Recently uploaded

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Recently uploaded (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Igniting Audience Measurement at Time Warner Cable

  • 1. IGNITING AUDIENCE MEASUREMENT AT TIME WARNER CABLE TIM CASE
  • 2. Agenda • Who is Time Warner Cable & Time Warner Cable Media • What is Audience Measurement? • Challenges With Legacy Architecture • Next Generation Architecture • Lessons Learned 1
  • 3. Who Am I? • 10+ years in E-commerce • Focused on Data Warehousing for the last 5 years • Certifications – Cloudera Certified Administrator for Apache Hadoop (CCAH) – Cloudera Certified Developer for Apache Hadoop (CCDH) – Teradata Certified Professional – IBM Certified Specialist - PureData System for Analytics – Tableau Server Certified Professional – MicroStrategy Certified Engineering Principal – Certified ScrumMaster – Certified SAFe Agilist • College sports fan – Go Noles! 2
  • 4. Time Warner Cable & Time Warner Cable Media Time Warner Cable is among the largest providers of video, high- speed data and voice services in the U.S., connecting more than 15 million customers to entertainment, information and each other • Serves customers in 29 states • More than 50,000 employees across the U.S. Time Warner Cable Media, the advertising arm of Time Warner Cable, provides national, regional and local marketers and agencies with innovative, strategic and cost effective advertising solutions. 3
  • 5. The Audience Measurement platforms enables census reporting of subscriber viewership and allows us to answer the Five W’s Who is watching? – Anonymized demographics, consumer behaviors What are they watching? – Station, program information, advertisements When are they watching? – Day of week, daypart, time-shifted Where are they watching? – Set-top box, TWC TV apps Why are they watching? – Program metadata What is Audience Measurement? 4
  • 6. Viewership Data • Set-top box – Processing more than 500 million events per day • Largest table is Program Tuning Event Fact  75 TB of raw data  180+ Billion records • TWC TV app (iPad, iPhone, Android, Xbox, etc.) • Video On Demand (VOD) Ads Data • TWC Media and 3rd party spots Reference Data • Household demographics • Program data • Automotive data • Political affiliation 5 Heavy Analytical Users 200 Audience Finder Users 50 Tableau Consumers 5 Audience Measurement By the Numbers
  • 7. • Around 100 Tableau Workbooks – Authored by the business and IT • Numerous ad hoc queries 6 Video Viewership Analyzer (VVA)
  • 8. • Custom application that enables complex audience definition by the user community – Date range – Geography (DMA, Ad Zone or Zip Code) – Platform (Classic, IPTV) – Audience Definition • Daypart • Station and/or Program • Demographics (includes line-of-business, propensities, Tribes and automotive) • Platform usage (VOD, IPTV, high-speed data) • Custom segmentation • Output includes ranked list of stations and some high-level metrics 7 Audience Finder
  • 11. • 3rd Party application ingests raw data and performs anonymization, correlation and some enrichment/mediation • TWC ingests files provided by 3rd Party and performs additional enrichment as well as applying business rules and stitching logic – Executed in Netezza using SQL and shell scripts • Two Netezza appliances – TwinFin 36 used for ELT processing – TwinFin 72 used for BI and customer-facing workloads 10 Legacy Platform Architecture Source Data TWC Media Business Logic Stitching Filtering Zombie Logic Core Logic Anonymization Correlation Mediation Enrichment
  • 12. Collection •Inconsistency around reliability and availability of source and reference data Processing •Slow catch up process •Arch does not promote speed to market for new features Data Storage + Delivery •Platform instability •Does not support concurrent users Analysis + Presentation •Limited exploration and interactive capabilities Challenges With Legacy Architecture 11 • SLA’s for T-3 and T-14 • Frequency of reprocessing • Reference data quality • Duration of reprocessing • Team Velocity when introducing ETL changes • Platform availability • Query response times • Response time SLA’s during mixed workload • User satisfaction w/ the interface • Customer dependency on IT for changes Metrics to Assess
  • 13. Technical Criteria • Performance • Supports batch and streaming • Leverage software engineering patterns • Open source momentum • “-ilities” – Scalability – Elasticity – Availability – Durability – Extensibility • Enables DevOps to compliment Agile adoption – Automated testing – Test-driven Development (TDD) – Continuous Integration (CI) • Strong foundation for Data Lake 12
  • 14. Data Warehouse Event Persistence Hadoop Visualization Data Integration Apache Spark is a more appropriate solution for set-top box processing logic: Reduces complexity, simplifies code maintenance, improves defect resolution time, improves run-time. Can be applied in batch or near real-time with modest changes which positions for T-x data availability (where ‘x’ is only limited by the availability of reference data) Enables use of Agile development principles (test-driven development and continuous integration) there by Improving time-to-market, code quality, and radically reducing QA costs and time. Hadoop/HDFS for storing large historical data positions the organization to leverage the evolving open source big data analytics technologies (machine learning, SQL on Hadoop, graph processing, etc.) Teradata will allow for large volumes of tuning event data to be secure, easily accessible, and highly available to large numbers of users and at reasonable cost. Tableau enables self-service analytics, including advanced algorithms, against the audience measurement data, then present information to various consumers in meaningful ways. Kafka is a high-performance, fault-tolerant, real-time messaging platform that will allow us to keep a history of tuning events for faster reprocessing. This component is critical once we are performing near real-time streaming of events. 13 Technologies Selected
  • 15. Core Logic Anonymization Correlation Mediation Enrichment TWC Media Business Logic Stitching Filtering Zombie Logic Initial Nextgen Architecture Replace MicroStrategy with Tableau to enable self-service Replace Netezza for customer facing workloads with Teradata, improving platform stability, enabling sandboxes (e.g., Data Labs) and workload management tools which assist in managing to performance SLA’s o Replace 3rd Party application and Netezza ELT with Spark for Collection and Processing logic (anonymization , correlation, enrichment, filtering, stitching, & zombie logic) Source Data 14
  • 16. Long-Term Architecture • Implement an enterprise Data Lake to enable non-Media use cases • Migrate to Spark Streaming and Kafka to enable near real-time use cases • Evaluate dedicated infrastructure for more predictable performance Event Data Business Logic Stitching Filtering Zombie Logic Data Lake Anonymization Correlation Mediation Enrichment 15 Reference Data
  • 17. Lessons Learned • Have Executive support • Infrastructure is critical – Node sizes – Network • Leverage the open source community – Enhancements – Extensions (Spark Packages) • Talent is hard to find – Consider abstractions 16