SlideShare a Scribd company logo
1 of 23
Targeted Marketing
How Marketing Companies can use Big Data to Target Customers
Based on Location & Certain Demographics
Confidential – Restricted Slide 2
Who @m I?
Confidential – Restricted Slide 3
Who we @re?
BA, short for Business Analytics, is the technology used by companies to investigate business performance by
analyzing their data. BA encompasses three major areas Business Intelligence (BI), Enterprise Performance
Management (EPM) and Big Data Analytics. BA in this instance indicates that we will go beyond EPM and also
address topics/technologies that pertain to BI and Big Data.
Wikipedia describes soapbox as a raised platform on which one stands to make an impromptu speech, often
about a political subject. The term originates from the days when speakers would elevate themselves by
standing on a wooden crate originally used for shipment of soap or other dry goods. I want this is be not
only a place for me to share tips and tricks, but also a platform where industry practitioners can come to
share their views and expertise.
I am not an expert! I say this because I have had the pleasure through the years of working with people I would
consider experts in specific products. Instead, I see myself as a data integration specialist or enthusiast if you
will. I am someone who is passionate about the field and have made considerable investment in the space.
If you are like me or find the thought of maintaining a website a bit daunting, this may be your opportunity to
share your thoughts and educate others.
So if you have a topic that you would like to contribute, please let me know at BASoapbox.com.
Confidential – Restricted Slide 4
The Abstract
This presentation will show how an outdoor advertising company used the Oracle Big Data environment
to provide real time statistics and high value insights to their customers. Using data from providers such
as Pinsight Media and Perconix data from Acxiom, they are able to accurately show the demographics
of the consumers in the viewshed of their billboards and other digital assets. With the need to get
useful information out of the terabyte of data they were receiving, our client use Oracle BDCS,
specifically, Hive to create external tables connecting to flat files or MongoDB and Impala to analyze the
data.
The data was then loaded into Oracle DBCS for to be accessed by OACS for further analysis and
dashboarding.
In the session, I will:
 Go over the overall objective of the project
 Go over some of the key Hive and Impala queries
 Show how ODICS was instrumental with the automation, but also with the interaction
with MongoDB
 Go over some of the lessons learned
Confidential – Restricted Slide 5
The Big Question
Can we identify by audience size and segment the consumer population exposed to a billboard by
time of day and day of week?
12:00 AM 11:59 PM7:00 AM 12:00 PM 5:00 PM
Out of the billion of aggregated anonymized observed devices in a geographical location per day:
1. We need curated assets (billboards)
2. We need to get a total count of the devices we observe within the viewshed of our assets
(billboards)
3. We need to take a look at home and secondary locations to define the segmentation of the
consumers near our assets (billboards)
Confidential – Restricted Slide 6
Viewshed Impressions
Once a viewshed is constructed, an
impression then becomes the
intersection of consumers with the
asset.
Wikipedia describes viewshed as the geographical area that is visible from a location. It includes all
surrounding points that are in line-of-sight with that location and excludes points that are beyond
the horizon or obstructed by terrain and other features (e.g., buildings, trees).
Confidential – Restricted Slide 7
A Few Key Terms…
Geofence as
Polygon
Pivot
Dwell Time
Route
Lat/Lon
Asset
(Digital Billboard)
x
Lat/Lon
Lat/Lon
Lat/LonLat/Lon
Lat/Lon
Lat/Lon
Visit
Term Definition
Pivot
When a user cross a polygon or geofence
boundary (i.e. New Jersey to New York City).
Dwell Time
When a user cross a polygon or geofence
boundary. New Jersey to New York City. SoHo
to LES. Outside to Inside.
Visit
dwell time at a location that is material to
intent
Geofence
A designated boundary around a geometry
that, if crossed, initiates a notification.
Geofences are often used in real-time route
Web applications.
Polygon
On a map, a closed shape defined by a
connected sequence of x,y coordinate pairs,
where the first and last coordinate pair are the
same and all other pairs are unique.
Confidential – Restricted Slide 8
Final Answer
9 AM to 10
AM Work
Learn
Shop & Eat
Play
Audience
Identity
Audience
Behavioral
Graph
Audience
Size
Time
• Friday 9 AM to 10 AM
Audience Size
• 10,432 impressions
Audience Segmentation
• 42% college crowd
• 38 % mobile mixers
• 20% city mixers
For impression seen at certain time, say between 9 AM to 10 AM, we are able to travel
backward and forward in time to understand home location type and secondary location
type. This all informs audience segmentation.
Confidential – Restricted Slide 9
Primary Data Sources
• Owned by Sprint
• Parquet file representing the observed population pivoting through
Assets’ viewsheds.
• Contains FIPS/Block Group data for each viewshed impression
FIPS / Block Group
• Data which will provide personal characteristics such as demographics, income, ethnicity, etc. to give us
profile of the users that are passing through each asset.
• PersonicX is a segmentation system which places U.S. households into one of 70 distinctive segments
and 21 life stage groups based upon specific consumer behavior and demographic characteristics.
• Asset inventory are maintained in a MongoDB application called GSP
• GSP holds Point of Interest (POI) data, asset data (e.g., location of asset, latitude, and longitude),
geospatial data, real estate, wireless, transit, and taxonomy
• Output data from Big Data Cloud Service (BDCS) will be loaded back into GSP
Confidential – Restricted Slide 10
Data Flow
+Big Data Manager
ODI to orchestrate Big Data Manager (BDM). BDM will be used to move Pinsight data from AWS S3
to BDCS
+Big Data Manager
ODI to orchestrate Big Data Manager (BDM). BDM will be used to move files from Acxiom to BDCS.
ODI to orchestrate mongodump and mongoimport functions to load Asset information from
MongoDB to BDCS.
Confidential – Restricted Slide 11
Big Data Manager – Data Pull from S3 to HDFS
Confidential – Restricted Slide 12
Oracle Data Integrator (ODI)
• ODI was used in the following
ways on the project:
• As a means to orchestrate
certain processes
• Performing E-L-T type
integrations
• The use of ODI made it easier to
not only streamline, but
automate certain processes.
ODI Process Orchestration
• This insert shows how we can have ODI perform
the follow orchestration, in a streamlined and
automated fashion:
1. Call the EDQ Workflow that will do the IMS data
cleansing.
2. If the process fails, then email the responsible
parties. Otherwise, call the EDQ Workflow that will
export the IMS (cleansed) data from EDQ to staging
DB or flat file format
3. Email responsible parties that all processes ran
successfully
Confidential – Restricted Slide 14
The Data Lake
Big Data Cloud Service (BDCS) was used as a central data repository, divided into ponds for raw
data, canonical model and semantic layer. Large cellular data from Pinsight, consumer insights
from Acxiom and asset data from MongoDB were loaded into the Hive table in the raw data pond.
 Data Ingestion Pond
 Data is stored as-is
 Recent History available on HDFS
 Older file moved to Oracle Object Storage
 Common Meanings
 Additional standardized field and values
 Optimized storage format
 Ideal layer for exploratory data analysis (EDA)
 Business Ready
 Finalized Model
 Data ready for consumption
 Conforming to the business’ needs
Acxiom MongoDB Pinsight
Confidential – Restricted Slide 15
Big Data Cloud Service (BDCS)
Confidential – Restricted Slide 16
Pinsight Sample File
Confidential – Restricted Slide 17
Acxiom Sample File (Demographics)
FIPS / Block Group
Confidential – Restricted Slide 18
Acxiom Sample File 1. ROW FORMAT SERDE "org.apache.hadoop.hive.serde2.RegexSerDe" Since the
file has a fixed format, we use a SerDe to tell hive to use the RegexSerDe class
to serialize and deserialize.
2. "input.regex" is used to deserialize the rows read from the table data. So this
regex pattern is applied to the row value read from the file to split up into
different columns defined in the meta data for this hive table.
3. "output.format.string" is used to serialize the rows being written out to this
table data. This value is used as a format to generate a row value (from its
column values) that is to be written back to the output file for this hive table.
Regex Description
. any character except for line break
{n} n number of repetition of code
that precedes it
Confidential – Restricted Slide 19
Insert Query
Confidential – Restricted Slide 20
Impala Query
Confidential – Restricted Slide 21
Loading Fact Table p_acxperconicsfact
Confidential – Restricted Slide 22
Output to MongoDB
Confidential – Restricted Slide 23
The Final Output

More Related Content

What's hot

Digital Transformation and the Journey to a Highly Connected Enterprise
Digital Transformation and the Journey to a Highly Connected EnterpriseDigital Transformation and the Journey to a Highly Connected Enterprise
Digital Transformation and the Journey to a Highly Connected EnterpriseNeo4j
 
Using Regular Expressions in Document Management Data Capture and Indexing
Using Regular Expressions in Document Management Data Capture and IndexingUsing Regular Expressions in Document Management Data Capture and Indexing
Using Regular Expressions in Document Management Data Capture and IndexingSandy Schiele
 
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop systemToby Woolfe
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Denodo
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use casesAllied Consultants
 
GraphChain
GraphChainGraphChain
GraphChainsopekmir
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesDenodo
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesDATAVERSITY
 
Future of Data Strategy
Future of Data StrategyFuture of Data Strategy
Future of Data StrategyDenodo
 
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Denodo
 
How do You Graph
How do You GraphHow do You Graph
How do You GraphBen Krug
 
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data:  Technical Introduction to BigSheets for InfoSphere BigInsightsBig Data:  Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsightsCynthia Saracco
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...Connected Data World
 
Self-service consumption Data Catalog
Self-service consumption Data CatalogSelf-service consumption Data Catalog
Self-service consumption Data CatalogDenodo
 
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...Insight Technology, Inc.
 
MongoDB on Financial Services Sector
MongoDB on Financial Services SectorMongoDB on Financial Services Sector
MongoDB on Financial Services SectorNorberto Leite
 
LEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI systemLEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI systemsopekmir
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and RetrievalOptum
 
Bringing Geospatial Business Intelligence to the Enterprise
Bringing Geospatial Business Intelligenceto the EnterpriseBringing Geospatial Business Intelligenceto the Enterprise
Bringing Geospatial Business Intelligence to the Enterprisemkarren
 

What's hot (20)

Digital Transformation and the Journey to a Highly Connected Enterprise
Digital Transformation and the Journey to a Highly Connected EnterpriseDigital Transformation and the Journey to a Highly Connected Enterprise
Digital Transformation and the Journey to a Highly Connected Enterprise
 
Using Regular Expressions in Document Management Data Capture and Indexing
Using Regular Expressions in Document Management Data Capture and IndexingUsing Regular Expressions in Document Management Data Capture and Indexing
Using Regular Expressions in Document Management Data Capture and Indexing
 
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
2014 10 09 Top reasons to use IBM BigInsights as your Big Data Hadoop system
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use cases
 
GraphChain
GraphChainGraphChain
GraphChain
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best Practices
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data Challenges
 
Future of Data Strategy
Future of Data StrategyFuture of Data Strategy
Future of Data Strategy
 
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
How do You Graph
How do You GraphHow do You Graph
How do You Graph
 
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data:  Technical Introduction to BigSheets for InfoSphere BigInsightsBig Data:  Technical Introduction to BigSheets for InfoSphere BigInsights
Big Data: Technical Introduction to BigSheets for InfoSphere BigInsights
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Self-service consumption Data Catalog
Self-service consumption Data CatalogSelf-service consumption Data Catalog
Self-service consumption Data Catalog
 
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
[db tech showcase Tokyo 2018] #dbts2018 #B38 『Big Data and the Multi-model Da...
 
MongoDB on Financial Services Sector
MongoDB on Financial Services SectorMongoDB on Financial Services Sector
MongoDB on Financial Services Sector
 
LEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI systemLEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI system
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and Retrieval
 
Bringing Geospatial Business Intelligence to the Enterprise
Bringing Geospatial Business Intelligenceto the EnterpriseBringing Geospatial Business Intelligenceto the Enterprise
Bringing Geospatial Business Intelligence to the Enterprise
 

Similar to Targeted Marketing: How Marketing Companies can use Big Data to Target Customers Based on Location & Certain Demographics

The Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail EnvironmentThe Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail EnvironmentDenodo
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 
Running Data Platforms Like Products
Running Data Platforms Like ProductsRunning Data Platforms Like Products
Running Data Platforms Like ProductsVMware Tanzu
 
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Karen Thompson
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dataconomy Media
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionDenodo
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Denodo
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroDenodo
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Denodo
 
Data Architecture Process in a BI environment
Data Architecture Process in a BI environmentData Architecture Process in a BI environment
Data Architecture Process in a BI environmentSasha Citino
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNabclearnn
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Denodo
 
Analyst Webinar: Enabling a Customer Data Platform Using Data Virtualization
Analyst Webinar: Enabling a Customer Data Platform Using Data VirtualizationAnalyst Webinar: Enabling a Customer Data Platform Using Data Virtualization
Analyst Webinar: Enabling a Customer Data Platform Using Data VirtualizationDenodo
 
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demands
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demandsMongoDB .local Toronto 2019: MongoDB – Powering the new age data demands
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demandsMongoDB
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo
 

Similar to Targeted Marketing: How Marketing Companies can use Big Data to Target Customers Based on Location & Certain Demographics (20)

BigData Analysis
BigData AnalysisBigData Analysis
BigData Analysis
 
The Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail EnvironmentThe Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail Environment
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
Running Data Platforms Like Products
Running Data Platforms Like ProductsRunning Data Platforms Like Products
Running Data Platforms Like Products
 
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
SAP BI/BW
SAP BI/BWSAP BI/BW
SAP BI/BW
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
 
Data Architecture Process in a BI environment
Data Architecture Process in a BI environmentData Architecture Process in a BI environment
Data Architecture Process in a BI environment
 
IBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARNIBM Cognos tutorial - ABC LEARN
IBM Cognos tutorial - ABC LEARN
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
Analyst Webinar: Enabling a Customer Data Platform Using Data Virtualization
Analyst Webinar: Enabling a Customer Data Platform Using Data VirtualizationAnalyst Webinar: Enabling a Customer Data Platform Using Data Virtualization
Analyst Webinar: Enabling a Customer Data Platform Using Data Virtualization
 
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demands
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demandsMongoDB .local Toronto 2019: MongoDB – Powering the new age data demands
MongoDB .local Toronto 2019: MongoDB – Powering the new age data demands
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 

More from Ray Février

Analytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsAnalytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsRay Février
 
Strengthen your Foundations
Strengthen your FoundationsStrengthen your Foundations
Strengthen your FoundationsRay Février
 
FDMEE 11.1.2.4.200 Partner Meeting - May 2016
FDMEE 11.1.2.4.200 Partner Meeting - May 2016FDMEE 11.1.2.4.200 Partner Meeting - May 2016
FDMEE 11.1.2.4.200 Partner Meeting - May 2016Ray Février
 
Loading Smartlists into PBCS using FDMEE
Loading Smartlists into PBCS using FDMEELoading Smartlists into PBCS using FDMEE
Loading Smartlists into PBCS using FDMEERay Février
 
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532Ray Février
 
HPCM Management Ledger & FDMEE: The Perfect Partnership?
HPCM Management Ledger & FDMEE: The Perfect Partnership?HPCM Management Ledger & FDMEE: The Perfect Partnership?
HPCM Management Ledger & FDMEE: The Perfect Partnership?Ray Février
 

More from Ray Février (6)

Analytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsAnalytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle Applications
 
Strengthen your Foundations
Strengthen your FoundationsStrengthen your Foundations
Strengthen your Foundations
 
FDMEE 11.1.2.4.200 Partner Meeting - May 2016
FDMEE 11.1.2.4.200 Partner Meeting - May 2016FDMEE 11.1.2.4.200 Partner Meeting - May 2016
FDMEE 11.1.2.4.200 Partner Meeting - May 2016
 
Loading Smartlists into PBCS using FDMEE
Loading Smartlists into PBCS using FDMEELoading Smartlists into PBCS using FDMEE
Loading Smartlists into PBCS using FDMEE
 
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532
EPM, ERP, Cloud and On-Premise – All options explained - OOW CON9532
 
HPCM Management Ledger & FDMEE: The Perfect Partnership?
HPCM Management Ledger & FDMEE: The Perfect Partnership?HPCM Management Ledger & FDMEE: The Perfect Partnership?
HPCM Management Ledger & FDMEE: The Perfect Partnership?
 

Recently uploaded

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfSkillCertProExams
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsaqsarehman5055
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatmentnswingard
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Delhi Call girls
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIINhPhngng3
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 

Recently uploaded (20)

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 

Targeted Marketing: How Marketing Companies can use Big Data to Target Customers Based on Location & Certain Demographics

  • 1. Targeted Marketing How Marketing Companies can use Big Data to Target Customers Based on Location & Certain Demographics
  • 2. Confidential – Restricted Slide 2 Who @m I?
  • 3. Confidential – Restricted Slide 3 Who we @re? BA, short for Business Analytics, is the technology used by companies to investigate business performance by analyzing their data. BA encompasses three major areas Business Intelligence (BI), Enterprise Performance Management (EPM) and Big Data Analytics. BA in this instance indicates that we will go beyond EPM and also address topics/technologies that pertain to BI and Big Data. Wikipedia describes soapbox as a raised platform on which one stands to make an impromptu speech, often about a political subject. The term originates from the days when speakers would elevate themselves by standing on a wooden crate originally used for shipment of soap or other dry goods. I want this is be not only a place for me to share tips and tricks, but also a platform where industry practitioners can come to share their views and expertise. I am not an expert! I say this because I have had the pleasure through the years of working with people I would consider experts in specific products. Instead, I see myself as a data integration specialist or enthusiast if you will. I am someone who is passionate about the field and have made considerable investment in the space. If you are like me or find the thought of maintaining a website a bit daunting, this may be your opportunity to share your thoughts and educate others. So if you have a topic that you would like to contribute, please let me know at BASoapbox.com.
  • 4. Confidential – Restricted Slide 4 The Abstract This presentation will show how an outdoor advertising company used the Oracle Big Data environment to provide real time statistics and high value insights to their customers. Using data from providers such as Pinsight Media and Perconix data from Acxiom, they are able to accurately show the demographics of the consumers in the viewshed of their billboards and other digital assets. With the need to get useful information out of the terabyte of data they were receiving, our client use Oracle BDCS, specifically, Hive to create external tables connecting to flat files or MongoDB and Impala to analyze the data. The data was then loaded into Oracle DBCS for to be accessed by OACS for further analysis and dashboarding. In the session, I will:  Go over the overall objective of the project  Go over some of the key Hive and Impala queries  Show how ODICS was instrumental with the automation, but also with the interaction with MongoDB  Go over some of the lessons learned
  • 5. Confidential – Restricted Slide 5 The Big Question Can we identify by audience size and segment the consumer population exposed to a billboard by time of day and day of week? 12:00 AM 11:59 PM7:00 AM 12:00 PM 5:00 PM Out of the billion of aggregated anonymized observed devices in a geographical location per day: 1. We need curated assets (billboards) 2. We need to get a total count of the devices we observe within the viewshed of our assets (billboards) 3. We need to take a look at home and secondary locations to define the segmentation of the consumers near our assets (billboards)
  • 6. Confidential – Restricted Slide 6 Viewshed Impressions Once a viewshed is constructed, an impression then becomes the intersection of consumers with the asset. Wikipedia describes viewshed as the geographical area that is visible from a location. It includes all surrounding points that are in line-of-sight with that location and excludes points that are beyond the horizon or obstructed by terrain and other features (e.g., buildings, trees).
  • 7. Confidential – Restricted Slide 7 A Few Key Terms… Geofence as Polygon Pivot Dwell Time Route Lat/Lon Asset (Digital Billboard) x Lat/Lon Lat/Lon Lat/LonLat/Lon Lat/Lon Lat/Lon Visit Term Definition Pivot When a user cross a polygon or geofence boundary (i.e. New Jersey to New York City). Dwell Time When a user cross a polygon or geofence boundary. New Jersey to New York City. SoHo to LES. Outside to Inside. Visit dwell time at a location that is material to intent Geofence A designated boundary around a geometry that, if crossed, initiates a notification. Geofences are often used in real-time route Web applications. Polygon On a map, a closed shape defined by a connected sequence of x,y coordinate pairs, where the first and last coordinate pair are the same and all other pairs are unique.
  • 8. Confidential – Restricted Slide 8 Final Answer 9 AM to 10 AM Work Learn Shop & Eat Play Audience Identity Audience Behavioral Graph Audience Size Time • Friday 9 AM to 10 AM Audience Size • 10,432 impressions Audience Segmentation • 42% college crowd • 38 % mobile mixers • 20% city mixers For impression seen at certain time, say between 9 AM to 10 AM, we are able to travel backward and forward in time to understand home location type and secondary location type. This all informs audience segmentation.
  • 9. Confidential – Restricted Slide 9 Primary Data Sources • Owned by Sprint • Parquet file representing the observed population pivoting through Assets’ viewsheds. • Contains FIPS/Block Group data for each viewshed impression FIPS / Block Group • Data which will provide personal characteristics such as demographics, income, ethnicity, etc. to give us profile of the users that are passing through each asset. • PersonicX is a segmentation system which places U.S. households into one of 70 distinctive segments and 21 life stage groups based upon specific consumer behavior and demographic characteristics. • Asset inventory are maintained in a MongoDB application called GSP • GSP holds Point of Interest (POI) data, asset data (e.g., location of asset, latitude, and longitude), geospatial data, real estate, wireless, transit, and taxonomy • Output data from Big Data Cloud Service (BDCS) will be loaded back into GSP
  • 10. Confidential – Restricted Slide 10 Data Flow +Big Data Manager ODI to orchestrate Big Data Manager (BDM). BDM will be used to move Pinsight data from AWS S3 to BDCS +Big Data Manager ODI to orchestrate Big Data Manager (BDM). BDM will be used to move files from Acxiom to BDCS. ODI to orchestrate mongodump and mongoimport functions to load Asset information from MongoDB to BDCS.
  • 11. Confidential – Restricted Slide 11 Big Data Manager – Data Pull from S3 to HDFS
  • 12. Confidential – Restricted Slide 12 Oracle Data Integrator (ODI) • ODI was used in the following ways on the project: • As a means to orchestrate certain processes • Performing E-L-T type integrations • The use of ODI made it easier to not only streamline, but automate certain processes.
  • 13. ODI Process Orchestration • This insert shows how we can have ODI perform the follow orchestration, in a streamlined and automated fashion: 1. Call the EDQ Workflow that will do the IMS data cleansing. 2. If the process fails, then email the responsible parties. Otherwise, call the EDQ Workflow that will export the IMS (cleansed) data from EDQ to staging DB or flat file format 3. Email responsible parties that all processes ran successfully
  • 14. Confidential – Restricted Slide 14 The Data Lake Big Data Cloud Service (BDCS) was used as a central data repository, divided into ponds for raw data, canonical model and semantic layer. Large cellular data from Pinsight, consumer insights from Acxiom and asset data from MongoDB were loaded into the Hive table in the raw data pond.  Data Ingestion Pond  Data is stored as-is  Recent History available on HDFS  Older file moved to Oracle Object Storage  Common Meanings  Additional standardized field and values  Optimized storage format  Ideal layer for exploratory data analysis (EDA)  Business Ready  Finalized Model  Data ready for consumption  Conforming to the business’ needs Acxiom MongoDB Pinsight
  • 15. Confidential – Restricted Slide 15 Big Data Cloud Service (BDCS)
  • 16. Confidential – Restricted Slide 16 Pinsight Sample File
  • 17. Confidential – Restricted Slide 17 Acxiom Sample File (Demographics) FIPS / Block Group
  • 18. Confidential – Restricted Slide 18 Acxiom Sample File 1. ROW FORMAT SERDE "org.apache.hadoop.hive.serde2.RegexSerDe" Since the file has a fixed format, we use a SerDe to tell hive to use the RegexSerDe class to serialize and deserialize. 2. "input.regex" is used to deserialize the rows read from the table data. So this regex pattern is applied to the row value read from the file to split up into different columns defined in the meta data for this hive table. 3. "output.format.string" is used to serialize the rows being written out to this table data. This value is used as a format to generate a row value (from its column values) that is to be written back to the output file for this hive table. Regex Description . any character except for line break {n} n number of repetition of code that precedes it
  • 19. Confidential – Restricted Slide 19 Insert Query
  • 20. Confidential – Restricted Slide 20 Impala Query
  • 21. Confidential – Restricted Slide 21 Loading Fact Table p_acxperconicsfact
  • 22. Confidential – Restricted Slide 22 Output to MongoDB
  • 23. Confidential – Restricted Slide 23 The Final Output

Editor's Notes

  1. FIPS county code. The Federal Information Processing Standard Publication 6-4 (FIPS 6-4) was a five-digit Federal Information Processing Standards code which uniquely identified counties and county equivalents in the United States, certain U.S. possessions, and certain freely associated states. The FIPS State Code is a unique two-digit numeric code used to identify the 50 states, the District of Columbia, and the outlying areas of the United States. The FIPS County Code is a three-digit numeric code that, when used with the FIPS State Code, provides a unique identifier for each county and statistically equivalent entity of the 50 states, the District of Columbia, and the possessions and associated areas of the United States. Census tracts are the basic county subdivision used by the Census Bureau for census collection purposes and contain an average of 1,500 households.  The tract code is made up of a 4-byte prefix and a 2-byte suffix (the next element) often separated by a decimal.  Here, the tract element is provided as one 6-byte element. A subdivision of a census tract, a block group is another county subdivision used by the Census Bureau for census collection purposes and refers to an area that contains an average of 500 households.
  2. FIPS county code. The Federal Information Processing Standard Publication 6-4 (FIPS 6-4) was a five-digit Federal Information Processing Standards code which uniquely identified counties and county equivalents in the United States, certain U.S. possessions, and certain freely associated states. The FIPS State Code is a unique two-digit numeric code used to identify the 50 states, the District of Columbia, and the outlying areas of the United States. The FIPS County Code is a three-digit numeric code that, when used with the FIPS State Code, provides a unique identifier for each county and statistically equivalent entity of the 50 states, the District of Columbia, and the possessions and associated areas of the United States. Census tracts are the basic county subdivision used by the Census Bureau for census collection purposes and contain an average of 1,500 households.  The tract code is made up of a 4-byte prefix and a 2-byte suffix (the next element) often separated by a decimal.  Here, the tract element is provided as one 6-byte element. A subdivision of a census tract, a block group is another county subdivision used by the Census Bureau for census collection purposes and refers to an area that contains an average of 500 households.
  3. Create a data platform in Big Data Cloud Service (BDCS) that will do the following: Data hub for data integration from Pinsight, Acxiom, GSP (MongoDB) and IMS Serve as landing of raw data, enriched and modeled as work in progress, and finalized for consumption Cleanse the data, especially IMS data that is coming from a DB2 database Creating some semblance of Master Data Management Providing beginnings of ad-hoc data model for analytics Large data sets from cellular and GPS sources are needed for audience size by location Large data sets from consumer insight data and social media are needed for audience segment from size and location POI and geofencing data sources are needed to understand origin and destination segmentation Billboard inventory is needed for associating billboard with origin, destination, POI, and neighborhood
  4. Create a data platform in Big Data Cloud Service (BDCS) that will do the following: Data hub for data integration from Pinsight, Acxiom, GSP (MongoDB) and IMS Serve as landing of raw data, enriched and modeled as work in progress, and finalized for consumption Cleanse the data, especially IMS data that is coming from a DB2 database Creating some semblance of Master Data Management Providing beginnings of ad-hoc data model for analytics Large data sets from cellular and GPS sources are needed for audience size by location Large data sets from consumer insight data and social media are needed for audience segment from size and location POI and geofencing data sources are needed to understand origin and destination segmentation Billboard inventory is needed for associating billboard with origin, destination, POI, and neighborhood
  5. The FIPS State Code is a unique two-digit numeric code used to identify the 50 states, the District of Columbia, and the outlying areas of the United States. The FIPS County Code is a three-digit numeric code that, when used with the FIPS State Code, provides a unique identifier for each county and statistically equivalent entity of the 50 states, the District of Columbia, and the possessions and associated areas of the United States. Census tracts are the basic county subdivision used by the Census Bureau for census collection purposes and contain an average of 1,500 households.  The tract code is made up of a 4-byte prefix and a 2-byte suffix (the next element) often separated by a decimal.  Here, the tract element is provided as one 6-byte element. A subdivision of a census tract, a block group is another county subdivision used by the Census Bureau for census collection purposes and refers to an area that contains an average of 500 households.
  6. FIPS county code. The Federal Information Processing Standard Publication 6-4 (FIPS 6-4) was a five-digit Federal Information Processing Standards code which uniquely identified counties and county equivalents in the United States, certain U.S. possessions, and certain freely associated states. The FIPS State Code is a unique two-digit numeric code used to identify the 50 states, the District of Columbia, and the outlying areas of the United States. The FIPS County Code is a three-digit numeric code that, when used with the FIPS State Code, provides a unique identifier for each county and statistically equivalent entity of the 50 states, the District of Columbia, and the possessions and associated areas of the United States. Census tracts are the basic county subdivision used by the Census Bureau for census collection purposes and contain an average of 1,500 households.  The tract code is made up of a 4-byte prefix and a 2-byte suffix (the next element) often separated by a decimal.  Here, the tract element is provided as one 6-byte element. A subdivision of a census tract, a block group is another county subdivision used by the Census Bureau for census collection purposes and refers to an area that contains an average of 500 households.