SlideShare a Scribd company logo
1 of 18
Jim Forrester
Director, Information Services
jim.forrester@lifeway.com
@jrforrester
HADOOP @ LIFEWAY
ABOUT LIFEWAY
• Founded in 1891
• 4,200+ employees
• One of the world’s largest providers of Christian
products
ABOUT LIFEWAY (CONT.)
• Operate more than 185 LifeWay Christian Stores
across the United States
• Trade publishing through B&H – direct publishing
and events through LifeWay Resources
OUR ISSUES
• Long ETL process in EDW
• Disjointed data warehouse
LifeWay.com
Database Server
WSPRODDBProduct Attribute Repository
JDA Test
Server
JOPPA
JDA Prod
Server
GAZA
Enterprise Data Warehouse
UID / RCP
Tomcat Server
KETESH
ScanUS
App Server
SAMSON
App Server
SOCO
Business Objects
SAS
UNICA
Epiphany
B&H Dashboard
UID/Customer Insight
Maximizer
Database Server
MAXSQLPROD
Database Server
lwSQLPROD
Web App Server
URBANE
App Server
INTWEB01
UID Database Server
ABRAM
Business Objects App Server
BOBAPP
Epiphany App Server
EPIAPPPROD
Unica App Server
UNICAAPP
Enterprise Data Warehouse
(EDW)
BISQLPROD
STAGING PROD
BISQLPROD2
Used for running SSIS load packages
against BISQLPROD
UNICA
EPIPHANY
.txt
files
Unica Database Server
UNICASQL
Unica
Staging
(SQL Server)
Unica
Data Mart
(SQL Server)
Epiphany Database Server
EPISQLPROD
Epiphany
(SQL Server)
Business
Objects
SAS App Server
NINEVEH
BaseSAS
SAS-EG
Client App
SAS-Web
Reporting
Studio
HK Inv
(oracle)
SBDS
(SQL Server)
Ministers Disc
(monthly)
UID Staging
(Sybase) UID
Web App
Tri-Media
RTS/Alliant
(SQL Server)
WORDSearch
(SQL Server)
LW Worship
(MY SQL)
RCP
(SQL Server)
Vendor LW
Commission
MOSAIC
MRI
Simmons
Vista
(static)
SAS
Libraries
Base SAS App
Server
ZERUBBABEL
BaseSAS
JDA
SAS
Information
Maps
Retail Acq RCI
(static)
Unity Mail
(static)
Freq Buyers
(static)
SBDS
(.asp)
Church leaders,
organizations, ACP
UID
(Sybase)
Discoverer
Prospects
(access)
DataFlux
Client App
NCOA
Natl Chg of Addr
(90 Days)
PeachTree
[TD]
AEC
Addr Elmt Corr
(Semi-Annual)
Post Office
[TD]
UNC
Universal Coder
(Java)
Unique Customer ID
B&H
Dashboard
(Java)
Customer
Insight
(asp)
C
ustom
erAccount
Customer
Insight
(SQL Server)
My Study Bible
/ Oracle
Campaign
Campaign
Account / Transaction / Product
Store / Inventory
Account / Commission
Royalty
Shipping / Receiving / Inventory
CRD Prospect
Account / Transaction
Lifestyle Segmentation
Church Demographics
Account / Transaction / Product
GL / Account / Transaction / Product
People
Church Leaders
Email Addresses
Maximizer
(SQL Server)
App Server
MAXAPPPROD
Maximizer
(asp)
Account / Transaction / Product
Account / Transaction / Product
All Data
All Data
All Data
CRD
Data
M
M
S
/C
ustom
erH
ousehold
ing
/S
ale
s
Address
Address
Address
Address
App Server
GUARDIANNEW
BlueFusion
(VB6)
LW Std
Uniq Cust
(VB6)
Customer
Customer
Custo
mer
C
ustom
er
Customer
Household / Customer / Profile
Customer
All Data
All Data
Customer
Name
B&H
Sto
re
/In
ventory
Address
Address /
Name
Unique
Customer
ID
Group 1
Address
Standardization
(Householding)
[Vend LW
Comm/TD]
Unique
Customer
ID
Address
Business Intelligence
Technology and
Integration Flowchart
Address
Vendor
LW
Commission
New or
Amended
Site
Account / Transaction / Product
Sales
Customer
Customer
Customer
Customer
Glorietta
(static)
Ridgecrest
(static)
Maximizer
(static)
Conference Attendees
Conference Attendees
Assigned Consultant
Household
Segm
entatio
n
à
/ß
Sale
s
ScanUS
Client App
.txt
files
App Server
THOMASSCANUS
ScanUS
Network App
Network Computer
[CRD/CSD/B&H]
Notebook Computer
[CRD/CSD/B&H]
.txt
files
Any Data
All Data
.txt
files
Customer
Budget/Proje
ctio
ns
Budget / Meeting Category
Store
Demographics / Price Event
JDA
(MY SQL)
Coupon
Customer
JDA
(DB2)
[CRD/Retail]
[CRD]
[B&H]
Campaigns &
Mail Lists
[Retail]
Campaigns &
Mail Lists
[CRD]
SAS Reports
[Retail]
SAS Reports
[Retail/CRD]
SAS Reports
[Retail/CRD]
WebIntelligence
Reports
[all divisions]
[TD/CRD/Retail]
[TD]
[Orgs/TD/LW Rsch]
Buying
Behavior
Buying
Behavior
JDA
(DB2)
Non EDW
Data
Non EDW
Data
Any Data
Any Data
Aptify
Product
Contribution
(CRD / B&H)
BHPOS
Aptify
(SQL Server)
Oracle eBus
(oracle)
Price Events
New Seed List Update
Store
Data
Lifestyle Segmentation
App Server
HAZOR
MSC
Mktg Sppt Cntr
(Java)
[TD/CS]
Mail Lists
Salesà/ßAssignmentsUID
Address
PAR
ItemPopularity
Websphere
Commerce
for
Lifeway.com
WCP01
(DB2)
Address
Business
Objects
Universes
Product Class
1/24/2014
Mosaic
My Study Bible
SSRS Reports
All Data
Big TX
Church leaders,
organizations, ACP
Church leaders,
organizations, ACP
Tomcat Reports
(TD)
Changed Names
and Addresses Unique Customer ID
Customer / UID
OUR ISSUES (CONT.)
• Analysts spending 80% of time gathering data
sources
• Data is limited to structured sources
• A lot of HiPPOs
“Without data you’re just another person with an
opinion.”
– W. Edward Deming
WHY HADOOP?
• ETL -> ELT
• Centralized schema development
• Analysts get to be analysts
• More and different types of data
• “Win with data” – new analytics culture
FIRST STEPS
• POCs on several distributions
• Ran POCs on a 5 node VM cluster
• Chose Hortonworks
• Pure play distribution
• Engineering expertise
• Support model
INVESTMENT
• Started with a 12-node physical cluster
• Cisco UCS hardware to meet infrastructure
standards
• Training for 1 administrator and 2 developers
• Two week professional services engagement
• Got environment up
• Successful ETL offload for two data marts
• Templated framework for remaining data marts
EMPLOYEE SKILLS
• SQL skills translate well for Hive
• Pig can be picked up quickly through training
• Forward thinking DBA for administration
HADOOP ECOSYSTEM TECHNOLOGIES
• Currently using:
• Hive
• Pig
• Sqoop
• Oozie
• Ranger
TYPES OF DATA
• Structured Systems
• ERP
• Logistics
• POS
• Merchandising
• Unstructured
• Wifi analytics
• Price API
• Clickstream
• International weather
• CPI
• Census
ARCHITECTURE
Enterprise
Data Hub
Cloud
Data
Structured
Data
Unstructured
Data
A
P
I
Apps
ARCHITECTURE (CONT.)
BUSINESS COLLABORATION
• Socializing impacts of Hadoop
• Data gathering time for analysts
• New data sets
• Improved schemas
• Successful implementation of Hadoop
• Lays the groundwork for new BI&A tooling
• Creates an Agile BI framework
USE CASES
• Segmentation/Targeting of customers
• Omnichannel customer views
• Pricing optimization
• Product clustering
• Supply chain optimization
• Cannibalization of products and customers
• Fraud detection on AP
NEXT STEPS
• Continued growth of cluster
• HA/DR planning
• Cloud vs On-premise
• EMC Isilon
FUTURE OF HADOOP AT LIFEWAY
• Data science
• Machine learning
• Process optimization
• Heat mapping for store optimization
• Event log and sensor aggregation – predicting failure
Jim Forrester
Director, Information Services
jim.forrester@lifeway.com
@jrforrester
HADOOP @ LIFEWAY

More Related Content

What's hot

Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...
Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...
Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...Joseph Alaimo Jr
 
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftData Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftAmazon Web Services
 
NRB SAP Hosting & Cloud Solutions
NRB SAP Hosting & Cloud SolutionsNRB SAP Hosting & Cloud Solutions
NRB SAP Hosting & Cloud SolutionsNRB
 
How to Migrate from Oracle to EDB Postgres
How to Migrate from Oracle to EDB PostgresHow to Migrate from Oracle to EDB Postgres
How to Migrate from Oracle to EDB PostgresAshnikbiz
 
Deploying OBIEE in the Cloud - Oracle Openworld 2014
Deploying OBIEE in the Cloud - Oracle Openworld 2014Deploying OBIEE in the Cloud - Oracle Openworld 2014
Deploying OBIEE in the Cloud - Oracle Openworld 2014Mark Rittman
 
OAC - From Cloud Entry to Data Engineering to Data Science
OAC - From Cloud Entry to Data Engineering to Data ScienceOAC - From Cloud Entry to Data Engineering to Data Science
OAC - From Cloud Entry to Data Engineering to Data ScienceChristian Berg
 
Lightning Talk: Get Even More Value from MongoDB Applications
Lightning Talk: Get Even More Value from MongoDB ApplicationsLightning Talk: Get Even More Value from MongoDB Applications
Lightning Talk: Get Even More Value from MongoDB ApplicationsMongoDB
 
NRB SAP DAY 2017 - Intro
NRB SAP DAY 2017 - IntroNRB SAP DAY 2017 - Intro
NRB SAP DAY 2017 - IntroNRB
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAJaemi Bremner
 
Amazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesAmazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesStridely Solutions
 
ECS19 - Jason Himmelstein - Telling data stories with Power BI
ECS19 - Jason Himmelstein - Telling data stories with Power BIECS19 - Jason Himmelstein - Telling data stories with Power BI
ECS19 - Jason Himmelstein - Telling data stories with Power BIEuropean Collaboration Summit
 
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015Fastly
 
Streamline your SOA Portfolio
Streamline your SOA Portfolio Streamline your SOA Portfolio
Streamline your SOA Portfolio WSO2
 
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...Joseph Alaimo Jr
 
hyperion essbase training | hyperion essbase online training | hyperion essb...
hyperion essbase training | hyperion essbase online training |  hyperion essb...hyperion essbase training | hyperion essbase online training |  hyperion essb...
hyperion essbase training | hyperion essbase online training | hyperion essb...Nancy Thomas
 
NoSQL on ACID: Meet Unstructured Postgres
NoSQL on ACID: Meet Unstructured PostgresNoSQL on ACID: Meet Unstructured Postgres
NoSQL on ACID: Meet Unstructured PostgresEDB
 
API Trends & Use Cases
API Trends & Use CasesAPI Trends & Use Cases
API Trends & Use CasesSmartWave
 
Talend Introduction by TSI
Talend Introduction by TSITalend Introduction by TSI
Talend Introduction by TSIRemain Software
 
Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...Lucas Jellema
 

What's hot (20)

Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...
Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...
Integrated Planning Using Enterprise Planning and Budgeting Cloud Service at ...
 
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF LoftData Warehouses & Data Lakes: Data Analytics Week at the SF Loft
Data Warehouses & Data Lakes: Data Analytics Week at the SF Loft
 
NRB SAP Hosting & Cloud Solutions
NRB SAP Hosting & Cloud SolutionsNRB SAP Hosting & Cloud Solutions
NRB SAP Hosting & Cloud Solutions
 
How to Migrate from Oracle to EDB Postgres
How to Migrate from Oracle to EDB PostgresHow to Migrate from Oracle to EDB Postgres
How to Migrate from Oracle to EDB Postgres
 
Deploying OBIEE in the Cloud - Oracle Openworld 2014
Deploying OBIEE in the Cloud - Oracle Openworld 2014Deploying OBIEE in the Cloud - Oracle Openworld 2014
Deploying OBIEE in the Cloud - Oracle Openworld 2014
 
OAC - From Cloud Entry to Data Engineering to Data Science
OAC - From Cloud Entry to Data Engineering to Data ScienceOAC - From Cloud Entry to Data Engineering to Data Science
OAC - From Cloud Entry to Data Engineering to Data Science
 
Oracle hyperion essbase
Oracle hyperion essbaseOracle hyperion essbase
Oracle hyperion essbase
 
Lightning Talk: Get Even More Value from MongoDB Applications
Lightning Talk: Get Even More Value from MongoDB ApplicationsLightning Talk: Get Even More Value from MongoDB Applications
Lightning Talk: Get Even More Value from MongoDB Applications
 
NRB SAP DAY 2017 - Intro
NRB SAP DAY 2017 - IntroNRB SAP DAY 2017 - Intro
NRB SAP DAY 2017 - Intro
 
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CAAdobe Spark Meetup - 9/19/2018 - San Jose, CA
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
 
Amazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs KubernetesAmazon AWS vs Azure Cloud vs Kubernetes
Amazon AWS vs Azure Cloud vs Kubernetes
 
ECS19 - Jason Himmelstein - Telling data stories with Power BI
ECS19 - Jason Himmelstein - Telling data stories with Power BIECS19 - Jason Himmelstein - Telling data stories with Power BI
ECS19 - Jason Himmelstein - Telling data stories with Power BI
 
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015
Migrating Target to Fastly - Eddie Roger at Fastly Altitude 2015
 
Streamline your SOA Portfolio
Streamline your SOA Portfolio Streamline your SOA Portfolio
Streamline your SOA Portfolio
 
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...
Baha Mar's All in Bet on Red: The Story of Integrating Data and Master Data w...
 
hyperion essbase training | hyperion essbase online training | hyperion essb...
hyperion essbase training | hyperion essbase online training |  hyperion essb...hyperion essbase training | hyperion essbase online training |  hyperion essb...
hyperion essbase training | hyperion essbase online training | hyperion essb...
 
NoSQL on ACID: Meet Unstructured Postgres
NoSQL on ACID: Meet Unstructured PostgresNoSQL on ACID: Meet Unstructured Postgres
NoSQL on ACID: Meet Unstructured Postgres
 
API Trends & Use Cases
API Trends & Use CasesAPI Trends & Use Cases
API Trends & Use Cases
 
Talend Introduction by TSI
Talend Introduction by TSITalend Introduction by TSI
Talend Introduction by TSI
 
Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...Business and IT agility through DevOps and microservice architecture powered ...
Business and IT agility through DevOps and microservice architecture powered ...
 

Similar to Hadoop @ LifeWay

Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Etu Solution
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Hortonworks
 
Achieving Business Value by Fusing Hadoop and Corporate Data
Achieving Business Value by Fusing Hadoop and Corporate DataAchieving Business Value by Fusing Hadoop and Corporate Data
Achieving Business Value by Fusing Hadoop and Corporate DataInside Analysis
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudDataWorks Summit
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
The Future of SAP® Automation in the Cloud
The Future of SAP® Automation in the CloudThe Future of SAP® Automation in the Cloud
The Future of SAP® Automation in the CloudPrecisely
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic IntelAPAC
 
Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale Amazon Web Services
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataPentaho
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsjdijcks
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 
AWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAmazon Web Services
 
The Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseThe Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseAltibase
 
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Dobo Radichkov
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Building a Self-Service Big Data Pipeline
Building a Self-Service Big Data PipelineBuilding a Self-Service Big Data Pipeline
Building a Self-Service Big Data PipelineDataWorks Summit
 

Similar to Hadoop @ LifeWay (20)

Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台Track B-1 建構新世代的智慧數據平台
Track B-1 建構新世代的智慧數據平台
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Achieving Business Value by Fusing Hadoop and Corporate Data
Achieving Business Value by Fusing Hadoop and Corporate DataAchieving Business Value by Fusing Hadoop and Corporate Data
Achieving Business Value by Fusing Hadoop and Corporate Data
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
The Future of SAP® Automation in the Cloud
The Future of SAP® Automation in the CloudThe Future of SAP® Automation in the Cloud
The Future of SAP® Automation in the Cloud
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic
 
Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale Modern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
AWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the CloudAWS Partner Webcast - Reporting and Analytics in the Cloud
AWS Partner Webcast - Reporting and Analytics in the Cloud
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
The Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseThe Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- Altibase
 
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
Data and Analytics at Holland & Barrett: Building a '3-Michelin-star' Data Pl...
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Building a Self-Service Big Data Pipeline
Building a Self-Service Big Data PipelineBuilding a Self-Service Big Data Pipeline
Building a Self-Service Big Data Pipeline
 

Recently uploaded

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 

Recently uploaded (20)

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 

Hadoop @ LifeWay

  • 1. Jim Forrester Director, Information Services jim.forrester@lifeway.com @jrforrester HADOOP @ LIFEWAY
  • 2. ABOUT LIFEWAY • Founded in 1891 • 4,200+ employees • One of the world’s largest providers of Christian products
  • 3. ABOUT LIFEWAY (CONT.) • Operate more than 185 LifeWay Christian Stores across the United States • Trade publishing through B&H – direct publishing and events through LifeWay Resources
  • 4. OUR ISSUES • Long ETL process in EDW • Disjointed data warehouse LifeWay.com Database Server WSPRODDBProduct Attribute Repository JDA Test Server JOPPA JDA Prod Server GAZA Enterprise Data Warehouse UID / RCP Tomcat Server KETESH ScanUS App Server SAMSON App Server SOCO Business Objects SAS UNICA Epiphany B&H Dashboard UID/Customer Insight Maximizer Database Server MAXSQLPROD Database Server lwSQLPROD Web App Server URBANE App Server INTWEB01 UID Database Server ABRAM Business Objects App Server BOBAPP Epiphany App Server EPIAPPPROD Unica App Server UNICAAPP Enterprise Data Warehouse (EDW) BISQLPROD STAGING PROD BISQLPROD2 Used for running SSIS load packages against BISQLPROD UNICA EPIPHANY .txt files Unica Database Server UNICASQL Unica Staging (SQL Server) Unica Data Mart (SQL Server) Epiphany Database Server EPISQLPROD Epiphany (SQL Server) Business Objects SAS App Server NINEVEH BaseSAS SAS-EG Client App SAS-Web Reporting Studio HK Inv (oracle) SBDS (SQL Server) Ministers Disc (monthly) UID Staging (Sybase) UID Web App Tri-Media RTS/Alliant (SQL Server) WORDSearch (SQL Server) LW Worship (MY SQL) RCP (SQL Server) Vendor LW Commission MOSAIC MRI Simmons Vista (static) SAS Libraries Base SAS App Server ZERUBBABEL BaseSAS JDA SAS Information Maps Retail Acq RCI (static) Unity Mail (static) Freq Buyers (static) SBDS (.asp) Church leaders, organizations, ACP UID (Sybase) Discoverer Prospects (access) DataFlux Client App NCOA Natl Chg of Addr (90 Days) PeachTree [TD] AEC Addr Elmt Corr (Semi-Annual) Post Office [TD] UNC Universal Coder (Java) Unique Customer ID B&H Dashboard (Java) Customer Insight (asp) C ustom erAccount Customer Insight (SQL Server) My Study Bible / Oracle Campaign Campaign Account / Transaction / Product Store / Inventory Account / Commission Royalty Shipping / Receiving / Inventory CRD Prospect Account / Transaction Lifestyle Segmentation Church Demographics Account / Transaction / Product GL / Account / Transaction / Product People Church Leaders Email Addresses Maximizer (SQL Server) App Server MAXAPPPROD Maximizer (asp) Account / Transaction / Product Account / Transaction / Product All Data All Data All Data CRD Data M M S /C ustom erH ousehold ing /S ale s Address Address Address Address App Server GUARDIANNEW BlueFusion (VB6) LW Std Uniq Cust (VB6) Customer Customer Custo mer C ustom er Customer Household / Customer / Profile Customer All Data All Data Customer Name B&H Sto re /In ventory Address Address / Name Unique Customer ID Group 1 Address Standardization (Householding) [Vend LW Comm/TD] Unique Customer ID Address Business Intelligence Technology and Integration Flowchart Address Vendor LW Commission New or Amended Site Account / Transaction / Product Sales Customer Customer Customer Customer Glorietta (static) Ridgecrest (static) Maximizer (static) Conference Attendees Conference Attendees Assigned Consultant Household Segm entatio n à /ß Sale s ScanUS Client App .txt files App Server THOMASSCANUS ScanUS Network App Network Computer [CRD/CSD/B&H] Notebook Computer [CRD/CSD/B&H] .txt files Any Data All Data .txt files Customer Budget/Proje ctio ns Budget / Meeting Category Store Demographics / Price Event JDA (MY SQL) Coupon Customer JDA (DB2) [CRD/Retail] [CRD] [B&H] Campaigns & Mail Lists [Retail] Campaigns & Mail Lists [CRD] SAS Reports [Retail] SAS Reports [Retail/CRD] SAS Reports [Retail/CRD] WebIntelligence Reports [all divisions] [TD/CRD/Retail] [TD] [Orgs/TD/LW Rsch] Buying Behavior Buying Behavior JDA (DB2) Non EDW Data Non EDW Data Any Data Any Data Aptify Product Contribution (CRD / B&H) BHPOS Aptify (SQL Server) Oracle eBus (oracle) Price Events New Seed List Update Store Data Lifestyle Segmentation App Server HAZOR MSC Mktg Sppt Cntr (Java) [TD/CS] Mail Lists Salesà/ßAssignmentsUID Address PAR ItemPopularity Websphere Commerce for Lifeway.com WCP01 (DB2) Address Business Objects Universes Product Class 1/24/2014 Mosaic My Study Bible SSRS Reports All Data Big TX Church leaders, organizations, ACP Church leaders, organizations, ACP Tomcat Reports (TD) Changed Names and Addresses Unique Customer ID Customer / UID
  • 5. OUR ISSUES (CONT.) • Analysts spending 80% of time gathering data sources • Data is limited to structured sources • A lot of HiPPOs “Without data you’re just another person with an opinion.” – W. Edward Deming
  • 6. WHY HADOOP? • ETL -> ELT • Centralized schema development • Analysts get to be analysts • More and different types of data • “Win with data” – new analytics culture
  • 7. FIRST STEPS • POCs on several distributions • Ran POCs on a 5 node VM cluster • Chose Hortonworks • Pure play distribution • Engineering expertise • Support model
  • 8. INVESTMENT • Started with a 12-node physical cluster • Cisco UCS hardware to meet infrastructure standards • Training for 1 administrator and 2 developers • Two week professional services engagement • Got environment up • Successful ETL offload for two data marts • Templated framework for remaining data marts
  • 9. EMPLOYEE SKILLS • SQL skills translate well for Hive • Pig can be picked up quickly through training • Forward thinking DBA for administration
  • 10. HADOOP ECOSYSTEM TECHNOLOGIES • Currently using: • Hive • Pig • Sqoop • Oozie • Ranger
  • 11. TYPES OF DATA • Structured Systems • ERP • Logistics • POS • Merchandising • Unstructured • Wifi analytics • Price API • Clickstream • International weather • CPI • Census
  • 14. BUSINESS COLLABORATION • Socializing impacts of Hadoop • Data gathering time for analysts • New data sets • Improved schemas • Successful implementation of Hadoop • Lays the groundwork for new BI&A tooling • Creates an Agile BI framework
  • 15. USE CASES • Segmentation/Targeting of customers • Omnichannel customer views • Pricing optimization • Product clustering • Supply chain optimization • Cannibalization of products and customers • Fraud detection on AP
  • 16. NEXT STEPS • Continued growth of cluster • HA/DR planning • Cloud vs On-premise • EMC Isilon
  • 17. FUTURE OF HADOOP AT LIFEWAY • Data science • Machine learning • Process optimization • Heat mapping for store optimization • Event log and sensor aggregation – predicting failure
  • 18. Jim Forrester Director, Information Services jim.forrester@lifeway.com @jrforrester HADOOP @ LIFEWAY