SlideShare a Scribd company logo
1 of 20
Download to read offline
Q. The CDO Agenda,
how can Data Architecture help?
Phill Radley,
Chief Data Architect
26 / October / 2016
Data Management
Specialist Group
1/25
Answer
• A lot
• A bit
• Not much
The architects answer…
It depends
…..State of the business
…..State of enterprise architecture
……What’s going on externally
Do you have or need a CDO ?
AGENDA
Framing View of the Industry
Discuss the CDO role
The organisation of BT & IT Challenges
Examples of what we’re doing with data
(in the absence of a CDO)
The Long View of Big Data
Data “Bigness” =
 ( Volume, Velocity, Variety)
1990 Y2K
Mainframe (1st Platform)
1960 09
First Research cluster   Production Cluster
HAAS = Hadoop as a Service
14
Proprietary, Monolithic
Batch, Interactive
COBOL/ISAM/IDMS
Linked Record sets Client-Server Applications +
RDBMS
(2nd Platform )
OPEN ! 3GL, 4GL
PC & Servers
on premise
RELATIONAL
1606
scale out infrastructure
(3rd Platform)
Clusters, Data hub, pipelines
Mobile
Social
Big Data
Cloud
?
cost/performance
VVV crunch
What does a Chief Data Office(r) do ?
Evangelise
• Culture change to “data driven organisation”
• Self-Service Data & Analytics
Centralise ( tackle the silo problem )
“Year 1: Build the House”
“Year 2: Throw Open the doors”
Facilitate
• Educate
• Design Pattern Cookbook for the Enterprise
• Briefings – All Hands Calls, Leadership Team Mtgs, Hackathons….
• Tooling
• SKOOL on GITHUB (tool to simplify transferring tables from Oracle to Hadoop)
“Understanding the Chief Data Officer”
O’Reilly – Julie Steele of Silicon Valley Data Science
A Good CDO Role Model ?
Joy Bonaguro
City of San Francisco
data.sfgov.org
Cataloguing Data Assets
Facilitating data sharing
Building enabling infrastructure
BT Group Structure 1/Apr/2016
Customers
Chief Architects Office Enterprise Architecture
Data Architecture
For BT Group
~ 90K FTE in 61 countries, serving 180 countries
Research & Innovation
Legacy Systems Architecture in each BT Business Unit
Analytics
Data
Warehouse
ESB
CRM
Service Management
Network Management
Networks
& IT
Customers
 
• Hundreds of systems in each business unit grouped
into 3 operational areas (CRM/Service Mgt/Network Mgt)
• Data Warehouse per business unit
• Client – Server applications running on
servers in BT Data Centres (~ 35K hosts)
• Mainframe applications (in Openreach)
• Total Storage ~ 25PB
• Lots of event / time series data
– Network Alarms & Telemetry
– Netflow Traffic Events, Security events
– Call Detail Records, web clicks,
– mobile handset data (GPS, Apps, browsing..)
• Business Unit CIOs manage IT investment roadmap, each business
unit deploys a “stack release” quarterly
Field Engineers
Challenges - Complexity
Example from BT Global Services
Design for Release 17 of
Repair Systems for 1 product family
Where’s the Master Data ?
Which flows are data replication ?
Which flows are transactional ?
x 70 Similar “system stacks”
x 4 Releases / yr
Data Replication pinball
Revenue
Assurance
Customer
Details
Archive
Order
Mgt.
CRM
Billing
25K
20K 75K
10K
10K
10K
35K
Challenges – Risk & Compliance
Challenge – Agility Opportunity - Scaling
What does Data architecture do…? 1. Sort the basics
Adopt/Adapt a framework
Establish Lists(systems, data landscape….)
DAMA DMBOK.. TOGAF…
What does Data architecture do…? 2. Develop Vision


 

CRM
Hive
Meta
Store
RDBMS
Web/APP
Server
 Map 
Reduce
code

BI Tools
Tableau, Zoomdata…
(HIVE TABLE ACCESS)
HDFS
Impala
+ Sentry
Wrangling & Discovery
Data Science
Datameer, HUE…
(HDFS FILE ACCESS)
Flume
Golden
Gate
 

ERP
RDBMS
Web/APP
Server
 Map 
Reduce
code

sqoop
 

DW
RDBMS
Web/APP
Server
 Map 
Reduce
code

sqoop







1. Event Ingestion from
Networks/IT/Web servers
Collection with flume agents
landing in HDFS files 2. DB Table transfer using sqoop
(map/reduce) jobs, landing in HDFS files
Active
Directory
FILES
TABLES
snapshotCDC snapshot

Data
Scientists
SQL
analysts
business
users
What does Data architecture do…? 3A. Build the data house
• Following a presentation to the TSO Leadership team Dec 2013 an initial inovestment in
a production cluster was agreed backed by a plan to launch in Feb 2014
• 60 nodes optimised for Hadoop map/reduce deployed in BT Data Centre in Sheffield
(6TB local disks, 1:1 core:spindle ratio, 8GB for JVM per map/reduce slot
• Existing linux 3rd line team tasked with running basic (Min. Viable Product) Hadoop
Cluster as a shared service platform
BT HaaS Release 1: 60 Nodes ~ 2 PB Feb 2014 Linux 3rd Line  Hadoop Admin
What does Data architecture do…? 3B. Build the data house
HAAS Platform
Hadoop Cluster B (Openreach only)
Order form
(SharePoint)

script

email
Active
Directory
Tennant
“Project Owner”

User
admin
Standard
User Admin
Process
Hadoop
Cluster A HAASA AP 00307_12126
HIVE
HDFS
sentry
Job queue
HUE Impala
Flume
BI Server
Create
Hadoop
Features
“HAASA AP 00307_12126
Is ready for you to use”
existing
Business APP
12126 .
Oracle
DB
APP extends footprint in HaaS
http FS
Kerberos
Datameer
Analytics
Review
Board
Platform
Admin
ARB
User Access
Systems Access
Sqoop
Create
Security
Group
HAASA AP 00101_2029
Faults
4369
Orders
3531
CRM
2029
 hree existing business applications (CRM, Orders, Faults) extended into HaaS 
RDBMS
Customer
Table
RDBMS
Orders
Table
RDBMS
Faults
Table
T_CustomerHive DB
HAASA
AP 00101_2029
sqoop
V_Customer
HAASA AP 00202_3531
T_OrdersHive DB
HAASA
AP 0202_3531
sqoop
V_Orders
HAASA AP 00303_4369
T_FaultsHive DB
HAASA
AP 0303_4369
sqoop
V_Faults
Business
Data
Stewards

Business Analysts / Data Scientists

CRM

Orders

Faults
Governing Access to Data on the Platform ** WIP **
1. Browse & select data
2. Get Steward Approval
3. Create VIEWs & GRANTs
4. Recommend joins/ Views
Data Catalogue
(Million Table Meta-store)
Cloudera
“Resident”
Solution
Architect
What does Data architecture do…? 3. Educate
BT HaaS Cookbook
snip.bt.com/haascook
Design patterns to
ease project on boarding
included in “Learning Pathways”
Research & Innovation
Data Scientists
Dec 2015 3rd BT Data Science Week
(50 @ Adastral)
Business Awareness
Sep 2014
UK Hadoop User Group
(200 @ BT Centre)
IT Operations
Jan 2014
RESOPS training week
(Research + IT Ops Adastral)
Architecture
Hadoop Summit Mar 2014
(Doug Cutting- Cloudera+BT)
Big Data Data Centre of Excellence
Cardiff / Bangalore
20 designers / developers
working on > 50 opportunities & projects
published open source “skool” utility
Q & A
Phill Radley
Chief Data Architect
phillip.radley@bt.com

More Related Content

What's hot

Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...Denodo
 
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo
 
Architecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data AnalyticsArchitecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data AnalyticsNir Rubinstein
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectPrecisely
 
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo DataFest 2016: Big Data Virtualization in the CloudDenodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo DataFest 2016: Big Data Virtualization in the CloudDenodo
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshConfluentInc1
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Big Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowBig Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowSnapLogic
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architectureSudheer Kondla
 
Ibm machine learning for z os
Ibm machine learning for z osIbm machine learning for z os
Ibm machine learning for z osCuneyt Goksu
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesCarole Gunst
 
Where does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsWhere does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsDenodo
 
Big-Data Server Farm Architecture
Big-Data Server Farm Architecture Big-Data Server Farm Architecture
Big-Data Server Farm Architecture Jordan Chung
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Modernizing Data Management Through Metadata
Modernizing Data Management Through MetadataModernizing Data Management Through Metadata
Modernizing Data Management Through MetadataMANTA
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Architecture of Big Data Solutions
Architecture of Big Data SolutionsArchitecture of Big Data Solutions
Architecture of Big Data SolutionsGuido Schmutz
 
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...Denodo
 
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...Rittman Analytics
 
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Dipti Borkar
 

What's hot (20)

Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
 
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
 
Architecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data AnalyticsArchitecture for Real-Time and Batch Big Data Analytics
Architecture for Real-Time and Batch Big Data Analytics
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with Connect
 
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo DataFest 2016: Big Data Virtualization in the CloudDenodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo DataFest 2016: Big Data Virtualization in the Cloud
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data Mesh
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Big Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowBig Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To Know
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
 
Ibm machine learning for z os
Ibm machine learning for z osIbm machine learning for z os
Ibm machine learning for z os
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data Pipelines
 
Where does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsWhere does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT Projects
 
Big-Data Server Farm Architecture
Big-Data Server Farm Architecture Big-Data Server Farm Architecture
Big-Data Server Farm Architecture
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Modernizing Data Management Through Metadata
Modernizing Data Management Through MetadataModernizing Data Management Through Metadata
Modernizing Data Management Through Metadata
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Architecture of Big Data Solutions
Architecture of Big Data SolutionsArchitecture of Big Data Solutions
Architecture of Big Data Solutions
 
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
 
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
 
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
 

Viewers also liked (9)

Beyond SQL: Managing Events and Relationships in Social Care
Beyond SQL: Managing Events and Relationships in Social CareBeyond SQL: Managing Events and Relationships in Social Care
Beyond SQL: Managing Events and Relationships in Social Care
 
The CDO Challenge 24-11-16
The CDO Challenge 24-11-16The CDO Challenge 24-11-16
The CDO Challenge 24-11-16
 
Nick Keen Data governance in the Environment Agency
Nick Keen   Data governance in the Environment AgencyNick Keen   Data governance in the Environment Agency
Nick Keen Data governance in the Environment Agency
 
Moving from 3rd Normal Form to a web enabled world 22-9-15
Moving from 3rd Normal Form to a web enabled world   22-9-15Moving from 3rd Normal Form to a web enabled world   22-9-15
Moving from 3rd Normal Form to a web enabled world 22-9-15
 
Nigel Turner data governance is not boring
Nigel Turner   data governance is not boringNigel Turner   data governance is not boring
Nigel Turner data governance is not boring
 
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
John Stuart-Clarke - beginning the data governance journey - 8th june 2016John Stuart-Clarke - beginning the data governance journey - 8th june 2016
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
 
Michael Bironneau Data governance and the IoT
Michael Bironneau   Data governance and the IoTMichael Bironneau   Data governance and the IoT
Michael Bironneau Data governance and the IoT
 
Big Data Analytics, Dave Shuttleworth - 22-9-15
Big Data Analytics, Dave Shuttleworth - 22-9-15Big Data Analytics, Dave Shuttleworth - 22-9-15
Big Data Analytics, Dave Shuttleworth - 22-9-15
 
Nicola Askham Key concepts in data governance
Nicola Askham   Key concepts in data governanceNicola Askham   Key concepts in data governance
Nicola Askham Key concepts in data governance
 

Similar to The CDO Agenda: how data architecture can help?

Presentation racsig 090730
Presentation racsig 090730Presentation racsig 090730
Presentation racsig 090730maclean liu
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big DataDataWorks Summit
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 
IBMHadoopofferingTechline-Systems2015
IBMHadoopofferingTechline-Systems2015IBMHadoopofferingTechline-Systems2015
IBMHadoopofferingTechline-Systems2015Daniela Zuppini
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics WebinarBill Wong
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaJeffrey T. Pollock
 
Cloud Computing Architecture Primer
Cloud Computing Architecture PrimerCloud Computing Architecture Primer
Cloud Computing Architecture PrimerIlham Ahmed
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit MumbaiAnand Haridass
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliData Driven Innovation
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopHazelcast
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDATAVERSITY
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
SAP on pay as you go model
SAP on pay as you go modelSAP on pay as you go model
SAP on pay as you go modelAjay Kumar Uppal
 

Similar to The CDO Agenda: how data architecture can help? (20)

Presentation racsig 090730
Presentation racsig 090730Presentation racsig 090730
Presentation racsig 090730
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big Data
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
IBMHadoopofferingTechline-Systems2015
IBMHadoopofferingTechline-Systems2015IBMHadoopofferingTechline-Systems2015
IBMHadoopofferingTechline-Systems2015
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics Webinar
 
Iotbds v1.0
Iotbds v1.0Iotbds v1.0
Iotbds v1.0
 
Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafka
 
Cloud Computing Architecture Primer
Cloud Computing Architecture PrimerCloud Computing Architecture Primer
Cloud Computing Architecture Primer
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo BrignoliL'architettura di classe enterprise di nuova generazione - Massimo Brignoli
L'architettura di classe enterprise di nuova generazione - Massimo Brignoli
 
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of HadoopBig Data, Simple and Fast: Addressing the Shortcomings of Hadoop
Big Data, Simple and Fast: Addressing the Shortcomings of Hadoop
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
SAP on pay as you go model
SAP on pay as you go modelSAP on pay as you go model
SAP on pay as you go model
 

Recently uploaded

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

The CDO Agenda: how data architecture can help?

  • 1. Q. The CDO Agenda, how can Data Architecture help? Phill Radley, Chief Data Architect 26 / October / 2016 Data Management Specialist Group 1/25
  • 2. Answer • A lot • A bit • Not much
  • 3. The architects answer… It depends …..State of the business …..State of enterprise architecture ……What’s going on externally Do you have or need a CDO ?
  • 4. AGENDA Framing View of the Industry Discuss the CDO role The organisation of BT & IT Challenges Examples of what we’re doing with data (in the absence of a CDO)
  • 5. The Long View of Big Data Data “Bigness” =  ( Volume, Velocity, Variety) 1990 Y2K Mainframe (1st Platform) 1960 09 First Research cluster   Production Cluster HAAS = Hadoop as a Service 14 Proprietary, Monolithic Batch, Interactive COBOL/ISAM/IDMS Linked Record sets Client-Server Applications + RDBMS (2nd Platform ) OPEN ! 3GL, 4GL PC & Servers on premise RELATIONAL 1606 scale out infrastructure (3rd Platform) Clusters, Data hub, pipelines Mobile Social Big Data Cloud ? cost/performance VVV crunch
  • 6. What does a Chief Data Office(r) do ? Evangelise • Culture change to “data driven organisation” • Self-Service Data & Analytics Centralise ( tackle the silo problem ) “Year 1: Build the House” “Year 2: Throw Open the doors” Facilitate • Educate • Design Pattern Cookbook for the Enterprise • Briefings – All Hands Calls, Leadership Team Mtgs, Hackathons…. • Tooling • SKOOL on GITHUB (tool to simplify transferring tables from Oracle to Hadoop) “Understanding the Chief Data Officer” O’Reilly – Julie Steele of Silicon Valley Data Science
  • 7. A Good CDO Role Model ? Joy Bonaguro City of San Francisco data.sfgov.org Cataloguing Data Assets Facilitating data sharing Building enabling infrastructure
  • 8. BT Group Structure 1/Apr/2016 Customers Chief Architects Office Enterprise Architecture Data Architecture For BT Group ~ 90K FTE in 61 countries, serving 180 countries Research & Innovation
  • 9. Legacy Systems Architecture in each BT Business Unit Analytics Data Warehouse ESB CRM Service Management Network Management Networks & IT Customers   • Hundreds of systems in each business unit grouped into 3 operational areas (CRM/Service Mgt/Network Mgt) • Data Warehouse per business unit • Client – Server applications running on servers in BT Data Centres (~ 35K hosts) • Mainframe applications (in Openreach) • Total Storage ~ 25PB • Lots of event / time series data – Network Alarms & Telemetry – Netflow Traffic Events, Security events – Call Detail Records, web clicks, – mobile handset data (GPS, Apps, browsing..) • Business Unit CIOs manage IT investment roadmap, each business unit deploys a “stack release” quarterly Field Engineers
  • 10. Challenges - Complexity Example from BT Global Services Design for Release 17 of Repair Systems for 1 product family Where’s the Master Data ? Which flows are data replication ? Which flows are transactional ? x 70 Similar “system stacks” x 4 Releases / yr
  • 12. Challenges – Risk & Compliance
  • 13. Challenge – Agility Opportunity - Scaling
  • 14. What does Data architecture do…? 1. Sort the basics Adopt/Adapt a framework Establish Lists(systems, data landscape….) DAMA DMBOK.. TOGAF…
  • 15. What does Data architecture do…? 2. Develop Vision      CRM Hive Meta Store RDBMS Web/APP Server  Map  Reduce code  BI Tools Tableau, Zoomdata… (HIVE TABLE ACCESS) HDFS Impala + Sentry Wrangling & Discovery Data Science Datameer, HUE… (HDFS FILE ACCESS) Flume Golden Gate    ERP RDBMS Web/APP Server  Map  Reduce code  sqoop    DW RDBMS Web/APP Server  Map  Reduce code  sqoop        1. Event Ingestion from Networks/IT/Web servers Collection with flume agents landing in HDFS files 2. DB Table transfer using sqoop (map/reduce) jobs, landing in HDFS files Active Directory FILES TABLES snapshotCDC snapshot  Data Scientists SQL analysts business users
  • 16. What does Data architecture do…? 3A. Build the data house • Following a presentation to the TSO Leadership team Dec 2013 an initial inovestment in a production cluster was agreed backed by a plan to launch in Feb 2014 • 60 nodes optimised for Hadoop map/reduce deployed in BT Data Centre in Sheffield (6TB local disks, 1:1 core:spindle ratio, 8GB for JVM per map/reduce slot • Existing linux 3rd line team tasked with running basic (Min. Viable Product) Hadoop Cluster as a shared service platform BT HaaS Release 1: 60 Nodes ~ 2 PB Feb 2014 Linux 3rd Line  Hadoop Admin
  • 17. What does Data architecture do…? 3B. Build the data house HAAS Platform Hadoop Cluster B (Openreach only) Order form (SharePoint)  script  email Active Directory Tennant “Project Owner”  User admin Standard User Admin Process Hadoop Cluster A HAASA AP 00307_12126 HIVE HDFS sentry Job queue HUE Impala Flume BI Server Create Hadoop Features “HAASA AP 00307_12126 Is ready for you to use” existing Business APP 12126 . Oracle DB APP extends footprint in HaaS http FS Kerberos Datameer Analytics Review Board Platform Admin ARB User Access Systems Access Sqoop Create Security Group
  • 18. HAASA AP 00101_2029 Faults 4369 Orders 3531 CRM 2029  hree existing business applications (CRM, Orders, Faults) extended into HaaS  RDBMS Customer Table RDBMS Orders Table RDBMS Faults Table T_CustomerHive DB HAASA AP 00101_2029 sqoop V_Customer HAASA AP 00202_3531 T_OrdersHive DB HAASA AP 0202_3531 sqoop V_Orders HAASA AP 00303_4369 T_FaultsHive DB HAASA AP 0303_4369 sqoop V_Faults Business Data Stewards  Business Analysts / Data Scientists  CRM  Orders  Faults Governing Access to Data on the Platform ** WIP ** 1. Browse & select data 2. Get Steward Approval 3. Create VIEWs & GRANTs 4. Recommend joins/ Views Data Catalogue (Million Table Meta-store)
  • 19. Cloudera “Resident” Solution Architect What does Data architecture do…? 3. Educate BT HaaS Cookbook snip.bt.com/haascook Design patterns to ease project on boarding included in “Learning Pathways” Research & Innovation Data Scientists Dec 2015 3rd BT Data Science Week (50 @ Adastral) Business Awareness Sep 2014 UK Hadoop User Group (200 @ BT Centre) IT Operations Jan 2014 RESOPS training week (Research + IT Ops Adastral) Architecture Hadoop Summit Mar 2014 (Doug Cutting- Cloudera+BT) Big Data Data Centre of Excellence Cardiff / Bangalore 20 designers / developers working on > 50 opportunities & projects published open source “skool” utility
  • 20. Q & A Phill Radley Chief Data Architect phillip.radley@bt.com