Enterprise Architecture
for Big data projects
Business Architecture
Information Architecture
Infrastructure Architecture
Data Architecture
Integration Architecture
Service Architecture
Solution Architecture
BI Solution
Requireme
nts
Corporate
Strategy
BI
Stakeholders
BI Vision
BI Strategy
BI Mission
Statement
BI Solution
Architectur
e
Business
Architecture
Infrastructure
Architecture
Data
Architecture
Service
Architecture
Integration
Architecture
Information
Architecture
Business Architecture
Business
Architecture
Business Objective
Business Strategy
Business
Capabilities
Business Process
Areas
Ways of Working
Interviews
Requirement
Workshop
Business Structure
Information Architecture
Information
Architecture
Social
Media:Facebook
data from API
Legacy System
Social Media Data:
Twitter
Data Capture
System
Web Application
data
KANBAN Process
ERP
(Manufacturing)
Landing
Space
Staging
Tables
ODC
Denormaliz
ed Column
Families
Row Key
ERP 3NF
Tables
Reference
Data
Data Ownership Data Contracts
Structural
Design
Shared Info
Env.
Flow &
Lineage
Meaning and
Use
Application Architecture
Application
Architecture
noSQL databases: HBASE, Cassandra,
MongoDB
Relational databases (Source /Target):
Oracle,SQL Server,Terradata
Graph databases: Neo4j and Giraph
Hadoop Technologies: HDFS
In memory/MPP/Search:
Shark,Spark,Impala,SAP HANA,
Visualisation/Reporting:
Tableau,Pentaho,qlickview
Data Integration:
(Talend/Informatica/BODI,Pentaho DI)
How All Application Hook Together
Data Warehouse:
(Hive, Relational Datawarehouse,SAP
HANA)
Infrastructure
Architecture
Infrastructur
e
Architecture
Storage
Communication
Data Federation
Data Virtualization
Cloud
Security
HA/DR
Recovery/Backup
Licensing Models
Versioning / Patching
Hardware/ VM
Procurement
Sizing
Production
Test
Development
Data Architecture
Data
Architecture
ETL
Sources
Data Governance
Metadata
Data Quality
Data Profiling
MDM
Big data
Near Real time
Real time
ODS
Reference Data
Physical /logical Data model
Dimensional Modelling
Entity Modelling
Data Warehouse
Data Integration
Architecture
Data
Integration
Architecture
ETL
ETL Subsystem
Sources
Data Governance
Metadata
Data Quality
Data Profiling
MDM
Big data
Near Real time
Real time
ODS
Reference Data
Physical /logical Data model
Dimensional Modelling
Entity Modelling
Data Warehouse
Details Next
Slide
ETL Subsystems
Profilin
g
Extrac
tion
Data
Change
Mgmt.
Error
Event
SchemaAudit
Dimensi
onDeduplicati
on
Data
Cleansin
gSurrogat
e
Generati
on
SCD
Mgmt.
Fact
Update
Dimension
& Fact
Conformity
Hierarch
y Mgmt.
Surrogat
e Mgmt.
Bridge
Table
Sorting
Job
Schedule
rLineage
Dependenci
esParallel
Processin
g
Bulk
Load
TransformExtract Load
Recovery
&
Restart
Security
Systems
Package
Versionin
g
Complianc
e
Metadata
Reposito
ryRecovery
&
Restart
Problem
Escalatio
n
Package
Lifecycle
Workflow
Monitori
ng
Backup
Systems
Manage
Integration
Architecture
Integration
Architecture
Security (Cloud Security, Data security,
Top 10)
Portal Integration
Coarse grain Integration (Web Services)
Social Media API
Distributed Hadoop Java Customization
Infrastructure AWS API publication-
consumption
Service Oriented Architecture
Big data Analysis Java Components
Data Analysis javascript Libraries
Real time data feed API (node.js)
3rd party Visualisation API like Adobe Flex
Analytics API
Integration Governance Framework
Other 3rd party system Integration API
Process Modelling: BPM/BPEL components
Supporting Platforms Integration
Service Architecture
Service
Architecture
Compliance
Backup and Recovery
Disaster Recovery and High Availability
Service Library
Defects/ Fixes/Impact Management
Upgrades/Maintenance
Service Operations
Change Requests CR, Request For Change
RFC
Release Policy
Service Transition
Service Design Package
Capacity Plan
Service Level Agreement
Service Design
Recovery Time Objective/ Recovery Point
Objective
Service Strategy

Enterprise architecture for big data projects

Editor's Notes

  • #4 Put Enterprise Architecture paradigm ADM framework into BI and Business Artifacts for Each process.
  • #5 Put Enterprise Architecture paradigm ADM framework into BI and Business Artifacts for Each process.