SlideShare a Scribd company logo
1 of 14
Download to read offline
Architecting Data for Integration and Longevity 
Jonathan Hamilton Solórzano, UCLA ORIS
Today’s Objectives 
 
Provide background context for data integration projects within UCLA Research Administration, and in administration more generally 
 
Illustrate business problems driving the need for these integrations 
 
Provide a high level overview of lessons learned and considerations when implementing data services 
 
Specifically, discuss the architecture and utility of a federated data services approach
Information Flow: ORA 
Campus 
Raw info. 
ORA 
Research Rules 
UCLA 
Central & Shared Data 
 
ORA applications relate to research 
 
Learn what PI’s are doing to assist & govern 
 
Campus-facing systems gather data in a “friendly” way 
 
Internal systems standardize and determine actions 
 
Cross-reference other campus data sources 
 
Cross-reference internally against other systems
The Need 
 
Why can’t the systems just talk to each other? 
 
Why does it take so long when they do? 
 
Why are we locked into this old vendor system? 
Source Data 
System 3 
System 2 
System 1
Establishing the Data Architecture 
 
Identify Systems of Record 
 
Learn “business rules” for the systems of record 
 
Streamline and update business processes 
 
Design data architecture 
 
Degree of Normalization/Modeling Approach 
 
Requirements for Data Versioning 
 
Required Remote Source Data (internal/external) 
 
Assess and Plan for Data Quality 
 
Downstream Data Flow
Business Rules 
 
Understand Business Logic vs. Application Logic. Customized off-the-shelf vendor applications typically bring their own “business” logic from their prior customers or target use case. Work with business users to separate application behavior that they “work around” versus what they actually “work with.” 
 
Document Business Logic. In this way the application documentation is also the business documentation, and vice versa. 
 
Implement Business Logic. Where possible, pull specific business logic upstream from the vendor implementation by leveraging vendor APIs. Operate the business logic against your data domain.
Architected Data Delivery 
Current ArchitectureTarget ArchitectureService WrappersOperational Data StoreApplication APIsApplication UIService WrappersBusiness Logic ServicesExternal Data WarehouseTransactional DatabasesTransactional DatabasesApplication APIsORA Data WarehouseOperational Data StoreData ServicesBusiness Logic ServicesExternal Data WarehouseApplication UI 
 
Widely varying administrative processes demand unique transactional systems 
 
Organic application deployment results in a hodgepodge approach to data delivery 
 
Implementing a consistent N-tier approach will streamline the architecture and facilitate future development
User Experience Under the Hood 
 
Campus users interact with their transactional system and the cross-cutting data access system 
 
Data access interfaces consume a “smart” management service 
 
Management service implements interfaces against API or periodic external snapshot depending on need
Serving Federated Data 
Source System DataTransactional DBApplication APIOperational Data StoreData ServiceAccessPresent (Data API) Transform (Canonical Model) AccessAccessXML/JSON Data RepresentationPeriodic Refresh 
 
Presentation of data in JSON or XML for downstream interfaces provides performance and reusability 
 
Data service transforms all source data to a consistent canonical model regardless of the source data structure 
 
Data accessors implement a single interface against source of choice returning data in the canonical type 
 
Business logic becomes decoupled from source data schema structures, improving reusability and longevity
Investing in Longevity 
 
Significant investment to implement canonical data service 
 
Define the canonical data model 
 
Implement transforms to (and potentially from) the canonical model against the transactional system 
 
Implement transforms from the canonical model to other transactional or representational data models 
 
Significant savings for downstream development efforts 
 
All data consumption becomes an iterative effort, just add another representation to the canonical model 
 
All business logic can be implemented against the canonical model 
 
Allows changing out source transactional systems much more easily which reduces vendor lock-in
Additional Technical Considerations 
 
Implementation Details 
 
Iterative, phased approach 
 
Cross-pollination in project implementation teams 
 
Connection Architecture 
 
Connection hardening 
 
Data authorization and access control 
 
Underlying Infrastructure 
 
Server and Storage Stack 
 
Cloud services? (Data Security) 
 
The Future 
 
iPaaSand iSaaS
Beyond Technical Architecture 
People and Organization 
 
Defining business canonical data model as a collaboration 
 
Agreement on downstream data usage 
 
Communicating change to system consumers (i.e. campus users) 
Processes 
 
Information Security Compliance 
 
Data Change SLA 
 
Increased governance on source system changes 
 
Data Dictionary updates
Key Take-Aways 
1. 
Understand your users and how they think about data 
2. 
Build your internal data structure against only that understanding 
3. 
Actively determine the degree of normalization and versioning in that data structure 
4. 
Bridge your specific implementations to your internal data structure 
5. 
Serve your data from this internal consistent structure
Further Reading 
 
Information Security Office and Plan at https://www.itsecurity.ucla.edu/plan/ 
 
Campus Data Warehouse information at https://www.it.ucla.edu/accounts/get-access/qdb-access 
 
Gartner Articles: 
 
Altman, Ross et. al. “Gartner G00212138: MDM, SOA, and BPM: Alphabet Soup or a Toolkit to Address Critical Data Management Issues?” Gartner Technical Professional Advice. 27 May 2011; refreshed October 2013. 
 
Selvage, Mei. “Gartner G00250365: Data Integration Decision Point,” Gartner Technical Professional Advice. 4 April 2013.

More Related Content

What's hot

Ryan-FINALProjectAbstract-v1
Ryan-FINALProjectAbstract-v1Ryan-FINALProjectAbstract-v1
Ryan-FINALProjectAbstract-v1Kevin Ryan
 
Chapter 5 data resource management
Chapter 5 data resource managementChapter 5 data resource management
Chapter 5 data resource managementAG RD
 
Data Federation/EII Uses And Abuses
Data Federation/EII Uses And AbusesData Federation/EII Uses And Abuses
Data Federation/EII Uses And Abusesmark madsen
 
Eight styles of data integration
Eight styles of data integrationEight styles of data integration
Eight styles of data integrationSteve Sobotincic
 
Data integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcuttaData integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcuttaBhawani N Prasad
 
Enterprise Information Integration at LondonMet
Enterprise Information Integration at LondonMetEnterprise Information Integration at LondonMet
Enterprise Information Integration at LondonMetPaul Walk
 
Big Data Fabric Capability Maturity Model
Big Data Fabric Capability Maturity ModelBig Data Fabric Capability Maturity Model
Big Data Fabric Capability Maturity ModelRoss Collins
 
What is Informatica Powercenter
What is Informatica PowercenterWhat is Informatica Powercenter
What is Informatica PowercenterBigClasses Com
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data WarehousesMichael Lamont
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data ManagementMoniqueO Opris
 
Content Centric Applications
Content Centric ApplicationsContent Centric Applications
Content Centric ApplicationsNetApp
 
Master data management
Master data managementMaster data management
Master data managementZahra Mansoori
 

What's hot (19)

Data Flux
Data FluxData Flux
Data Flux
 
Ryan-FINALProjectAbstract-v1
Ryan-FINALProjectAbstract-v1Ryan-FINALProjectAbstract-v1
Ryan-FINALProjectAbstract-v1
 
Chapter 5 data resource management
Chapter 5 data resource managementChapter 5 data resource management
Chapter 5 data resource management
 
Data Federation/EII Uses And Abuses
Data Federation/EII Uses And AbusesData Federation/EII Uses And Abuses
Data Federation/EII Uses And Abuses
 
Enterprise Information Integration
Enterprise Information IntegrationEnterprise Information Integration
Enterprise Information Integration
 
Eight styles of data integration
Eight styles of data integrationEight styles of data integration
Eight styles of data integration
 
Data integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcuttaData integration ppt-bhawani nandan prasad - iim calcutta
Data integration ppt-bhawani nandan prasad - iim calcutta
 
Chap05 data resource mgt
Chap05 data resource mgtChap05 data resource mgt
Chap05 data resource mgt
 
Enterprise Information Integration at LondonMet
Enterprise Information Integration at LondonMetEnterprise Information Integration at LondonMet
Enterprise Information Integration at LondonMet
 
Big Data Fabric Capability Maturity Model
Big Data Fabric Capability Maturity ModelBig Data Fabric Capability Maturity Model
Big Data Fabric Capability Maturity Model
 
What is Informatica Powercenter
What is Informatica PowercenterWhat is Informatica Powercenter
What is Informatica Powercenter
 
Industrialization of IT and Operations
Industrialization of IT and OperationsIndustrialization of IT and Operations
Industrialization of IT and Operations
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data Warehouses
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Content Centric Applications
Content Centric ApplicationsContent Centric Applications
Content Centric Applications
 
FlowOverview2
FlowOverview2FlowOverview2
FlowOverview2
 
IBM_Insight_2015
IBM_Insight_2015IBM_Insight_2015
IBM_Insight_2015
 
Master data management
Master data managementMaster data management
Master data management
 

Viewers also liked

EUGANGS - Κοινωνιοψυχολογικες προσεγγίσεις
EUGANGS - Κοινωνιοψυχολογικες προσεγγίσειςEUGANGS - Κοινωνιοψυχολογικες προσεγγίσεις
EUGANGS - Κοινωνιοψυχολογικες προσεγγίσειςGeorge Diamandis
 
Proceso para crear una cuenta en twitter
Proceso para crear una cuenta en twitterProceso para crear una cuenta en twitter
Proceso para crear una cuenta en twitterFranklinIsmalS
 
BCN Biosciences I-corps@nih 121014
BCN Biosciences I-corps@nih 121014 BCN Biosciences I-corps@nih 121014
BCN Biosciences I-corps@nih 121014 Stanford University
 

Viewers also liked (6)

Relative pronouns
Relative pronounsRelative pronouns
Relative pronouns
 
EUGANGS - Κοινωνιοψυχολογικες προσεγγίσεις
EUGANGS - Κοινωνιοψυχολογικες προσεγγίσειςEUGANGS - Κοινωνιοψυχολογικες προσεγγίσεις
EUGANGS - Κοινωνιοψυχολογικες προσεγγίσεις
 
Proceso para crear una cuenta en twitter
Proceso para crear una cuenta en twitterProceso para crear una cuenta en twitter
Proceso para crear una cuenta en twitter
 
Asclepix I-Corps@NIH 121014
Asclepix I-Corps@NIH 121014Asclepix I-Corps@NIH 121014
Asclepix I-Corps@NIH 121014
 
BCN Biosciences I-corps@nih 121014
BCN Biosciences I-corps@nih 121014 BCN Biosciences I-corps@nih 121014
BCN Biosciences I-corps@nih 121014
 
Doc2
Doc2Doc2
Doc2
 

Similar to t2_4-architecting-data-for-integration-and-longevity

Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoJusto Hidalgo
 
Integrating SIS’s with Salesforce: An Accidental Integrator’s Guide
Integrating SIS’s with Salesforce: An Accidental Integrator’s GuideIntegrating SIS’s with Salesforce: An Accidental Integrator’s Guide
Integrating SIS’s with Salesforce: An Accidental Integrator’s GuideSalesforce.org
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
The New Enterprise Alphabet - .Net, XML And XBRL
The New Enterprise Alphabet - .Net, XML And XBRLThe New Enterprise Alphabet - .Net, XML And XBRL
The New Enterprise Alphabet - .Net, XML And XBRLJorgen Thelin
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricNathan Bijnens
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]Usman Tariq
 
Pysyvästi laadukasta masterdataa SmartMDM:n avulla
Pysyvästi laadukasta masterdataa SmartMDM:n avullaPysyvästi laadukasta masterdataa SmartMDM:n avulla
Pysyvästi laadukasta masterdataa SmartMDM:n avullaBilot
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Denodo
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Denodo
 
Information management
Information managementInformation management
Information managementDavid Champeau
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
data-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfdata-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfssuser18927d
 
Process management seminar
Process management seminarProcess management seminar
Process management seminarapurva_naik
 
How to Get Cloud Architecture and Design Right the First Time
How to Get Cloud Architecture and Design Right the First TimeHow to Get Cloud Architecture and Design Right the First Time
How to Get Cloud Architecture and Design Right the First TimeDavid Linthicum
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Zaloni
 

Similar to t2_4-architecting-data-for-integration-and-longevity (20)

Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by Denodo
 
Integrating SIS’s with Salesforce: An Accidental Integrator’s Guide
Integrating SIS’s with Salesforce: An Accidental Integrator’s GuideIntegrating SIS’s with Salesforce: An Accidental Integrator’s Guide
Integrating SIS’s with Salesforce: An Accidental Integrator’s Guide
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
The New Enterprise Alphabet - .Net, XML And XBRL
The New Enterprise Alphabet - .Net, XML And XBRLThe New Enterprise Alphabet - .Net, XML And XBRL
The New Enterprise Alphabet - .Net, XML And XBRL
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
Data Models [DATABASE SYSTEMS: Design, Implementation, and Management]
 
Pysyvästi laadukasta masterdataa SmartMDM:n avulla
Pysyvästi laadukasta masterdataa SmartMDM:n avullaPysyvästi laadukasta masterdataa SmartMDM:n avulla
Pysyvästi laadukasta masterdataa SmartMDM:n avulla
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Data models
Data modelsData models
Data models
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)
 
Information management
Information managementInformation management
Information management
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
data-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdfdata-mesh_whitepaper_dec2021.pdf
data-mesh_whitepaper_dec2021.pdf
 
Process management seminar
Process management seminarProcess management seminar
Process management seminar
 
Enterprise Deployments & SOA
Enterprise Deployments & SOAEnterprise Deployments & SOA
Enterprise Deployments & SOA
 
How to Get Cloud Architecture and Design Right the First Time
How to Get Cloud Architecture and Design Right the First TimeHow to Get Cloud Architecture and Design Right the First Time
How to Get Cloud Architecture and Design Right the First Time
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
 

t2_4-architecting-data-for-integration-and-longevity

  • 1. Architecting Data for Integration and Longevity Jonathan Hamilton Solórzano, UCLA ORIS
  • 2. Today’s Objectives  Provide background context for data integration projects within UCLA Research Administration, and in administration more generally  Illustrate business problems driving the need for these integrations  Provide a high level overview of lessons learned and considerations when implementing data services  Specifically, discuss the architecture and utility of a federated data services approach
  • 3. Information Flow: ORA Campus Raw info. ORA Research Rules UCLA Central & Shared Data  ORA applications relate to research  Learn what PI’s are doing to assist & govern  Campus-facing systems gather data in a “friendly” way  Internal systems standardize and determine actions  Cross-reference other campus data sources  Cross-reference internally against other systems
  • 4. The Need  Why can’t the systems just talk to each other?  Why does it take so long when they do?  Why are we locked into this old vendor system? Source Data System 3 System 2 System 1
  • 5. Establishing the Data Architecture  Identify Systems of Record  Learn “business rules” for the systems of record  Streamline and update business processes  Design data architecture  Degree of Normalization/Modeling Approach  Requirements for Data Versioning  Required Remote Source Data (internal/external)  Assess and Plan for Data Quality  Downstream Data Flow
  • 6. Business Rules  Understand Business Logic vs. Application Logic. Customized off-the-shelf vendor applications typically bring their own “business” logic from their prior customers or target use case. Work with business users to separate application behavior that they “work around” versus what they actually “work with.”  Document Business Logic. In this way the application documentation is also the business documentation, and vice versa.  Implement Business Logic. Where possible, pull specific business logic upstream from the vendor implementation by leveraging vendor APIs. Operate the business logic against your data domain.
  • 7. Architected Data Delivery Current ArchitectureTarget ArchitectureService WrappersOperational Data StoreApplication APIsApplication UIService WrappersBusiness Logic ServicesExternal Data WarehouseTransactional DatabasesTransactional DatabasesApplication APIsORA Data WarehouseOperational Data StoreData ServicesBusiness Logic ServicesExternal Data WarehouseApplication UI  Widely varying administrative processes demand unique transactional systems  Organic application deployment results in a hodgepodge approach to data delivery  Implementing a consistent N-tier approach will streamline the architecture and facilitate future development
  • 8. User Experience Under the Hood  Campus users interact with their transactional system and the cross-cutting data access system  Data access interfaces consume a “smart” management service  Management service implements interfaces against API or periodic external snapshot depending on need
  • 9. Serving Federated Data Source System DataTransactional DBApplication APIOperational Data StoreData ServiceAccessPresent (Data API) Transform (Canonical Model) AccessAccessXML/JSON Data RepresentationPeriodic Refresh  Presentation of data in JSON or XML for downstream interfaces provides performance and reusability  Data service transforms all source data to a consistent canonical model regardless of the source data structure  Data accessors implement a single interface against source of choice returning data in the canonical type  Business logic becomes decoupled from source data schema structures, improving reusability and longevity
  • 10. Investing in Longevity  Significant investment to implement canonical data service  Define the canonical data model  Implement transforms to (and potentially from) the canonical model against the transactional system  Implement transforms from the canonical model to other transactional or representational data models  Significant savings for downstream development efforts  All data consumption becomes an iterative effort, just add another representation to the canonical model  All business logic can be implemented against the canonical model  Allows changing out source transactional systems much more easily which reduces vendor lock-in
  • 11. Additional Technical Considerations  Implementation Details  Iterative, phased approach  Cross-pollination in project implementation teams  Connection Architecture  Connection hardening  Data authorization and access control  Underlying Infrastructure  Server and Storage Stack  Cloud services? (Data Security)  The Future  iPaaSand iSaaS
  • 12. Beyond Technical Architecture People and Organization  Defining business canonical data model as a collaboration  Agreement on downstream data usage  Communicating change to system consumers (i.e. campus users) Processes  Information Security Compliance  Data Change SLA  Increased governance on source system changes  Data Dictionary updates
  • 13. Key Take-Aways 1. Understand your users and how they think about data 2. Build your internal data structure against only that understanding 3. Actively determine the degree of normalization and versioning in that data structure 4. Bridge your specific implementations to your internal data structure 5. Serve your data from this internal consistent structure
  • 14. Further Reading  Information Security Office and Plan at https://www.itsecurity.ucla.edu/plan/  Campus Data Warehouse information at https://www.it.ucla.edu/accounts/get-access/qdb-access  Gartner Articles:  Altman, Ross et. al. “Gartner G00212138: MDM, SOA, and BPM: Alphabet Soup or a Toolkit to Address Critical Data Management Issues?” Gartner Technical Professional Advice. 27 May 2011; refreshed October 2013.  Selvage, Mei. “Gartner G00250365: Data Integration Decision Point,” Gartner Technical Professional Advice. 4 April 2013.