SlideShare a Scribd company logo
1 of 78
Download to read offline
Copyright 2013 by Data Blueprint
Data Systems Integration & Business Value Part 2: Cloud-based Integration
All organizations are prepared to benefit from aspects of the cloud.
These benefits accrue when cloud-hosted datasets share three
attributes. They must be of:
1. Higher quality data than those data residing outside of the cloud;
2. Lower volume (1/5 the size of data collections) than similar
collections residing outside of the cloud; and
3. Increased share-ability than data residing outside the cloud.
Increases in capacity utilization, improved IT flexibility and
responsiveness, as well as the forecast decreases in cost accruing
to cloud-based computing are all possible after these first three
conditions have been met. Necessary investments in data
engineering can help organizations to save even more money by
reducing the amount of resources required to perform their duties
and increasing the effectiveness of their duties & decision-making.
This webinar will show you how to recognize the opportunities,
‘size up’ the required investment, and properly supervise your
efforts to take advantage of the opportunities presented by the
cloud.
Date: August 13, 2013
Time: 2:00 PM ET/11:00 AM PT
Presenter: Peter Aiken, Ph.D.
1
Copyright 2013 by Data Blueprint
Executive Editor at DATAVERSITY.net
2
Shannon Kempe
Copyright 2013 by Data Blueprint
Commonly Asked Questions
1) Will I get copies of the
slides after the event?
2) Is this being recorded so I
can view it afterwards?
3
Copyright 2013 by Data Blueprint
Get Social With Us!
Live Twitter Feed
Join the conversation!
Follow us:
@datablueprint
@paiken
Ask questions and submit your
comments: #dataed
Like Us on Facebook
www.facebook.com/datablueprint
Post questions and comments
Find industry news, insightful content
and event updates.
Join the Group
Data Management & Business
Intelligence
Ask questions, gain insights and
collaborate with fellow data
management professionals
4
Copyright 2013 by Data Blueprint
5
Peter Aiken, PhD
• 25+ years of experience in data
management
• Multiple international awards &
recognition
• Founder, Data Blueprint (datablueprint.com)
• Associate Professor of IS, VCU (vcu.edu)
• President, DAMA International (dama.org)
• 8 books and dozens of articles
• Experienced w/ 500+ data
management practices in 20 countries
• Multi-year immersions with
organizations as diverse as the
US DoD, Nokia, Deutsche Bank,
Wells Fargo, and the Commonwealth
of Virginia
2
The Case for the
Chief Data Officer
Recasting the C-Suite to Leverage
Your MostValuable Asset
Peter Aiken and
Michael Gorman
Data Systems Integration & Business
Value Part 2: Cloud-based Integration
Presented by Peter Aiken, Ph.D.
10124 W. Broad Street, Suite C
Glen Allen, Virginia 23060
804.521.4056
Copyright 2013 by Data Blueprint
Anticipated Business Value of Cloud-based Integration
7
• Increased Automation and Storage Capacity
– Virtually unlimited capacity & flexible storage
– Easy to upgrade & Up-to-date software
– Automated file synching & backups
• Affordability
– Pay as you go
– Usage is scaled to fit needs
• Agility, Scalability and Flexibility
– Access from anywhere & collaborate
– Data is always current
• Free up IT Hours & Staff
– Cloud provider takes care of maintenance
• Ease of Use
– Easy to use & automated
Copyright 2013 by Data Blueprint
Prerequisites to Cloud-based Integration
• Organizational investments in the cloud will be useless from
a data perspective unless:
– Data governance, architecture, quality, development practices are
sufficiently mature
– You must understand your data architecture and strategy in order to
evaluate various cloud options
– Data must be reengineered to be
• Less
• Better quality
• More shareable
– for the cloud
– Failure to do these will
result in more business
value for the cloud
vendors/service providers
and less for your
organization
8
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
9
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
10
Data Program
Coordination
Feedback
Data
Development
Copyright 2013 by Data Blueprint
Standard
Data
Five Integrated DM Practice Areas
Organizational Strategies
Goals
Business
Data
Business Value
Application
Models &
Designs
Implementation
Direction
Guidance
11
Organizational
Data Integration
Data
Stewardship
Data Support
Operations
Data
Asset Use
Integrated
Models
Leverage data in organizational activities
Data management
processes and
infrastructure
Combining multiple
assets to produce
extra value
Organizational-entity
subject area data
integration
Provide reliable data
access
Achieve sharing of data within a
business area
Copyright 2013 by Data Blueprint
Five Integrated DM Practice Areas
Manage data coherently.
Share data across boundaries.
Assign responsibilities for data.
Engineer data delivery systems.
Maintain data availability.
Data Program
Coordination
Organizational Data
Integration
Data Stewardship Data Development
Data Support
Operations
12
Copyright 2013 by Data Blueprint
Hierarchy of Data Management Practices (after Maslow)
• 5 Data management
practices areas /
data management
basics ...
• ... are necessary but
insufficient
prerequisites to
organizational data
leveraging
applications that is
self actualizing data
or advanced data
practices Basic Data Management Practices
– Data Program Management
– Organizational Data Integration
– Data Stewardship
– Data Development
– Data Support Operations
http://3.bp.blogspot.com/-ptl-9mAieuQ/T-idBt1YFmI/AAAAAAAABgw/Ib-nVkMmMEQ/s1600/maslows_hierarchy_of_needs.png
Advanced
Data
Practices
• MDM
• Mining
• Big Data
• Analytics
• Warehousing
• SOA
Cloud
• Data Management Body of Knowledge
(DMBOK)
– Published by DAMA International, the
professional association for
Data Managers (40 chapters worldwide)
– Organized around primary data management
functions focused around data delivery to the
organization and several environmental elements
• Certified Data Management Professional
(CDMP)
– Series of 3 exams by DAMA International and
ICCP
– Membership in a distinct group of
fellow professionals
– Recognition for specialized knowledge in a
choice of 17 specialty areas
– For more information, please visit:
• www.dama.org, www.iccp.org
Copyright 2013 by Data Blueprint
DAMA DM BoK & CDMP
14
Copyright 2013 by Data Blueprint
Series Context
• Certain systems are more data
focused than others. Usually
their primary focus is on
accomplishing integration of
disparate data. In these cases,
failure is most often attributable
to the adoption of a single
technological pillar (silver bullet).
The three webinars in the Data
Systems Integration and Business Value
series are designed to illustrate that
good systems development more often depends on at least three
DM disciplines (pie wedges) in order to provide a solid foundation.
• Data Systems Integration & Business Value
– Pt. 1: Metadata Practices
– Pt. 2: Cloud-based Integration
– Pt. 3: Warehousing, et al.
15
Uses
Copyright 2013 by Data Blueprint
Part 1: Metadata Take Aways
• Metadata unlocks the value of data, and therefore requires
management attention [Gartner 2011]
• Metadata is the language of data governance
• Metadata defines the essence of integration challenges
Sources
Metadata Governance
Metadata
Engineering
Metadata
Delivery
Metadata Practices
Metadata
Storage
16
Specialized Team Skills
Data
Management
functions
necessary but
insufficient for
metadata-
based
integration
Copyright 2013 by Data Blueprint
Data Management
Body of
Knowledge
17
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
18
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
19
Copyright 2013 by Data Blueprint
Data Management
Body of
Knowledge
20
Data
Management
functions
necessary but
insufficient
for cloud-
based
integration
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
21
Data
Governance
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Data Governance
22
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Organizational Data Governance Purpose Statement
• What does data governance
mean to my organization?
– Getting some individuals (whose
opinions matter)
– To form a body (needs a formal
purpose/authority)
– Who will advocate/evangelize for
(not dictate, enforce, rule)
– Increasing scope and rigor of
– Data-centric development
practices
23
Top
Operations
Job
Copyright 2013 by Data Blueprint
Data Governance is a Gateway for IT Projects
24
Top Job
Top
Finance
Job
Top
Information
Technology
Job
Top
Marketing
Job
• Data assets are better foundational building block for IT projects
• CDO coordinates IT investment priorities with Top IT Job
• CDO determines when proposed IT projects are "ready"
Data Governance Organization
Chief
Data
Officer
Copyright 2013 by Data Blueprint
25
Data
Architecture
Management
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Data Architecture Management
26
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Architectural Answers
(Adapted from [Allen & Boynton 1991])
Computers
Human resources
Communication facilities
Software
Management
responsibilities
Policies,
directives,
and rules
Data
27
• Where do they go?
• When are they needed?
• What standards
should be adopted?
• What vendors
should be chosen?
• What rules should govern
the decisions?
• What policies should guide
the process?
• How and why do the components interact?
• Why and how will the changes be implemented?
• What should be managed organization-wide and what should
be managed locally?
Zachman Framework 3.0 - the Enterprise Ontology
Classification
Names
Model
Names
*Horizontal integration lines
areshownforexamplepurposes
only and are not a complete set.
Composite, integrative rela-
tionships connecting every cell
horizontally potentially exist.
Audience
Perspectives
Enterprise
Names
Classification
Names
Audience
Perspectives
C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
C o m p o s i t e I n t e g r a t i o n s C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
A l i g n m e n t
A l i g n m e n t
How Where Who WhenWhat Why
Process
Flows
Distribution
Networks
Responsibility
Assignments
Timing
Cycles
Inventory
Sets
Motivation
Intentions
Operations
Instances
(Implementations)
The
Enterprise
The
Enterprise
Enterprise
Perspective
(Users)
Executive
Perspective
(Business	
 Context
Planners)
Business Mgmt
Perspective
(Business	
 Concept	
 
Owners)
Architect
Perspective
(Business	
 Logic
Designers)
Engineer
Perspective
(Business	
 Physics	
 
Builders)
Technician
Perspective
(Business	
 Component
Implementers)
Scope
Contexts
(Scope	
 Identification	
 
Lists)
Business
Concepts
(Business	
 Definition	
 
Models)
System
Logic
(System
Representation	
 Models)
Technology
Physics
(Technology
Specification	
 Models)
Tool
Components
(Tool	
 Configuration	
 
Models)
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g.: primitive e.g.: composite model:
model:
Forecast Sales
Plan Production
Sell Products
Take Orders
Train Employees
Assign Territories
Develop Markets
Maintain Facilities
Repair Products
Record Transctns
Material Supply Ntwk
Product Dist. Ntwk
Voice Comm. Ntwk
Data Comm. Ntwk
Manu. Process Ntwk
Office
 

Wrk
 

Flow
 

Ntwk
Parts Dist. Ntwk
Personnel Dist. Ntwk
etc., etc.
General Mgmt
Product Mgmt
Engineering Design
Manu. Engineering
Accounting
Finance
Transportation
Distribution
Marketing
Sales
Product Cycle
Market Cycle
Planning Cycle
Order Cycle
Employee Cycle
Maint. Cycle
Production Cycle
Sales Cycle
Economic Cycle
Accounting Cycle
Products
Product Types
Warehouses
Parts Bins
Customers
Territories
Orders
Employees
Vehicles
Accounts
New Markets
Revenue Growth
Expns Reduction
Cust Convenience
Customer Satis.
Regulatory Comp.
New Capital
Social Contribution
Increased Yield
Increased Qualitye.g. e.g. e.g. e.g. e.g. e.g.
Operations	
 Transforms
Operations	
 In/Outputs
Operations	
 Locations
Operations	
 Connections
Operations	
 Roles
Operations	
 Work	
 Products
Operations	
 Intervals
Operations	
 Moments
Operations	
 Entities
Operations	
 Relationships
Operations	
 Ends
Operations	
 Means
Process
Instantiations
Distribution
Instantiations
Responsibility
Instantiations
Timing
Instantiations
Inventory
Instantiations
Motivation
Instantiations
List: Timing Types
Business Interval
Business Moment
List: Responsibility Types
Business Role
Business Work Product
List: Distribution Types
Business Location
Business Connection
List: Process Types
Business Transform
Business Input/Output
System Transform
System Input /Output
System Location
System Connection
System Role
System Work Product
System Interval
System Moment
Technology Transform
Technology Input /Output
Technology Location
Technology Connection
Technology Role
Technology Work Product
Technology Interval
Technology Moment
Tool Transform
Tool Input /Output
Tool Location
Tool Connection
Tool Role
Tool Work Product
Tool Interval
Tool Moment
List: Inventory Types
Business Entity
Business Relationship
System Entity
System Relationship
Technology Entity
Technology Relationship
Tool Entity
Tool Relationship
List: Motivation Types
Business End
Business Means
System End
System Means
Technology End
Technology Means
Tool End
Tool Means
Timing	
 IdentificationResponsibility	
 IdentificationDistribution	
 IdentificationProcess	
 Identification
Timing	
 DefinitionResponsibility	
 DefinitionDistribution	
 DefinitionProcess	
 Definition
Process	
 Representation Distribution	
 Representation Responsibility	
 Representation Timing	
 Representation
Process	
 Specification Distribution	
 Specification Responsibility	
 Specification Timing	
 Specification
Inventory	
 Identification
Inventory	
 Definition
Inventory	
 Representation
Inventory	
 Specification
Inventory	
 Configuration Process	
 Configuration Distribution	
 Configuration Responsibility	
 Configuration Timing	
 Configuration
Motivation	
 Identification
Motivation	
 Definition
Motivation	
 Representation
Motivation	
 Specification
Motivation	
 Configuration
Copyright 2013 by Data Blueprint
28
Copyright 2008-2011 John A. Zachman
Copyright 2013 by Data Blueprint
What is an information architecture?
• A structure of data-based information
assets supporting implementation of
organizational strategy (or strategies)
• Most organizations have data assets
that are not supportive of strategies -
i.e., information architectures that are
not helpful
• The really important question is: how
can organizations more effectively
use their information architectures to
support strategy implementation?
29
Classification
Names
Model
Names
*Horizontal integration lines
areshownforexamplepurposes
only and are not a complete set.
Composite, integrative rela-
tionships connecting every cell
horizontally potentially exist.
Audience
Perspectives
Enterprise
Names
Classification
Names
Audience
Perspectives
C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
C o m p o s i t e I n t e g r a t i o n s C o m p o s i t e I n t e g r a t i o n s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
A
l
i
g
n
m
e
n
t
T
r
a
n
s
f
o
r
m
a
t
i
o
n
s
A l i g n m e n t
A l i g n m e n t
How Where Who WhenWhat Why
Process
Flows
Distribution
Networks
Responsibility
Assignments
Timing
Cycles
Inventory
Sets
Motivation
Intentions
Operations
Instances
(Implementations)
The
Enterprise
The
Enterprise
Enterprise
Perspective
(Users)
Executive
Perspective
(Business	
 Context
Planners)
Business Mgmt
Perspective
(Business	
 Concept	
 
Owners)
Architect
Perspective
(Business	
 Logic
Designers)
Engineer
Perspective
(Business	
 Physics	
 
Builders)
Technician
Perspective
(Business	
 Component
Implementers)
Scope
Contexts
(Scope	
 Identification	
 
Lists)
Business
Concepts
(Business	
 Definition	
 
Models)
System
Logic
(System
Representation	
 Models)
Technology
Physics
(Technology
Specification	
 Models)
Tool
Components
(Tool	
 Configuration	
 
Models)
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g. e.g. e.g. e.g. e.g. e.g.
e.g.: primitive e.g.: composite model:
model:
Forecast Sales
Plan Production
Sell Products
Take Orders
Train Employees
Assign Territories
Develop Markets
Maintain Facilities
Repair Products
Record Transctns
Material Supply Ntwk
Product Dist. Ntwk
Voice Comm. Ntwk
Data Comm. Ntwk
Manu. Process Ntwk
Office
 

Wrk
 

Flow
 

Ntwk
Parts Dist. Ntwk
Personnel Dist. Ntwk
etc., etc.
General Mgmt
Product Mgmt
Engineering Design
Manu. Engineering
Accounting
Finance
Transportation
Distribution
Marketing
Sales
Product Cycle
Market Cycle
Planning Cycle
Order Cycle
Employee Cycle
Maint. Cycle
Production Cycle
Sales Cycle
Economic Cycle
Accounting Cycle
Products
Product Types
Warehouses
Parts Bins
Customers
Territories
Orders
Employees
Vehicles
Accounts
New Markets
Revenue Growth
Expns Reduction
Cust Convenience
Customer Satis.
Regulatory Comp.
New Capital
Social Contribution
Increased Yield
Increased Qualitye.g. e.g. e.g. e.g. e.g. e.g.
Operations	
 Transforms
Operations	
 In/Outputs
Operations	
 Locations
Operations	
 Connections
Operations	
 Roles
Operations	
 Work	
 Products
Operations	
 Intervals
Operations	
 Moments
Operations	
 Entities
Operations	
 Relationships
Operations	
 Ends
Operations	
 Means
Process
Instantiations
Distribution
Instantiations
Responsibility
Instantiations
Timing
Instantiations
Inventory
Instantiations
Motivation
Instantiations
List: Timing Types
Business Interval
Business Moment
List: Responsibility Types
Business Role
Business Work Product
List: Distribution Types
Business Location
Business Connection
List: Process Types
Business Transform
Business Input/Output
System Transform
System Input /Output
System Location
System Connection
System Role
System Work Product
System Interval
System Moment
Technology Transform
Technology Input /Output
Technology Location
Technology Connection
Technology Role
Technology Work Product
Technology Interval
Technology Moment
Tool Transform
Tool Input /Output
Tool Location
Tool Connection
Tool Role
Tool Work Product
Tool Interval
Tool Moment
List: Inventory Types
Business Entity
Business Relationship
System Entity
System Relationship
Technology Entity
Technology Relationship
Tool Entity
Tool Relationship
List: Motivation Types
Business End
Business Means
System End
System Means
Technology End
Technology Means
Tool End
Tool Means
Timing	
 IdentificationResponsibility	
 IdentificationDistribution	
 IdentificationProcess	
 Identification
Timing	
 DefinitionResponsibility	
 DefinitionDistribution	
 DefinitionProcess	
 Definition
Process	
 Representation Distribution	
 Representation Responsibility	
 Representation Timing	
 Representation
Process	
 Specification Distribution	
 Specification Responsibility	
 Specification Timing	
 Specification
Inventory	
 Identification
Inventory	
 Definition
Inventory	
 Representation
Inventory	
 Specification
Inventory	
 Configuration Process	
 Configuration Distribution	
 Configuration Responsibility	
 Configuration Timing	
 Configuration
Motivation	
 Identification
Motivation	
 Definition
Motivation	
 Representation
Motivation	
 Specification
Motivation	
 Configuration
!

!

 !

!

Copyright 2013 by Data Blueprint
30
Organizational Needs
become instantiated
and integrated into an
Data/Information
Architecture
Informa(on)System)
Requirements
authorizes and
articulates
satisfyspecificorganizationalneeds
Data Architectures produce and are made up of information models that are
developed in response to organizational needs
Copyright 2013 by Data Blueprint
Data Architecture – Better Definition
31
• All organizations have information
architectures
– Some are better understood and
documented (and therefore more
useful to the organization) than
others.
• Common vocabulary expressing
integrated requirements ensuring
that data assets are stored,
arranged, managed, and used in
systems in support of
organizational strategy [Aiken 2010]
Copyright 2013 by Data Blueprint
32
Data
Development
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Data Modeling/Data Development
33
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
34
#dataed
Program F
Program E
Program D
Program G
Program H
Program I
Application
domain 2Application
domain 3
Data Development Focus
Copyright 2013 by Data Blueprint
35
#dataed
Data Development has greater Business Value
Copyright 2013 by Data Blueprint
36
Conceptual Logical Physical
Validated
Not Validated
Every change can
be mapped to a
transformation in
this framework!
Data Development Evolution Framework
Copyright 2013 by Data Blueprint
37
Data
Quality
Management
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Data Quality Engineering
38
From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
Copyright 2013 by Data Blueprint
Definitions
• Quality Data
– Fit for use meets the requirements of its authors, users,
and administrators (adapted from Martin Eppler)
– Synonymous with information quality, since poor data quality
results in inaccurate information and poor business performance
• Data Quality Management
– Planning, implementation and control activities that apply quality
management techniques to measure, assess, improve, and
ensure data quality
– Entails the "establishment and deployment of roles, responsibilities
concerning the acquisition, maintenance, dissemination, and
disposition of data" http://www2.sas.com/proceedings/sugi29/098-29.pdf
✓ Critical supporting process from change management
✓ Continuous process for defining acceptable levels of data quality to meet business
needs and for ensuring that data quality meets these levels
• Data Quality Engineering
– Recognition that data quality solutions cannot not managed but must be engineered
– Engineering is the application of scientific, economic, social, and practical knowledge in
order to design, build, and maintain solutions to data quality challenges
– Engineering concepts are generally not known and understood within IT or business!
39
Spinach/Popeye story from http://it.toolbox.com/blogs/infosphere/spinach-how-a-data-quality-mistake-created-a-myth-and-a-cartoon-character-10166
Copyright 2013 by Data Blueprint
Quality Dimensions
40
Copyright 2013 by Data Blueprint
Starting
point
for new
system
development
data performance metadata
data architecture
data
architecture and
data models
shared data updated data
corrected
data
architecture
refinements
facts &
meanings
Metadata &
Data Storage
Starting point
for existing
systems
Metadata Refinement
• Correct Structural Defects
• Update Implementation
Metadata Creation
• Define Data Architecture
• Define Data Model Structures
Metadata Structuring
• Implement Data Model Views
• Populate Data Model Views
Data Refinement
• Correct Data Value Defects
• Re-store Data Values
Data Manipulation
• Manipulate Data
• Updata Data
Data Utilization
• Inspect Data
• Present Data
Data Creation
• Create Data
• Verify Data Values
Data Assessment
• Assess Data Values
• Assess Metadata
Extended data life cycle model with metadata sources and uses
41
Copyright 2013 by Data Blueprint
DQE Context & Engineering Concepts
• Can rules be implemented stating that no data can be
corrected unless the source of the error has been
discovered and addressed?
• All data must
be 100%
perfect?
• Pareto
– 80/20 rule
– Not all data
is of equal
Importance
• Scientific,
economic,
social, and
practical
knowledge
42
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
43
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
44
Copyright 2013 by Data Blueprint
http://visual.ly/amazing-journey-data-cloud
45
Copyright 2013 by Data Blueprint
http://visual.ly/amazing-journey-data-cloud
46
Copyright 2013 by Data Blueprint
http://visual.ly/amazing-journey-data-cloud
47
Copyright 2013 by Data Blueprint
Gartner Five-phase Hype Cycle
http://www.gartner.com/technology/research/methodologies/hype-cycle.jsp
48
Technology Trigger: A potential technology breakthrough kicks things off. Early proof-of-concept stories and media interest
trigger significant publicity. Often no usable products exist and commercial viability is unproven.
Trough of Disillusionment: Interest wanes as experiments and implementations fail to deliver. Producers of the
technology shake out or fail. Investments continue only if the surviving providers improve their products to the
satisfaction of early adopters.
Peak of Inflated Expectations: Early publicity produces a number of
success stories—often accompanied by scores of failures. Some
companies take action; many do not.
Slope of Enlightenment: More instances of how the technology can benefit the
enterprise start to crystallize and become more widely understood. Second- and third-
generation products appear from technology providers. More enterprises fund pilots;
conservative companies remain cautious.
Plateau of Productivity: Mainstream adoption starts to
take off. Criteria for assessing provider viability are more
clearly defined. The technology’s broad market
applicability and relevance are clearly paying off.
Copyright 2013 by Data Blueprint
Gartner Cloud Hype Cycle “While clearly
maturing, cloud
computing
continues to be the
most hyped subject
in IT today.”
49
Copyright 2013 by Data Blueprint
50
• Cloud computing is location-independent
computing, whereby shared servers provide
resources, software, and data to computers
and other devices on demand, as with the
electricity grid.
• Cloud computing is a natural evolution of the
widespread adoption of virtualization, service-
oriented architecture and utility computing.
• Details are abstracted from consumers, who no
longer have need for expertise in, or control over,
the technology infrastructure "in the cloud" that
supports them.
Cloud Computing
Copyright 2013 by Data Blueprint
Five Essential Characteristics of Data Cloud Infrastructure
• Gartner defines "cloud computing" as the set of disciplines,
technologies, and business models used to deliver IT
capabilities (software, platforms, hardware) as an on-
demand, scalable, elastic service.
• Five essential characteristics of cloud computing:
– It uses shared infrastructure
– It provides on-demand
self-service
– It is elastic and scalable
– It is priced by consumption
– It is dynamic and virtualized
51
Copyright 2013 by Data Blueprint
52
Cloud Scalability
Copyright 2013 by Data Blueprint
Cloud Rendering
53
Copyright 2013 by Data Blueprint
Cisco's Ladder to the Cloud
54
Copyright 2013 by Data Blueprint
Cloud Options
55
Copyright 2013 by Data Blueprint
Solving the Big Data Puzzle
56
http://damfoundation.org/2012/06/whats-the-big-deal-about-big-data/
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline
57
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline
58
Copyright 2013 by Data Blueprint
Benefits
59
Copyright 2013 by Data Blueprint
Benefits
60
Copyright 2013 by Data Blueprint
Anticipated Benefits
61
0% 13% 25% 38% 50%
Improve data quality
Reduce installation and maintenance efforts
Reduce implementation efforts
Eliminate manual processes
Reduce time require to collect and prepare data
Apply data governance policies
Copyright 2013 by Data Blueprint
Similar Opportunity
• IT Infrastructure. Your submission should include funding for the timely execution of agency plans
to consolidate data centers developed in FY 2010 (reference FY 2011 passback guidance). In
coordination with the data center consolidations, agencies should evaluate the potential to adopt
cloud computing solutions by analyzing computing alternatives for IT investments in FY 2012.
Agencies will be expected to adopt cloud computing solutions where they represent the best value at
an acceptable level of risk.
• Adopt Light Technologies and
Shared Solutions. We are reducing
our data center footprint by 40
percent by 2015 and shifting the
agency default approach to IT to a
cloud-first policy as part of the 2012
budget process. Consolidating more
than 2,000 government data centers
will save money, increase security
and improve performance.
62
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
63
Copyright 2013 by Data Blueprint
1. Data Management: Contextual Overview
2. Necessary Data Management Functions
(Prerequisites)
- Data Governance
- Data Architecture
- Data Development
- Data Quality
3. Understanding Cloud-based
Technologies
4. Cloud-based Benefits
5. Cloud-based Integration
- Cleaner
- Smaller
- Shareable
6. Take Aways, References and Q&A
Tweeting now:
#dataed
Outline: Cloud-based Integration
64
Copyright 2013 by Data Blueprint
Data in the cloud should have three attributes that
data outside the cloud should not have. It should be:
65
Sharable-er
Cleaner
Smaller
Copyright 2013 by Data Blueprint
Aspirational Data in the Cloud
66
Copyright 2013 by Data Blueprint
Effective Cloud Transformation
• Transformation into cloud computing cannot be
done in a manner that benefits organizations
unless data is re-architected – formally with two goals:
– Maximizing effective, organization-wide data sharing; and
– Minimizing organizational data ROT.
• Resulting data volume reduction should be 1/5 what is currently is
– A significant economic motivator.
• All existing organizations have data collections that possess
unique strengths and weaknesses
– Strengths that should be leveraged
– Weaknesses must be addressed
• Neither of these can be accomplished without formal data
rearchitecting prior to cloud loading.
• There are very few who work in the area for a living but my team
has achieved some remarkable successes.
67
Copyright 2013 by Data Blueprint
Transform
68
Problems with forklifting
1. no basis for
decisions made
2. no inclusion of
architecture/
engineering concepts
3. no idea that these
concepts are missing
from the process
Less
Cleaner
More shareable
... data
Getting into the Cloud
Copyright 2013 by Data Blueprint
Data Leverage
• Permits organizations to better manage their sole non-depleteable, non-
degrading, durable, strategic asset - data
– within the organization, and
– with organizational data exchange partners
• Leverage
– Obtained by implementation of data-centric technologies, processes, and human skill
sets
– Increased by elimination of data ROT (redundant, obsolete, or trivial)
• The bigger the organization, the greater potential leverage exists
• Treating data more asset-like simultaneously
1. lowers organizational IT costs and
2. increases organizational knowledge worker productivity
69
Less ROT
Technologies
Process
People
Copyright 2013 by Data Blueprint
The Cloud as a Data Quality Tool
Enterprise Portal
Data DeliveryData Analysis
Quality
Technology
Continuous Improvement
Data Baselining
Statistical Data Control
Cost of Quality Model
Empowerment
Data Reduction
Pattern Analysis
Mathematical Analysis
Schema Validation
Reusability
Logic & Logic Programming
Relational DB Technology
Data Migration Technologies
Statistical Programming Languages
70
Copyright 2013 by Data Blueprint
Fixing Data in the Cloud Using A Glovebox
71
Copyright 2013 by Data Blueprint
72
Conceptual Logical Physical
Validated
Not Validated
Every change can
be mapped to a
transformation in
this framework!
Data Development Evolution Framework

More Related Content

What's hot

Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesDATAVERSITY
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDATAVERSITY
 
Data-Ed Online: Unlock Business Value through Document & Content Management
Data-Ed Online: Unlock Business Value through Document & Content ManagementData-Ed Online: Unlock Business Value through Document & Content Management
Data-Ed Online: Unlock Business Value through Document & Content ManagementDATAVERSITY
 
Data-Ed Online: Unlock Business Value through Reference & MDM
Data-Ed Online: Unlock Business Value through Reference & MDMData-Ed Online: Unlock Business Value through Reference & MDM
Data-Ed Online: Unlock Business Value through Reference & MDMDATAVERSITY
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Reference master data management
Reference master data managementReference master data management
Reference master data managementDr. Hamdan Al-Sabri
 
Building an Effective Data & Analytics Operating Model A Data Modernization G...
Building an Effective Data & Analytics Operating Model A Data Modernization G...Building an Effective Data & Analytics Operating Model A Data Modernization G...
Building an Effective Data & Analytics Operating Model A Data Modernization G...Mark Hewitt
 
Information & Data Architecture
Information & Data ArchitectureInformation & Data Architecture
Information & Data ArchitectureSammer Qader
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsDATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Using Data Platforms That Are Fit-For-Purpose
Using Data Platforms That Are Fit-For-PurposeUsing Data Platforms That Are Fit-For-Purpose
Using Data Platforms That Are Fit-For-PurposeDATAVERSITY
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architectureCosta Pissaris
 
ADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsDATAVERSITY
 
Advanced Analytics: Analytic Platforms Should Be Columnar Orientation
Advanced Analytics: Analytic Platforms Should Be Columnar OrientationAdvanced Analytics: Analytic Platforms Should Be Columnar Orientation
Advanced Analytics: Analytic Platforms Should Be Columnar OrientationDATAVERSITY
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...Christopher Bradley
 

What's hot (19)

Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data Governance
 
Enterprise Data Management
Enterprise Data ManagementEnterprise Data Management
Enterprise Data Management
 
Data-Ed Online: Unlock Business Value through Document & Content Management
Data-Ed Online: Unlock Business Value through Document & Content ManagementData-Ed Online: Unlock Business Value through Document & Content Management
Data-Ed Online: Unlock Business Value through Document & Content Management
 
Data-Ed Online: Unlock Business Value through Reference & MDM
Data-Ed Online: Unlock Business Value through Reference & MDMData-Ed Online: Unlock Business Value through Reference & MDM
Data-Ed Online: Unlock Business Value through Reference & MDM
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Reference master data management
Reference master data managementReference master data management
Reference master data management
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Building an Effective Data & Analytics Operating Model A Data Modernization G...
Building an Effective Data & Analytics Operating Model A Data Modernization G...Building an Effective Data & Analytics Operating Model A Data Modernization G...
Building an Effective Data & Analytics Operating Model A Data Modernization G...
 
Information & Data Architecture
Information & Data ArchitectureInformation & Data Architecture
Information & Data Architecture
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Big Data Boom
Big Data BoomBig Data Boom
Big Data Boom
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Using Data Platforms That Are Fit-For-Purpose
Using Data Platforms That Are Fit-For-PurposeUsing Data Platforms That Are Fit-For-Purpose
Using Data Platforms That Are Fit-For-Purpose
 
Building the enterprise data architecture
Building the enterprise data architectureBuilding the enterprise data architecture
Building the enterprise data architecture
 
ADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic SolutionsADV Slides: Comparing the Enterprise Analytic Solutions
ADV Slides: Comparing the Enterprise Analytic Solutions
 
Advanced Analytics: Analytic Platforms Should Be Columnar Orientation
Advanced Analytics: Analytic Platforms Should Be Columnar OrientationAdvanced Analytics: Analytic Platforms Should Be Columnar Orientation
Advanced Analytics: Analytic Platforms Should Be Columnar Orientation
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
 

Viewers also liked

TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...
TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...
TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...TourismFastForward
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTourismFastForward
 
TFF2016, Elias Kärle, Schema.org im Tourismus
TFF2016, Elias Kärle, Schema.org im TourismusTFF2016, Elias Kärle, Schema.org im Tourismus
TFF2016, Elias Kärle, Schema.org im TourismusTourismFastForward
 
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...TourismFastForward
 
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...TourismFastForward
 
TFF2016, Hendrik Maat, easybooking
TFF2016, Hendrik Maat, easybookingTFF2016, Hendrik Maat, easybooking
TFF2016, Hendrik Maat, easybookingTourismFastForward
 
TFF2016, Reinhard Lanner, The end of advertisment as we know it
TFF2016, Reinhard Lanner, The end of advertisment as we know itTFF2016, Reinhard Lanner, The end of advertisment as we know it
TFF2016, Reinhard Lanner, The end of advertisment as we know itTourismFastForward
 
Mobility in der Customer Journey
Mobility in der Customer JourneyMobility in der Customer Journey
Mobility in der Customer JourneyTourismFastForward
 
TFF2016, Michael Steingress, Die digitale Customer Journey
TFF2016, Michael Steingress, Die digitale Customer JourneyTFF2016, Michael Steingress, Die digitale Customer Journey
TFF2016, Michael Steingress, Die digitale Customer JourneyTourismFastForward
 
TFF2016, Roland Schegg, Online Hotelvetrieb
TFF2016, Roland Schegg, Online HotelvetriebTFF2016, Roland Schegg, Online Hotelvetrieb
TFF2016, Roland Schegg, Online HotelvetriebTourismFastForward
 
TFF2016, Alexander Schwarz, Airbnb
TFF2016, Alexander Schwarz, AirbnbTFF2016, Alexander Schwarz, Airbnb
TFF2016, Alexander Schwarz, AirbnbTourismFastForward
 

Viewers also liked (12)

TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...
TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...
TFF2016, Michael Brandl, Die PSI Richtlinien und ihre Umsetzung in nationals ...
 
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen TechnologienTFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
TFF2016, Rudi Studer, Smarte Dienstleistungen mit semantischen Technologien
 
TFF2016, Elias Kärle, Schema.org im Tourismus
TFF2016, Elias Kärle, Schema.org im TourismusTFF2016, Elias Kärle, Schema.org im Tourismus
TFF2016, Elias Kärle, Schema.org im Tourismus
 
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...
TFF2016, Christian Klingler, PSI-Richtlinie am praktischen Beispiel touuristi...
 
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...
TFF2016, Johann Reisinger, Einsatz eines multimedialen Tourismus Datenmanagem...
 
TFF2016, Hendrik Maat, easybooking
TFF2016, Hendrik Maat, easybookingTFF2016, Hendrik Maat, easybooking
TFF2016, Hendrik Maat, easybooking
 
TFF2016, Reinhard Lanner, The end of advertisment as we know it
TFF2016, Reinhard Lanner, The end of advertisment as we know itTFF2016, Reinhard Lanner, The end of advertisment as we know it
TFF2016, Reinhard Lanner, The end of advertisment as we know it
 
Mobility in der Customer Journey
Mobility in der Customer JourneyMobility in der Customer Journey
Mobility in der Customer Journey
 
TFF2016, Michael Steingress, Die digitale Customer Journey
TFF2016, Michael Steingress, Die digitale Customer JourneyTFF2016, Michael Steingress, Die digitale Customer Journey
TFF2016, Michael Steingress, Die digitale Customer Journey
 
AirBnB
AirBnBAirBnB
AirBnB
 
TFF2016, Roland Schegg, Online Hotelvetrieb
TFF2016, Roland Schegg, Online HotelvetriebTFF2016, Roland Schegg, Online Hotelvetrieb
TFF2016, Roland Schegg, Online Hotelvetrieb
 
TFF2016, Alexander Schwarz, Airbnb
TFF2016, Alexander Schwarz, AirbnbTFF2016, Alexander Schwarz, Airbnb
TFF2016, Alexander Schwarz, Airbnb
 

Similar to Data Systems Integration & Business Value Pt. 2: Cloud

Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Blueprint
 
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingData-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingDATAVERSITY
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData Blueprint
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures DATAVERSITY
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data Blueprint
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringDATAVERSITY
 
Ensuring Data Quality and Lineage in Cloud Migration - Dan Power
Ensuring Data Quality and Lineage in Cloud Migration - Dan PowerEnsuring Data Quality and Lineage in Cloud Migration - Dan Power
Ensuring Data Quality and Lineage in Cloud Migration - Dan PowerMolly Alexander
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsDATAVERSITY
 
Get ahead of the cloud or get left behind
Get ahead of the cloud or get left behindGet ahead of the cloud or get left behind
Get ahead of the cloud or get left behindMatt Mandich
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data Blueprint
 
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfData Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfShailendra Mruthyunjayappa
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsDATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsData-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsDATAVERSITY
 
Cloud Data Management: The Future of Data Storage and Management
Cloud Data Management: The Future of Data Storage and ManagementCloud Data Management: The Future of Data Storage and Management
Cloud Data Management: The Future of Data Storage and ManagementFredReynolds2
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture StrategiesDATAVERSITY
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDATAVERSITY
 
Data Governance Maturity Model
Data Governance Maturity ModelData Governance Maturity Model
Data Governance Maturity ModelBasuki Rahmad
 

Similar to Data Systems Integration & Business Value Pt. 2: Cloud (20)

Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing Data Systems Integration & Business Value PT. 3: Warehousing
Data Systems Integration & Business Value PT. 3: Warehousing
 
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: WarehousingData-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
Data-Ed: Data Systems Integration & Business Value Pt. 3: Warehousing
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures
 
Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures Data-Ed: Design and Manage Data Structures
Data-Ed: Design and Manage Data Structures
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
2014 dqe handouts
2014 dqe handouts2014 dqe handouts
2014 dqe handouts
 
Ensuring Data Quality and Lineage in Cloud Migration - Dan Power
Ensuring Data Quality and Lineage in Cloud Migration - Dan PowerEnsuring Data Quality and Lineage in Cloud Migration - Dan Power
Ensuring Data Quality and Lineage in Cloud Migration - Dan Power
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced Analytics
 
Get ahead of the cloud or get left behind
Get ahead of the cloud or get left behindGet ahead of the cloud or get left behind
Get ahead of the cloud or get left behind
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdfData Management Trends 2022_Shailendra Mruthyunjayappa.pdf
Data Management Trends 2022_Shailendra Mruthyunjayappa.pdf
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
Data-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture RequirementsData-Ed Online: Data Architecture Requirements
Data-Ed Online: Data Architecture Requirements
 
Cloud Data Management: The Future of Data Storage and Management
Cloud Data Management: The Future of Data Storage and ManagementCloud Data Management: The Future of Data Storage and Management
Cloud Data Management: The Future of Data Storage and Management
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
Data Governance Maturity Model
Data Governance Maturity ModelData Governance Maturity Model
Data Governance Maturity Model
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Recently uploaded (20)

Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

Data Systems Integration & Business Value Pt. 2: Cloud

  • 1. Copyright 2013 by Data Blueprint Data Systems Integration & Business Value Part 2: Cloud-based Integration All organizations are prepared to benefit from aspects of the cloud. These benefits accrue when cloud-hosted datasets share three attributes. They must be of: 1. Higher quality data than those data residing outside of the cloud; 2. Lower volume (1/5 the size of data collections) than similar collections residing outside of the cloud; and 3. Increased share-ability than data residing outside the cloud. Increases in capacity utilization, improved IT flexibility and responsiveness, as well as the forecast decreases in cost accruing to cloud-based computing are all possible after these first three conditions have been met. Necessary investments in data engineering can help organizations to save even more money by reducing the amount of resources required to perform their duties and increasing the effectiveness of their duties & decision-making. This webinar will show you how to recognize the opportunities, ‘size up’ the required investment, and properly supervise your efforts to take advantage of the opportunities presented by the cloud. Date: August 13, 2013 Time: 2:00 PM ET/11:00 AM PT Presenter: Peter Aiken, Ph.D. 1
  • 2. Copyright 2013 by Data Blueprint Executive Editor at DATAVERSITY.net 2 Shannon Kempe
  • 3. Copyright 2013 by Data Blueprint Commonly Asked Questions 1) Will I get copies of the slides after the event? 2) Is this being recorded so I can view it afterwards? 3
  • 4. Copyright 2013 by Data Blueprint Get Social With Us! Live Twitter Feed Join the conversation! Follow us: @datablueprint @paiken Ask questions and submit your comments: #dataed Like Us on Facebook www.facebook.com/datablueprint Post questions and comments Find industry news, insightful content and event updates. Join the Group Data Management & Business Intelligence Ask questions, gain insights and collaborate with fellow data management professionals 4
  • 5. Copyright 2013 by Data Blueprint 5 Peter Aiken, PhD • 25+ years of experience in data management • Multiple international awards & recognition • Founder, Data Blueprint (datablueprint.com) • Associate Professor of IS, VCU (vcu.edu) • President, DAMA International (dama.org) • 8 books and dozens of articles • Experienced w/ 500+ data management practices in 20 countries • Multi-year immersions with organizations as diverse as the US DoD, Nokia, Deutsche Bank, Wells Fargo, and the Commonwealth of Virginia 2 The Case for the Chief Data Officer Recasting the C-Suite to Leverage Your MostValuable Asset Peter Aiken and Michael Gorman
  • 6. Data Systems Integration & Business Value Part 2: Cloud-based Integration Presented by Peter Aiken, Ph.D. 10124 W. Broad Street, Suite C Glen Allen, Virginia 23060 804.521.4056
  • 7. Copyright 2013 by Data Blueprint Anticipated Business Value of Cloud-based Integration 7 • Increased Automation and Storage Capacity – Virtually unlimited capacity & flexible storage – Easy to upgrade & Up-to-date software – Automated file synching & backups • Affordability – Pay as you go – Usage is scaled to fit needs • Agility, Scalability and Flexibility – Access from anywhere & collaborate – Data is always current • Free up IT Hours & Staff – Cloud provider takes care of maintenance • Ease of Use – Easy to use & automated
  • 8. Copyright 2013 by Data Blueprint Prerequisites to Cloud-based Integration • Organizational investments in the cloud will be useless from a data perspective unless: – Data governance, architecture, quality, development practices are sufficiently mature – You must understand your data architecture and strategy in order to evaluate various cloud options – Data must be reengineered to be • Less • Better quality • More shareable – for the cloud – Failure to do these will result in more business value for the cloud vendors/service providers and less for your organization 8
  • 9. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 9 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A
  • 10. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 10
  • 11. Data Program Coordination Feedback Data Development Copyright 2013 by Data Blueprint Standard Data Five Integrated DM Practice Areas Organizational Strategies Goals Business Data Business Value Application Models & Designs Implementation Direction Guidance 11 Organizational Data Integration Data Stewardship Data Support Operations Data Asset Use Integrated Models Leverage data in organizational activities Data management processes and infrastructure Combining multiple assets to produce extra value Organizational-entity subject area data integration Provide reliable data access Achieve sharing of data within a business area
  • 12. Copyright 2013 by Data Blueprint Five Integrated DM Practice Areas Manage data coherently. Share data across boundaries. Assign responsibilities for data. Engineer data delivery systems. Maintain data availability. Data Program Coordination Organizational Data Integration Data Stewardship Data Development Data Support Operations 12
  • 13. Copyright 2013 by Data Blueprint Hierarchy of Data Management Practices (after Maslow) • 5 Data management practices areas / data management basics ... • ... are necessary but insufficient prerequisites to organizational data leveraging applications that is self actualizing data or advanced data practices Basic Data Management Practices – Data Program Management – Organizational Data Integration – Data Stewardship – Data Development – Data Support Operations http://3.bp.blogspot.com/-ptl-9mAieuQ/T-idBt1YFmI/AAAAAAAABgw/Ib-nVkMmMEQ/s1600/maslows_hierarchy_of_needs.png Advanced Data Practices • MDM • Mining • Big Data • Analytics • Warehousing • SOA Cloud
  • 14. • Data Management Body of Knowledge (DMBOK) – Published by DAMA International, the professional association for Data Managers (40 chapters worldwide) – Organized around primary data management functions focused around data delivery to the organization and several environmental elements • Certified Data Management Professional (CDMP) – Series of 3 exams by DAMA International and ICCP – Membership in a distinct group of fellow professionals – Recognition for specialized knowledge in a choice of 17 specialty areas – For more information, please visit: • www.dama.org, www.iccp.org Copyright 2013 by Data Blueprint DAMA DM BoK & CDMP 14
  • 15. Copyright 2013 by Data Blueprint Series Context • Certain systems are more data focused than others. Usually their primary focus is on accomplishing integration of disparate data. In these cases, failure is most often attributable to the adoption of a single technological pillar (silver bullet). The three webinars in the Data Systems Integration and Business Value series are designed to illustrate that good systems development more often depends on at least three DM disciplines (pie wedges) in order to provide a solid foundation. • Data Systems Integration & Business Value – Pt. 1: Metadata Practices – Pt. 2: Cloud-based Integration – Pt. 3: Warehousing, et al. 15
  • 16. Uses Copyright 2013 by Data Blueprint Part 1: Metadata Take Aways • Metadata unlocks the value of data, and therefore requires management attention [Gartner 2011] • Metadata is the language of data governance • Metadata defines the essence of integration challenges Sources Metadata Governance Metadata Engineering Metadata Delivery Metadata Practices Metadata Storage 16 Specialized Team Skills
  • 17. Data Management functions necessary but insufficient for metadata- based integration Copyright 2013 by Data Blueprint Data Management Body of Knowledge 17 From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 18. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 18
  • 19. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 19
  • 20. Copyright 2013 by Data Blueprint Data Management Body of Knowledge 20 Data Management functions necessary but insufficient for cloud- based integration From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 21. Copyright 2013 by Data Blueprint 21 Data Governance From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 22. Copyright 2013 by Data Blueprint Data Governance 22 From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 23. Copyright 2013 by Data Blueprint Organizational Data Governance Purpose Statement • What does data governance mean to my organization? – Getting some individuals (whose opinions matter) – To form a body (needs a formal purpose/authority) – Who will advocate/evangelize for (not dictate, enforce, rule) – Increasing scope and rigor of – Data-centric development practices 23
  • 24. Top Operations Job Copyright 2013 by Data Blueprint Data Governance is a Gateway for IT Projects 24 Top Job Top Finance Job Top Information Technology Job Top Marketing Job • Data assets are better foundational building block for IT projects • CDO coordinates IT investment priorities with Top IT Job • CDO determines when proposed IT projects are "ready" Data Governance Organization Chief Data Officer
  • 25. Copyright 2013 by Data Blueprint 25 Data Architecture Management From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 26. Copyright 2013 by Data Blueprint Data Architecture Management 26 From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 27. Copyright 2013 by Data Blueprint Architectural Answers (Adapted from [Allen & Boynton 1991]) Computers Human resources Communication facilities Software Management responsibilities Policies, directives, and rules Data 27 • Where do they go? • When are they needed? • What standards should be adopted? • What vendors should be chosen? • What rules should govern the decisions? • What policies should guide the process? • How and why do the components interact? • Why and how will the changes be implemented? • What should be managed organization-wide and what should be managed locally?
  • 28. Zachman Framework 3.0 - the Enterprise Ontology Classification Names Model Names *Horizontal integration lines areshownforexamplepurposes only and are not a complete set. Composite, integrative rela- tionships connecting every cell horizontally potentially exist. Audience Perspectives Enterprise Names Classification Names Audience Perspectives C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s C o m p o s i t e I n t e g r a t i o n s C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s A l i g n m e n t A l i g n m e n t How Where Who WhenWhat Why Process Flows Distribution Networks Responsibility Assignments Timing Cycles Inventory Sets Motivation Intentions Operations Instances (Implementations) The Enterprise The Enterprise Enterprise Perspective (Users) Executive Perspective (Business Context Planners) Business Mgmt Perspective (Business Concept Owners) Architect Perspective (Business Logic Designers) Engineer Perspective (Business Physics Builders) Technician Perspective (Business Component Implementers) Scope Contexts (Scope Identification Lists) Business Concepts (Business Definition Models) System Logic (System Representation Models) Technology Physics (Technology Specification Models) Tool Components (Tool Configuration Models) e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g.: primitive e.g.: composite model: model: Forecast Sales Plan Production Sell Products Take Orders Train Employees Assign Territories Develop Markets Maintain Facilities Repair Products Record Transctns Material Supply Ntwk Product Dist. Ntwk Voice Comm. Ntwk Data Comm. Ntwk Manu. Process Ntwk Office
  • 31.  

Ntwk Parts Dist. Ntwk Personnel Dist. Ntwk etc., etc. General Mgmt Product Mgmt Engineering Design Manu. Engineering Accounting Finance Transportation Distribution Marketing Sales Product Cycle Market Cycle Planning Cycle Order Cycle Employee Cycle Maint. Cycle Production Cycle Sales Cycle Economic Cycle Accounting Cycle Products Product Types Warehouses Parts Bins Customers Territories Orders Employees Vehicles Accounts New Markets Revenue Growth Expns Reduction Cust Convenience Customer Satis. Regulatory Comp. New Capital Social Contribution Increased Yield Increased Qualitye.g. e.g. e.g. e.g. e.g. e.g. Operations Transforms Operations In/Outputs Operations Locations Operations Connections Operations Roles Operations Work Products Operations Intervals Operations Moments Operations Entities Operations Relationships Operations Ends Operations Means Process Instantiations Distribution Instantiations Responsibility Instantiations Timing Instantiations Inventory Instantiations Motivation Instantiations List: Timing Types Business Interval Business Moment List: Responsibility Types Business Role Business Work Product List: Distribution Types Business Location Business Connection List: Process Types Business Transform Business Input/Output System Transform System Input /Output System Location System Connection System Role System Work Product System Interval System Moment Technology Transform Technology Input /Output Technology Location Technology Connection Technology Role Technology Work Product Technology Interval Technology Moment Tool Transform Tool Input /Output Tool Location Tool Connection Tool Role Tool Work Product Tool Interval Tool Moment List: Inventory Types Business Entity Business Relationship System Entity System Relationship Technology Entity Technology Relationship Tool Entity Tool Relationship List: Motivation Types Business End Business Means System End System Means Technology End Technology Means Tool End Tool Means Timing IdentificationResponsibility IdentificationDistribution IdentificationProcess Identification Timing DefinitionResponsibility DefinitionDistribution DefinitionProcess Definition Process Representation Distribution Representation Responsibility Representation Timing Representation Process Specification Distribution Specification Responsibility Specification Timing Specification Inventory Identification Inventory Definition Inventory Representation Inventory Specification Inventory Configuration Process Configuration Distribution Configuration Responsibility Configuration Timing Configuration Motivation Identification Motivation Definition Motivation Representation Motivation Specification Motivation Configuration Copyright 2013 by Data Blueprint 28 Copyright 2008-2011 John A. Zachman
  • 32. Copyright 2013 by Data Blueprint What is an information architecture? • A structure of data-based information assets supporting implementation of organizational strategy (or strategies) • Most organizations have data assets that are not supportive of strategies - i.e., information architectures that are not helpful • The really important question is: how can organizations more effectively use their information architectures to support strategy implementation? 29 Classification Names Model Names *Horizontal integration lines areshownforexamplepurposes only and are not a complete set. Composite, integrative rela- tionships connecting every cell horizontally potentially exist. Audience Perspectives Enterprise Names Classification Names Audience Perspectives C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s C o m p o s i t e I n t e g r a t i o n s C o m p o s i t e I n t e g r a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s A l i g n m e n t T r a n s f o r m a t i o n s A l i g n m e n t A l i g n m e n t How Where Who WhenWhat Why Process Flows Distribution Networks Responsibility Assignments Timing Cycles Inventory Sets Motivation Intentions Operations Instances (Implementations) The Enterprise The Enterprise Enterprise Perspective (Users) Executive Perspective (Business Context Planners) Business Mgmt Perspective (Business Concept Owners) Architect Perspective (Business Logic Designers) Engineer Perspective (Business Physics Builders) Technician Perspective (Business Component Implementers) Scope Contexts (Scope Identification Lists) Business Concepts (Business Definition Models) System Logic (System Representation Models) Technology Physics (Technology Specification Models) Tool Components (Tool Configuration Models) e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g. e.g.: primitive e.g.: composite model: model: Forecast Sales Plan Production Sell Products Take Orders Train Employees Assign Territories Develop Markets Maintain Facilities Repair Products Record Transctns Material Supply Ntwk Product Dist. Ntwk Voice Comm. Ntwk Data Comm. Ntwk Manu. Process Ntwk Office
  • 35.  

Ntwk Parts Dist. Ntwk Personnel Dist. Ntwk etc., etc. General Mgmt Product Mgmt Engineering Design Manu. Engineering Accounting Finance Transportation Distribution Marketing Sales Product Cycle Market Cycle Planning Cycle Order Cycle Employee Cycle Maint. Cycle Production Cycle Sales Cycle Economic Cycle Accounting Cycle Products Product Types Warehouses Parts Bins Customers Territories Orders Employees Vehicles Accounts New Markets Revenue Growth Expns Reduction Cust Convenience Customer Satis. Regulatory Comp. New Capital Social Contribution Increased Yield Increased Qualitye.g. e.g. e.g. e.g. e.g. e.g. Operations Transforms Operations In/Outputs Operations Locations Operations Connections Operations Roles Operations Work Products Operations Intervals Operations Moments Operations Entities Operations Relationships Operations Ends Operations Means Process Instantiations Distribution Instantiations Responsibility Instantiations Timing Instantiations Inventory Instantiations Motivation Instantiations List: Timing Types Business Interval Business Moment List: Responsibility Types Business Role Business Work Product List: Distribution Types Business Location Business Connection List: Process Types Business Transform Business Input/Output System Transform System Input /Output System Location System Connection System Role System Work Product System Interval System Moment Technology Transform Technology Input /Output Technology Location Technology Connection Technology Role Technology Work Product Technology Interval Technology Moment Tool Transform Tool Input /Output Tool Location Tool Connection Tool Role Tool Work Product Tool Interval Tool Moment List: Inventory Types Business Entity Business Relationship System Entity System Relationship Technology Entity Technology Relationship Tool Entity Tool Relationship List: Motivation Types Business End Business Means System End System Means Technology End Technology Means Tool End Tool Means Timing IdentificationResponsibility IdentificationDistribution IdentificationProcess Identification Timing DefinitionResponsibility DefinitionDistribution DefinitionProcess Definition Process Representation Distribution Representation Responsibility Representation Timing Representation Process Specification Distribution Specification Responsibility Specification Timing Specification Inventory Identification Inventory Definition Inventory Representation Inventory Specification Inventory Configuration Process Configuration Distribution Configuration Responsibility Configuration Timing Configuration Motivation Identification Motivation Definition Motivation Representation Motivation Specification Motivation Configuration
  • 36. ! ! ! ! Copyright 2013 by Data Blueprint 30 Organizational Needs become instantiated and integrated into an Data/Information Architecture Informa(on)System) Requirements authorizes and articulates satisfyspecificorganizationalneeds Data Architectures produce and are made up of information models that are developed in response to organizational needs
  • 37. Copyright 2013 by Data Blueprint Data Architecture – Better Definition 31 • All organizations have information architectures – Some are better understood and documented (and therefore more useful to the organization) than others. • Common vocabulary expressing integrated requirements ensuring that data assets are stored, arranged, managed, and used in systems in support of organizational strategy [Aiken 2010]
  • 38. Copyright 2013 by Data Blueprint 32 Data Development From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 39. Copyright 2013 by Data Blueprint Data Modeling/Data Development 33 From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 40. Copyright 2013 by Data Blueprint 34 #dataed Program F Program E Program D Program G Program H Program I Application domain 2Application domain 3 Data Development Focus
  • 41. Copyright 2013 by Data Blueprint 35 #dataed Data Development has greater Business Value
  • 42. Copyright 2013 by Data Blueprint 36 Conceptual Logical Physical Validated Not Validated Every change can be mapped to a transformation in this framework! Data Development Evolution Framework
  • 43. Copyright 2013 by Data Blueprint 37 Data Quality Management From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 44. Copyright 2013 by Data Blueprint Data Quality Engineering 38 From The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International
  • 45. Copyright 2013 by Data Blueprint Definitions • Quality Data – Fit for use meets the requirements of its authors, users, and administrators (adapted from Martin Eppler) – Synonymous with information quality, since poor data quality results in inaccurate information and poor business performance • Data Quality Management – Planning, implementation and control activities that apply quality management techniques to measure, assess, improve, and ensure data quality – Entails the "establishment and deployment of roles, responsibilities concerning the acquisition, maintenance, dissemination, and disposition of data" http://www2.sas.com/proceedings/sugi29/098-29.pdf ✓ Critical supporting process from change management ✓ Continuous process for defining acceptable levels of data quality to meet business needs and for ensuring that data quality meets these levels • Data Quality Engineering – Recognition that data quality solutions cannot not managed but must be engineered – Engineering is the application of scientific, economic, social, and practical knowledge in order to design, build, and maintain solutions to data quality challenges – Engineering concepts are generally not known and understood within IT or business! 39 Spinach/Popeye story from http://it.toolbox.com/blogs/infosphere/spinach-how-a-data-quality-mistake-created-a-myth-and-a-cartoon-character-10166
  • 46. Copyright 2013 by Data Blueprint Quality Dimensions 40
  • 47. Copyright 2013 by Data Blueprint Starting point for new system development data performance metadata data architecture data architecture and data models shared data updated data corrected data architecture refinements facts & meanings Metadata & Data Storage Starting point for existing systems Metadata Refinement • Correct Structural Defects • Update Implementation Metadata Creation • Define Data Architecture • Define Data Model Structures Metadata Structuring • Implement Data Model Views • Populate Data Model Views Data Refinement • Correct Data Value Defects • Re-store Data Values Data Manipulation • Manipulate Data • Updata Data Data Utilization • Inspect Data • Present Data Data Creation • Create Data • Verify Data Values Data Assessment • Assess Data Values • Assess Metadata Extended data life cycle model with metadata sources and uses 41
  • 48. Copyright 2013 by Data Blueprint DQE Context & Engineering Concepts • Can rules be implemented stating that no data can be corrected unless the source of the error has been discovered and addressed? • All data must be 100% perfect? • Pareto – 80/20 rule – Not all data is of equal Importance • Scientific, economic, social, and practical knowledge 42
  • 49. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 43
  • 50. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 44
  • 51. Copyright 2013 by Data Blueprint http://visual.ly/amazing-journey-data-cloud 45
  • 52. Copyright 2013 by Data Blueprint http://visual.ly/amazing-journey-data-cloud 46
  • 53. Copyright 2013 by Data Blueprint http://visual.ly/amazing-journey-data-cloud 47
  • 54. Copyright 2013 by Data Blueprint Gartner Five-phase Hype Cycle http://www.gartner.com/technology/research/methodologies/hype-cycle.jsp 48 Technology Trigger: A potential technology breakthrough kicks things off. Early proof-of-concept stories and media interest trigger significant publicity. Often no usable products exist and commercial viability is unproven. Trough of Disillusionment: Interest wanes as experiments and implementations fail to deliver. Producers of the technology shake out or fail. Investments continue only if the surviving providers improve their products to the satisfaction of early adopters. Peak of Inflated Expectations: Early publicity produces a number of success stories—often accompanied by scores of failures. Some companies take action; many do not. Slope of Enlightenment: More instances of how the technology can benefit the enterprise start to crystallize and become more widely understood. Second- and third- generation products appear from technology providers. More enterprises fund pilots; conservative companies remain cautious. Plateau of Productivity: Mainstream adoption starts to take off. Criteria for assessing provider viability are more clearly defined. The technology’s broad market applicability and relevance are clearly paying off.
  • 55. Copyright 2013 by Data Blueprint Gartner Cloud Hype Cycle “While clearly maturing, cloud computing continues to be the most hyped subject in IT today.” 49
  • 56. Copyright 2013 by Data Blueprint 50 • Cloud computing is location-independent computing, whereby shared servers provide resources, software, and data to computers and other devices on demand, as with the electricity grid. • Cloud computing is a natural evolution of the widespread adoption of virtualization, service- oriented architecture and utility computing. • Details are abstracted from consumers, who no longer have need for expertise in, or control over, the technology infrastructure "in the cloud" that supports them. Cloud Computing
  • 57. Copyright 2013 by Data Blueprint Five Essential Characteristics of Data Cloud Infrastructure • Gartner defines "cloud computing" as the set of disciplines, technologies, and business models used to deliver IT capabilities (software, platforms, hardware) as an on- demand, scalable, elastic service. • Five essential characteristics of cloud computing: – It uses shared infrastructure – It provides on-demand self-service – It is elastic and scalable – It is priced by consumption – It is dynamic and virtualized 51
  • 58. Copyright 2013 by Data Blueprint 52 Cloud Scalability
  • 59. Copyright 2013 by Data Blueprint Cloud Rendering 53
  • 60. Copyright 2013 by Data Blueprint Cisco's Ladder to the Cloud 54
  • 61. Copyright 2013 by Data Blueprint Cloud Options 55
  • 62. Copyright 2013 by Data Blueprint Solving the Big Data Puzzle 56 http://damfoundation.org/2012/06/whats-the-big-deal-about-big-data/
  • 63. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline 57
  • 64. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline 58
  • 65. Copyright 2013 by Data Blueprint Benefits 59
  • 66. Copyright 2013 by Data Blueprint Benefits 60
  • 67. Copyright 2013 by Data Blueprint Anticipated Benefits 61 0% 13% 25% 38% 50% Improve data quality Reduce installation and maintenance efforts Reduce implementation efforts Eliminate manual processes Reduce time require to collect and prepare data Apply data governance policies
  • 68. Copyright 2013 by Data Blueprint Similar Opportunity • IT Infrastructure. Your submission should include funding for the timely execution of agency plans to consolidate data centers developed in FY 2010 (reference FY 2011 passback guidance). In coordination with the data center consolidations, agencies should evaluate the potential to adopt cloud computing solutions by analyzing computing alternatives for IT investments in FY 2012. Agencies will be expected to adopt cloud computing solutions where they represent the best value at an acceptable level of risk. • Adopt Light Technologies and Shared Solutions. We are reducing our data center footprint by 40 percent by 2015 and shifting the agency default approach to IT to a cloud-first policy as part of the 2012 budget process. Consolidating more than 2,000 government data centers will save money, increase security and improve performance. 62
  • 69. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 63
  • 70. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 64
  • 71. Copyright 2013 by Data Blueprint Data in the cloud should have three attributes that data outside the cloud should not have. It should be: 65 Sharable-er Cleaner Smaller
  • 72. Copyright 2013 by Data Blueprint Aspirational Data in the Cloud 66
  • 73. Copyright 2013 by Data Blueprint Effective Cloud Transformation • Transformation into cloud computing cannot be done in a manner that benefits organizations unless data is re-architected – formally with two goals: – Maximizing effective, organization-wide data sharing; and – Minimizing organizational data ROT. • Resulting data volume reduction should be 1/5 what is currently is – A significant economic motivator. • All existing organizations have data collections that possess unique strengths and weaknesses – Strengths that should be leveraged – Weaknesses must be addressed • Neither of these can be accomplished without formal data rearchitecting prior to cloud loading. • There are very few who work in the area for a living but my team has achieved some remarkable successes. 67
  • 74. Copyright 2013 by Data Blueprint Transform 68 Problems with forklifting 1. no basis for decisions made 2. no inclusion of architecture/ engineering concepts 3. no idea that these concepts are missing from the process Less Cleaner More shareable ... data Getting into the Cloud
  • 75. Copyright 2013 by Data Blueprint Data Leverage • Permits organizations to better manage their sole non-depleteable, non- degrading, durable, strategic asset - data – within the organization, and – with organizational data exchange partners • Leverage – Obtained by implementation of data-centric technologies, processes, and human skill sets – Increased by elimination of data ROT (redundant, obsolete, or trivial) • The bigger the organization, the greater potential leverage exists • Treating data more asset-like simultaneously 1. lowers organizational IT costs and 2. increases organizational knowledge worker productivity 69 Less ROT Technologies Process People
  • 76. Copyright 2013 by Data Blueprint The Cloud as a Data Quality Tool Enterprise Portal Data DeliveryData Analysis Quality Technology Continuous Improvement Data Baselining Statistical Data Control Cost of Quality Model Empowerment Data Reduction Pattern Analysis Mathematical Analysis Schema Validation Reusability Logic & Logic Programming Relational DB Technology Data Migration Technologies Statistical Programming Languages 70
  • 77. Copyright 2013 by Data Blueprint Fixing Data in the Cloud Using A Glovebox 71
  • 78. Copyright 2013 by Data Blueprint 72 Conceptual Logical Physical Validated Not Validated Every change can be mapped to a transformation in this framework! Data Development Evolution Framework
  • 79. Copyright 2013 by Data Blueprint 73 Data Reengineering for More Shareable Data As-is To-be Technology Independent/ Logical Technology Dependent/ Physical abstraction Other logical as-is data architecture components
  • 80. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 74
  • 81. Copyright 2013 by Data Blueprint 1. Data Management: Contextual Overview 2. Necessary Data Management Functions (Prerequisites) - Data Governance - Data Architecture - Data Development - Data Quality 3. Understanding Cloud-based Technologies 4. Cloud-based Benefits 5. Cloud-based Integration - Cleaner - Smaller - Shareable 6. Take Aways, References and Q&A Tweeting now: #dataed Outline: Cloud-based Integration 75
  • 82. Copyright 2013 by Data Blueprint Part 2: Take Aways • Data governance, architecture, quality, development maturity are necessary but insufficient prerequisites to successful data cloud implementation • A variety of cloud options will influence cloud and data architectures in general – You must understand your architecture and strategy in order to evaluate the options • Data must be reengineered to be – Less – Better quality – More shareable – for the cloud • Failure to do these will result in more business value for the cloud vendors/ service providers and less for your organization
  • 83. Copyright 2013 by Data Blueprint Questions? It’s your turn! Use the chat feature or Twitter (#dataed) to submit your questions to Peter now. 77 + =
  • 84. Data Systems Integration & Business Value Pt. 3: Warehousing September 10, 2013 @ 2:00 PM ET/11:00 AM PT Show me the Money: Monetizing Data Management October 8, 2013 @ 2:00 PM ET/11:00 AM PT Sign up here: www.datablueprint.com/webinar-schedule or www.dataversity.net Copyright 2013 by Data Blueprint Upcoming Events 78