EMBARCADERO TECHNOLOGIESEMBARCADERO TECHNOLOGIES
Build a Collaborative Data Architecture
Ron Huizenga
Senior Product Manager – ER/Studio
EMBARCADERO TECHNOLOGIES
Agenda
• What’s happening in the world of data?
– Data trends, data usage, data thirst
• Collaboration defined
– Enabling collaboration
• The need for architecture
• What is data architecture?
• Model based data architecture
– For solution development
– To mitigate organizational data landscape complexity
• Business driven data architecture
• Concluding remarks
2
EMBARCADERO TECHNOLOGIES
Increasing volumes,
velocity, and variety of
Enterprise Data
30% - 50% year/year
growth
Decreasing % of
enterprise data which is
effectively utilized
5% of all Enterprise data
fully utilized
Increased risk from data
misunderstanding and
non-compliance
$600bn/annual cost for
data clean-up in U.S.
Enterprise Data Trends
EMBARCADERO TECHNOLOGIES
Quenching the Thirst - Big Data?
• Volume
• Velocity
• Variety
• Veracity
EMBARCADERO TECHNOLOGIES
Business Stakeholders’ Data Usage
5
Suspect that business stakeholders
INTERPRET DATA INCORRECTLY
Yes,
frequently
14%
Yes,
occasionally
67%
No, never
9%
I don’t know
10%
Suspect that business stakeholders make decisions
USING THE WRONG DATA?
Yes,
frequently
11%
Yes,
occasionally
64%
No, never
13%
I don’t know
12%
EMBARCADERO TECHNOLOGIES
Data Model Usage & Understanding
6
13%
3%
16%
19%
31%
18%
0% 5% 10% 15% 20% 25% 30% 35%
We don’t use data models
Other
Our data team does most data
models but developers also build
them as needed
Our database administrators own
data modeling
Developers develop their own data
models
We have a data modeling team that
is responsible for data models
What is your organization’s approach to data modeling?
How well does your organization’s technology leadership team
understand the value of using data models?
Completely
understand
20%
Understand
somewhat
60%
Don’t
understand
17%
I don’t know
3%
87%
EMBARCADERO TECHNOLOGIES
Collaboration
• Collaborate
– to work jointly with others or together especially in an
intellectual endeavor
– to cooperate with or willingly assist an enemy of one's country
and especially an occupying force
– to cooperate with an agency or instrumentality with which one
is not immediately connected
• Collaborative
– produced or conducted by two or more parties working
together
7
EMBARCADERO TECHNOLOGIES
Communication is Critical
8
EMBARCADERO TECHNOLOGIES
Communication – Full Definition
9
Technically correct
Functionally useless
EMBARCADERO TECHNOLOGIES
Communication – Simple Definition
• Communication
– The act or process of using words, sounds, signs, or behaviors to
express or exchange information or to express your ideas,
thoughts, feelings, etc., to someone else
– A message that is given to someone : a letter, telephone call,
etc.
• Communications
– The ways of sending information to people by using technology
10
EMBARCADERO TECHNOLOGIES
The need for architecture?
Evolution:
• 38 years of construction
• 147 builders
• No Blueprints
• No Planning
Result:
• 7 stories
• 65 doors to blank walls
• 13 staircases abandoned
• 24 skylights in floors
• 160 rooms, 950 doors
• 47 fireplaces, 17 chimneys
• Miles of hallways
• Secret passages in walls
• 10,000 window panes (all bathrooms are fitted with windows)
3
EMBARCADERO TECHNOLOGIES
Data Architecture (as defined by DAMA)
• A master set of data models and design approaches identifying the strategic data requirements and the
components of data management solutions, usually at an enterprise level. Enterprise data architecture typically
consists of
– 1) an enterprise data model (contextual/subject area, conceptual or logical),
– 2) state transition diagrams depicting the lifecycle of major entities,
– 3) a robust information value chain analysis identifying data stakeholder roles, organizations, processes and applications, and
– 4) data integration architecture identifying how data will flow between applications and databases. The data integration
architecture may divide into
• database architecture
• master data management architecture
• data warehouse / business intelligence architecture
• meta data architecture.
• Some enterprises also include
– 5) lists of controlled domain values (code sets), and
– 6) the responsibility assignments of data stewards to subject areas, entities and code sets.
• The enterprise data architecture is an important part of the larger enterprise architecture that includes business,
process and technology architecture
12
EMBARCADERO TECHNOLOGIES
Data Architecture
• As defined in Wikipedia:
– Data architecture is composed of models, policies, rules or
standards that govern which data is collected, and how it is
stored, arranged, integrated, and put to use in data
systems and in organizations.
13
EMBARCADERO TECHNOLOGIES
ER/Studio Enterprise Team Edition
5
EMBARCADERO TECHNOLOGIES
Key Skill Sets
• Data Design & Management
• ETL and Software Development
• Data Analysis / Stats
• Business Analysis & Discovery
Value Delivered
• Validation
• Integration
• Enrichment
• Usability
Value and the New Lifecycle
15
Discover
Document
(Model)
Integrate
EMBARCADERO TECHNOLOGIES
Data Landscape Complexity
16
• Comprised of:
– Proliferation of disparate systems
– Mismatched departmental solutions
– Many database platforms
– Big data platforms
– ERP, SAAS
– Obsolete legacy systems
• Compounded by:
– Poor decommissioning strategy
– Point-to-point interfaces
– Data warehouse, data marts, ETL …
Data Archaeologist?
EMBARCADERO TECHNOLOGIES
Discovery and Identification Through Models
• Identify candidate data sources
• Reverse engineer data sources into models
• Identify, name and define
• Classify through metadata
• Map “like” items across models
• Data lineage / chain of custody
• Repository
• Collaboration & publishing
17
EMBARCADERO TECHNOLOGIES
Addressing Complexity through Models
• Multi-level sub-models: allow business decomposition
• Reverse engineering: wide variety of platforms including Big Data
• What and where?
– Naming standards
– Universal mappings
• Document and define
– Metadata extensions (attachments)
– Business glossaries
• Data in context: business processes
• Data lineage
• Repository, collaboration & publishing
18
EMBARCADERO TECHNOLOGIES
Automated Naming Standards
19
EMBARCADERO TECHNOLOGIES
ER/Studio: Universal Mappings
• Ability to link “like” or related objects
– Within same model file
– Across separate model files
• Entity/Table level
• Attribute/Column level
20
EMBARCADERO TECHNOLOGIES
ER Studio: Attachment of Metadata extensions
21
EMBARCADERO TECHNOLOGIES
ER/Studio: Data Dictionary
22
EMBARCADERO TECHNOLOGIES
ER/Studio: Extended Notation for MongoDB
23
EMBARCADERO TECHNOLOGIES
Clarify with Business Data Objects
24
EMBARCADERO TECHNOLOGIES
Alternate Perspectives for Different Audiences
25
EMBARCADERO TECHNOLOGIES
Data Lineage
26
EMBARCADERO TECHNOLOGIES
ER/Studio Team Server: Enterprise Collaboration
27
EMBARCADERO TECHNOLOGIES
ER/Studio Team Server – Model Explorer
28
EMBARCADERO TECHNOLOGIES
The Need for Common Understanding
29
EMBARCADERO TECHNOLOGIES
Business Glossary – Why?
• Maximize understanding of the core business concepts
and terminology of the organization
• Minimize misuse of data due to inaccurate understanding
of the business concepts and terms
• Improve alignment of the business organization with the
technology assets (and technology organization)
• Maximize the accuracy of the results to searches for
business concepts, and associated knowledge
30
EMBARCADERO TECHNOLOGIES
Team Server: Glossary/Terms
31
EMBARCADERO TECHNOLOGIES
Enhanced Communication: Glossary Integration
32
EMBARCADERO TECHNOLOGIES
Addressing Governance
33
Data
Governance
Data
Architecture
Management
Data
Development
Database
Operations
Management
Data Security
Management
Reference &
Master Data
Management
Data
Warehousing
& Business
Intelligence
Management
Document &
Content
Management
Metadata
Management
Data Quality
Management
EMBARCADERO TECHNOLOGIES
Collaborative, Business-Driven Data Architecture
• Improve visibility and collaboration with ER/Studio
• Enable more efficient and automated data modeling
• Share models and metadata across the organization
• Establish business glossaries with consistent terms
and definitions
• Build a solid foundation for compliance, data
governance, and master data management
34
EMBARCADERO TECHNOLOGIES
Thank you!
• Learn more about the ER/Studio product family:
http://www.embarcadero.com/data-modeling
• Trial Downloads:
http://www.embarcadero.com/downloads
• To arrange a demo, please contact Embarcadero
Sales: sales@embarcadero.com, (888) 233-2224
35

Building a Collaborative Data Architecture

  • 1.
    EMBARCADERO TECHNOLOGIESEMBARCADERO TECHNOLOGIES Builda Collaborative Data Architecture Ron Huizenga Senior Product Manager – ER/Studio
  • 2.
    EMBARCADERO TECHNOLOGIES Agenda • What’shappening in the world of data? – Data trends, data usage, data thirst • Collaboration defined – Enabling collaboration • The need for architecture • What is data architecture? • Model based data architecture – For solution development – To mitigate organizational data landscape complexity • Business driven data architecture • Concluding remarks 2
  • 3.
    EMBARCADERO TECHNOLOGIES Increasing volumes, velocity,and variety of Enterprise Data 30% - 50% year/year growth Decreasing % of enterprise data which is effectively utilized 5% of all Enterprise data fully utilized Increased risk from data misunderstanding and non-compliance $600bn/annual cost for data clean-up in U.S. Enterprise Data Trends
  • 4.
    EMBARCADERO TECHNOLOGIES Quenching theThirst - Big Data? • Volume • Velocity • Variety • Veracity
  • 5.
    EMBARCADERO TECHNOLOGIES Business Stakeholders’Data Usage 5 Suspect that business stakeholders INTERPRET DATA INCORRECTLY Yes, frequently 14% Yes, occasionally 67% No, never 9% I don’t know 10% Suspect that business stakeholders make decisions USING THE WRONG DATA? Yes, frequently 11% Yes, occasionally 64% No, never 13% I don’t know 12%
  • 6.
    EMBARCADERO TECHNOLOGIES Data ModelUsage & Understanding 6 13% 3% 16% 19% 31% 18% 0% 5% 10% 15% 20% 25% 30% 35% We don’t use data models Other Our data team does most data models but developers also build them as needed Our database administrators own data modeling Developers develop their own data models We have a data modeling team that is responsible for data models What is your organization’s approach to data modeling? How well does your organization’s technology leadership team understand the value of using data models? Completely understand 20% Understand somewhat 60% Don’t understand 17% I don’t know 3% 87%
  • 7.
    EMBARCADERO TECHNOLOGIES Collaboration • Collaborate –to work jointly with others or together especially in an intellectual endeavor – to cooperate with or willingly assist an enemy of one's country and especially an occupying force – to cooperate with an agency or instrumentality with which one is not immediately connected • Collaborative – produced or conducted by two or more parties working together 7
  • 8.
  • 9.
    EMBARCADERO TECHNOLOGIES Communication –Full Definition 9 Technically correct Functionally useless
  • 10.
    EMBARCADERO TECHNOLOGIES Communication –Simple Definition • Communication – The act or process of using words, sounds, signs, or behaviors to express or exchange information or to express your ideas, thoughts, feelings, etc., to someone else – A message that is given to someone : a letter, telephone call, etc. • Communications – The ways of sending information to people by using technology 10
  • 11.
    EMBARCADERO TECHNOLOGIES The needfor architecture? Evolution: • 38 years of construction • 147 builders • No Blueprints • No Planning Result: • 7 stories • 65 doors to blank walls • 13 staircases abandoned • 24 skylights in floors • 160 rooms, 950 doors • 47 fireplaces, 17 chimneys • Miles of hallways • Secret passages in walls • 10,000 window panes (all bathrooms are fitted with windows) 3
  • 12.
    EMBARCADERO TECHNOLOGIES Data Architecture(as defined by DAMA) • A master set of data models and design approaches identifying the strategic data requirements and the components of data management solutions, usually at an enterprise level. Enterprise data architecture typically consists of – 1) an enterprise data model (contextual/subject area, conceptual or logical), – 2) state transition diagrams depicting the lifecycle of major entities, – 3) a robust information value chain analysis identifying data stakeholder roles, organizations, processes and applications, and – 4) data integration architecture identifying how data will flow between applications and databases. The data integration architecture may divide into • database architecture • master data management architecture • data warehouse / business intelligence architecture • meta data architecture. • Some enterprises also include – 5) lists of controlled domain values (code sets), and – 6) the responsibility assignments of data stewards to subject areas, entities and code sets. • The enterprise data architecture is an important part of the larger enterprise architecture that includes business, process and technology architecture 12
  • 13.
    EMBARCADERO TECHNOLOGIES Data Architecture •As defined in Wikipedia: – Data architecture is composed of models, policies, rules or standards that govern which data is collected, and how it is stored, arranged, integrated, and put to use in data systems and in organizations. 13
  • 14.
  • 15.
    EMBARCADERO TECHNOLOGIES Key SkillSets • Data Design & Management • ETL and Software Development • Data Analysis / Stats • Business Analysis & Discovery Value Delivered • Validation • Integration • Enrichment • Usability Value and the New Lifecycle 15 Discover Document (Model) Integrate
  • 16.
    EMBARCADERO TECHNOLOGIES Data LandscapeComplexity 16 • Comprised of: – Proliferation of disparate systems – Mismatched departmental solutions – Many database platforms – Big data platforms – ERP, SAAS – Obsolete legacy systems • Compounded by: – Poor decommissioning strategy – Point-to-point interfaces – Data warehouse, data marts, ETL … Data Archaeologist?
  • 17.
    EMBARCADERO TECHNOLOGIES Discovery andIdentification Through Models • Identify candidate data sources • Reverse engineer data sources into models • Identify, name and define • Classify through metadata • Map “like” items across models • Data lineage / chain of custody • Repository • Collaboration & publishing 17
  • 18.
    EMBARCADERO TECHNOLOGIES Addressing Complexitythrough Models • Multi-level sub-models: allow business decomposition • Reverse engineering: wide variety of platforms including Big Data • What and where? – Naming standards – Universal mappings • Document and define – Metadata extensions (attachments) – Business glossaries • Data in context: business processes • Data lineage • Repository, collaboration & publishing 18
  • 19.
  • 20.
    EMBARCADERO TECHNOLOGIES ER/Studio: UniversalMappings • Ability to link “like” or related objects – Within same model file – Across separate model files • Entity/Table level • Attribute/Column level 20
  • 21.
    EMBARCADERO TECHNOLOGIES ER Studio:Attachment of Metadata extensions 21
  • 22.
  • 23.
  • 24.
    EMBARCADERO TECHNOLOGIES Clarify withBusiness Data Objects 24
  • 25.
  • 26.
  • 27.
    EMBARCADERO TECHNOLOGIES ER/Studio TeamServer: Enterprise Collaboration 27
  • 28.
    EMBARCADERO TECHNOLOGIES ER/Studio TeamServer – Model Explorer 28
  • 29.
    EMBARCADERO TECHNOLOGIES The Needfor Common Understanding 29
  • 30.
    EMBARCADERO TECHNOLOGIES Business Glossary– Why? • Maximize understanding of the core business concepts and terminology of the organization • Minimize misuse of data due to inaccurate understanding of the business concepts and terms • Improve alignment of the business organization with the technology assets (and technology organization) • Maximize the accuracy of the results to searches for business concepts, and associated knowledge 30
  • 31.
  • 32.
  • 33.
    EMBARCADERO TECHNOLOGIES Addressing Governance 33 Data Governance Data Architecture Management Data Development Database Operations Management DataSecurity Management Reference & Master Data Management Data Warehousing & Business Intelligence Management Document & Content Management Metadata Management Data Quality Management
  • 34.
    EMBARCADERO TECHNOLOGIES Collaborative, Business-DrivenData Architecture • Improve visibility and collaboration with ER/Studio • Enable more efficient and automated data modeling • Share models and metadata across the organization • Establish business glossaries with consistent terms and definitions • Build a solid foundation for compliance, data governance, and master data management 34
  • 35.
    EMBARCADERO TECHNOLOGIES Thank you! •Learn more about the ER/Studio product family: http://www.embarcadero.com/data-modeling • Trial Downloads: http://www.embarcadero.com/downloads • To arrange a demo, please contact Embarcadero Sales: sales@embarcadero.com, (888) 233-2224 35