SlideShare a Scribd company logo
1 of 131
Data Governance
CHAPTER-3
INTRODUCTION
1
1. Introduction
Data Governance (DG) is defined as the exercise of authority and control (planning, monitoring, and enforcement) over
the management of data assets.
The purpose of Data Governance is to ensure that data is managed
properly, according to policies and best practices
The scope and focus of a particular data governance program will
depend on organizational needs, but most programs include:
ī‚§ Strategy
ī‚§ Policy
ī‚§ Standards & Quality
ī‚§ Oversight
ī‚§ Compliance
ī‚§ Issue Management
ī‚§ Data Management Projects
ī‚§ Data Asset Valuation
Data Governance
2. Goals
Within an organization, data management
goals include:
ī‚§ Understanding and supporting the
information needs of the enterprise and its
stakeholders, including customers,
employees, and business partners
ī‚§ Capturing, storing, protecting, and ensuring
the integrity of data assets
ī‚§ Ensuring the quality of data and
information
ī‚§ Ensuring the privacy and confidentiality of
stakeholder data
ī‚§ Preventing unauthorized or inappropriate
access, manipulation, or use of data and
information
ī‚§ Ensuring data can be used effectively to
add value to the enterprise
Data Governance
Business, Application, Information and Technology.
BAIT
According to the DMBoK, which is not a component of a Data Management strategy?
â€ĸIdentifying individuals for Data Management roles
(Correct)
â€ĸA compelling vision for Data Management
â€ĸDescriptions of Data Management roles and organizations, along with a summary of
their responsibilities and decision rights
â€ĸA summary business case for Data Management with selected examples
â€ĸA draft implementation roadmap with projects and action items
(Incorrect)
Information Governance and Data Governance should be?
â€ĸManaged as a single function
(Correct)
â€ĸManaged as separate functions
â€ĸManaged as integrated functions, with Data Governance reporting to Information
Governance
â€ĸManaged as integrated functions, with Information Governance reporting to Data
Governance
â€ĸManaged by the Chief Information Office
(Incorrect)
Data Governance
SMART is an acronym for objectives in projects and programs. SMART stands for?
â€ĸSpecific, Manageable, Agile, Realistic, Topical
â€ĸSystems, Management, Architecture, Resources, Technology
â€ĸSpecific, Measurable, Achievable, Robust, Timely
â€ĸStructured, Manageable, Accurate, Robust, Tested
â€ĸSpecific, Measurable, Achievable, Realistic, Timely
(Correct)
Which of the following is NOT an approach to data valuation?
â€ĸCost of obtaining and storing data
â€ĸCost of replacing data if it were lost
â€ĸWhat the data could be sold for
â€ĸExpected revenue from innovative uses of data
â€ĸEnterprise Data Modeling
(Correct)
Storage & Operations
Atomicity: All operations are performed, or none of them is, so that if one part of the transaction fails,then the entire
transaction fails.
â€ĸConsistency: The transaction must meet all rules defined by the system at all times and must void half-completed
transactions.
â€ĸIsolation: Each transaction is independent unto itself.
â€ĸDurability: Once complete, the transaction cannot be undone.
Basically Available: The system guarantees some level of availability to the data even when there are node failures. The
data may be stale, but the system will still give and accept responses.
â€ĸSoft State: The data is in a constant state of flux; while a response may be given, the data is not guaranteed to be current.
â€ĸEventual Consistency: The data will eventually be consistent through all nodes and in all databases, but not every
transaction will be consistent at every moment.
The CAP Theorem (or Brewer’s Theorem) was developed in response to a shift toward more distributed systems (Brewer, 2000).
Consistency
Availability
Partition Tolerance
Storage & Operations
The CAP Theorem states that at most two of the three properties can exist in any shared-data
system. This is usually stated with a ‘pick two’ statement,
Storage & Operations
In practice, row-oriented storage layouts are well suited for Online Transaction Processing (OLTP)-like workloads, which are
more heavily loaded with interactive transactions. Column-oriented storage layouts are well suited for Online Analytical
Processing (OLAP)-like workloads (e.g., data warehouses) which typically involve a smaller number of highly complex queries
over all data (possibly terabytes).
Triplestore : A data entity composed of subject-predicate-object is known as a triplestore.
A triplestore is a purpose-built database for the storage and retrieval of triples in the form of
1. Subject
2. Predicate
3. Object expressions
Sharding - Sharding is a process where small chunks of the database are isolated and can be updated independently of
other shards, so replication is merely a file copy. Because the shards are small, refreshes/overwrites may be optimal.
Storage & Operations
Data Interoperability & Integration
Data orchestration : Automates processes related to managing data, such as bringing data together from
multiple sources, combining it, and preparing it for data analysis. It can also include tasks like provisioning
resources and monitoring
SCSI or Small Computer System Interface
HDD Evolution
ATA or Advanced Technology Attachment
SATA or Serial Advanced Technology Attachment
SDD Evolution
Data Interoperability & Integration
Data Interoperability & Integration
1.Introduction
Data Governance
1.1 Business Drivers
The most common driver for data governance is often regulatory compliance, especially for heavily regulated industries,
such as financial services and healthcare.
ī‚§ Data governance is not an end in itself.
ī‚§ It needs to align directly with organizational strategy.
ī‚§ The more clearly it helps solve organizational problems, the more likely people will change behaviors and adopt
governance practices.
ī‚§ Drivers for data governance most often focus on reducing risks or improving processes.
Reducing Risks
General risk management
Data security
Privacy
Improving Processes
Regulatory compliance
Data quality improvement
Metadata management
Efficiency in development projects
Vendor management
Business drivers for data governance within an organization are to be aligned
with overall business strategy.
Data Governance
ī‚§ The goal of Data Governance is to enable an organization to manage data as an asset.
ī‚§ DG provides the principles, policy, processes, framework, metrics, and oversight to manage data as an asset and to
guide data management activities at all levels.
ī‚§ To achieve this overall goal, a DG program must be:
ī‚§ Sustainable – Must be sticky, not like a project that has an end date
ī‚§ Embedded – Incorporated into development methods of software, processes, MDM, risk management
ī‚§ Measured – Measure start point and plan for measurable improvements (Financial and value wise)
ī‚§ Leadership & Strategy – Require visionary and committed leadership
ī‚§ Business driven – Must govern the IT decisions related to data and business
ī‚§ Shared responsibility – Responsibility between data stewards and Tech data mgmt. professionals
ī‚§ Multi-layers – Occurs both enterprise and local levels or levels in between.
ī‚§ Framework-based – Requires operating framework that defines accountabilities & interactions.
ī‚§ Principle-based - articulate a core set of principles and best practices as part of policy work. Guiding principles are the
foundation of DG activities
Sustainable Embedded Measured
ī‚§ The following principles, developed since the early 2000s, can help set a strong foundation for data governance
1.2 Goals and Principles
Data Governance
1.3 Essential Concepts
Data governance represents an inherent separation of duty between oversight and execution
Data Governance and Data Management
1.3.1 Data-centric Organization
1.3.2 Data Governance Organization
Data Governance
Data Governance Organization Parts
Data Governance
Typical Data Governance Committees / Bodies
Data Governance
1.3.3 Data Governance Operating Model Types
Data Governance
Obstacles for Establishing Enterprise DG & DM vision:
ī‚§ Existing Culture
ī‚§ Internal politics
ī‚§ Ambiguity about ownership
ī‚§ Budgetary competition
ī‚§ Legacy systems
Data-centric Organization
data stewards are responsible for defining and implementing
policies and procedures for the day-to-day operational and
administrative management of systems and data — including
the intake, storage, processing, and transmission of data to
internal and external systems.
Data Governance
Data Custodian & Data Steward
Activities and accountabilities
1.3.4 Data Stewardship
Data Stewardship is the most common label to describe accountability
and responsibility for data and processes that ensure effective control
and use of data assets.
Data Governance
1.3.5 Types of Data Stewards
ī‚§ Chief Data Stewards may chair data governance bodies in lieu of the CDO or may act as a CDO in a virtual (committee-
based) or distributed data governance organization. They may also be Executive Sponsors.
ī‚§ Executive Data Stewards are senior managers who serve on a Data Governance Council.
ī‚§ Enterprise Data Stewards have oversight of a data domain across business functions.
ī‚§ Business Data Stewards are business professionals, most often recognized subject matter experts, accountable for a
subset of data. They work with stakeholders to define and control data.
ī‚§ A Data Owner is a business Data Steward, who has approval authority for decisions about data within their domain.
ī‚§ Technical Data Stewards are IT professionals operating within one of the Knowledge Areas, such as Data Integration
Specialists, Database Administrators, Business Intelligence Specialists, Data Quality Analysts or Metadata Administrators.
ī‚§ Coordinating Data Stewards lead and represent teams of business and technical Data Stewards in discussions across
teams and with executive Data Stewards. Coordinating Data Stewards are particularly important in large organizations.
(MCIT Role)
Data Governance
1.3.6 Data Policies
ī‚§ Data policies describe the ‘what’ of data governances (what to do and what not to do), while standards and
procedures describe ‘how’ to do data governance. There should be relatively few data policies, and they should be
stated briefly and directly.
ī‚§ Data only brings value when it is used.
ī‚§ The data is not to be viewed as temporary means to achieve results or merely as a business by-product
ī‚§ Data policies are directives (Official or authoritative instructions) that codify principles and management intent into
fundamental rules governing the creation, acquisition, integrity, security, quality, and use of data and information.
1.3.7 Data Asset Valuation
Data Governance
1.3.7 Data Asset Valuation
ī‚§ Data asset valuation is the process of understanding and calculating the economic value of data to an organization.
ī‚§ Most phases of the data lifecycle involve costs (including acquiring, storing, administering, and disposing of data)
ī‚§ Data only brings value when it is used.
ACTIVITIES
2
Data Governance
2. Data Governance Activities
2.1 Define Data Governance for the Organization (that support business strategy & goals)
2.2 Perform readiness assessment (4 Subs)
2.3 Perform Discovery and Business Alignment
2.4 Develop Organizational Touch Points
2.5 Develop Data Governance Strategy
2.6 Define the DG Operating Framework
2.7 Develop Goals, Principles, and Policies
2.8 Underwrite Data Management Projects
2.9 Engage Change Management
2.10 Engage in Issue Management
2.11 Assess Regulatory Compliance Requirements
2.12 Implement Data Governance
2.13 Sponsor Data Standards and Procedures
2.14 Develop a Business Glossary
2.15 Coordinate with Architecture Groups
2.16 Sponsor Data Asset Valuation
2.17 Embed Data Governance
DGs
17 Activities
Data Governance
Successful data governance requires a clear understanding of what is being governed and who is
being governed, as well as who is governing.
1. What is being governed ?
2. Who is being governed ?
3. Who is governing ?
2.1 Define Data Governance for the Organization (that support business strategy & goals)
Data governance is most effective when it is an enterprise
effort, rather than isolated to a particular functional area.
Data Governance
2.2 Perform readiness assessment (4 typical assessments)
1. Data Management Maturity (Measure current data management capabilities & capacity)
2. Capacity to change (Required for change behavior to adapt & will help identify potential resistance points)
3. Collaborative readiness
4. Business alignment
Data Governance
2.3 Perform Discovery and Business Alignment
Discovery activity will identify and assess the effectiveness of existing policies and guidelines –
1. What risks they address,
2. What behaviors they encourage, and
3. How well they have been implemented
ī‚§ Discovery can also identify opportunities for DG to improve the usefulness of data and content.
ī‚§ Business alignment attaches business benefits to DG program elements.
ī‚§ Data Quality analysis is part of discovery
ī‚§ Assessment of data management practices is another key aspect of the data governance discovery
process.
ī‚§ Derive a list of DG requirements from the discovery and alignment activities. For example, if regulatory
risks generate a financial concern to the business, then specify DG activities that support risk
management. These requirements will drive DG strategy and tactics.
Data Governance
2.4 Develop Organizational Touch Points
Touch points that support alignment and
cohesiveness of an enterprise data governance and
data management approach in areas outside the
direct authority of the Chief Data Officer (CDO).
ī‚§ Procurement and Contracts
ī‚§ Budget and Funding
ī‚§ Budget and Funding
ī‚§ Regulatory Compliance
ī‚§ SDLC / development framework
The touch points that the CDO influences support the
organization’s cohesiveness in managing its data, therefore,
increasing its nimbleness to use its data.
In essence, this is a vision of how DG will be perceived by the
organization.
Data Governance
2.5 Develop Data Governance Strategy
ī‚§ DG strategy should be defined comprehensively and articulated in relation to the overall business strategy, as well as
to data management and IT strategies.
ī‚§ It should be implemented iteratively as the pieces are developed and approved.
ī‚§ The deliverables for the DG Strategy include the following:
1. The Charter (Identify Business drivers, vision, mission, principles for DG including readiness, challenges, etc.)
2. Operating framework and accountabilities (Structure & responsibilities of DG activities)
3. Implementation Roadmap (Timeframe for policies, directives, standards, procedures, roll outs, etc.)
4. Plan for Operational success (Describe a target state for sustainable DG activities)
Data Governance
2.6 Define DG Operating Framework
Consider the following areas when constructing an organization’s operating model.
1. Value of data to the organization (Commercial data,
Open data & Data as Operational lubricant)
2. Business model (Decentralized business vs. centralized,
local vs. international) Links with specific IT strategy,
Data Architecture, and application integration functions
should be reflected in the target operating framework
design. (As per adjacent figure)
3. Cultural factors (Acceptance & Resistance to change)
4. Impact of regulation ( Highly regulated organizations
will have a different mindset and operating model of DG
than those less regulated. There may be links to the Risk
Management group or Legal as well)
Data Governance
2.6 Define DG Operating Framework
Layers of data governance are often part of the solution.
This means determining where accountability should
reside for stewardship activities, who owns the data, etc.
The operating model also defines the interaction
between the governance organization and the people
responsible for data management projects or initiatives,
the engagement of change management activities to
introduce this new program, and the model for issue
management resolution pathways through governance.
The example is illustrative. This kind of artifact must be
customized to meet the needs of a specific organization.
Data Governance
2.7 Develop Goals, Principles, and Policies
Development of goals, principles, and policies derived from the Data Governance Strategy will guide the organization into
the desired future state.
Goals, principles, and policies are typically drafted by either by data management professionals, business policy staff, or a
combination, under the auspices of data governance. Next, Data Stewards and management review and refine them.
Then, the Data Governance Council (or similar body) conducts the final review, revision, and adoption.
Policies may take different shapes, as in the following examples:
ī‚§ The Data Governance Office (DGO) will certify data for use by the organization.
ī‚§ Business owners will be approved by the Data Governance Office.
ī‚§ Business owners will designate Data Stewards from their business capability areas. The Data Stewards will have day-to-
day responsibility for coordinating data governance activities.
ī‚§ Whenever possible, standardized reporting and/or dashboards/scorecards will be made available to serve the majority
of business needs.
ī‚§ Certified Users will be granted access to Certified Data for ad hoc /non-standard reporting.
ī‚§ All certified data will be evaluated on a regular basis to assess its accuracy, completeness, consistency, accessibility,
uniqueness, compliance, and efficiency.
Data policies must be effectively communicated, monitored, enforced, and periodically re-evaluated. The Data Governance
Council may delegate this authority to the Data Stewardship Steering Committee.
Data Governance
2.8 Underwrite Data Management Projects
ī‚§ Initiatives to improve data management capabilities provide enterprise-wide benefits
ī‚§ These usually require cross-functional sponsorship or visibility from the DGC
ī‚§ Data management projects may be considered part of the overall IT project portfolio
ī‚§ Data management activity in other projects must be accommodated by the internal SDLC, service delivery management,
other Information Technology Infrastructure Library (ITIL) components, and PMO processes
Data Governance
2.9 Engage Change Management
Organizational Change Management (OCM) is the vehicle for bringing about change in an organization’s systems and processes.
Organizations should create a team responsible for:
ī‚§ Planning
ī‚§ Training
ī‚§ Influencing systems development
ī‚§ Policy implementation
ī‚§ Communications
A change management program supporting formal Data Governance should focus communications on:
ī‚§ Promoting the value of data assets:
ī‚§ Monitoring and acting on feedback about data governance activities
ī‚§ Implementing data management training
ī‚§ Measuring the effects of change management on in five key areas
ī‚§ Implementing new metrics and KPIs
1. Awareness of the need to change
2. Desire to participate and support the change
3. Knowledge about how to change
4. Ability to implement new skills and behaviors
5. Reinforcement to keep the change in place
Data Governance
2.10 Engage in Issue Management
Issue management is the process for identifying, quantifying, prioritizing, and resolving data governance-related issues, including:
ī‚§ Authority
ī‚§ Change management escalations
ī‚§ Compliance
ī‚§ Conflicts
ī‚§ Conformance
ī‚§ Contracts
ī‚§ Data security and identity
ī‚§ Data quality
A Data Governance Scorecard can be used to identify trends
related to issues, such as where within the organization they occur,
what their root causes are, etc. Issues that cannot be resolved by
the DGC should be escalated to corporate governance and / or
management.
Data Issue Escalation Path
Data Governance
2.11 Assess Regulatory Compliance Requirements
Regulatory compliance is often the initial reason for implementing data governance
Several global regulations have significant implications on data management practices. For example:
ī‚§ Accounting Standards (The Government Accounting Standards Board)
ī‚§ BCBS (Basel committee on Banking Supervision) and Basel II
ī‚§ CPG 235
ī‚§ PCI-DSS The Payment Card Industry Data Security Standards (PCI-DSS)
ī‚§ OGC Standards (Open Geospatial consortium), etc.
Data governance organizations work with other business and technical leadership to evaluate the implications of regulations.
The organization must determine, for example,
ī‚§ In what ways is a regulation relevant to the organization?
ī‚§ What constitutes compliance? What policies and procedures will be required to achieve compliance?
ī‚§ When is compliance required? How and when is compliance monitored?
ī‚§ Can the organization adopt industry standards to achieve compliance?
ī‚§ How is compliance demonstrated?
ī‚§ What is the risk of and penalty for non-compliance?
ī‚§ How is non-compliance identified and reported? How is non-compliance managed and rectified?
Data Governance
2.12 Implement Data Governance
ī‚§ Data governance cannot be implemented overnight. It requires planning
ī‚§ Implementation of DG includes many complex activities that need to be coordinated
ī‚§ It is best to create an implementation roadmap that illustrates the timeframes for and relationship between different
activities
ī‚§ In a federated DG organization, implementation in various lines of business can occur on different schedules, based on
their level of engagement and maturity, as well as funding
Prioritized DG activities in the early stages include the following:
ī‚§ Defining data governance procedures required to meet high priority goals
ī‚§ Establishing a business glossary and documenting terminology and standards
ī‚§ Coordinating with Enterprise Architecture and Data Architecture teams
ī‚§ Assigning financial value to data assets to enable better decision-making and to increase understanding of the role that
data plays in organizational success
Data Governance
2.13 Sponsor Data Standards and Procedures
ī‚§ A standard is defined as “something set up and established by authority as a rule for the measure of quantity, weight,
extent, value, or quality.”
ī‚§ Creating or adopting standards is often a politicized process and these goals get lost
ī‚§ DG standards define, how a field must be populated, rules governing the relationships between fields, detailed
documentation of acceptable and unacceptable values, format, etc.
ī‚§ Data standards should be reviewed, approved and adopted by the DGC, or a delegated workgroup, such as a Data
Standards Steering Committee
Examples of concepts that can be standardized within the Data Management Knowledge Areas include:
1. Data Architecture
2. Data Modeling and Design
3. Data Storage and Operations
4. Data Security
5. Data Integration
6. Documents and Content
7. Reference and Master Data
8. Data Warehousing and Business Intelligence
9. Metadata
10. Data Quality
11. Big Data & Data Science
Data Governance
2.14 Develop a Business Glossary
ī‚§ Data Stewards are generally responsible for business glossary content
ī‚§ A glossary is necessary because people use words differently
1. Enable common understanding of the core business concepts and terminology
2. Reduce the risk that data will be misused due to inconsistent understanding of the business concepts
3. Improve the alignment between technology assets and the business organization
4. Maximize search capability and enable access to documented institutional knowledge
Business glossaries have the following objectives:
A business glossary is not merely a list of terms and definitions. Each term will also be associated with other
valuable Metadata: synonyms, metrics, lineage, business rules, the steward responsible for the term, etc.
Data Governance
2.15 Coordinate with Architecture Groups
ī‚§ The Data Governance Council (DGC) sponsors and approves data architecture artifacts, such as a business-oriented
enterprise data model.
ī‚§ The DGC may appoint or interact with an Enterprise Data Architecture Steering Committee or Architecture Review Board
(ARB) to oversee the program and its iterative projects.
ī‚§ The enterprise data model should be developed and maintained jointly by data architects and Data Stewards working
together in subject area teams.
ī‚§ The enterprise data model should be reviewed, approved, and formally adopted by the DGC.
ī‚§ This approved model must align with key business strategies, processes, organizations, and systems
Data Governance
2.16 Sponsor Data Asset Valuation
ī‚§ Data and information are assets because they have or can create value
ī‚§ organizations finding it challenging to put monetary value on data
Data Governance
2.17 Embed Data Governance
ī‚§ One goal of the data governance organization is to embed in a range of processes behaviors related to managing
data as an asset
ī‚§ In order to deepen the organization’s understanding of data governance in general, its application locally, and to
learn from each other, create a Data Governance Community of Interest.
ī‚§ This type of community is particularly useful in the first years of governance, and will likely taper off as the DG
operations become mature.
Data Governance
TOOLS & TECHNIQUES
3
Data governance is fundamentally about organizational behavior.
This is not a problem that can be solved through technology.
Data Governance
3.1 Online Presence / Websites
3.2 Business Glossary
3.3 Workflow Tools
3.4 Document Management Tools
3.5 Data Governance Scorecards
Data Governance
IMPLEMENTATION GUIDELINES
4
Once the data governance program is defined, an operating plan is developed, and then an implementation roadmap
prepared with the support of information gathered in the data maturity assessment (see Chapter 15).
The organization can begin to implement processes and policies.
Most rollout strategies are incremental, either applying DG first to a large effort, such as MDM, or by a region or division.
Rarely is DG deployed enterprise-wide as a first effort.
Data Governance
4.1 Organization and Culture
ī‚§ Effective and long-lasting data governance programs require a cultural shift in organizational thinking and behavior about data
ī‚§ No matter how precise or exotic the data governance strategy, ignoring culture will diminish chances for success.
ī‚§ Focus on managing change must be part of the implementation strategy.
Data Governance
4.2 Adjustment and Communication
ī‚§ Data Governance programs are implemented incrementally within the context of a wider business and data
management strategy.
ī‚§ Tools required to manage and communicate changes include:
â€ĸ Business / DG strategy map
â€ĸ DG roadmap
â€ĸ Ongoing business case for DG
â€ĸ DG metrics
Data Governance
METRICS
5
Data Governance
ī‚§ DG program must be able to measure progress and success through metrics that demonstrate how DG participants have
added business value and attained objectives.
ī‚§ Measure progress of the rollout of data governance, compliance with the data governance requirements, and the value
data governance is bringing to the organization.
5 Metrics
ī‚§ Performance of policies
and processes
ī‚§ Conformance to
standards and procedures
Value Effectiveness Sustainability
ī‚§ Contributions to business
objectives
ī‚§ Reduction of risk
ī‚§ Improved efficiency in operations
ī‚§ Achievement of goals and
objectives
ī‚§ Extent stewards are using the
relevant tools
ī‚§ Effectiveness of communication
ī‚§ Effectiveness of education/training
ī‚§ Speed of change adoption
Data Governance
Data Architecture
CHAPTER-4
Data Architecture
INTRODUCTION
1
Data Architecture
1.Introduction
Architecture refers to an organized arrangement of component elements intended to optimize the function, performance,
feasibility, cost, and aesthetics of an overall structure or system.
Architecture practice is carried out at different levels within an organization (enterprise, domain, project, etc.) and with
different areas of focus (infrastructure, application, and data).
The three form the essential components of Data Architecture:
1. Data Architecture outcomes, such models, definitions and data flows on
various levels, usually referred as Data Architecture artifacts
2. Data Architecture activities, to form, deploy and fulfill Data Architecture
intentions
3. Data Architecture behavior, such as collaborations, mindsets, and skills
among the various roles that affect the enterprise’s Data Architecture
Data Architecture
The discipline of Enterprise Architecture encompasses domain architectures, including business,
data, application, and technology.
1.Introduction
ī‚§ The most detailed Data Architecture design document is a formal enterprise data model, containing data names,
ī‚§ comprehensive data and Metadata definitions, conceptual and logical entities and relationships, and business rules.
ī‚§ Architecture is most valuable when it fully supports the needs of the entire enterprise
ī‚§ Enterprise Data Architecture enables consistent data standardization and integration across the enterprise
Data Architecture
.
Identifying the data needs of the enterprise (regardless of structure), and designing and maintaining the master blueprints to
meet those needs. Using master blueprints to guide data integration, control data assets, and align data investments with
business strategy
Data Architecture Definition :
Data Architecture
1.1 Business Drivers
The goal of Data Architecture is to be a bridge between business strategy and technology execution. As part of Enterprise
Architecture, Data Architects:
ī‚§ Strategically prepare organizations to quickly evolve their products, services, and data to take advantage of business
opportunities inherent in emerging technologies
ī‚§ Translate business needs into data and system requirements so that processes consistently have the data they require
ī‚§ Manage complex data and information delivery throughout the enterprise
ī‚§ Facilitate alignment between Business and IT
ī‚§ Act as agents for change, transformation, and agility
Data Architecture
1.2 Data Architecture Outcomes and Practices
Primary Data Architecture outcomes include:
ī‚§ Data storage and processing requirements
ī‚§ Designs of structures and plans that meet the current
and long-term data requirements of the enterprise
To reach Data architecture goals, Data Architects define and
maintain specifications that:
ī‚§ Define the current state of data in the organization
ī‚§ Provide a standard business vocabulary for data and
components
ī‚§ Align Data Architecture with enterprise strategy and
business architecture
ī‚§ Express strategic data requirements
ī‚§ Outline high-level integrated designs to meet these
requirements
Data Architecture
1.2 Data Architecture Outcomes and Practices
An overall Data Architecture practice includes:
ī‚§ Using Data Architecture artifacts (master blueprints) to define data requirements, guide data integration, control data assets,
and align data investments with business strategy
ī‚§ Collaborating with, learning from and influencing various stakeholders that are engaged with improving the business or IT
systems development
ī‚§ Using Data Architecture to establish the semantics of an enterprise, via a common business vocabulary
Data Architecture
1.3 Essential Concepts
1.3.1 Enterprise Architecture Domains
ī‚§ Data Architecture operates in context of other
architecture domains, including business,
application, and technical architecture.
ī‚§ Each domain influences and put constraints on
the other domains.
1.3.2 Enterprise Architecture Frameworks
ī‚§ An architecture framework is a foundational
structure used to develop a broad range of related
architectures.
ī‚§ Architectural frameworks provide ways of thinking
about and understanding architecture.
Data Security
1.3.6.1 The Four A’s
1. Access
2. Audit
3. Authentication &
4. Authorization
Additional ‘E’ as Entitlement is added to the 4 ‘A’s
There are four main methods of encryption:
1. Hash (Message Digest 5 (MD5) and Secure Hashing Algorithm (SHA)
2. Symmetric
3. Private-key (Both the sender and the recipient must have the key to read the original data)
4. Public-key – The sender and the receiver have different keys
Document & Content Management
Managing the lifecycle of documents and records includes:
â€ĸInventory: Identification of existing and newly created documents / records.
â€ĸPolicy: Creation, approval, and enforcement of documents / records policies, including a document /records retention policy.
â€ĸClassification of documents / records.
â€ĸStorage: Short- and long-term storage of physical and electronic documents / records.
â€ĸRetrieval and Circulation: Allowing access to and circulation of documents / records in accordance with policies, security and
control standards, and legal requirements.
â€ĸPreservation and Disposal: Archiving and destroying documents / records according to organizational needs, statutes, and
regulations.
Generally Acceptable Recordkeeping PrinciplesÂŽ (GARP)45
Document & Content Management
Electronic Discovery Reference Model
ANSI Standard 859 has three levels of control of data, based on the criticality of the data and the perceived harm that would
occur if data were corrupted or otherwise unavailable:
1. Formal
2. Revision, and
3. Custody
Document & Content Management
Information Governance Frameworks
Information Governance Reference Model (IGRM)
Document & Content Management
Definition of an Ontology : A set of concepts and categories in a subject area or domain that shows
their properties and the relations between them.
Ontology asks What while metaphysics asks How?
Document & Content Management
Ontology is the study of? Being & Existence
Which is the best example of a Taxonomy? The Dewey Decimal System for libraries
A goal of 'Document and Content Management' is to ensure effective and efficient retrieval and use of:
Data & Information in unstructured formats.
Data lineage includes the data origin, what happens to it and where it moves over time. Data lineage gives
visibility while greatly simplifying the ability to trace errors back to the root cause in a data analytics process.
A process that tracks the movement of evidence through its collection, safeguarding, and analysis lifecycle
by documenting each person who handled the evidence, the date/time it was collected or transferred, and the
purpose for the transfer. (Chain of Custody)
Data Architecture
GARP- Generally Acceptable Recordkeeping Principles
ANSI 859 - XXXXXXXXXXXXXXXXXXXXX
Taxonomy: An umbrella term for any classification or controlled vocabulary is:
The MoSCoW method is a four-step approach to prioritizing which project requirements will provide the
best return on investment (ROI). MoSCoW stands for must have, should have, could have and will not
have -- the o's were added to make the acronym more pronounceable.
Data Architecture
The most meaningful relationship Label is: An order is composed of order lines
A Data steward is one who is :
1. Effective communicator
2. He / She works in association with the Data Owner to protect and enhance the data assets under his or her control
3. He / she works collaboratively across the organization with data stakeholders and others identifying data problems and issues
4. Is a recognized subject matter expert in the data subject area / business domain that he or she is responsible for
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Architecture
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx
Data Governance_Notes.pptx

More Related Content

What's hot

Capability Model_Data Governance
Capability Model_Data GovernanceCapability Model_Data Governance
Capability Model_Data GovernanceSteve Novak
 
CDMP Overview Professional Information Management Certification
CDMP Overview Professional Information Management CertificationCDMP Overview Professional Information Management Certification
CDMP Overview Professional Information Management CertificationChristopher Bradley
 
DAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataDAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataMary Levins, PMP
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
 
How to Make a Data Governance Program that Lasts
How to Make a Data Governance Program that LastsHow to Make a Data Governance Program that Lasts
How to Make a Data Governance Program that LastsDATAVERSITY
 
Data Governance
Data GovernanceData Governance
Data GovernanceRob Lux
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsDATAVERSITY
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
How to Implement Data Governance Best Practice
How to Implement Data Governance Best PracticeHow to Implement Data Governance Best Practice
How to Implement Data Governance Best PracticeDATAVERSITY
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...Christopher Bradley
 
How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model DATUM LLC
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Guillaume LE GALIARD
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata ManagementDATAVERSITY
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data GovernanceJohn Bao Vuu
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 
Data Management vs Data Strategy
Data Management vs Data StrategyData Management vs Data Strategy
Data Management vs Data StrategyDATAVERSITY
 
Business Drivers Behind Data Governance
Business Drivers Behind Data GovernanceBusiness Drivers Behind Data Governance
Business Drivers Behind Data GovernancePrecisely
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringDATAVERSITY
 
Data Management is Data Governance
Data Management is Data GovernanceData Management is Data Governance
Data Management is Data GovernanceDATAVERSITY
 

What's hot (20)

Capability Model_Data Governance
Capability Model_Data GovernanceCapability Model_Data Governance
Capability Model_Data Governance
 
CDMP Overview Professional Information Management Certification
CDMP Overview Professional Information Management CertificationCDMP Overview Professional Information Management Certification
CDMP Overview Professional Information Management Certification
 
DAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataDAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master Data
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
How to Make a Data Governance Program that Lasts
How to Make a Data Governance Program that LastsHow to Make a Data Governance Program that Lasts
How to Make a Data Governance Program that Lasts
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
How to Implement Data Governance Best Practice
How to Implement Data Governance Best PracticeHow to Implement Data Governance Best Practice
How to Implement Data Governance Best Practice
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
 
How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model How to Build & Sustain a Data Governance Operating Model
How to Build & Sustain a Data Governance Operating Model
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data Governance
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Management vs Data Strategy
Data Management vs Data StrategyData Management vs Data Strategy
Data Management vs Data Strategy
 
Business Drivers Behind Data Governance
Business Drivers Behind Data GovernanceBusiness Drivers Behind Data Governance
Business Drivers Behind Data Governance
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
Data Management is Data Governance
Data Management is Data GovernanceData Management is Data Governance
Data Management is Data Governance
 

Similar to Data Governance_Notes.pptx

Enterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsEnterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsSheldon McCarthy
 
Importance of Data Governance
Importance of Data GovernanceImportance of Data Governance
Importance of Data GovernanceHTS Hosting
 
The Key Reason Why Your DG Program is Failing
The Key Reason Why Your DG Program is FailingThe Key Reason Why Your DG Program is Failing
The Key Reason Why Your DG Program is FailingCCG
 
Data Governance Maturity Levels
Data Governance Maturity LevelsData Governance Maturity Levels
Data Governance Maturity LevelsSowmya Kandregula
 
chapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfchapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfMahmoudSOLIMAN380726
 
Chapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsChapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsAhmed Alorage
 
chapter3-220725142737-bf613658.pdf
chapter3-220725142737-bf613658.pdfchapter3-220725142737-bf613658.pdf
chapter3-220725142737-bf613658.pdfMahmoudSOLIMAN380726
 
Chapter 3: Data Governance
Chapter 3: Data Governance Chapter 3: Data Governance
Chapter 3: Data Governance Ahmed Alorage
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Defining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business EnvironmentDefining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business EnvironmentCaserta
 
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdf
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdfACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdf
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdfJerichoGerance
 
Metadata Repositories in Health Care - Master Data Management Approach to Met...
Metadata Repositories in Health Care - Master Data Management Approach to Met...Metadata Repositories in Health Care - Master Data Management Approach to Met...
Metadata Repositories in Health Care - Master Data Management Approach to Met...Health Informatics New Zealand
 
Data Governance Intro.pptx
Data Governance Intro.pptxData Governance Intro.pptx
Data Governance Intro.pptxBHARATH KUNAMNENI
 
From DQ to DG
From DQ to DGFrom DQ to DG
From DQ to DGJorge Garcia
 
How to Structure the Data Organization
How to Structure the Data OrganizationHow to Structure the Data Organization
How to Structure the Data OrganizationRobyn Bollhorst
 
Decoding the Role of a Data Engineer.pdf
Decoding the Role of a Data Engineer.pdfDecoding the Role of a Data Engineer.pdf
Decoding the Role of a Data Engineer.pdfDatavalley.ai
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big DataUmair Shafique
 
Information architecture overview
Information architecture overviewInformation architecture overview
Information architecture overviewJames M. Dey
 

Similar to Data Governance_Notes.pptx (20)

Enterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsEnterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial Institutions
 
Importance of Data Governance
Importance of Data GovernanceImportance of Data Governance
Importance of Data Governance
 
The Key Reason Why Your DG Program is Failing
The Key Reason Why Your DG Program is FailingThe Key Reason Why Your DG Program is Failing
The Key Reason Why Your DG Program is Failing
 
Data Governance Maturity Levels
Data Governance Maturity LevelsData Governance Maturity Levels
Data Governance Maturity Levels
 
chapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdfchapter1-220725121543-7c158b33.pdf
chapter1-220725121543-7c158b33.pdf
 
Chapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data AssetsChapter 1: The Importance of Data Assets
Chapter 1: The Importance of Data Assets
 
chapter3-220725142737-bf613658.pdf
chapter3-220725142737-bf613658.pdfchapter3-220725142737-bf613658.pdf
chapter3-220725142737-bf613658.pdf
 
Chapter 3: Data Governance
Chapter 3: Data Governance Chapter 3: Data Governance
Chapter 3: Data Governance
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Defining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business EnvironmentDefining and Applying Data Governance in Today’s Business Environment
Defining and Applying Data Governance in Today’s Business Environment
 
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdf
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdfACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdf
ACCOUNTING-IT-APP-MIdterm Topic-Bigdata.pdf
 
Critical Success Factors
Critical Success FactorsCritical Success Factors
Critical Success Factors
 
Metadata Repositories in Health Care - Master Data Management Approach to Met...
Metadata Repositories in Health Care - Master Data Management Approach to Met...Metadata Repositories in Health Care - Master Data Management Approach to Met...
Metadata Repositories in Health Care - Master Data Management Approach to Met...
 
Data Governance Intro.pptx
Data Governance Intro.pptxData Governance Intro.pptx
Data Governance Intro.pptx
 
From DQ to DG
From DQ to DGFrom DQ to DG
From DQ to DG
 
How to Structure the Data Organization
How to Structure the Data OrganizationHow to Structure the Data Organization
How to Structure the Data Organization
 
Decoding the Role of a Data Engineer.pdf
Decoding the Role of a Data Engineer.pdfDecoding the Role of a Data Engineer.pdf
Decoding the Role of a Data Engineer.pdf
 
BD1.pptx
BD1.pptxBD1.pptx
BD1.pptx
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big Data
 
Information architecture overview
Information architecture overviewInformation architecture overview
Information architecture overview
 

Recently uploaded

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookmanojkuma9823
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝soniya singh
 
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一F La
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 

Recently uploaded (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Bookvip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
vip Sarai Rohilla Call Girls 9999965857 Call or WhatsApp Now Book
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi đŸ’¯Call Us 🔝8264348440🔝
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一
办į†(Vancouveræ¯•ä¸šč¯äšĻ)加æ‹ŋ大渊å“ĨåŽå˛›å¤§å­Ļæ¯•ä¸šč¯æˆįģŠå•åŽŸį‰ˆä¸€æ¯”一
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 

Data Governance_Notes.pptx

  • 3. 1. Introduction Data Governance (DG) is defined as the exercise of authority and control (planning, monitoring, and enforcement) over the management of data assets. The purpose of Data Governance is to ensure that data is managed properly, according to policies and best practices The scope and focus of a particular data governance program will depend on organizational needs, but most programs include: ī‚§ Strategy ī‚§ Policy ī‚§ Standards & Quality ī‚§ Oversight ī‚§ Compliance ī‚§ Issue Management ī‚§ Data Management Projects ī‚§ Data Asset Valuation Data Governance
  • 4. 2. Goals Within an organization, data management goals include: ī‚§ Understanding and supporting the information needs of the enterprise and its stakeholders, including customers, employees, and business partners ī‚§ Capturing, storing, protecting, and ensuring the integrity of data assets ī‚§ Ensuring the quality of data and information ī‚§ Ensuring the privacy and confidentiality of stakeholder data ī‚§ Preventing unauthorized or inappropriate access, manipulation, or use of data and information ī‚§ Ensuring data can be used effectively to add value to the enterprise Data Governance Business, Application, Information and Technology. BAIT According to the DMBoK, which is not a component of a Data Management strategy? â€ĸIdentifying individuals for Data Management roles (Correct) â€ĸA compelling vision for Data Management â€ĸDescriptions of Data Management roles and organizations, along with a summary of their responsibilities and decision rights â€ĸA summary business case for Data Management with selected examples â€ĸA draft implementation roadmap with projects and action items (Incorrect) Information Governance and Data Governance should be? â€ĸManaged as a single function (Correct) â€ĸManaged as separate functions â€ĸManaged as integrated functions, with Data Governance reporting to Information Governance â€ĸManaged as integrated functions, with Information Governance reporting to Data Governance â€ĸManaged by the Chief Information Office (Incorrect)
  • 5. Data Governance SMART is an acronym for objectives in projects and programs. SMART stands for? â€ĸSpecific, Manageable, Agile, Realistic, Topical â€ĸSystems, Management, Architecture, Resources, Technology â€ĸSpecific, Measurable, Achievable, Robust, Timely â€ĸStructured, Manageable, Accurate, Robust, Tested â€ĸSpecific, Measurable, Achievable, Realistic, Timely (Correct) Which of the following is NOT an approach to data valuation? â€ĸCost of obtaining and storing data â€ĸCost of replacing data if it were lost â€ĸWhat the data could be sold for â€ĸExpected revenue from innovative uses of data â€ĸEnterprise Data Modeling (Correct)
  • 6. Storage & Operations Atomicity: All operations are performed, or none of them is, so that if one part of the transaction fails,then the entire transaction fails. â€ĸConsistency: The transaction must meet all rules defined by the system at all times and must void half-completed transactions. â€ĸIsolation: Each transaction is independent unto itself. â€ĸDurability: Once complete, the transaction cannot be undone. Basically Available: The system guarantees some level of availability to the data even when there are node failures. The data may be stale, but the system will still give and accept responses. â€ĸSoft State: The data is in a constant state of flux; while a response may be given, the data is not guaranteed to be current. â€ĸEventual Consistency: The data will eventually be consistent through all nodes and in all databases, but not every transaction will be consistent at every moment.
  • 7. The CAP Theorem (or Brewer’s Theorem) was developed in response to a shift toward more distributed systems (Brewer, 2000). Consistency Availability Partition Tolerance Storage & Operations
  • 8. The CAP Theorem states that at most two of the three properties can exist in any shared-data system. This is usually stated with a ‘pick two’ statement, Storage & Operations
  • 9. In practice, row-oriented storage layouts are well suited for Online Transaction Processing (OLTP)-like workloads, which are more heavily loaded with interactive transactions. Column-oriented storage layouts are well suited for Online Analytical Processing (OLAP)-like workloads (e.g., data warehouses) which typically involve a smaller number of highly complex queries over all data (possibly terabytes). Triplestore : A data entity composed of subject-predicate-object is known as a triplestore. A triplestore is a purpose-built database for the storage and retrieval of triples in the form of 1. Subject 2. Predicate 3. Object expressions Sharding - Sharding is a process where small chunks of the database are isolated and can be updated independently of other shards, so replication is merely a file copy. Because the shards are small, refreshes/overwrites may be optimal. Storage & Operations
  • 10. Data Interoperability & Integration Data orchestration : Automates processes related to managing data, such as bringing data together from multiple sources, combining it, and preparing it for data analysis. It can also include tasks like provisioning resources and monitoring SCSI or Small Computer System Interface HDD Evolution ATA or Advanced Technology Attachment SATA or Serial Advanced Technology Attachment SDD Evolution
  • 11. Data Interoperability & Integration
  • 12. Data Interoperability & Integration
  • 14. 1.1 Business Drivers The most common driver for data governance is often regulatory compliance, especially for heavily regulated industries, such as financial services and healthcare. ī‚§ Data governance is not an end in itself. ī‚§ It needs to align directly with organizational strategy. ī‚§ The more clearly it helps solve organizational problems, the more likely people will change behaviors and adopt governance practices. ī‚§ Drivers for data governance most often focus on reducing risks or improving processes. Reducing Risks General risk management Data security Privacy Improving Processes Regulatory compliance Data quality improvement Metadata management Efficiency in development projects Vendor management Business drivers for data governance within an organization are to be aligned with overall business strategy. Data Governance
  • 15. ī‚§ The goal of Data Governance is to enable an organization to manage data as an asset. ī‚§ DG provides the principles, policy, processes, framework, metrics, and oversight to manage data as an asset and to guide data management activities at all levels. ī‚§ To achieve this overall goal, a DG program must be: ī‚§ Sustainable – Must be sticky, not like a project that has an end date ī‚§ Embedded – Incorporated into development methods of software, processes, MDM, risk management ī‚§ Measured – Measure start point and plan for measurable improvements (Financial and value wise) ī‚§ Leadership & Strategy – Require visionary and committed leadership ī‚§ Business driven – Must govern the IT decisions related to data and business ī‚§ Shared responsibility – Responsibility between data stewards and Tech data mgmt. professionals ī‚§ Multi-layers – Occurs both enterprise and local levels or levels in between. ī‚§ Framework-based – Requires operating framework that defines accountabilities & interactions. ī‚§ Principle-based - articulate a core set of principles and best practices as part of policy work. Guiding principles are the foundation of DG activities Sustainable Embedded Measured ī‚§ The following principles, developed since the early 2000s, can help set a strong foundation for data governance 1.2 Goals and Principles Data Governance
  • 16. 1.3 Essential Concepts Data governance represents an inherent separation of duty between oversight and execution Data Governance and Data Management 1.3.1 Data-centric Organization 1.3.2 Data Governance Organization Data Governance
  • 17. Data Governance Organization Parts Data Governance
  • 18. Typical Data Governance Committees / Bodies Data Governance
  • 19. 1.3.3 Data Governance Operating Model Types Data Governance
  • 20. Obstacles for Establishing Enterprise DG & DM vision: ī‚§ Existing Culture ī‚§ Internal politics ī‚§ Ambiguity about ownership ī‚§ Budgetary competition ī‚§ Legacy systems Data-centric Organization data stewards are responsible for defining and implementing policies and procedures for the day-to-day operational and administrative management of systems and data — including the intake, storage, processing, and transmission of data to internal and external systems. Data Governance
  • 21. Data Custodian & Data Steward Activities and accountabilities 1.3.4 Data Stewardship Data Stewardship is the most common label to describe accountability and responsibility for data and processes that ensure effective control and use of data assets. Data Governance
  • 22. 1.3.5 Types of Data Stewards ī‚§ Chief Data Stewards may chair data governance bodies in lieu of the CDO or may act as a CDO in a virtual (committee- based) or distributed data governance organization. They may also be Executive Sponsors. ī‚§ Executive Data Stewards are senior managers who serve on a Data Governance Council. ī‚§ Enterprise Data Stewards have oversight of a data domain across business functions. ī‚§ Business Data Stewards are business professionals, most often recognized subject matter experts, accountable for a subset of data. They work with stakeholders to define and control data. ī‚§ A Data Owner is a business Data Steward, who has approval authority for decisions about data within their domain. ī‚§ Technical Data Stewards are IT professionals operating within one of the Knowledge Areas, such as Data Integration Specialists, Database Administrators, Business Intelligence Specialists, Data Quality Analysts or Metadata Administrators. ī‚§ Coordinating Data Stewards lead and represent teams of business and technical Data Stewards in discussions across teams and with executive Data Stewards. Coordinating Data Stewards are particularly important in large organizations. (MCIT Role) Data Governance
  • 23. 1.3.6 Data Policies ī‚§ Data policies describe the ‘what’ of data governances (what to do and what not to do), while standards and procedures describe ‘how’ to do data governance. There should be relatively few data policies, and they should be stated briefly and directly. ī‚§ Data only brings value when it is used. ī‚§ The data is not to be viewed as temporary means to achieve results or merely as a business by-product ī‚§ Data policies are directives (Official or authoritative instructions) that codify principles and management intent into fundamental rules governing the creation, acquisition, integrity, security, quality, and use of data and information. 1.3.7 Data Asset Valuation Data Governance
  • 24. 1.3.7 Data Asset Valuation ī‚§ Data asset valuation is the process of understanding and calculating the economic value of data to an organization. ī‚§ Most phases of the data lifecycle involve costs (including acquiring, storing, administering, and disposing of data) ī‚§ Data only brings value when it is used.
  • 26. 2. Data Governance Activities 2.1 Define Data Governance for the Organization (that support business strategy & goals) 2.2 Perform readiness assessment (4 Subs) 2.3 Perform Discovery and Business Alignment 2.4 Develop Organizational Touch Points 2.5 Develop Data Governance Strategy 2.6 Define the DG Operating Framework 2.7 Develop Goals, Principles, and Policies 2.8 Underwrite Data Management Projects 2.9 Engage Change Management 2.10 Engage in Issue Management 2.11 Assess Regulatory Compliance Requirements 2.12 Implement Data Governance 2.13 Sponsor Data Standards and Procedures 2.14 Develop a Business Glossary 2.15 Coordinate with Architecture Groups 2.16 Sponsor Data Asset Valuation 2.17 Embed Data Governance DGs 17 Activities Data Governance
  • 27. Successful data governance requires a clear understanding of what is being governed and who is being governed, as well as who is governing. 1. What is being governed ? 2. Who is being governed ? 3. Who is governing ? 2.1 Define Data Governance for the Organization (that support business strategy & goals) Data governance is most effective when it is an enterprise effort, rather than isolated to a particular functional area. Data Governance
  • 28. 2.2 Perform readiness assessment (4 typical assessments) 1. Data Management Maturity (Measure current data management capabilities & capacity) 2. Capacity to change (Required for change behavior to adapt & will help identify potential resistance points) 3. Collaborative readiness 4. Business alignment Data Governance
  • 29. 2.3 Perform Discovery and Business Alignment Discovery activity will identify and assess the effectiveness of existing policies and guidelines – 1. What risks they address, 2. What behaviors they encourage, and 3. How well they have been implemented ī‚§ Discovery can also identify opportunities for DG to improve the usefulness of data and content. ī‚§ Business alignment attaches business benefits to DG program elements. ī‚§ Data Quality analysis is part of discovery ī‚§ Assessment of data management practices is another key aspect of the data governance discovery process. ī‚§ Derive a list of DG requirements from the discovery and alignment activities. For example, if regulatory risks generate a financial concern to the business, then specify DG activities that support risk management. These requirements will drive DG strategy and tactics. Data Governance
  • 30. 2.4 Develop Organizational Touch Points Touch points that support alignment and cohesiveness of an enterprise data governance and data management approach in areas outside the direct authority of the Chief Data Officer (CDO). ī‚§ Procurement and Contracts ī‚§ Budget and Funding ī‚§ Budget and Funding ī‚§ Regulatory Compliance ī‚§ SDLC / development framework The touch points that the CDO influences support the organization’s cohesiveness in managing its data, therefore, increasing its nimbleness to use its data. In essence, this is a vision of how DG will be perceived by the organization. Data Governance
  • 31. 2.5 Develop Data Governance Strategy ī‚§ DG strategy should be defined comprehensively and articulated in relation to the overall business strategy, as well as to data management and IT strategies. ī‚§ It should be implemented iteratively as the pieces are developed and approved. ī‚§ The deliverables for the DG Strategy include the following: 1. The Charter (Identify Business drivers, vision, mission, principles for DG including readiness, challenges, etc.) 2. Operating framework and accountabilities (Structure & responsibilities of DG activities) 3. Implementation Roadmap (Timeframe for policies, directives, standards, procedures, roll outs, etc.) 4. Plan for Operational success (Describe a target state for sustainable DG activities) Data Governance
  • 32. 2.6 Define DG Operating Framework Consider the following areas when constructing an organization’s operating model. 1. Value of data to the organization (Commercial data, Open data & Data as Operational lubricant) 2. Business model (Decentralized business vs. centralized, local vs. international) Links with specific IT strategy, Data Architecture, and application integration functions should be reflected in the target operating framework design. (As per adjacent figure) 3. Cultural factors (Acceptance & Resistance to change) 4. Impact of regulation ( Highly regulated organizations will have a different mindset and operating model of DG than those less regulated. There may be links to the Risk Management group or Legal as well) Data Governance
  • 33. 2.6 Define DG Operating Framework Layers of data governance are often part of the solution. This means determining where accountability should reside for stewardship activities, who owns the data, etc. The operating model also defines the interaction between the governance organization and the people responsible for data management projects or initiatives, the engagement of change management activities to introduce this new program, and the model for issue management resolution pathways through governance. The example is illustrative. This kind of artifact must be customized to meet the needs of a specific organization. Data Governance
  • 34. 2.7 Develop Goals, Principles, and Policies Development of goals, principles, and policies derived from the Data Governance Strategy will guide the organization into the desired future state. Goals, principles, and policies are typically drafted by either by data management professionals, business policy staff, or a combination, under the auspices of data governance. Next, Data Stewards and management review and refine them. Then, the Data Governance Council (or similar body) conducts the final review, revision, and adoption. Policies may take different shapes, as in the following examples: ī‚§ The Data Governance Office (DGO) will certify data for use by the organization. ī‚§ Business owners will be approved by the Data Governance Office. ī‚§ Business owners will designate Data Stewards from their business capability areas. The Data Stewards will have day-to- day responsibility for coordinating data governance activities. ī‚§ Whenever possible, standardized reporting and/or dashboards/scorecards will be made available to serve the majority of business needs. ī‚§ Certified Users will be granted access to Certified Data for ad hoc /non-standard reporting. ī‚§ All certified data will be evaluated on a regular basis to assess its accuracy, completeness, consistency, accessibility, uniqueness, compliance, and efficiency. Data policies must be effectively communicated, monitored, enforced, and periodically re-evaluated. The Data Governance Council may delegate this authority to the Data Stewardship Steering Committee. Data Governance
  • 35. 2.8 Underwrite Data Management Projects ī‚§ Initiatives to improve data management capabilities provide enterprise-wide benefits ī‚§ These usually require cross-functional sponsorship or visibility from the DGC ī‚§ Data management projects may be considered part of the overall IT project portfolio ī‚§ Data management activity in other projects must be accommodated by the internal SDLC, service delivery management, other Information Technology Infrastructure Library (ITIL) components, and PMO processes Data Governance
  • 36. 2.9 Engage Change Management Organizational Change Management (OCM) is the vehicle for bringing about change in an organization’s systems and processes. Organizations should create a team responsible for: ī‚§ Planning ī‚§ Training ī‚§ Influencing systems development ī‚§ Policy implementation ī‚§ Communications A change management program supporting formal Data Governance should focus communications on: ī‚§ Promoting the value of data assets: ī‚§ Monitoring and acting on feedback about data governance activities ī‚§ Implementing data management training ī‚§ Measuring the effects of change management on in five key areas ī‚§ Implementing new metrics and KPIs 1. Awareness of the need to change 2. Desire to participate and support the change 3. Knowledge about how to change 4. Ability to implement new skills and behaviors 5. Reinforcement to keep the change in place Data Governance
  • 37. 2.10 Engage in Issue Management Issue management is the process for identifying, quantifying, prioritizing, and resolving data governance-related issues, including: ī‚§ Authority ī‚§ Change management escalations ī‚§ Compliance ī‚§ Conflicts ī‚§ Conformance ī‚§ Contracts ī‚§ Data security and identity ī‚§ Data quality A Data Governance Scorecard can be used to identify trends related to issues, such as where within the organization they occur, what their root causes are, etc. Issues that cannot be resolved by the DGC should be escalated to corporate governance and / or management. Data Issue Escalation Path Data Governance
  • 38. 2.11 Assess Regulatory Compliance Requirements Regulatory compliance is often the initial reason for implementing data governance Several global regulations have significant implications on data management practices. For example: ī‚§ Accounting Standards (The Government Accounting Standards Board) ī‚§ BCBS (Basel committee on Banking Supervision) and Basel II ī‚§ CPG 235 ī‚§ PCI-DSS The Payment Card Industry Data Security Standards (PCI-DSS) ī‚§ OGC Standards (Open Geospatial consortium), etc. Data governance organizations work with other business and technical leadership to evaluate the implications of regulations. The organization must determine, for example, ī‚§ In what ways is a regulation relevant to the organization? ī‚§ What constitutes compliance? What policies and procedures will be required to achieve compliance? ī‚§ When is compliance required? How and when is compliance monitored? ī‚§ Can the organization adopt industry standards to achieve compliance? ī‚§ How is compliance demonstrated? ī‚§ What is the risk of and penalty for non-compliance? ī‚§ How is non-compliance identified and reported? How is non-compliance managed and rectified? Data Governance
  • 39. 2.12 Implement Data Governance ī‚§ Data governance cannot be implemented overnight. It requires planning ī‚§ Implementation of DG includes many complex activities that need to be coordinated ī‚§ It is best to create an implementation roadmap that illustrates the timeframes for and relationship between different activities ī‚§ In a federated DG organization, implementation in various lines of business can occur on different schedules, based on their level of engagement and maturity, as well as funding Prioritized DG activities in the early stages include the following: ī‚§ Defining data governance procedures required to meet high priority goals ī‚§ Establishing a business glossary and documenting terminology and standards ī‚§ Coordinating with Enterprise Architecture and Data Architecture teams ī‚§ Assigning financial value to data assets to enable better decision-making and to increase understanding of the role that data plays in organizational success Data Governance
  • 40. 2.13 Sponsor Data Standards and Procedures ī‚§ A standard is defined as “something set up and established by authority as a rule for the measure of quantity, weight, extent, value, or quality.” ī‚§ Creating or adopting standards is often a politicized process and these goals get lost ī‚§ DG standards define, how a field must be populated, rules governing the relationships between fields, detailed documentation of acceptable and unacceptable values, format, etc. ī‚§ Data standards should be reviewed, approved and adopted by the DGC, or a delegated workgroup, such as a Data Standards Steering Committee Examples of concepts that can be standardized within the Data Management Knowledge Areas include: 1. Data Architecture 2. Data Modeling and Design 3. Data Storage and Operations 4. Data Security 5. Data Integration 6. Documents and Content 7. Reference and Master Data 8. Data Warehousing and Business Intelligence 9. Metadata 10. Data Quality 11. Big Data & Data Science Data Governance
  • 41. 2.14 Develop a Business Glossary ī‚§ Data Stewards are generally responsible for business glossary content ī‚§ A glossary is necessary because people use words differently 1. Enable common understanding of the core business concepts and terminology 2. Reduce the risk that data will be misused due to inconsistent understanding of the business concepts 3. Improve the alignment between technology assets and the business organization 4. Maximize search capability and enable access to documented institutional knowledge Business glossaries have the following objectives: A business glossary is not merely a list of terms and definitions. Each term will also be associated with other valuable Metadata: synonyms, metrics, lineage, business rules, the steward responsible for the term, etc. Data Governance
  • 42. 2.15 Coordinate with Architecture Groups ī‚§ The Data Governance Council (DGC) sponsors and approves data architecture artifacts, such as a business-oriented enterprise data model. ī‚§ The DGC may appoint or interact with an Enterprise Data Architecture Steering Committee or Architecture Review Board (ARB) to oversee the program and its iterative projects. ī‚§ The enterprise data model should be developed and maintained jointly by data architects and Data Stewards working together in subject area teams. ī‚§ The enterprise data model should be reviewed, approved, and formally adopted by the DGC. ī‚§ This approved model must align with key business strategies, processes, organizations, and systems Data Governance
  • 43. 2.16 Sponsor Data Asset Valuation ī‚§ Data and information are assets because they have or can create value ī‚§ organizations finding it challenging to put monetary value on data Data Governance
  • 44. 2.17 Embed Data Governance ī‚§ One goal of the data governance organization is to embed in a range of processes behaviors related to managing data as an asset ī‚§ In order to deepen the organization’s understanding of data governance in general, its application locally, and to learn from each other, create a Data Governance Community of Interest. ī‚§ This type of community is particularly useful in the first years of governance, and will likely taper off as the DG operations become mature. Data Governance
  • 45. TOOLS & TECHNIQUES 3 Data governance is fundamentally about organizational behavior. This is not a problem that can be solved through technology. Data Governance
  • 46. 3.1 Online Presence / Websites 3.2 Business Glossary 3.3 Workflow Tools 3.4 Document Management Tools 3.5 Data Governance Scorecards Data Governance
  • 47. IMPLEMENTATION GUIDELINES 4 Once the data governance program is defined, an operating plan is developed, and then an implementation roadmap prepared with the support of information gathered in the data maturity assessment (see Chapter 15). The organization can begin to implement processes and policies. Most rollout strategies are incremental, either applying DG first to a large effort, such as MDM, or by a region or division. Rarely is DG deployed enterprise-wide as a first effort. Data Governance
  • 48. 4.1 Organization and Culture ī‚§ Effective and long-lasting data governance programs require a cultural shift in organizational thinking and behavior about data ī‚§ No matter how precise or exotic the data governance strategy, ignoring culture will diminish chances for success. ī‚§ Focus on managing change must be part of the implementation strategy. Data Governance
  • 49. 4.2 Adjustment and Communication ī‚§ Data Governance programs are implemented incrementally within the context of a wider business and data management strategy. ī‚§ Tools required to manage and communicate changes include: â€ĸ Business / DG strategy map â€ĸ DG roadmap â€ĸ Ongoing business case for DG â€ĸ DG metrics Data Governance
  • 51. ī‚§ DG program must be able to measure progress and success through metrics that demonstrate how DG participants have added business value and attained objectives. ī‚§ Measure progress of the rollout of data governance, compliance with the data governance requirements, and the value data governance is bringing to the organization. 5 Metrics ī‚§ Performance of policies and processes ī‚§ Conformance to standards and procedures Value Effectiveness Sustainability ī‚§ Contributions to business objectives ī‚§ Reduction of risk ī‚§ Improved efficiency in operations ī‚§ Achievement of goals and objectives ī‚§ Extent stewards are using the relevant tools ī‚§ Effectiveness of communication ī‚§ Effectiveness of education/training ī‚§ Speed of change adoption Data Governance
  • 54. 1.Introduction Architecture refers to an organized arrangement of component elements intended to optimize the function, performance, feasibility, cost, and aesthetics of an overall structure or system. Architecture practice is carried out at different levels within an organization (enterprise, domain, project, etc.) and with different areas of focus (infrastructure, application, and data). The three form the essential components of Data Architecture: 1. Data Architecture outcomes, such models, definitions and data flows on various levels, usually referred as Data Architecture artifacts 2. Data Architecture activities, to form, deploy and fulfill Data Architecture intentions 3. Data Architecture behavior, such as collaborations, mindsets, and skills among the various roles that affect the enterprise’s Data Architecture Data Architecture The discipline of Enterprise Architecture encompasses domain architectures, including business, data, application, and technology.
  • 55. 1.Introduction ī‚§ The most detailed Data Architecture design document is a formal enterprise data model, containing data names, ī‚§ comprehensive data and Metadata definitions, conceptual and logical entities and relationships, and business rules. ī‚§ Architecture is most valuable when it fully supports the needs of the entire enterprise ī‚§ Enterprise Data Architecture enables consistent data standardization and integration across the enterprise Data Architecture . Identifying the data needs of the enterprise (regardless of structure), and designing and maintaining the master blueprints to meet those needs. Using master blueprints to guide data integration, control data assets, and align data investments with business strategy Data Architecture Definition :
  • 56. Data Architecture 1.1 Business Drivers The goal of Data Architecture is to be a bridge between business strategy and technology execution. As part of Enterprise Architecture, Data Architects: ī‚§ Strategically prepare organizations to quickly evolve their products, services, and data to take advantage of business opportunities inherent in emerging technologies ī‚§ Translate business needs into data and system requirements so that processes consistently have the data they require ī‚§ Manage complex data and information delivery throughout the enterprise ī‚§ Facilitate alignment between Business and IT ī‚§ Act as agents for change, transformation, and agility
  • 57. Data Architecture 1.2 Data Architecture Outcomes and Practices Primary Data Architecture outcomes include: ī‚§ Data storage and processing requirements ī‚§ Designs of structures and plans that meet the current and long-term data requirements of the enterprise To reach Data architecture goals, Data Architects define and maintain specifications that: ī‚§ Define the current state of data in the organization ī‚§ Provide a standard business vocabulary for data and components ī‚§ Align Data Architecture with enterprise strategy and business architecture ī‚§ Express strategic data requirements ī‚§ Outline high-level integrated designs to meet these requirements
  • 58. Data Architecture 1.2 Data Architecture Outcomes and Practices An overall Data Architecture practice includes: ī‚§ Using Data Architecture artifacts (master blueprints) to define data requirements, guide data integration, control data assets, and align data investments with business strategy ī‚§ Collaborating with, learning from and influencing various stakeholders that are engaged with improving the business or IT systems development ī‚§ Using Data Architecture to establish the semantics of an enterprise, via a common business vocabulary
  • 59. Data Architecture 1.3 Essential Concepts 1.3.1 Enterprise Architecture Domains ī‚§ Data Architecture operates in context of other architecture domains, including business, application, and technical architecture. ī‚§ Each domain influences and put constraints on the other domains. 1.3.2 Enterprise Architecture Frameworks ī‚§ An architecture framework is a foundational structure used to develop a broad range of related architectures. ī‚§ Architectural frameworks provide ways of thinking about and understanding architecture.
  • 60. Data Security 1.3.6.1 The Four A’s 1. Access 2. Audit 3. Authentication & 4. Authorization Additional ‘E’ as Entitlement is added to the 4 ‘A’s There are four main methods of encryption: 1. Hash (Message Digest 5 (MD5) and Secure Hashing Algorithm (SHA) 2. Symmetric 3. Private-key (Both the sender and the recipient must have the key to read the original data) 4. Public-key – The sender and the receiver have different keys
  • 61. Document & Content Management Managing the lifecycle of documents and records includes: â€ĸInventory: Identification of existing and newly created documents / records. â€ĸPolicy: Creation, approval, and enforcement of documents / records policies, including a document /records retention policy. â€ĸClassification of documents / records. â€ĸStorage: Short- and long-term storage of physical and electronic documents / records. â€ĸRetrieval and Circulation: Allowing access to and circulation of documents / records in accordance with policies, security and control standards, and legal requirements. â€ĸPreservation and Disposal: Archiving and destroying documents / records according to organizational needs, statutes, and regulations. Generally Acceptable Recordkeeping PrinciplesÂŽ (GARP)45
  • 62. Document & Content Management Electronic Discovery Reference Model
  • 63. ANSI Standard 859 has three levels of control of data, based on the criticality of the data and the perceived harm that would occur if data were corrupted or otherwise unavailable: 1. Formal 2. Revision, and 3. Custody Document & Content Management
  • 64. Information Governance Frameworks Information Governance Reference Model (IGRM) Document & Content Management
  • 65. Definition of an Ontology : A set of concepts and categories in a subject area or domain that shows their properties and the relations between them. Ontology asks What while metaphysics asks How? Document & Content Management Ontology is the study of? Being & Existence Which is the best example of a Taxonomy? The Dewey Decimal System for libraries A goal of 'Document and Content Management' is to ensure effective and efficient retrieval and use of: Data & Information in unstructured formats. Data lineage includes the data origin, what happens to it and where it moves over time. Data lineage gives visibility while greatly simplifying the ability to trace errors back to the root cause in a data analytics process. A process that tracks the movement of evidence through its collection, safeguarding, and analysis lifecycle by documenting each person who handled the evidence, the date/time it was collected or transferred, and the purpose for the transfer. (Chain of Custody)
  • 66. Data Architecture GARP- Generally Acceptable Recordkeeping Principles ANSI 859 - XXXXXXXXXXXXXXXXXXXXX Taxonomy: An umbrella term for any classification or controlled vocabulary is: The MoSCoW method is a four-step approach to prioritizing which project requirements will provide the best return on investment (ROI). MoSCoW stands for must have, should have, could have and will not have -- the o's were added to make the acronym more pronounceable.
  • 67. Data Architecture The most meaningful relationship Label is: An order is composed of order lines A Data steward is one who is : 1. Effective communicator 2. He / She works in association with the Data Owner to protect and enhance the data assets under his or her control 3. He / she works collaboratively across the organization with data stakeholders and others identifying data problems and issues 4. Is a recognized subject matter expert in the data subject area / business domain that he or she is responsible for