More Related Content Similar to Getting Started with Data Governance? Use Process Models! (20) More from DATAVERSITY (20) Getting Started with Data Governance? Use Process Models!1. © 2016 IDERA, Inc. All rights reserved.
Proprietary and confidential.
© 2017 IDERA, Inc. All rights reserved.
Proprietary and confidential.
GETTING STARTED WITH DATA
GOVERNANCE?
Use Process Models
Kim Brushaber, IDERA, Senior Product Manager
2. 2© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHAT IS DATA GOVERNANCE
The official definition
3. “
3© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Data Governance is a system of decision rights and
accountabilities for information-related processes, executed
according to agreed-upon models which describe who can
take what actions with what information, and when, under
what circumstances, using what methods.
– Data Governance Institue
4. “
4© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Over 90% of all the data in the world was created in the
past 2 years.
- IBM
5. “
5© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Around 100 hours of video are uploaded to YouTube
every minute and it would take you around 15 years
to watch every video uploaded by users in one day
- YouTube
6. 6© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHY IS DATA GOVERNANCE IMPORTANT
Things you should be thinking about
7. 7© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 7© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
REGULATORY STANDARDS INFLUENCING DATA GOVERNANCE
Data Governance is essential for companies working in highly regulated
industries
Sarbanes-Oxley (Accounting and Finance)
Basel I, II and III (Banking)
HIPAA (Healthcare)
GDPR (Data Protection)
8. 8© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 8© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
5 TYPES OF DATA
BIG DATA – Predictive Analytics
FAST DATA – Information that can be quickly analyzed (e.g. coupon upon
checkout)
DARK DATA – Information that you can’t easily access (e.g. Videos)
LOST DATA – Information that is collected but never reviewed
NEW DATA – Information that you could have but aren’t harvesting
* Discussed in the Forbes Article from March 2016 - https://www.forbes.com/sites/michaelkanellos/2016/03/11/the-five-different-types-of-big-data
9. 9© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 9© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
COMPANY DATASETS TO CONSIDER
Marketing Analytics/Demographics
Product Information
Regulated Information
Operational Data
Financial Data
HR Data
Legal Data
10. 10© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 10© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
BENEFITS OF DATA GOVERNANCE
Increasing consistency and confidence in making data decision
Decreasing the risk of regulatory fines
Improving data security
Maximizing the revenue generation potential of data
Designating accountability for information quality
Enabling better data planning
Reducing data redundancy
11. 11© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 11© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
POOR DATA GOVERNANCE CAN RESULT IN:
Lawsuits
Regulatory Fines
Security Breaches
Data Regulated risks that can be expensive and damaging to a
company’s reputation
Legal Discovery – allowing too much information to be handed over
12. 12© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 12© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
CHALLENGES WHEN DATA IS NOT UNDERSTOOD
Data is thrown into a data lake waiting to be used one day
Data is thrown out and discovered what is needed later
If you don’t have Data Governance, increasing the scope and
scale of data just breeds confusion
13. 13© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 13© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHY IS THERE SO MUCH ANGST?
Outside regulations don’t provide guidance on how to handle the
data, leaving companies to figure it out for themselves
Most companies compensate by archiving all of their data on central
file servers without understanding what they have or need (leaving
themselves open to greater risk)
Companies tend to ignore data points that live outside their firewalls
In most organizations, data quality is siloed and poor to begin with
Data become fragmented, inconsistent and redundant
14. “
14© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Google alone processes on average over 40 thousand search
queries per second, making it almost 4 billion in a single day
- InternetLiveStats.com
15. “
15© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
The number of Bits of information stored in the
digital universe is thought to have exceeded the
number of stars in the physical universe in 2007.
- Computerworld
16. 16© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PILLARS OF DATA GOVERNANCE
The foundation of a good Data Governance Program
17. 17© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 17© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Governance
DataQuality
DataDefinitions
DataAccess
Data ArchitectureDataLineage
DataModeling
18. 18© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 18© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Quality
Data Definitions
Data Lineage
Data Modeling
Data Access
19. 19© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 19© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA QUALITY QUESTIONS
How can you improve and maintain the quality of your data?
How do you measure the quality of your data?
What is the current condition of your data?
How trustworthy is your data?
How accurate does your data need to be?
How well does the data align with your corporate and regulatory
policies?
How do you identify issues with your data?
How do you fix your data once you determine it is broken?
How do we develop strong data quality parameters that are
consistent and repeatable?
20. 20© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 20© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Quality
Data Definitions
Data Lineage
Data Modeling
Data Access
21. 21© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 21© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA DEFINITION QUESTIONS
How do you define your data?
How is your data mapped?
What does your data mean?
Do you have consistent definitions across your organization?
Are you in alignment with terms and lexicons?
How do you find the right elements to interact with?
22. 22© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 22© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Quality
Data Definitions
Data Lineage
Data Modeling
Data Access
23. 23© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 23© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA LINEAGE QUESTIONS
What happens to your data over time?
How is your data used?
What can the data be used for?
Where can the data be used?
What does the data produce?
What does the data consume?
What rules does it follow?
What associations are there?
24. 24© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 24© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Quality
Data Definitions
Data Lineage
Data Modeling
Data Access
25. 25© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 25© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA MODELING QUESTIONS
What does your data look like?
What controls and audits are put in place to ensure compliance?
What meta data needs to be captured?
Are there places you can reduce redundancy?
Is your data consistent?
26. 26© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 26© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA GOVERNANCE PILLARS
Data Quality
Data Definitions
Data Lineage
Data Modeling
Data Access
27. 27© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 27© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DATA ACCESS QUESTIONS
Who can access your data?
How is your data protected?
How is your data stored?
How is your data managed?
Who can influence your data?
28. “
28© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
If you burned all of the data created in just one
day onto DVDs you could stack them on top of
each other and reach the moon – twice.
- Computerworld
29. “
29© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
This year, there will be over 1.2 billion smart phones
in the world (which are stuffed full of sensors and
data collection features) and the growth is predicted
to continue.
- ZDNet
30. 30© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW PROCESS MODELS CAN HELP
Visualizing what is happening with your data
31. 31© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 31© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
IDERA ER/STUDIO BUSINESS ARCHITECT ELEMENTS
32. 32© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 32© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
33. 33© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 33© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHO CAN ACCESS WHICH DATA?
34. 34© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 34© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
35. 35© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 35© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHO CAN MAKE DECISIONS
36. 36© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 36© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
37. 37© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 37© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
DETERMINE WHO IS ACCOUNTABLE/RESPONSIBLE FOR
Accuracy
Accessibility
Consistency
Completeness
Updating/Cleansing
38. 38© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 38© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHO IS ACCOUNTABLE FOR WHICH INFORMATION
39. 39© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 39© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
40. 40© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 40© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHO CAN ACT ON THE DATA?
41. 41© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 41© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
42. 42© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 42© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHEN CAN YOU TAKE THESE ACTIONS?
43. 43© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 43© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESS MODELS FOR
Who Can Access Which Data?
Who Can Make Decisions?
Who Is Accountable for Which Information?
Who Can Act On The Data?
When Can They Take These Actions?
Which Info Can You Use?
44. 44© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 44© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
WHICH INFO CAN YOU USE
45. “
45© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Every 2 days we create as much information as we did
from the beginning of time until 2013
- Techcrunch
46. 46© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
MORE DATA GOVERNANCE PROCESS MODELS
Woohoo! More Diagrams!
47. 47© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 47© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESSES MUST BE DEFINED CONCERNING HOW DATA IS:
Stored
Mapped
Archived
Backed Up
Protected from mishaps, theft or attack
48. 48© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 48© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW IS DATA STORED?
49. 49© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 49© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESSES MUST BE DEFINED CONCERNING HOW DATA IS:
Stored
Mapped
Archived
Backed Up
Protected from mishaps, theft or attack
50. 50© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 50© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW IS DATA MAPPED?
51. 51© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 51© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESSES MUST BE DEFINED CONCERNING HOW DATA IS:
Stored
Mapped
Archived
Backed Up
Protected from mishaps, theft or attack
52. 52© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 52© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW IS DATA ARCHIVED?
53. 53© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 53© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESSES MUST BE DEFINED CONCERNING HOW DATA IS:
Stored
Mapped
Archived
Backed Up
Protected from mishaps, theft or attack
54. 54© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 54© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW IS DATA BACKED UP?
55. 55© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 55© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
PROCESSES MUST BE DEFINED CONCERNING HOW DATA IS:
Stored
Mapped
Archived
Backed Up
Protected from mishaps, theft or attack
56. 56© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 56© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
HOW IS DATA PROTECTED FROM MISHAPS OR THREATS?
57. “
57© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
Big data has been used to predict crimes before they
happen – a “predictive policing” trial in California
was able to identify areas where crime will occur
three times more accurately than existing methods
of forecasting.
- BusinessInsider
58. 58© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
CONCLUSION
Summing it all up!
59. 59© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 59© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
CONCLUSION
Data Governance is already a necessity for regulated industries
Data Governance will become more essential as data continues to
grow in organizations
Implementing good Data Governance Practices aren’t easy but
Business Process Models can help get you started
60. “
60© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
It’s expected that by 2020 the amount of digital
information in existence will have grown from 3.2
zettabytes today to 40 zettbytes
- IBM
61. “
61© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
The NSA is thought to analyze 1.6% of all global
internet traffic – around 30 petabytes (30
million gigabytes) every day.
- CNET
62. 62© 2016 IDERA, Inc. All rights reserved. Proprietary and confidential. 62© 2017 IDERA, Inc. All rights reserved. Proprietary and confidential.
THANKS!
Any questions?
You can find me on Twitter at:
Kim Brushaber
@Brushaber_IDERA