The first step towards understanding what data assets mean for your organization is understanding what those assets mean for each other. Metadata—literally, data about data—is one of many Data Management disciplines inherent in good systems development, and is perhaps the most mislabeled and misunderstood out of the lot. Understanding Metadata and its associated technologies as more than just straightforward technological tools can provide powerful insight into the efficiency of organizational practices, and can also enable you to combine more sophisticated Data Management techniques in support of larger and more complex business initiatives.
In this webinar, we will:
Illustrate how to leverage Metadata in support of your business strategy
Discuss foundational Metadata concepts based on “The DAMA Guide to the Data Management Body of Knowledge” (DAMA DMBOK)
Enumerate guiding principles for and lessons previously learned from Metadata and its practical uses
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Metadata Strategies
1. Peter Aiken, Ph.D.
Metadata
!1
• DAMA International President 2009-2013 / 2018
• DAMA International Achievement Award 2001
(with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
Peter Aiken, Ph.D.
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Founder, Data Blueprint (datablueprint.com)
• DAMA International (dama.org)
• 10 books and dozens of articles
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart
– …
Copyright 2018 by Data Blueprint Slide # !2
PETER AIKEN WITH JUANITA BILLINGS
FOREWORD BY JOHN BOTTEGA
MONETIZING
DATA MANAGEMENT
Unlocking the Value in Your Organization’s
Most Important Asset.
The Case for the
Chief Data Officer
Recasting the C-Suite to Leverage
Your MostValuable Asset
Peter Aiken and
Michael Gorman
2. 1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
!3Copyright 2018 by Data Blueprint Slide #
Metadata
UsesUsesReuses
What is data management?
!4Copyright 2018 by Data Blueprint Slide #
Sources
Data
Engineering
Data
Delivery
Data
Storage
Specialized Team Skills
Data Governance
Understanding the current
and future data needs of an
enterprise and making that
data effective and efficient in
supporting
business activities
Aiken, P, Allen, M. D., Parker, B., Mattia, A.,
"Measuring Data Management's Maturity:
A Community's Self-Assessment"
IEEE Computer (research feature April 2007)
Data management practices connect
data sources and uses in an
organized and efficient manner
• Engineering
• Storage
• Delivery
• Governance
When executed,
engineering, storage, and
delivery implement governance
Note: does not well-depict data reuse
3.
What is data management?
!5Copyright 2018 by Data Blueprint Slide #
Sources
Data
Engineering
Data
Delivery
Data
Storage
Resources
(optimized for reuse)
Data Governance
AnalyticInsight Specialized Team Skills
Maslow's Hierarchy of Needs
!6Copyright 2018 by Data Blueprint Slide #
4. You can accomplish
Advanced Data Practices
without becoming proficient
in the Foundational Data
Practices however
this will:
• Take longer
• Cost more
• Deliver less
• Present
greater
risk (with thanks to
Tom DeMarco)
Data Management Practices Hierarchy
Advanced
Data
Practices
• MDM
• Mining
• Big Data
• Analytics
• Warehousing
• SOA
Foundational Data Practices
Data Platform/Architecture
Data Governance Data Quality
Data Operations
Data Management Strategy
Technologies
Capabilities
Copyright 2018 by Data Blueprint Slide # !7
DMM℠ Structure of
5 Integrated
DM Practice Areas
Data architecture
implementation
Data
Governance
Data
Management
Strategy
Data
Operations
Platform
Architecture
Supporting
Processes
Maintain fit-for-purpose data,
efficiently and effectively
!8Copyright 2018 by Data Blueprint Slide #
Manage data coherently
Manage data assets professionally
Data life cycle
management
Organizational support
Data
Quality
5. Data Management Strategy is often the weakest link
Data architecture
implementation
Data
Governance
Data
Management
Strategy
Data
Operations
Platform
Architecture
Supporting
Processes
Maintain fit-for-purpose data,
efficiently and effectively
!9Copyright 2018 by Data Blueprint Slide #
Manage data coherently
Manage data assets professionally
Data life cycle
management
Organizational support
Data
Quality
3 3
33
1
!10Copyright 2018 by Data Blueprint Slide #
DataManagement
BodyofKnowledge(DMBoK)
7. 1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
!13Copyright 2018 by Data Blueprint Slide #
Metadata
Meta data, Meta-data, or metadata
• In the history of language, whenever two words are
pasted together to form a combined concept initially, a
hyphen links them
• With the passage of time,
the hyphen is lost. The
argument can be made
that that time has passed
• There is a copyright on
the term "metadata," but
it has not been enforced
• So, the term is "metadata"
!14Copyright 2018 by Data Blueprint Slide #
8. !15Copyright 2018 by Data Blueprint Slide #
Data
About
Data
!16Copyright 2018 by Data Blueprint Slide #
UsesSources
Metadata Governance
Metadata
Engineering
Metadata
Delivery
Metadata
Storage
Specialized Team Skills
Metadata Practices
• If metadata is data then what technologies and techniques
should we use to manage it?
• Data management technologies and techniques
10. Definitions
• Metadata is
– Everywhere in every data management activity and integral
to all IT systems and applications.
– To data what data is to real life. Data reflects real life transactions, events,
objects, relationships, etc. Metadata reflects data transactions, events,
objects, relations, etc.
– The data that describe the structure and workings of an
organization’s use of information, and which describe the
systems it uses to manage that information.
[quote from David Hay's book, page 4]
• Data describing various facets of a data asset, for the purpose
of improving its usability throughout its life cycle [Gartner 2010]
• Metadata unlocks the value of data, and therefore requires
management attention [Gartner 2011]
• Metadata Management is
– The set of processes that ensure proper creation, storage, integration, and
control to support associated use of metadata
!19Copyright 2018 by Data Blueprint Slide #
!20Copyright 2018 by Data Blueprint Slide #
12. !23Copyright 2018 by Data Blueprint Slide #
Defining Metadata
Metadata is any
combination of
any circle and the
data in the center
that unlocks the
value of the data!
!24Copyright 2018 by Data Blueprint Slide #
Adapted from Brad Melton
Data
WhereWhy
What How
Who
When
Data
13. Data
Library Metadata Example
Libraries can operate
efficiently through careful
use of metadata (Card
Catalog)
Who: Author
What: Title
Where: Shelf Location
When: Publication Date
A small amount of
metadata (Card Catalog)
unlocks the value of a large
amount of data (the
Library)
!25Copyright 2018 by Data Blueprint Slide #
Library Book
WhereWhy
What How
Who
When
Outlook Example
!26Copyright 2018 by Data Blueprint Slide #
"Outlook" metadata is used
to navigate/manage email
What: "Subject"
How: "Priority"
Where: "USERID/Inbox",
"USERID/Personal"
Why: "Body"
When: "Sent" & "Received”
• Find the important stuff/weed out junk
• Organize for future access/outlook rules
• Imagine how managing e-mail (already non-trivial)
would change if Outlook did not make use of
metadata Who:"To" & "From?"
14. Hedy Lamarr
• Google celebrated her
101st Birthday on 11/8/2015
– Tablets for fizzy drinks
– Improved stop light design
• Invention of “frequency
hopping” radio
– By jumping from one radio
frequency to another rapidly,
only a receiver that shares the
key can find the transmission
– Prevent interference with the radio
guidance controls of torpedoes
• U.S. Patent 2,292,387 (w/ George Antheil)
• Associated traffic analysis
– Looking at other elements of a communication
when you don’t know the actual content
– Time/duration of a message
– Location of transmitters
– Detect specific operator “fists”
– Identify the operator and you could identify a
specific ship or military unit, locate it with
direction finding, and then track its activity over time
!27Copyright 2018 by Data Blueprint Slide #
https://theconversation.com/how-wwi-codebreakers-taught-your-gas-meter-to-snitch-on-you-29924
Why Metadata Matters
• They know you rang a phone sex service at 2:24 am and spoke for 18
minutes. But they don't know what you talked about.
• They know you called the suicide prevention hotline from the Golden Gate
Bridge. But the topic of the call remains a secret.
• They know you spoke with an HIV testing service, then your doctor, then
your health insurance company in the same hour. But they don't know what
was discussed.
• They know you received a call from the local NRA office while it was
having a campaign against gun legislation, and then called your senators
and congressional representatives immediately after. But the content of
those calls remains safe from government intrusion.
• They know you called a gynecologist, spoke for a half hour, and then
called the local Planned Parenthood's number later that day. But nobody
knows what you spoke about.
– https://www.eff.org/deeplinks/2013/06/why-metadata-matters
!28Copyright 2018 by Data Blueprint Slide #
15. 1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
!29Copyright 2018 by Data Blueprint Slide #
Metadata
Typically Managed Architectures
• Process Architecture
– Arrangement of inputs -> transformations = value -> outputs
– Typical elements: Functions, activities, workflow, events, cycles, products, procedures
• Systems Architecture
– Applications, software components, interfaces, projects
• Business Architecture
– Goals, strategies, roles, organizational structure, location(s)
• Security Architecture
– Arrangement of security controls relation to IT Architecture
• Technical Architecture/Tarchitecture
– Relation of software capabilities/technology stack
– Structure of the technology infrastructure of an enterprise, solution or system
– Typical elements: Networks, hardware, software platforms, standards/protocols
• Data/Information Architecture
– Arrangement of data assets supporting organizational strategy
– Typical elements: specifications expressed as entities, relationships, attributes,
definitions, values, vocabularies
!30Copyright 2018 by Data Blueprint Slide #
21. 1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
!41Copyright 2018 by Data Blueprint Slide #
Metadata
Investing in Metadata?
• How can IT staff convince managers to plan,
budget, and apply resources for metadata
management?
• What is metadata
and why is it
important?
• What technologies
are involved?
• Internet and intranet
technologies are
part of the answer
and will get the
immediate attention of management.
!42Copyright 2018 by Data Blueprint Slide #
23. Metadata …
• Isn't
– Is not a noun
– One persons data is another's metadata
• Is more of a verb?
– Represents a use of existing facts, rather than a type of data itself
• A gerund
– a form that is derived from
a verb but that functions as a noun
– e.g., asking in do you mind my asking you?
– Describes a use of data - not a type of data
• Describes the use of some attributes
of data to understand or manage that
same data from a different
(usually higher) level of abstraction
• Value proposition
– Is this data worth including within the scope of our metadata practices?
!45Copyright 2018 by Data Blueprint Slide #
Keep the proper focus
• Wrong question:
– Is this metadata?
• Right question:
– Should we include this
data item within the
scope of our
metadata
practices?
!46Copyright 2018 by Data Blueprint Slide #
24. Definition of Bed
!47Copyright 2018 by Data Blueprint Slide #
Entity: BED
Data Asset Type: Principal Data Entity
Purpose: This is a substructure within the Room
substructure of the Facility Location. It contains
information about beds within rooms.
Source: Maintenance Manual for File and Table
Data (Software Version 3.0, Release 3.1)
Attributes: Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Associations: >0-+ Room
Status: Validated
Purpose statement incorporates motivations
A purpose statement describing
– Why the organization is maintaining information about this business concept;
– Sources of information about it;
– A partial list of the attributes or characteristics of the entity; and
– Associations with other data items;
this one is read as "One room contains zero or many beds."
!48Copyright 2018 by Data Blueprint Slide #
Draft
25. MetadataImplementationPhases
!49Copyright 2018 by Data Blueprint Slide #
F o r 1 m a n a g e a b l e b u s i n e s s p r o b l e m !
0 500 1000 1500 2000 2500
Manage Positions(2%)
Plan Careers (~5%)
AdministerTraining (~5%)
Plan Successions(~5%
Manage Competencies(20%)
Recruit Workforce (62%)
DevelopWorkforce(29.9%)
AdministerWorkforce(28.8%)
CompensateEmployees(23.7%)
MonitorWorkplace(8.1%)
DefineBusiness(4.4%)
TargetSystem (3.9%)
EDI Manager (.9%)
TargetSystem Tools(.3%)
Administer Workforce
Metadata Uses
!50Copyright 2018 by Data Blueprint Slide #
27. Build Your Own Metadata Repository
!53Copyright 2018 by Data Blueprint Slide #
Sample
Low-tech
Repository
!54Copyright 2018 by Data Blueprint Slide #
28. !55Copyright 2018 by Data Blueprint Slide #
FTI Metadata Model
!56Copyright 2018 by Data Blueprint Slide #
33. Example: iTunes Metadata
• Example:
– iTunes
Metadata
• Insert a recently
purchased CD
• iTunes can:
– Count the
number of
tracks (25)
– Determine the
length of each
track
!65Copyright 2018 by Data Blueprint Slide #
• When connected to the
Internet iTunes connects to
the Gracenote(.com) Media
Database and retrieves:
– CD Name
– Artist
– Track Names
– Genre
– Artwork
• Sure would be a pain to
type in all this information
!66Copyright 2018 by Data Blueprint Slide #
34. • To organize iTunes
– I create a "New Smart Playlist" for
Artist's containing "Miles Davis"
!67Copyright 2018 by Data Blueprint Slide #
• Notice I didn't get the desired results
• I already had another Miles Davis recording in iTunes
• Must fine-tune the request to get the desired results
– Album contains "The complete birth of the cool"
• Now I can move the playlist "Miles Davis" to a folder
!68Copyright 2018 by Data Blueprint Slide #
35. • The same:
–Interface
–Processing
–Data Structures
• are applied to
–Podcasts
–Movies
–Books
–.pdf files
• Economies of scale
are enormous
!69Copyright 2018 by Data Blueprint Slide #
1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
!70Copyright 2018 by Data Blueprint Slide #
Metadata
36. Metadata Take Aways
• 'Data about data'
• Metadata unlocks the value of data, and therefore requires
management attention [Gartner 2011]
• Metadata is less about what and more about how
• Metadata is the language of data governance
• Metadata defines the essence of integration challenges
• Should we include this data item within the scope of our
metadata practices?
!71Copyright 2018 by Data Blueprint Slide #
UsesSources
Metadata Governance
Metadata
Engineering
Metadata
Delivery
Metadata Practices
Metadata
Storage
Specialized Team Skills
!72Copyright 2018 by Data Blueprint Slide #
1. In the context of data management
2. What is it and why is it important?
3. Major types & subject areas
4. Benefits, application & sources
5. Implementation considerations
6. Guiding principles & building blocks
7. Specific teachable example
8. Take Aways, References and Q&A
Metadata
39. It’s your turn!
Use the chat
feature or Twitter
(#dataed) to submit
your questions now!
Questions?
+ =
!77Copyright 2018 by Data Blueprint Slide #
Data Architecture Summit (DAS)
Data Architecture Bootcamp
October 8, 2018 @ 8:00 AM CT
October Webinar:
Data Quality Strategies: Rethinking Data Quality
October 9, 2018 @ 2:00 PM ET
November Webinar:
Data Architecture v Data Modeling
November 12, 2018 @ 2:00 PM ET
Sign up for webinars at: www.datablueprint.com/webinar-schedule or at www.dataversity.net
Upcoming Events
!78Copyright 2018 by Data Blueprint Slide #
Brought to you by:
40. 10124 W. Broad Street, Suite C
Glen Allen, Virginia 23060
804.521.4056
Copyright 2018 by Data Blueprint Slide # !79