SlideShare a Scribd company logo
1 of 68
Download to read offline
METADATA
MATTERS
Metadata and Taxonomies for Organizing
your Content - April 29, 2015
ABOUT AIIM
▪ AIIM (Association for Information and Image Management) is the
global community of information professionals. Our mission is to help
you and your organization survive and thrive in this era of Information
Chaos by solving these 4 key business problems:
▪ How do we manage the risk of growing volumes of content?
▪ How do we automate our content-intensive business processes?
▪ How do we use content to better engage and collaborate?
▪ How do we gain business insight from all of this information?
▪ www.aiim.org
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
2
ABOUT AIIM TORONTO
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
3
▪ The First Canadian Chapter services
▪ Toronto
▪ Montreal, and
▪ Ottawa
▪ Brings together members for education and networking
▪ Looking for volunteers to help with running the chapter
ABOUT YOUR PRESENTER
29-Apr-15©2015AscanInformationArchitectsLimited
4
▪ Rob Hanna, ECMs
▪ President of Precision Content Authoring Solutions
Inc. and a director of AIIM First Canadian Chapter
▪ Expert in structured authoring and content
management practices and technology
▪ Instructor at the University of Toronto School of
Continuing Studies – Metadata and Controlled
Vocabularies
WHAT IS METADATA?
And how does it relate to content?
WHAT IS CONTENT?
Data Information
ContentKnowledge
METADATA DEFINED
▪ Coined in the 1960’s by Jack Myers
▪ Data about Data
▪ Stuff about Stuff
▪ Essential properties stored within the content or external to the content
that identify and define context, history, and management of the
content
Metadata
METADATA IS INFORMATION ABOUT A
RESOURCE
APPLICATION OF METADATA
▪ Metadata is
▪ applied to all structured and unstructured content in a corpus
▪ visible to the user or it can be hidden from view
▪ both machine-driven and manually entered
▪ internal or external to the content
▪ mandatory, optional, or conditional
MANY FORMS OF METADATA
▪ Corporate metadata is structured data about content
▪ Metadata is relational or hierarchical
▪ Metadata may take the form of
▪ Rich-text or binary
▪ Plain-text
▪ Controlled values/pick-lists/lookup values
▪ Syntax encoded values
▪ date/time (e.g., yyyy-mm-dd hh:mm:ss)
▪ financial ($0.00, -$0.00)
▪ numeric - integer/floating values (#,###)
▪ boolean (true/false)
▪ special (phone numbers, postal codes, or social insurance numbers)
Metadata
MANY ROLES OF METADATA
▪ The primary role of metadata is to facilitate the identification, retrieval,
and processing of content in any media.
▪ Secondarily, metadata may also
▪ appear as content to the content consumer, and
▪ serve as corporate structured data for analysis and business intelligence.
Metadata
METADATA IS THE
SOUP CAN
Content is the soup
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
12
METADATA ISN’T
THE MESSAGE
▪ Twitter post
(118 chars)
▪ Twitter status message
metadata (1,938 chars)
{"id"=>12296272736
"text"=>
"An early look at Annotations:
http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453",
"created at"=>"Fri Apr 16 17:55:46 +0000 2010",
"in_reply_to_user_id"=>nil,
"in_reply_to_screen_name"=>nil,
"in_reply_to_status_id"=>nil,
"favorited"=>false,
"truncated"=>false,
"user"=>
{"id"=>6253282,
"screen_name"=>"twitterapi"
"name"=>"Twitter API",
"description"=>
"The Real Twitter API. I tweet about API changes, service issues and happily answer questions about
Twitter and our API. Don't qet an answer? It's on my website.",
"url"=>"http://apiwiki.twitter.com",
"location"=>"San Francisco, CA",
"profile_background_color"=>"cldfee",
"profile_background_image_url"=>
"http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png ",
"profile_background_tile"=>false,
"profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png",
"profile_link_color"=>"0000ff",
"profile_sidebar_border_color"=>"87bc44",
"profile_sidebar_fill_color"=>"e0ff92",
"profile_text_color"=>"000000",
"created_at"=>"Wed May 23 06:01:13 +0000 2007",
"contributors_enabled"=>true,
"favourites_count"=>1
"statuses_count"=>1628
"friends_count"=>13
"time_zone"=>"Pacific Time (US & Canada)",
"utc_offset"=>-28800,
"lang"=>"en",
"protected"=>false,
"followers_count"=>100581,
"geo_enabled"=>true,
"notifications"=>false,
"following"=>true
"verified"=>true}
"contributors"=>[3191321]
"geo"=>nil
"coordinates"=>nil
"place"=>
{"id"=>"2b6ff8c22edd9576",
"url"=>"http ://api.twitter.com/1/geo/id/2b6ff8c22ed9576.json",
"name">"SoMa",
"full_name"=>"SoMa, San Francisco",
"place_type"=>"neighborhood",
"country_code"=>"US",
"country "=>"The United States of America",
"bounding_box"=>
{"coordinates"=>
[[[-122.42284884, 37.76893497],
[-122 .3964, 37.76893497],
[-122.3964, 37.78752897],
[-122.42284884, 37.78752897]]],
"type"=>"Polygon"}},
"source"=> "web"}
An early look at Annotations:
http://groups.google.com/group/twitter-api-
announce/browse_thread/thread/fa5da2608865453
WHY METADATA
MATTERS
Collection and use of metadata has been
known to be controversial when viewed out of
context of the content it carries.
Electronic Frontier Foundation
30 December 2013
Metadata Importance of Metadata▪ They know you rang a phone sex service
at 2:24 am and spoke for 18 minutes.
But they don’t know what you talked
about.
▪ They know you called the suicide
prevention hotline from the Golden
Gate Bridge. But the topic of the call
remains a secret.
▪ They know you spoke with an HIV
testing service, then your doctor, then
your health insurance company in the
same hour. But they don’t know what
was discussed
TYPES OF
METADATA
Library of Congress states that metadata
consists of
• Descriptive Metadata
• Administrative Metadata, and
• Structural Metadata
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
15
DESCRIPTIVE METADATA
And how it is applied through classification
▪ Classification is the ordering of entities (things or concepts) into groups
or classes on the basis of their similarity
▪ an activity that we do everyday
▪ metadata and controlled vocabularies are tools that can be used for
classification
THINKING ABOUT CLASSIFICATION
analyst brake market stapler
seat traders alternator investor
calculators scissors engine pedal
dashboard pen backers marker
tape profit starter ruler prospects
THINKING ABOUT CLASSIFICATION
How many words can you
memorize in 20 seconds?
analyst brake market stapler
dashboard pen backer marker
seat trader alternator investor
pedalcalculator scissors engine
tape profit starter ruler prospect
THINKING ABOUT CLASSIFICATION
1. Filter out all of the noise
analyst brake market stapler
dashboard pen
backer
marker
seat
trader
alternator
investor
pedalcalculator scissors engine
tapeprofit
starter
ruler prospect
THINKING ABOUT CLASSIFICATION
2. Break into smaller groupings
dashboardalternator pedal
brake seatengine starter
marker
staplerscissorstape
pen calculatorruler
analyst market backer
investor
traderprofitprospect
THINKING ABOUT CLASSIFICATION
3. Organize words by similarities
dashboardalternator pedal
brake seatengine starter
marker
staplerscissorstape
pen calculatorruler
analyst market backer
investor
traderprofitprospect
Stock market Office supplies
Car parts
THINKING ABOUT CLASSIFICATION
4. Classify and label groups
THINKING ABOUT CLASSIFICATION
Stock market Office supplies Car parts
analyst stapler brake
market calculator seat
trader scissors dashboard
investor pen engine
backer marker alternator
profit tape starter
prospect ruler pedal
How well did you do?
THINKING ABOUT CLASSIFICATION
Vegetables Computer parts Instruments
peas hard drive violin
endive sound card harp
carrots monitor piano
spinach mouse trumpet
celery processor cello
broccoli flash drive flute
tomato keyboard guitar
Now how many words can you
memorize in 20 seconds?
CONTROLLED VOCABULARIES
▪ Some metadata requires a classification, controlled list of values or terms to
define it, for example:
▪ Film rating: G, PG, 14A, 18A, R, A
▪ Ebay seller location:
▪ Control is exercised over modifications to the list
Controlled vocabularies defined
▪ A list of terms
▪ All terms in a controlled vocabulary must
have an unambiguous, non-redundant
definition. (Source: ANSI/NISO Z39.19-2005)
Controlled Vocabularies
What is a controlled vocabulary?
Why use controlled vocabularies?
Types of controlled vocabularies
BRIDGING BOUNDARIES -
WHICH TERM IS “RIGHT”?
Accessible parking spaces
Accessible permit parking
Disabled permit parking
Designated disabled parking spaces
Handicapped parking
Disabled parking
spaces
TOWARDS A COMMON
VOCABULARY
Accessible parking spaces
Accessible permit parking
Disabled permit parking
Designated disabled parking spaces
Handicapped parking
Disabled parking spaces
CARD SORTING
Techniques for developing controlled
vocabularies
MANAGING CONTROLLED
VOCABULARIES
TYPES OF CLASSIFICATION SCHEMES
▪ Subject
▪ Identify content topics
▪ Organization Structure
▪ Depicts business units
▪ Functional
▪ Defined by business processes
SUBJECT TAXONOMIES
▪ Describes the topic of the resource
▪ Structured from broad to narrow / general to specific
▪ Often stable over time
SUBJECT
CLASSIFICATION
Source: http://popchartlab.com/products/the-very-very-many-
varieties-of-beer
ORGANIZATION CLASSIFICATION
▪ Shows business unit relationships
▪ Can be used to identify:
▪ Ownership of content
▪ Maintenance responsibilities
▪ A person’s place in the organization
▪ Often change frequently
ORGANIZATIONAL
CLASSIFICATION
FUNCTIONAL CLASSIFICATION
▪ Describes the breakdown of business processes
▪ Function – Activity - Task
▪ Stable in nature unless new processes or functions are introduced
Taxonomy
FUNCTIONAL
CLASSIFICATION
Source: http://www.iskouk.org/conf2009/papers/milne_ISKOUK2009.pdf
TAXONOMIES
▪ Types of taxonomies
▪ Lists
▪ Trees
▪ Hierarchies and polyhierarchies
▪ Matricies, and
▪ System maps
TAXONOMY TYPES
▪ List style taxonomy
TAXONOMY TYPES
▪ Simple tree style taxonomy
Taxonomy Types
TAXONOMY TYPES
▪ Classical hierarchical style
taxonomy
TAXONOMY TYPES
▪ Polyhierarchical style taxonomy
TAXONOMY TYPES
▪ Matrix style taxonomy
▪ With 3 facets
TAXONOMY TYPES
▪ System map style taxonomy
ADMINISTRATIVE
METADATA
For managing the content
ADMINISTRATIVE METADATA
▪ Information about the metadata record itself – its creation,
modification, relationship to other records, etc.
▪ Audit trails may capture the date and time when a file’s title was changed.
▪ Common subsets of administrative metadata are:
▪ Rights Management: metadata that deals with intellectual property rights
▪ Preservation: information needed to archive / preserve a resource
Source: Understanding Metadata – NISO 2004
SEPARATION OF STATUS METADATA
▪ Much of the administrative metadata is applied automatically by the
system
▪ Other administrative metadata may live with the workflow rather than
the record itself
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
47
STRUCTURAL METADATA
Defining the structure of a resource
ABOUT STRUCTURAL METADATA
▪ Describe the structure of a resource
▪ Book
▪ Document
▪ Website
▪ Table of contents
▪ Site map
▪ Internal structure
WHAT IS XML?
▪ (eXtensible Markup Language) is an open
standard for the exchange of information
▪ first published in 1996 by W3C
▪ to encode electronic documents readable by
▪ human, and
▪ machine
▪ for a multitude of applications ranging from
▪ corporate financial reporting applications, to
▪ Microsoft Word
XML IS
EVERYWHERE
XML defines meaningful data structures for
documents and data. It is a human-readable
file format used to power
• manufacturing assembly lines
• medical devices
• military applications, and
• many other things.
XML is the language of the Web. It enables
smart phones and web browsers.
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
51
WHAT ARE MARKUP LANGUAGES?
▪ pre-date desktop publishing and the Internet
▪ tell computers how to handle data
▪ such as how to render electronic content on a page
▪ categorized as either
▪ presentation, or
▪ semantic markup
PRESENTATION MARKUP
▪ With electronic presentation markup, we markup the
paragraph and italicize the citation for publication
▪ This is typical of web pages using hypertext markup (HTML)
The Cancer Journal: The Journal of Principles & Practice of
Oncology provides an integrated view of modern oncology across
all disciplines.
<p><i>The Cancer Journal: The Journal of Principles & Practice
of Oncology</i> provides an integrated view of modern oncology
across <i>all</i> disciplines.</p>
The Cancer Journal: The Journal of Principles & Practice of Oncology provides an
integrated view of modern oncology across all disciplines.
SEMANTIC MARKUP
▪ With semantic markup, we markup the content to describe the meaning
of the text
▪ Publishing stylesheets interpret the meaning from the markup and apply
appropriate styles specific to the publishing context
The Cancer Journal: The Journal of Principles & Practice of
Oncology provides an integrated view of modern oncology across
all disciplines.
<intro><cite>The Cancer Journal: The Journal of Principles &
Practice of Oncology</cite> provides an integrated view of
modern oncology across <em>all</em> disciplines.</intro>
The Cancer Journal: The Journal of Principles & Practice of Oncology provides an
integrated view of modern oncology across all disciplines.
The Cancer Journal: The Journal of Principles & Practice of Oncology provides an
integrated view of modern oncology across all disciplines.
SEMANTIC MARKUP
▪ Using semantic markup, we
can
▪ disambiguate content
▪ search based on meaning
▪ connect to other content, and
▪ reuse or substitute new text.
MULTI-CHANNEL
PUBLISHING
▪ Supports complex, multi-channel
publishing to many common
output formats
▪ Add new formats or styles easily
?
INTELLIGENT CONTENT
▪ Content that is
▪ not limited to one
▪ purpose
▪ technology, or
▪ output
▪ structurally rich and semantically aware, making it
▪ discoverable
▪ reusable
▪ reconfigurable, and
▪ adaptable.
INTEROPERABILITY OF
METADATA
Demonstration
Communicating the
benefits
Demonstrating interoperability
with business examples
Keywords Fort York; children, soldier, history
Creator Jose San Juan
Asset Credit City of Toronto
Headline
A British soldier in historical red
uniform salutes children at Fort York
Communicating the
benefits
Demonstrate reuse with
business examples
write Headline once using DAL
or Adobe CS: “A British soldier
in historical red uniform salutes
children at Fort York”
Reuse Headline
during design, as
alt-tag for screen
readers (to comply
with AODA)
Reuse Headline
to search for files
in DAL
USE OF STANDARDS
Why are they important?
“Let me tell you how dangerous it is to design a
classification scheme. It’s very dangerous. I have
suffered.
People attribute all kinds of motives to you. Apart from
that, if anything goes wrong, they will pounce upon
you.”
– Melvil Dewey
Dublin Core Metadata Standard
International Press Telecommunications Council – Photo Metadata
Adobe XMP – Extensible Metadata Platform
Rules for Archival Description
DUBLIN CORE
▪ maintains a vocabulary of metadata properties and encoding schemes
▪ core set of 15 properties for use in describing resources:
Metadata
Contributor
Coverage
Creator
Date
Description
Format
Identifier
Language
Publisher
Relation
Rights
Source
Subject
Title
Type
ISO METADATA STANDARDS
▪ ISO 23081 – Metadata for Records
▪ Recommendations for metadata required to manage records
▪ Metadata about the record itself
▪ Metadata about the business rules or policies and mandates
▪ Metadata about the agents
▪ Metadata about business activities or processes
▪ Metadata about records management processes
Metadata
ISO 2788 – DEVELOPMENT OF
MONOLINGUAL THESAURI
• Latest edition published in 1986
• Media- and Language-Agnostic
• Applicable across both broad and narrow
subject areas and describes how to deal with
multiple domains
• Intended to ensure consistency of practice
across different agencies
• Provides recommendations rather than
mandatory instructions
• Outlines optional procedures for many special
cases where a standard approach may not be
applicable
Thesaurus
QUESTIONS?
Rob Hanna
Contact me through
• www.linkedin.com/in/singlesourceror
• rob@precisioncontent.com
WHO IS PRECISION CONTENT
AUTHORING SOLUTIONS INC.?
▪ We help organizations across North America make their information
easier to use
▪ Our solutions consist of
▪ Content strategy
▪ Detailed information architecture
▪ Content lifecycle design and development
▪ Turn-key content transformation
▪ Tools selection and development
▪ Multi-channel publishing
▪ www.precisioncontent.com
29-Apr-15©2015PrecisionContentAuthoringSolutionsInc.
68

More Related Content

Viewers also liked

[Webinar Slides] How to Plan Your Information Management Strategy in 2017
[Webinar Slides] How to Plan Your Information Management Strategy in 2017[Webinar Slides] How to Plan Your Information Management Strategy in 2017
[Webinar Slides] How to Plan Your Information Management Strategy in 2017AIIM International
 
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017 [Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017 AIIM International
 
Electronic records management
Electronic records managementElectronic records management
Electronic records managementKirti Joshi
 
Learning English as a second language - the myths, facts and realities
Learning English as a second language - the myths, facts  and realitiesLearning English as a second language - the myths, facts  and realities
Learning English as a second language - the myths, facts and realitiesNalaka Gamage
 
Information Management aaS AIIM First Canadian presentation
Information Management aaS AIIM First Canadian presentationInformation Management aaS AIIM First Canadian presentation
Information Management aaS AIIM First Canadian presentationChristopher Wynder
 
What is Electronic Records Management?
What is Electronic Records Management?What is Electronic Records Management?
What is Electronic Records Management?Atle Skjekkeland
 

Viewers also liked (6)

[Webinar Slides] How to Plan Your Information Management Strategy in 2017
[Webinar Slides] How to Plan Your Information Management Strategy in 2017[Webinar Slides] How to Plan Your Information Management Strategy in 2017
[Webinar Slides] How to Plan Your Information Management Strategy in 2017
 
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017 [Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017
[Webinar Slides] 7 Key ECM Changes - A Look Ahead to 2017
 
Electronic records management
Electronic records managementElectronic records management
Electronic records management
 
Learning English as a second language - the myths, facts and realities
Learning English as a second language - the myths, facts  and realitiesLearning English as a second language - the myths, facts  and realities
Learning English as a second language - the myths, facts and realities
 
Information Management aaS AIIM First Canadian presentation
Information Management aaS AIIM First Canadian presentationInformation Management aaS AIIM First Canadian presentation
Information Management aaS AIIM First Canadian presentation
 
What is Electronic Records Management?
What is Electronic Records Management?What is Electronic Records Management?
What is Electronic Records Management?
 

Similar to Metadata matters

Sales Plays to Exceed Quota and Close Out This Year Strong
Sales Plays to Exceed Quota and Close Out This Year StrongSales Plays to Exceed Quota and Close Out This Year Strong
Sales Plays to Exceed Quota and Close Out This Year StrongSales Hacker
 
Why Information Architecture is Vital for Effective Information Management
Why Information Architecture is Vital for Effective Information ManagementWhy Information Architecture is Vital for Effective Information Management
Why Information Architecture is Vital for Effective Information ManagementJ. Kevin Parker, CIP
 
Putting the Human Back in the Loop for Analysis
Putting the Human Back in the Loop for AnalysisPutting the Human Back in the Loop for Analysis
Putting the Human Back in the Loop for AnalysisAndy Piazza
 
Technology & Business - Wharton 2014
Technology & Business - Wharton 2014Technology & Business - Wharton 2014
Technology & Business - Wharton 2014Stephen Andriole
 
Breaking the ABM Status Quo With the Right Strategy, Tools, and Process
Breaking the ABM Status Quo With the Right Strategy, Tools, and ProcessBreaking the ABM Status Quo With the Right Strategy, Tools, and Process
Breaking the ABM Status Quo With the Right Strategy, Tools, and ProcessMarketo
 
Inventory and Discovery: How to Take Charge of “What’s Out There”
Inventory and Discovery: How to Take Charge of “What’s Out There” Inventory and Discovery: How to Take Charge of “What’s Out There”
Inventory and Discovery: How to Take Charge of “What’s Out There” Enterprise Management Associates
 
WEBINAR – DAM 2020 Report & Analysis along side the user perspective
WEBINAR – DAM 2020 Report & Analysis along side the user perspectiveWEBINAR – DAM 2020 Report & Analysis along side the user perspective
WEBINAR – DAM 2020 Report & Analysis along side the user perspectiveActivo Consulting
 
How to leverage E-E-A-T to boost your international expansion
How to leverage E-E-A-T to boost your international expansionHow to leverage E-E-A-T to boost your international expansion
How to leverage E-E-A-T to boost your international expansionGemma Fontane
 
Big data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data WarehousingBig data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data WarehousingMartyn Richard Jones
 
8h30 Cate Ambrose Sala 1 26.09.09
8h30   Cate Ambrose   Sala 1   26.09.098h30   Cate Ambrose   Sala 1   26.09.09
8h30 Cate Ambrose Sala 1 26.09.09Daniel Florence
 
Structured Thinking: Authoring for Precision Content
Structured Thinking: Authoring for Precision ContentStructured Thinking: Authoring for Precision Content
Structured Thinking: Authoring for Precision ContentRob Hanna, ECMs
 
Dumping Dead Data and Other Spring Cleaning for Marketo
Dumping Dead Data and Other Spring Cleaning for MarketoDumping Dead Data and Other Spring Cleaning for Marketo
Dumping Dead Data and Other Spring Cleaning for MarketoPerkuto
 
Trends in Tech M&A
Trends in Tech M&ATrends in Tech M&A
Trends in Tech M&ASecureDocs
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarConcept Searching, Inc
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementDATAVERSITY
 
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...Digital Customer Experience (DX) Summit
 
Mapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance ProceduresMapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance ProceduresDATAVERSITY
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementDATAVERSITY
 
... whether as President and CEO of companies or as an industr.docx
... whether as President and CEO of companies or as an industr.docx... whether as President and CEO of companies or as an industr.docx
... whether as President and CEO of companies or as an industr.docxsmithhedwards48727
 

Similar to Metadata matters (20)

Sales Plays to Exceed Quota and Close Out This Year Strong
Sales Plays to Exceed Quota and Close Out This Year StrongSales Plays to Exceed Quota and Close Out This Year Strong
Sales Plays to Exceed Quota and Close Out This Year Strong
 
Why Information Architecture is Vital for Effective Information Management
Why Information Architecture is Vital for Effective Information ManagementWhy Information Architecture is Vital for Effective Information Management
Why Information Architecture is Vital for Effective Information Management
 
Putting the Human Back in the Loop for Analysis
Putting the Human Back in the Loop for AnalysisPutting the Human Back in the Loop for Analysis
Putting the Human Back in the Loop for Analysis
 
Technology & Business - Wharton 2014
Technology & Business - Wharton 2014Technology & Business - Wharton 2014
Technology & Business - Wharton 2014
 
Breaking the ABM Status Quo With the Right Strategy, Tools, and Process
Breaking the ABM Status Quo With the Right Strategy, Tools, and ProcessBreaking the ABM Status Quo With the Right Strategy, Tools, and Process
Breaking the ABM Status Quo With the Right Strategy, Tools, and Process
 
Inventory and Discovery: How to Take Charge of “What’s Out There”
Inventory and Discovery: How to Take Charge of “What’s Out There” Inventory and Discovery: How to Take Charge of “What’s Out There”
Inventory and Discovery: How to Take Charge of “What’s Out There”
 
WEBINAR – DAM 2020 Report & Analysis along side the user perspective
WEBINAR – DAM 2020 Report & Analysis along side the user perspectiveWEBINAR – DAM 2020 Report & Analysis along side the user perspective
WEBINAR – DAM 2020 Report & Analysis along side the user perspective
 
Intro slides
Intro slidesIntro slides
Intro slides
 
How to leverage E-E-A-T to boost your international expansion
How to leverage E-E-A-T to boost your international expansionHow to leverage E-E-A-T to boost your international expansion
How to leverage E-E-A-T to boost your international expansion
 
Big data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data WarehousingBig data, Analytics and 4th Generation Data Warehousing
Big data, Analytics and 4th Generation Data Warehousing
 
8h30 Cate Ambrose Sala 1 26.09.09
8h30   Cate Ambrose   Sala 1   26.09.098h30   Cate Ambrose   Sala 1   26.09.09
8h30 Cate Ambrose Sala 1 26.09.09
 
Structured Thinking: Authoring for Precision Content
Structured Thinking: Authoring for Precision ContentStructured Thinking: Authoring for Precision Content
Structured Thinking: Authoring for Precision Content
 
Dumping Dead Data and Other Spring Cleaning for Marketo
Dumping Dead Data and Other Spring Cleaning for MarketoDumping Dead Data and Other Spring Cleaning for Marketo
Dumping Dead Data and Other Spring Cleaning for Marketo
 
Trends in Tech M&A
Trends in Tech M&ATrends in Tech M&A
Trends in Tech M&A
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email Webinar
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata Management
 
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...
No AI Without IA: Information Architecture as a Critical Enabler - Dino Eliop...
 
Mapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance ProceduresMapping Business Processes to Compliance Procedures
Mapping Business Processes to Compliance Procedures
 
The Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata ManagementThe Missing Link in Enterprise Data Governance - Automated Metadata Management
The Missing Link in Enterprise Data Governance - Automated Metadata Management
 
... whether as President and CEO of companies or as an industr.docx
... whether as President and CEO of companies or as an industr.docx... whether as President and CEO of companies or as an industr.docx
... whether as President and CEO of companies or as an industr.docx
 

More from Rob Hanna, ECMs

Changing how we think about content
Changing how we think about contentChanging how we think about content
Changing how we think about contentRob Hanna, ECMs
 
Introduction to structured authoring
Introduction to structured authoringIntroduction to structured authoring
Introduction to structured authoringRob Hanna, ECMs
 
Transforming the Application of Cancer Staging with Intelligent Content
Transforming the Application of Cancer Staging with Intelligent ContentTransforming the Application of Cancer Staging with Intelligent Content
Transforming the Application of Cancer Staging with Intelligent ContentRob Hanna, ECMs
 
Certification essentials
Certification essentialsCertification essentials
Certification essentialsRob Hanna, ECMs
 
How do I get CPTC certified?
How do I get CPTC certified?How do I get CPTC certified?
How do I get CPTC certified?Rob Hanna, ECMs
 
Exploring the Information Ecosystem
Exploring the Information EcosystemExploring the Information Ecosystem
Exploring the Information EcosystemRob Hanna, ECMs
 
Metadata primer for technical communicators
Metadata primer for technical communicatorsMetadata primer for technical communicators
Metadata primer for technical communicatorsRob Hanna, ECMs
 
The Knowledge Revolution
The Knowledge RevolutionThe Knowledge Revolution
The Knowledge RevolutionRob Hanna, ECMs
 
STC Toronto Mentor Program
STC Toronto Mentor ProgramSTC Toronto Mentor Program
STC Toronto Mentor ProgramRob Hanna, ECMs
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystemRob Hanna, ECMs
 
Single Sourcing Deconstructed
Single Sourcing DeconstructedSingle Sourcing Deconstructed
Single Sourcing DeconstructedRob Hanna, ECMs
 
Seven Traits of Successful Technical Communicators
Seven Traits of Successful Technical CommunicatorsSeven Traits of Successful Technical Communicators
Seven Traits of Successful Technical CommunicatorsRob Hanna, ECMs
 
Preparing For Successful Content Management
Preparing For Successful Content ManagementPreparing For Successful Content Management
Preparing For Successful Content ManagementRob Hanna, ECMs
 
Process Re-engineering for Topic Based Authoring
Process Re-engineering for Topic Based AuthoringProcess Re-engineering for Topic Based Authoring
Process Re-engineering for Topic Based AuthoringRob Hanna, ECMs
 

More from Rob Hanna, ECMs (15)

Changing how we think about content
Changing how we think about contentChanging how we think about content
Changing how we think about content
 
Introduction to structured authoring
Introduction to structured authoringIntroduction to structured authoring
Introduction to structured authoring
 
Transforming the Application of Cancer Staging with Intelligent Content
Transforming the Application of Cancer Staging with Intelligent ContentTransforming the Application of Cancer Staging with Intelligent Content
Transforming the Application of Cancer Staging with Intelligent Content
 
Certification essentials
Certification essentialsCertification essentials
Certification essentials
 
How do I get CPTC certified?
How do I get CPTC certified?How do I get CPTC certified?
How do I get CPTC certified?
 
Exploring the Information Ecosystem
Exploring the Information EcosystemExploring the Information Ecosystem
Exploring the Information Ecosystem
 
Metadata primer for technical communicators
Metadata primer for technical communicatorsMetadata primer for technical communicators
Metadata primer for technical communicators
 
The Knowledge Revolution
The Knowledge RevolutionThe Knowledge Revolution
The Knowledge Revolution
 
STC CPTC Certification
STC CPTC CertificationSTC CPTC Certification
STC CPTC Certification
 
STC Toronto Mentor Program
STC Toronto Mentor ProgramSTC Toronto Mentor Program
STC Toronto Mentor Program
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystem
 
Single Sourcing Deconstructed
Single Sourcing DeconstructedSingle Sourcing Deconstructed
Single Sourcing Deconstructed
 
Seven Traits of Successful Technical Communicators
Seven Traits of Successful Technical CommunicatorsSeven Traits of Successful Technical Communicators
Seven Traits of Successful Technical Communicators
 
Preparing For Successful Content Management
Preparing For Successful Content ManagementPreparing For Successful Content Management
Preparing For Successful Content Management
 
Process Re-engineering for Topic Based Authoring
Process Re-engineering for Topic Based AuthoringProcess Re-engineering for Topic Based Authoring
Process Re-engineering for Topic Based Authoring
 

Recently uploaded

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

Metadata matters

  • 1. METADATA MATTERS Metadata and Taxonomies for Organizing your Content - April 29, 2015
  • 2. ABOUT AIIM ▪ AIIM (Association for Information and Image Management) is the global community of information professionals. Our mission is to help you and your organization survive and thrive in this era of Information Chaos by solving these 4 key business problems: ▪ How do we manage the risk of growing volumes of content? ▪ How do we automate our content-intensive business processes? ▪ How do we use content to better engage and collaborate? ▪ How do we gain business insight from all of this information? ▪ www.aiim.org 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 2
  • 3. ABOUT AIIM TORONTO 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 3 ▪ The First Canadian Chapter services ▪ Toronto ▪ Montreal, and ▪ Ottawa ▪ Brings together members for education and networking ▪ Looking for volunteers to help with running the chapter
  • 4. ABOUT YOUR PRESENTER 29-Apr-15©2015AscanInformationArchitectsLimited 4 ▪ Rob Hanna, ECMs ▪ President of Precision Content Authoring Solutions Inc. and a director of AIIM First Canadian Chapter ▪ Expert in structured authoring and content management practices and technology ▪ Instructor at the University of Toronto School of Continuing Studies – Metadata and Controlled Vocabularies
  • 5. WHAT IS METADATA? And how does it relate to content?
  • 6. WHAT IS CONTENT? Data Information ContentKnowledge
  • 7. METADATA DEFINED ▪ Coined in the 1960’s by Jack Myers ▪ Data about Data ▪ Stuff about Stuff ▪ Essential properties stored within the content or external to the content that identify and define context, history, and management of the content
  • 9. APPLICATION OF METADATA ▪ Metadata is ▪ applied to all structured and unstructured content in a corpus ▪ visible to the user or it can be hidden from view ▪ both machine-driven and manually entered ▪ internal or external to the content ▪ mandatory, optional, or conditional
  • 10. MANY FORMS OF METADATA ▪ Corporate metadata is structured data about content ▪ Metadata is relational or hierarchical ▪ Metadata may take the form of ▪ Rich-text or binary ▪ Plain-text ▪ Controlled values/pick-lists/lookup values ▪ Syntax encoded values ▪ date/time (e.g., yyyy-mm-dd hh:mm:ss) ▪ financial ($0.00, -$0.00) ▪ numeric - integer/floating values (#,###) ▪ boolean (true/false) ▪ special (phone numbers, postal codes, or social insurance numbers) Metadata
  • 11. MANY ROLES OF METADATA ▪ The primary role of metadata is to facilitate the identification, retrieval, and processing of content in any media. ▪ Secondarily, metadata may also ▪ appear as content to the content consumer, and ▪ serve as corporate structured data for analysis and business intelligence. Metadata
  • 12. METADATA IS THE SOUP CAN Content is the soup 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 12
  • 13. METADATA ISN’T THE MESSAGE ▪ Twitter post (118 chars) ▪ Twitter status message metadata (1,938 chars) {"id"=>12296272736 "text"=> "An early look at Annotations: http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453", "created at"=>"Fri Apr 16 17:55:46 +0000 2010", "in_reply_to_user_id"=>nil, "in_reply_to_screen_name"=>nil, "in_reply_to_status_id"=>nil, "favorited"=>false, "truncated"=>false, "user"=> {"id"=>6253282, "screen_name"=>"twitterapi" "name"=>"Twitter API", "description"=> "The Real Twitter API. I tweet about API changes, service issues and happily answer questions about Twitter and our API. Don't qet an answer? It's on my website.", "url"=>"http://apiwiki.twitter.com", "location"=>"San Francisco, CA", "profile_background_color"=>"cldfee", "profile_background_image_url"=> "http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png ", "profile_background_tile"=>false, "profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png", "profile_link_color"=>"0000ff", "profile_sidebar_border_color"=>"87bc44", "profile_sidebar_fill_color"=>"e0ff92", "profile_text_color"=>"000000", "created_at"=>"Wed May 23 06:01:13 +0000 2007", "contributors_enabled"=>true, "favourites_count"=>1 "statuses_count"=>1628 "friends_count"=>13 "time_zone"=>"Pacific Time (US & Canada)", "utc_offset"=>-28800, "lang"=>"en", "protected"=>false, "followers_count"=>100581, "geo_enabled"=>true, "notifications"=>false, "following"=>true "verified"=>true} "contributors"=>[3191321] "geo"=>nil "coordinates"=>nil "place"=> {"id"=>"2b6ff8c22edd9576", "url"=>"http ://api.twitter.com/1/geo/id/2b6ff8c22ed9576.json", "name">"SoMa", "full_name"=>"SoMa, San Francisco", "place_type"=>"neighborhood", "country_code"=>"US", "country "=>"The United States of America", "bounding_box"=> {"coordinates"=> [[[-122.42284884, 37.76893497], [-122 .3964, 37.76893497], [-122.3964, 37.78752897], [-122.42284884, 37.78752897]]], "type"=>"Polygon"}}, "source"=> "web"} An early look at Annotations: http://groups.google.com/group/twitter-api- announce/browse_thread/thread/fa5da2608865453
  • 14. WHY METADATA MATTERS Collection and use of metadata has been known to be controversial when viewed out of context of the content it carries. Electronic Frontier Foundation 30 December 2013 Metadata Importance of Metadata▪ They know you rang a phone sex service at 2:24 am and spoke for 18 minutes. But they don’t know what you talked about. ▪ They know you called the suicide prevention hotline from the Golden Gate Bridge. But the topic of the call remains a secret. ▪ They know you spoke with an HIV testing service, then your doctor, then your health insurance company in the same hour. But they don’t know what was discussed
  • 15. TYPES OF METADATA Library of Congress states that metadata consists of • Descriptive Metadata • Administrative Metadata, and • Structural Metadata 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 15
  • 16. DESCRIPTIVE METADATA And how it is applied through classification
  • 17. ▪ Classification is the ordering of entities (things or concepts) into groups or classes on the basis of their similarity ▪ an activity that we do everyday ▪ metadata and controlled vocabularies are tools that can be used for classification THINKING ABOUT CLASSIFICATION
  • 18. analyst brake market stapler seat traders alternator investor calculators scissors engine pedal dashboard pen backers marker tape profit starter ruler prospects THINKING ABOUT CLASSIFICATION How many words can you memorize in 20 seconds?
  • 19. analyst brake market stapler dashboard pen backer marker seat trader alternator investor pedalcalculator scissors engine tape profit starter ruler prospect THINKING ABOUT CLASSIFICATION 1. Filter out all of the noise
  • 20. analyst brake market stapler dashboard pen backer marker seat trader alternator investor pedalcalculator scissors engine tapeprofit starter ruler prospect THINKING ABOUT CLASSIFICATION 2. Break into smaller groupings
  • 21. dashboardalternator pedal brake seatengine starter marker staplerscissorstape pen calculatorruler analyst market backer investor traderprofitprospect THINKING ABOUT CLASSIFICATION 3. Organize words by similarities
  • 22. dashboardalternator pedal brake seatengine starter marker staplerscissorstape pen calculatorruler analyst market backer investor traderprofitprospect Stock market Office supplies Car parts THINKING ABOUT CLASSIFICATION 4. Classify and label groups
  • 23. THINKING ABOUT CLASSIFICATION Stock market Office supplies Car parts analyst stapler brake market calculator seat trader scissors dashboard investor pen engine backer marker alternator profit tape starter prospect ruler pedal How well did you do?
  • 24. THINKING ABOUT CLASSIFICATION Vegetables Computer parts Instruments peas hard drive violin endive sound card harp carrots monitor piano spinach mouse trumpet celery processor cello broccoli flash drive flute tomato keyboard guitar Now how many words can you memorize in 20 seconds?
  • 25. CONTROLLED VOCABULARIES ▪ Some metadata requires a classification, controlled list of values or terms to define it, for example: ▪ Film rating: G, PG, 14A, 18A, R, A ▪ Ebay seller location: ▪ Control is exercised over modifications to the list
  • 26. Controlled vocabularies defined ▪ A list of terms ▪ All terms in a controlled vocabulary must have an unambiguous, non-redundant definition. (Source: ANSI/NISO Z39.19-2005) Controlled Vocabularies What is a controlled vocabulary? Why use controlled vocabularies? Types of controlled vocabularies
  • 27. BRIDGING BOUNDARIES - WHICH TERM IS “RIGHT”? Accessible parking spaces Accessible permit parking Disabled permit parking Designated disabled parking spaces Handicapped parking Disabled parking spaces
  • 28. TOWARDS A COMMON VOCABULARY Accessible parking spaces Accessible permit parking Disabled permit parking Designated disabled parking spaces Handicapped parking Disabled parking spaces
  • 29. CARD SORTING Techniques for developing controlled vocabularies
  • 31. TYPES OF CLASSIFICATION SCHEMES ▪ Subject ▪ Identify content topics ▪ Organization Structure ▪ Depicts business units ▪ Functional ▪ Defined by business processes
  • 32. SUBJECT TAXONOMIES ▪ Describes the topic of the resource ▪ Structured from broad to narrow / general to specific ▪ Often stable over time
  • 34. ORGANIZATION CLASSIFICATION ▪ Shows business unit relationships ▪ Can be used to identify: ▪ Ownership of content ▪ Maintenance responsibilities ▪ A person’s place in the organization ▪ Often change frequently
  • 36. FUNCTIONAL CLASSIFICATION ▪ Describes the breakdown of business processes ▪ Function – Activity - Task ▪ Stable in nature unless new processes or functions are introduced Taxonomy
  • 38. TAXONOMIES ▪ Types of taxonomies ▪ Lists ▪ Trees ▪ Hierarchies and polyhierarchies ▪ Matricies, and ▪ System maps
  • 39. TAXONOMY TYPES ▪ List style taxonomy
  • 40. TAXONOMY TYPES ▪ Simple tree style taxonomy Taxonomy Types
  • 41. TAXONOMY TYPES ▪ Classical hierarchical style taxonomy
  • 43. TAXONOMY TYPES ▪ Matrix style taxonomy ▪ With 3 facets
  • 44. TAXONOMY TYPES ▪ System map style taxonomy
  • 46. ADMINISTRATIVE METADATA ▪ Information about the metadata record itself – its creation, modification, relationship to other records, etc. ▪ Audit trails may capture the date and time when a file’s title was changed. ▪ Common subsets of administrative metadata are: ▪ Rights Management: metadata that deals with intellectual property rights ▪ Preservation: information needed to archive / preserve a resource Source: Understanding Metadata – NISO 2004
  • 47. SEPARATION OF STATUS METADATA ▪ Much of the administrative metadata is applied automatically by the system ▪ Other administrative metadata may live with the workflow rather than the record itself 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 47
  • 48. STRUCTURAL METADATA Defining the structure of a resource
  • 49. ABOUT STRUCTURAL METADATA ▪ Describe the structure of a resource ▪ Book ▪ Document ▪ Website ▪ Table of contents ▪ Site map ▪ Internal structure
  • 50. WHAT IS XML? ▪ (eXtensible Markup Language) is an open standard for the exchange of information ▪ first published in 1996 by W3C ▪ to encode electronic documents readable by ▪ human, and ▪ machine ▪ for a multitude of applications ranging from ▪ corporate financial reporting applications, to ▪ Microsoft Word
  • 51. XML IS EVERYWHERE XML defines meaningful data structures for documents and data. It is a human-readable file format used to power • manufacturing assembly lines • medical devices • military applications, and • many other things. XML is the language of the Web. It enables smart phones and web browsers. 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 51
  • 52. WHAT ARE MARKUP LANGUAGES? ▪ pre-date desktop publishing and the Internet ▪ tell computers how to handle data ▪ such as how to render electronic content on a page ▪ categorized as either ▪ presentation, or ▪ semantic markup
  • 53. PRESENTATION MARKUP ▪ With electronic presentation markup, we markup the paragraph and italicize the citation for publication ▪ This is typical of web pages using hypertext markup (HTML) The Cancer Journal: The Journal of Principles & Practice of Oncology provides an integrated view of modern oncology across all disciplines. <p><i>The Cancer Journal: The Journal of Principles & Practice of Oncology</i> provides an integrated view of modern oncology across <i>all</i> disciplines.</p> The Cancer Journal: The Journal of Principles & Practice of Oncology provides an integrated view of modern oncology across all disciplines.
  • 54. SEMANTIC MARKUP ▪ With semantic markup, we markup the content to describe the meaning of the text ▪ Publishing stylesheets interpret the meaning from the markup and apply appropriate styles specific to the publishing context The Cancer Journal: The Journal of Principles & Practice of Oncology provides an integrated view of modern oncology across all disciplines. <intro><cite>The Cancer Journal: The Journal of Principles & Practice of Oncology</cite> provides an integrated view of modern oncology across <em>all</em> disciplines.</intro> The Cancer Journal: The Journal of Principles & Practice of Oncology provides an integrated view of modern oncology across all disciplines. The Cancer Journal: The Journal of Principles & Practice of Oncology provides an integrated view of modern oncology across all disciplines.
  • 55. SEMANTIC MARKUP ▪ Using semantic markup, we can ▪ disambiguate content ▪ search based on meaning ▪ connect to other content, and ▪ reuse or substitute new text.
  • 56. MULTI-CHANNEL PUBLISHING ▪ Supports complex, multi-channel publishing to many common output formats ▪ Add new formats or styles easily ?
  • 57. INTELLIGENT CONTENT ▪ Content that is ▪ not limited to one ▪ purpose ▪ technology, or ▪ output ▪ structurally rich and semantically aware, making it ▪ discoverable ▪ reusable ▪ reconfigurable, and ▪ adaptable.
  • 59. Communicating the benefits Demonstrating interoperability with business examples Keywords Fort York; children, soldier, history Creator Jose San Juan Asset Credit City of Toronto Headline A British soldier in historical red uniform salutes children at Fort York
  • 60. Communicating the benefits Demonstrate reuse with business examples write Headline once using DAL or Adobe CS: “A British soldier in historical red uniform salutes children at Fort York” Reuse Headline during design, as alt-tag for screen readers (to comply with AODA) Reuse Headline to search for files in DAL
  • 61. USE OF STANDARDS Why are they important?
  • 62. “Let me tell you how dangerous it is to design a classification scheme. It’s very dangerous. I have suffered. People attribute all kinds of motives to you. Apart from that, if anything goes wrong, they will pounce upon you.” – Melvil Dewey
  • 63. Dublin Core Metadata Standard International Press Telecommunications Council – Photo Metadata Adobe XMP – Extensible Metadata Platform Rules for Archival Description
  • 64. DUBLIN CORE ▪ maintains a vocabulary of metadata properties and encoding schemes ▪ core set of 15 properties for use in describing resources: Metadata Contributor Coverage Creator Date Description Format Identifier Language Publisher Relation Rights Source Subject Title Type
  • 65. ISO METADATA STANDARDS ▪ ISO 23081 – Metadata for Records ▪ Recommendations for metadata required to manage records ▪ Metadata about the record itself ▪ Metadata about the business rules or policies and mandates ▪ Metadata about the agents ▪ Metadata about business activities or processes ▪ Metadata about records management processes Metadata
  • 66. ISO 2788 – DEVELOPMENT OF MONOLINGUAL THESAURI • Latest edition published in 1986 • Media- and Language-Agnostic • Applicable across both broad and narrow subject areas and describes how to deal with multiple domains • Intended to ensure consistency of practice across different agencies • Provides recommendations rather than mandatory instructions • Outlines optional procedures for many special cases where a standard approach may not be applicable Thesaurus
  • 67. QUESTIONS? Rob Hanna Contact me through • www.linkedin.com/in/singlesourceror • rob@precisioncontent.com
  • 68. WHO IS PRECISION CONTENT AUTHORING SOLUTIONS INC.? ▪ We help organizations across North America make their information easier to use ▪ Our solutions consist of ▪ Content strategy ▪ Detailed information architecture ▪ Content lifecycle design and development ▪ Turn-key content transformation ▪ Tools selection and development ▪ Multi-channel publishing ▪ www.precisioncontent.com 29-Apr-15©2015PrecisionContentAuthoringSolutionsInc. 68