SlideShare a Scribd company logo
1 of 27
Tracking Workflow with
Embedded Metadata
Clay Helberg
SPSS Inc
Characteristics of Documents

• Who created it
• What project(s) use it
• When it was last changed
• Why it was changed
• What its current status is
Metadata

• Answers to these questions stored in the system
Two Approaches to Metadata

• External system
  – Metadata stored in a database, a CMS, or some other system separately
    from the document content
  – Pros: Optimized for searching, available in canned systems
  – Cons: Overhead of separate systems, data aren’t attached to what they
    describe
• Internal (embedded) system
  – Metadata stored directly in the documents themselves
  – Pros: Data are attached directly to what they describe, easy to
    edit/update
  – Cons: Not as widely supported in canned systems, might have to build
    your own support tools
Our Metadata Scheme

• We chose an embedded system
  – Cost of canned CMS was prohibitive
  – Already had many CMS features in our source control system
  – Allowed us to design the system to suit our needs
  – Allowed us to grow the system incrementally
Metadata Structure

• We break our metadata into two basic types:
  – Attribute metadata, things that tend to be fixed for a given piece
    of content
     • Owner, project, reference documents
  – Status metadata, things that tend to change over time
     • Revisions, write status and review status
Attribute Metadata

• This is encapsulated in a <metadata> element near the top of
  each piece of content
• Metadata contains information on the document or fragment:
  – The document’s owner
  – Any technical contacts for the document (engineers, statisticians)
  – Reference documents relevant to the material (designs, functional
    specs, primary sources)
  – General revision information that would affect the whole
    document or fragment
Attribute Metadata Example
Status Metadata

• Status metadata appears inline in the document in a
  <revision> element
• Revisions give information on various aspects of a document
  change:
  – Who made the change
  – When it was made
  – The version of the project to which the change applies
  – A comment describing the change
  – Writing status (to do, writing, ready for edit, complete, deferred)
  – Review status (pending, complete)
Status Metadata Example
Tools for Managing Metadata

• Edit tags
  – Metadata is part of DTD, so we can just edit the tags as
    necessary
  – BUT, this was awkward and error-prone
• Specialized tools
  – AE customizations to streamline the process
     • Remind authors to add metadata when they create/change
       content
     • Custom dialog to make editing metadata easy
     • Auto-complete as much as possible to minimize overhead
AE Customizations: Display of Metadata

• Highlighted in green
• Menu item allows hiding/showing of metadata
• (Demo)
Showing/Hiding Metadata
AE Customizations: Entering Attribute Metadata

• When new content is created, author is prompted and basic
  metadata is created automatically
• Author must enter information on technical contacts or
  reference documents
New Metadata Prompt
AE Customizations: Entering Status Metadata

• When a change is made to content, on save the user is
  prompted to add a revision element for each topic changed
• Custom dialog auto-inserts author’s ID and today’s date
• Author supplies the rest of the information for each topic
  changed
Custom Dialogs for Status Metadata
AE customizations: Metadata Reports

• Menu item to get summaries of the metadata for a document
  or project
• Summary report is dynamic HTML
• Report can be sorted and filtered based on any of the revision
  attributes
HTML Metadata Report
HTML Metadata Report Options
AE Customizations: Metadata Reports

• We can also produce PDF output which includes revision
  information in one of two formats:
  – Inline revision information
  – Summary table at the top of each document
Custom Print Dialog with Metadata Options
Inline Revision Information
Revision Summary Table
To do

• Still on the wish list:
   – Automatic notification of content changes
      • To owner of document (if author of change is not owner)
      • To owners of documents or projects that contain references to
        that content
   – Automatic notification of status changes
      • Notify editors when revisions are marked “ready for edit”
      • Notify production staff when documents are marked “complete”
Questions?

• Contact info:
  Clay Helberg
  SPSS Inc.
  chelberg@spss.com
Tracking Workflow with Embedded Metadata

More Related Content

Similar to Tracking Workflow with Embedded Metadata

Tips and Tricks for Using the Solution Packager
Tips and Tricks for Using the Solution PackagerTips and Tricks for Using the Solution Packager
Tips and Tricks for Using the Solution PackagerIvan Kurtev
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfRob Winters
 
Auto Tagger for SharePoint - Rule-based Content Classification
Auto Tagger for SharePoint - Rule-based Content ClassificationAuto Tagger for SharePoint - Rule-based Content Classification
Auto Tagger for SharePoint - Rule-based Content ClassificationFrank Daske
 
Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2James Cowie
 
Magento 1.x to Magento 2 Code Migration Tools
Magento 1.x to Magento 2 Code Migration ToolsMagento 1.x to Magento 2 Code Migration Tools
Magento 1.x to Magento 2 Code Migration ToolsSergii Shymko
 
SUN_Nashville_PatchEval
SUN_Nashville_PatchEvalSUN_Nashville_PatchEval
SUN_Nashville_PatchEvalDaniel Mauk
 
SysAid Presentation V.7
SysAid Presentation V.7SysAid Presentation V.7
SysAid Presentation V.7srinivasp2010
 
datamarts.ppt
datamarts.pptdatamarts.ppt
datamarts.pptbhavyag24
 
E-Business Suite Customization Impact Assessment
E-Business Suite Customization Impact AssessmentE-Business Suite Customization Impact Assessment
E-Business Suite Customization Impact Assessmenteprentise
 
MetaSuite productfolder- ETL-Tool für große Datenmengen
MetaSuite productfolder- ETL-Tool für große DatenmengenMetaSuite productfolder- ETL-Tool für große Datenmengen
MetaSuite productfolder- ETL-Tool für große DatenmengenMinerva SoftCare GmbH
 
Information and Records Management in SharePoint - An In-depth Review
Information and Records Management in SharePoint - An In-depth ReviewInformation and Records Management in SharePoint - An In-depth Review
Information and Records Management in SharePoint - An In-depth ReviewSimon Rawson
 
chapter8-220725121547-f85998bb.pdf
chapter8-220725121547-f85998bb.pdfchapter8-220725121547-f85998bb.pdf
chapter8-220725121547-f85998bb.pdfMahmoudSOLIMAN380726
 
‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management ‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management Ahmed Alorage
 
Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint teamBaris Bruce Tuncertan
 
IPC Data Analysis and Extraction
IPC Data Analysis and ExtractionIPC Data Analysis and Extraction
IPC Data Analysis and Extractionpzybrick
 
Did you mean 'Galene'?
Did you mean 'Galene'?Did you mean 'Galene'?
Did you mean 'Galene'?Azeem Mohammad
 

Similar to Tracking Workflow with Embedded Metadata (20)

Tips and Tricks for Using the Solution Packager
Tips and Tricks for Using the Solution PackagerTips and Tricks for Using the Solution Packager
Tips and Tricks for Using the Solution Packager
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the Bijenkorf
 
Auto Tagger for SharePoint - Rule-based Content Classification
Auto Tagger for SharePoint - Rule-based Content ClassificationAuto Tagger for SharePoint - Rule-based Content Classification
Auto Tagger for SharePoint - Rule-based Content Classification
 
Datastage Introduction To Data Warehousing
Datastage Introduction To Data WarehousingDatastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing
 
Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2Expert guidance on migrating from magento 1 to magento 2
Expert guidance on migrating from magento 1 to magento 2
 
Magento 1.x to Magento 2 Code Migration Tools
Magento 1.x to Magento 2 Code Migration ToolsMagento 1.x to Magento 2 Code Migration Tools
Magento 1.x to Magento 2 Code Migration Tools
 
SUN_Nashville_PatchEval
SUN_Nashville_PatchEvalSUN_Nashville_PatchEval
SUN_Nashville_PatchEval
 
SysAid Presentation V.7
SysAid Presentation V.7SysAid Presentation V.7
SysAid Presentation V.7
 
Real world rm in share point 2013
Real world rm in share point 2013Real world rm in share point 2013
Real world rm in share point 2013
 
Rpsonmongodb
RpsonmongodbRpsonmongodb
Rpsonmongodb
 
datamarts.ppt
datamarts.pptdatamarts.ppt
datamarts.ppt
 
E-Business Suite Customization Impact Assessment
E-Business Suite Customization Impact AssessmentE-Business Suite Customization Impact Assessment
E-Business Suite Customization Impact Assessment
 
JOSA TechTalk: Metadata Management
in Big Data
JOSA TechTalk: Metadata Management
in Big DataJOSA TechTalk: Metadata Management
in Big Data
JOSA TechTalk: Metadata Management
in Big Data
 
MetaSuite productfolder- ETL-Tool für große Datenmengen
MetaSuite productfolder- ETL-Tool für große DatenmengenMetaSuite productfolder- ETL-Tool für große Datenmengen
MetaSuite productfolder- ETL-Tool für große Datenmengen
 
Information and Records Management in SharePoint - An In-depth Review
Information and Records Management in SharePoint - An In-depth ReviewInformation and Records Management in SharePoint - An In-depth Review
Information and Records Management in SharePoint - An In-depth Review
 
chapter8-220725121547-f85998bb.pdf
chapter8-220725121547-f85998bb.pdfchapter8-220725121547-f85998bb.pdf
chapter8-220725121547-f85998bb.pdf
 
‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management ‏‏Chapter 8: Reference and Master Data Management
‏‏Chapter 8: Reference and Master Data Management
 
Building an effective sharepoint team
Building an effective sharepoint teamBuilding an effective sharepoint team
Building an effective sharepoint team
 
IPC Data Analysis and Extraction
IPC Data Analysis and ExtractionIPC Data Analysis and Extraction
IPC Data Analysis and Extraction
 
Did you mean 'Galene'?
Did you mean 'Galene'?Did you mean 'Galene'?
Did you mean 'Galene'?
 

Tracking Workflow with Embedded Metadata

  • 1. Tracking Workflow with Embedded Metadata Clay Helberg SPSS Inc
  • 2. Characteristics of Documents • Who created it • What project(s) use it • When it was last changed • Why it was changed • What its current status is
  • 3. Metadata • Answers to these questions stored in the system
  • 4. Two Approaches to Metadata • External system – Metadata stored in a database, a CMS, or some other system separately from the document content – Pros: Optimized for searching, available in canned systems – Cons: Overhead of separate systems, data aren’t attached to what they describe • Internal (embedded) system – Metadata stored directly in the documents themselves – Pros: Data are attached directly to what they describe, easy to edit/update – Cons: Not as widely supported in canned systems, might have to build your own support tools
  • 5. Our Metadata Scheme • We chose an embedded system – Cost of canned CMS was prohibitive – Already had many CMS features in our source control system – Allowed us to design the system to suit our needs – Allowed us to grow the system incrementally
  • 6. Metadata Structure • We break our metadata into two basic types: – Attribute metadata, things that tend to be fixed for a given piece of content • Owner, project, reference documents – Status metadata, things that tend to change over time • Revisions, write status and review status
  • 7. Attribute Metadata • This is encapsulated in a <metadata> element near the top of each piece of content • Metadata contains information on the document or fragment: – The document’s owner – Any technical contacts for the document (engineers, statisticians) – Reference documents relevant to the material (designs, functional specs, primary sources) – General revision information that would affect the whole document or fragment
  • 9. Status Metadata • Status metadata appears inline in the document in a <revision> element • Revisions give information on various aspects of a document change: – Who made the change – When it was made – The version of the project to which the change applies – A comment describing the change – Writing status (to do, writing, ready for edit, complete, deferred) – Review status (pending, complete)
  • 11. Tools for Managing Metadata • Edit tags – Metadata is part of DTD, so we can just edit the tags as necessary – BUT, this was awkward and error-prone • Specialized tools – AE customizations to streamline the process • Remind authors to add metadata when they create/change content • Custom dialog to make editing metadata easy • Auto-complete as much as possible to minimize overhead
  • 12. AE Customizations: Display of Metadata • Highlighted in green • Menu item allows hiding/showing of metadata • (Demo)
  • 14. AE Customizations: Entering Attribute Metadata • When new content is created, author is prompted and basic metadata is created automatically • Author must enter information on technical contacts or reference documents
  • 16. AE Customizations: Entering Status Metadata • When a change is made to content, on save the user is prompted to add a revision element for each topic changed • Custom dialog auto-inserts author’s ID and today’s date • Author supplies the rest of the information for each topic changed
  • 17. Custom Dialogs for Status Metadata
  • 18. AE customizations: Metadata Reports • Menu item to get summaries of the metadata for a document or project • Summary report is dynamic HTML • Report can be sorted and filtered based on any of the revision attributes
  • 21. AE Customizations: Metadata Reports • We can also produce PDF output which includes revision information in one of two formats: – Inline revision information – Summary table at the top of each document
  • 22. Custom Print Dialog with Metadata Options
  • 25. To do • Still on the wish list: – Automatic notification of content changes • To owner of document (if author of change is not owner) • To owners of documents or projects that contain references to that content – Automatic notification of status changes • Notify editors when revisions are marked “ready for edit” • Notify production staff when documents are marked “complete”
  • 26. Questions? • Contact info: Clay Helberg SPSS Inc. chelberg@spss.com