Good metadata is critical to helping people find information. Metadata can be used to enhance search tools, drive navigation and relate documents to one another. Unfortunately, manually adding metadata to content is cumbersome for small batches of content and impractical or impossible for large content sets.
Enterprise Knowledge understands the difficulty and importance of maintaining metadata. In this session, we will share 6 different ways to simplify and/or automate metadata management even on extremely large content sets. We will share the tools and techniques we have used with our clients to make metadata management possible and provide real world examples as to how these techniques can be applied to your content.
Today, I want to share some ways that our customers have solved the metadata management problem.
Properly tagged content makes a number of things possible.
It improves findability
Aids in the management of content
Provides context that helps identify relationships within a large corpus of information
Many of my clients understand the importance of metadata, but are frustrated with the effort it takes to keep it up to date.
I hear things like
“My search would be great if only I had the budget to get all of my content tagged”
“My users are just going to have to be disappointed. I cannot afford to tag my content to do things right”
In many cases manually tagging should be the last option and not the only option.
It amazes me how often we look at the content our clients are trying to tag and realize that there is implied metadata that they could be taking advantage of.
Implied metadata is very cheap and requires no manual intervention.
The authors on staff had specific areas of responsibility. For example, the author might be responsible for advances in Knowledge Management. We can use linked metadata to tag his stories as related to Knowledge Management.
The great thing about linked metadata is that administrators only need to manage the relationships as opposed to the individual pieces of content. This approach tends to scale quite well.
Entity enrichment can get more expensive. The good news is that there are a number of vendors here that the conference that specialize in this capability. I encourage all of you to stop by their booths to learn more about how these tools work.
It is important to note that there is no single best way to do auto-categorization. Everyone’s content is different. Pilot solutions with one or more vendors to see which solution is best for you.
Content with consistent structure is a great candidate for Pattern Matching. Analyze content to find patterns that can be exploited to extract metadata. Good candidates include forms and standard contracts.
Search results by definition have something in common. Batch metadata management allows the tagger to update multiple pieces of content at the same time.
The best way to describe this is to think of Filters in Google Mail. You can search for a term in your mail. Select one or more emails and add a filter (actually a tag)
Batch Metadata Management is best for topical requirements where completeness is not critical.