The document discusses semantic metadata and how it can be used to improve content discovery and integration. It defines semantic metadata as a consistent, rules-based information layer that exposes the meaning of data so computers can perform more sophisticated tasks. Tagging content with terms from controlled vocabularies and taxonomies allows computers to better understand relationships and provide more precise search results. The benefits of semantic metadata include precision in discovery, computable context links between related content, and insights into collection gaps and trends.
Slides for the class, From Pattern Matching to Knowledge Discovery Using Text Mining and Visualization Techniques, presented June 13, 2010, at the Special Libraries Association 2010 annual meeting.
The whitepaper addresses the challenges in the data–driven organizations, medical research and health care. It summarizes how the context-enabled and semantic enrichment can transform the traditional method to search optimum data. 3RDi has advanced content enrichment with Named Entity Recognition, Semantic similarity, Content classification and Content summarization. Get the right data at the right time that helps medical researchers and health care practitioners.
Slides for the class, From Pattern Matching to Knowledge Discovery Using Text Mining and Visualization Techniques, presented June 13, 2010, at the Special Libraries Association 2010 annual meeting.
The whitepaper addresses the challenges in the data–driven organizations, medical research and health care. It summarizes how the context-enabled and semantic enrichment can transform the traditional method to search optimum data. 3RDi has advanced content enrichment with Named Entity Recognition, Semantic similarity, Content classification and Content summarization. Get the right data at the right time that helps medical researchers and health care practitioners.
Semantic Web in Action: Ontology-driven information search, integration and a...Amit Sheth
Amit Sheth's Keynote talk given at: “Semantic Web in Action: Ontology-driven information search, integration and analysis,” Net Object Days 2003 and MATES03, Erfurt, Germany, September 23, 2003. http://knoesis.org
Note: slides 51-55 have audio.
Identifying Security Risks Using Auto-Tagging and Text AnalyticsEnterprise Knowledge
On Thursday, November 10, Joe Hilger and Sara Duane spoke at Text Analytics Forum about identifying secure and confidential information using auto-tagging. Information security continues to grow in importance in today's society. We hear stories all of the time about hackers accessing private information from companies and government agencies. Every organization struggles with employees who store confidential information on insecure network drives or cloud drives. Joe and Sara did a project with a federal research organization that used auto-tagging and text analytics to identify confidential information that needed to be moved to a secure location. During the presentation, we shared the approach we took to identify this information and how we made sure that the tagging and text analytics were accurate. Attendees learned best practices for designing a taxonomy for auto-tagging and tuning auto-tagging as well as ways to identify confidential information across the enterprise.
Content Management, Metadata and Semantic WebAmit Sheth
Keynote given at NetObjectDays conference, Erfurt, September 11, 2001.
One of the earliest keynotes discussing commercial semantic web technologies, semantic web applications (including semantic search, semantic targeting, semantic content management). Prof. Sheth started a Semantic Web company Taalee, Inc. in 1999 (Product was MediaAnywhere A/V search engine),that merged to become Voquette in 2001 (product was called SCORE), Semagix in 2004 (product was called Semagix Freedom), and then Fortent in 2006 (products included Know Your Customers). Additional details can be found in U.S. Patent #6311194, 30 Oct. 2001 (filed 2000).
Note: the commercial system used "WorldModel" as at the time, business customers were not yet warm to "Ontology" - the concept/intent is the same. More recent information at http://knoesis.org
Amit P. Sheth, “Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating and Exploiting Complex Semantic Relationships,” Keynote at the 29th Conference on Current Trends in Theory and Practice of Informatics (SOFSEM 2002), Milovy, Czech Republic, November 22–29, 2002.
Keynote: http://www.sofsem.cz/sofsem02/keynote.html
Related paper: http://knoesis.wright.edu/?q=node/2063
The presentation I gave at the 2007 Semantic Technology Conference. Declarative programming” has become the latest buzzword to describe languages that abstractly define systems requirements (the what) and leave the implementation (the how) to be determined by an independent process. This makes the semantics (meaning) of declarative data elements even more critical as these systems are shared between organizations. This presentation: (1) Provides a background of declarative programming (2) Describes why understanding the semantic aspects of declarative systems is critical to cost-effective software development.
Semantic search helps business people find answers to pressing questions by wading through oceans of information to find nuggets of meaningful information. In this presentation we’ll discuss how semantic search and content analysis technologies are starting to appear in the marketplace today. We’ll provide a recap of what semantic search is and what the key benefits are, then we’ll answer the following questions:
• Is semantic search a feature, an application, or enterprise system?
• How can I add semantic search to my existing work processes?
• Will I need to replace my existing content technologies?
• What will I need to do to prepare my content for semantic search?
• Is semantic search just for documents or can I search my data too?
• Can I use semantic search to find information on the internet and other public data sources?
• Are there standards to consider?
SEMANTIC NETWORK BASED MECHANISMS FOR KNOWLEDGE ACQUISITIONcscpconf
This article summarizes research work started with the SeiPro2S (Semantically Enhanced Intellectual Property Protection System) system designed to protect resources from the unauthorized use of intellectual property. The system implements semantic network as a structure of knowledge representation and a new idea of semantic compression. As the author proved that semantic compression is viable concept for English, he decided to focus on potential applications. An algorithm is presented that employing semantic network WiSENet for knowledge acquisition with flexible rules that yield high precision results. Developed algorithm is implemented as a Finite State Automaton with advanced methods for triggering desired actions. Detailed discussion is given with description of devised algorithm, usage examples and results of experiments.
Semantic Web in Action: Ontology-driven information search, integration and a...Amit Sheth
Amit Sheth's Keynote talk given at: “Semantic Web in Action: Ontology-driven information search, integration and analysis,” Net Object Days 2003 and MATES03, Erfurt, Germany, September 23, 2003. http://knoesis.org
Note: slides 51-55 have audio.
Identifying Security Risks Using Auto-Tagging and Text AnalyticsEnterprise Knowledge
On Thursday, November 10, Joe Hilger and Sara Duane spoke at Text Analytics Forum about identifying secure and confidential information using auto-tagging. Information security continues to grow in importance in today's society. We hear stories all of the time about hackers accessing private information from companies and government agencies. Every organization struggles with employees who store confidential information on insecure network drives or cloud drives. Joe and Sara did a project with a federal research organization that used auto-tagging and text analytics to identify confidential information that needed to be moved to a secure location. During the presentation, we shared the approach we took to identify this information and how we made sure that the tagging and text analytics were accurate. Attendees learned best practices for designing a taxonomy for auto-tagging and tuning auto-tagging as well as ways to identify confidential information across the enterprise.
Content Management, Metadata and Semantic WebAmit Sheth
Keynote given at NetObjectDays conference, Erfurt, September 11, 2001.
One of the earliest keynotes discussing commercial semantic web technologies, semantic web applications (including semantic search, semantic targeting, semantic content management). Prof. Sheth started a Semantic Web company Taalee, Inc. in 1999 (Product was MediaAnywhere A/V search engine),that merged to become Voquette in 2001 (product was called SCORE), Semagix in 2004 (product was called Semagix Freedom), and then Fortent in 2006 (products included Know Your Customers). Additional details can be found in U.S. Patent #6311194, 30 Oct. 2001 (filed 2000).
Note: the commercial system used "WorldModel" as at the time, business customers were not yet warm to "Ontology" - the concept/intent is the same. More recent information at http://knoesis.org
Amit P. Sheth, “Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating and Exploiting Complex Semantic Relationships,” Keynote at the 29th Conference on Current Trends in Theory and Practice of Informatics (SOFSEM 2002), Milovy, Czech Republic, November 22–29, 2002.
Keynote: http://www.sofsem.cz/sofsem02/keynote.html
Related paper: http://knoesis.wright.edu/?q=node/2063
The presentation I gave at the 2007 Semantic Technology Conference. Declarative programming” has become the latest buzzword to describe languages that abstractly define systems requirements (the what) and leave the implementation (the how) to be determined by an independent process. This makes the semantics (meaning) of declarative data elements even more critical as these systems are shared between organizations. This presentation: (1) Provides a background of declarative programming (2) Describes why understanding the semantic aspects of declarative systems is critical to cost-effective software development.
Semantic search helps business people find answers to pressing questions by wading through oceans of information to find nuggets of meaningful information. In this presentation we’ll discuss how semantic search and content analysis technologies are starting to appear in the marketplace today. We’ll provide a recap of what semantic search is and what the key benefits are, then we’ll answer the following questions:
• Is semantic search a feature, an application, or enterprise system?
• How can I add semantic search to my existing work processes?
• Will I need to replace my existing content technologies?
• What will I need to do to prepare my content for semantic search?
• Is semantic search just for documents or can I search my data too?
• Can I use semantic search to find information on the internet and other public data sources?
• Are there standards to consider?
SEMANTIC NETWORK BASED MECHANISMS FOR KNOWLEDGE ACQUISITIONcscpconf
This article summarizes research work started with the SeiPro2S (Semantically Enhanced Intellectual Property Protection System) system designed to protect resources from the unauthorized use of intellectual property. The system implements semantic network as a structure of knowledge representation and a new idea of semantic compression. As the author proved that semantic compression is viable concept for English, he decided to focus on potential applications. An algorithm is presented that employing semantic network WiSENet for knowledge acquisition with flexible rules that yield high precision results. Developed algorithm is implemented as a Finite State Automaton with advanced methods for triggering desired actions. Detailed discussion is given with description of devised algorithm, usage examples and results of experiments.
This presentation, part of our Redefining the IT Lexicon (applied innovation series), introduces a new term related to the field of Semantic technology. That term is Semantic Intelligence. This briefing was provided by Semantech's InnovationWorx division.
Securing Your Digital Assets slides NYC July 14, 2015Darrell W. Gunter
Cyber Security is an issue for all companies and everyone. Our executive panel will explore the definition of cyber security and the key steps to creating a culture and program to protect your assets.
Darrell W. Gunter, CEO of Gunter Media Group, Inc. leads a panel of industry experts on the subject of "Chunking" book content to extend your content's value. The panel was represented Robert Kasher of First Source, Mike Shannon of Packback books and Laura Dawson of Bowker.
Barbara Meyers Ford provides a foundation of what is going on in Social Media for scholarly publishers. Her panel includes Darrell Gunter of Gunter Media Group and Bill Jackson Assistant Professor, Dept of Microbiology and Molecular Genetics of Medical College of Wisconsin
How Does Advertising in the Professional Scholarly Publishing Industry Work?Darrell W. Gunter
Join Darrell W. Gunter and his panelist Carr Davis Co-CEO, ASC Partners, LLC and Luis Portero, VP Sales, Advertising, Elsevier explore advertising and how it will develop over the next decade.
This presentation to the ASIDIC spring meeting provided a Case study from the American Association of Cancer Research and how they improved their peer review process utilizing the Collexis Reviewer Finder application.
In this presentation to the ASIDIC conference Darrell draws the correlation between the neighborhood grocery and the internet. Very interesting comparison
3. Definition The Semantic Web provides a common framework that allows data to be shared and reused across application, enterprise, and community boundaries. --W3C Semantic Web Activity Definition
4. Beyond Documents The Semantic Web requires us to go beyond documents and think of our content as data. For example: 1 practice guideline = 1 document OR 1 practice guideline = 312 distinct pieces of data This comes more naturally to industries that have traditionally dealt with uniform data (finance, travel)
6. If the airlines treated their data the way publishers did… This Week’s Departures (PDF, 45K) This Week’s Arrivals (PDF, 52K)
7. The Semantic Layer The semantic layer is an evolution of traditional web <meta> data. It is a consistent, rules-based information layer for computer logic parsing. It is a method for exposing the meaning of data so the computer can perform more sophisticated cognitive tasks.
8. Parallel Data For Humans: The Narrative Layer Chapter 23: Numbness, Tingling, and Sensory Loss Normal somatic sensation reflects a continuous monitoring process, little of which reaches consciousness under ordinary conditions. By contrast, disordered sensation, particularly when experienced as painful, is alarming and… For Computers: The Semantic Layer <semantics controlvocab=“UMLS”> <tag> <root-term termID="28648">sensation disorders</root-term> <sub-term termID="180">classification</sub-term> <sub-term termID="6138">terminology</sub-term> </tag> <tag> <root-term termID="39923">sensory testing</root-term> </tag> </semantics>
10. Order of Complexity Less Complex Term listSimple set of words used in text Controlled vocabulary Uses only approved terms Taxonomy Includes structural hierarchy (parent/child) Ontology Limitless relationship types defined in system More Complex
11. Taxonomy as Semantic Foundation The taxonomy is the framework for the semantic layer and semantic tagging—crucial for concept normalization and hierarchies Industry standard taxonomies facilitate integration Taxonomies are living creatures—they should be actively managed by an expert team (e.g. Silverchair Cortex is updated every day)
12. Normalization Authors use different terminology in different books, journal articles, and even in the same book. A semantic layer with a controlled vocabulary will normalize these differences and make user-data connections smarter. This is especially pertinent in health care.
13. From a Previous Example For Humans Chapter 23: Numbness, Tingling, and Sensory Loss Normal somatic sensation reflects a continuous monitoring process, little of which reaches consciousness under ordinary conditions. By contrast, disordered sensation, particularly when experienced as painful, is alarming and… For Computers <semantics controlvocab=“UMLS”> <tag> <root-term termID="28648">sensation disorders</root-term>… “disordered sensation” = 215 PubMed results “sensation disorders” = 112,577 PubMed results (raw search) = 76,826 PubMed results (MeSH major topic search)
14. More Need for Normalization Synonyms (newborn = neonate) Acronyms (GHB = gamma hydroxybutyrate) Shorthand (c diff =clostridium difficile) Bonus:You can use a semantic normalization web service in your search without tagging your content.
15. Contextual Integration By using a shared vocabulary or taxonomy, you can more easily integrate your varied content (journals, books, videos, images, training). Current taxonomies in health care include: MeSH, SNOMED, ICD-10, Read Codes, Silverchair Cortex, (and about 100 more). The Unified Medical Language System (UMLS) is a place to start for health care integrations.
17. Semantic Tagging Tagging is the insertion of semantic information in the XML, whose smallest unit is called a tag. Tagging can also be placed in database tables and header files if the content is inaccessible (such as images and videos). Tagging should be done at the smallest “atomic” level of data possible
18. Who Tags, and How? Human indexers are the most accurate taggers for high-value content, but computer routines can help them tag or tag extremely formulaic content. At Silverchair, we run an automated routine to place obvious tags and medical editors apply the rest. Community tagging/author tagging seems attractive, but can be risky due to inconsistency.
21. Precision in Discovery! Precision in answering user queries is a key component of an application’s usability and user satisfaction rating. The semantic layer provides an application with a concise guide to the content in a language it can understand. It can now provide more accurate results.
22. Example A user wants to know about the mortality of necrotizing fasciitis.
23. Computable Context Links Create a rich matrix of contextual linking for your users using the semantic layer. These links never have to be updated by a person—semantics enable instantaneous, automated relationships whenever new content is added.
26. Collection Intelligence Content Where are the topic gaps in your collections? Where is your content complete? Semantic reports give a unified view to integrated sites and can help guide collection development. Trends How are certain topics trending among your user groups? What topics are of greatest interest and value to your users?
27. Next Wave of SEO Discovery tools (intelligent agents, virtual research assistants) will give greater weight to content they can understand. Don’t let your collections be part of the “dark web”—expose your content through your semantic layer. Semantics have the potential to dramatically enhance federated search.
28.
29. Ask Publishers and Aggregators About What Semantic Metadata They Can Provide Many publishers are enriching content with semantic metadata now, and many more will Ask what kind of metadata is available to support your applications
30. Thank You! Thane Kerner CEO Silverchair thanek@silverchair.com www.silverchair.com