Deploying Semantic Technologies for Digital Publishing: A Case Study from Logos Bible Software - Presentation Transcript
Deploying Semantic Technologies for Digital Publishing A Case Study from Logos Bible Software Sean Boisen (sean@logos.com) Slides at: http://semanticbible.org/other/presentations/2007-SemTech/
Outline
Background: application and motivation
Scope and Overview
Technical Challenges:
Reification for provenance data
Converting legacy data
Tools for knowledge extension
Future directions
Who Am I?
19 years with BBN Technologies
Information extraction, human language technology
Scientist, technology manager
Semantic Web hobbyist
Senior Information Architect at Logos
One-man semantic band
The Importance of the Bible as a Semantic Domain
The most widely distributed book
35M Bibles and Testaments in 2005
The most widely translated work
> 2000 languages
41 languages at www.biblegateway.com
Spans 1000s of years of ancient history
Logos Bible Software
High-end desktop digital library
> 7000 titles
Resources in a dozen languages
Users in 180 countries
Extensive cross-indexing and hyper linking
Leading publisher and developer of digital resources for Bible study
Logos Value
Digital library with hyperlinked references and citations
Information integration for navigation, search
Support for original languages
Search
New content to enrich Bible study
The Bible Knowledgebase (BK)
A machine-readable knowledgebase of semantically-organized Bible data
In OWL
Linked to Biblical texts
Search, navigation, visualization
Relationships support discovery and exploration
Reusable content (unlike prose)
Integration framework for library resources (future)
Today: named people and places, and their relationships
Layer knowledge: first entities, then relationships
Be conservative in what we assert and provide references as evidence
Try to avoid philosophy and focus on end-user value
The Semantic Value Proposition
Identify and disambiguate entities (beyond names)
30 people named Zechariah
Jesus’ disciple: Peter, Simon, Simeon, Cephas …
Judah: person, tribe, territory
Link reference information to passages for background
Provide a rich set of relationships to encourage exploration and discovery
Provide consistent cross-resource indexing
Leverage third-party tools
Provide scalability
Avoid reinventing the wheel
User Benefits
Disambiguation makes search work better
Passage guide displays relevant entities to provide background information
Relationships encourage browsing and exploration
Visualization makes complex information easier to grasp
Development Tools
Ontology development and instance creation with Protégé
Legacy data conversion and data merging through XSLT
Storage in Sesame
Some integration code in Python for loading and querying RDF
TBD
Most Important BK Classes
> 60 classes in all (not counting reified relationships)
Many upper classes are not instantiated
General coordination of class names with SUMO
But not true re-use
BK Classes for Places
BK Abstract Classes
BK Instances
~100k triples
~3000 people instances
Aaron to Zurishaddai
Names (various languages)
~20k passage references for assertions
90 cities, other places
Ethnicities, belief systems, languages, social roles, organizations
Major BK Relationships Family Relationships Human Human Domain Range Property Knows, collaborates, antagonist, enemy Member of Group Region Native, resident, visited place And inverse relationships … (attributes) Social role, Ethnicity, Belief Region Subregion Geolocation data Latitude, longitude, etc.
Challenge: Assertions about Properties
Provenance is important to the domain and application
Presented May 24, 2007 at the Semantic Technology C more
Presented May 24, 2007 at the Semantic Technology Conference This talk describes an effort at Logos Research Systems to build a semantic knowledgebase encompassing general background information about entities and relationships from the Bible (one of the world's most popular collections of information). The scope includes people, places, belief systems, ethnic attributes, social roles, as well as family and other inter-personal relationships, places visited, etc. This Bible Knowledgebase (BK) will be used to support knowledge discovery and visualization in both desktop and web-server configurations for Logos' products. It will also provide an integration framework for Logos' substantial digital library (more than 7000 titles from over 100 different publishers). The project is a good example of what it takes to move a real-world, knowledge-intensive application into a Semantic Web framework. less
0 comments
Post a comment