ConSense : http://consense-project.com
The amount of unstructured electronic documents in enterprise environments is growing rapidly. This presentation illustrates an approach to assist in the enterprise wide lifecycle management of electronic documents by utilizing the context of document access by knowledge workers. From this context data we propose to deduct semantic relations among documents and business-domain specific entities which can be combined into a semantic network. Querying the resulting network may allow more efficient discovery and filtering of unstructured documents.
6. Motivation Rapidly rising Amount of (Document-based) unstructured Information in Enterprise Environments Impact on Personal Information Management 1 High Effort to locate required Information 2 Potential Redundancy 3 Orphaned Documents 4 Outdated Document Versions
7. Motivation Rapidly rising Amount of (Document-based) unstructured Information in Enterprise Environments Additional Impact on Enterprise ContentManagement Impact on Personal Information Management (numerous Users, multitude of Repositories, no coordinating Instance) 1 High Effort to locate required Information 5 Conflicting Changes 2 Potential Redundancy 6 Unclear Responsibilities 3 Orphaned Documents 7 4 Costly Process Changes Outdated Document Versions
8. Documents are semantically related to Business Entities works on the same project as knows is an author of is an author of relates to knows works on the same project as has to review documents was created duringprocess step is aware of has role is responsible for
9. (Meta-)Information as a Source for semantic Relations maintenance operations concurrently open documents provenance contextual personal access and usage collaborative access and usage compliance status user classification static administrative access rights static file attributes inherent content Personal Domain Enterprise Domain
10. Existing Research Approaches Activity-based Relation Building maintenance operations concurrently open documents provenance contextual personal access and usage collaborative access and usage compliance status user classification static administrative access rights static file attributes Content-based Relation Building inherent content Personal Domain Enterprise Domain
11. Activity-based Relation Building in Multi-User Environments Workspace User A Workspace User B Document A Document B Document B Document C
12. Activity-based Relation Building in Multi-User Environments Workspace User A Workspace User B Document A Document B Document B Document C
13. Relation Building in Multi-User Environments Document opened Document A + static meta-information Document opened and edited Document B + static meta-information Email sent to Document B
14. Relation Building in Multi-User Environments Document opened Document A + static meta-information has understanding on document content A Document opened and edited Document B is an author of has to report to + static meta-information is aware of B Email sent to Document B
15. Resulting relations can be unreliable, aging and continuous works on the same project as knows is an author of is an author of relates to knows works on the same project as has to review documents was created duringprocess step has role is aware of is responsible for
16.
17.
18.
19.
20. Seeding External Business Entities Enterprise Applications User Activity(Document Usage) User Activity(Document Usage) Filesystem Outlook Filesystem Outlook
21. ConSense: Architecture Overview Central Semantic Store Enterprise Applications User Activity(Document Usage) User Activity(Document Usage) Filesystem Outlook Filesystem Outlook
22. Adressing Skalablity and Privacy Issues FOAF Dublin Core ConSense Ontology Client Ontology Subset Client Ontology Subset merge merge foaf:knows foaf:knows cs:visitedUrl cs:responsibleFor cs:responsibleFor Central Semantic Store cs:subVersionOf
30. Research Overview Actual Business Domain: Knowledge workers use, create and collaborate with documentsin business processes. Context Sensors: Client-side software plugins track the context of business-relevant document usage. Heuristic business-rules are applied resulting in semanticrelationships among documents, persons, products, processes and services. Semantic Virtualization: High-level semantic information on document-interrelations is stored in a central semantic RDF-quad store. Domain Business Ontology Rules Semantic Store Task-specific Information: The semantic relationships are used to proactively supply knowledge workers with information and documents related to their actual task-context.