Expressive Query Answering For Semantic Wikis (20min)

Expressive Query Answering For Semantic Wikis Jie Bao, Rensselaer Polytechnic Institute baojie@cs.rpi.edu, http://www.cs.rpi.edu/~baojie

Semantic Wiki as a Data Store May 10, 2011 2

Semantic Media Wiki (SMW) Low-cost solution for light-weight semantic applications Dozens of extensions to build apps. Integrated environment for modeling and querying SMW-ML (Modeling language): subclass/subproperty SMW-QL (Query language): disjunctive query with subquery (detailed SMW expressivity in the backup slides) May 10, 2011 3

However, we often need more expressivity Modeling Inverse property: “has author” <-> “author of” Transitive property: “part of” … Query Negation: find cities that are not capitals Counting: find professors who advise more than 5 students May 10, 2011 4

Desired Expressivity Balance between expressiveness and simplicity Modeling Language: OWL Prime [1] rdfs:subClassOf, subPropertyOf, domain, range owl:TransitiveProperty, SymmetricProperty, FunctionalProperty, InverseFunctionalProperty, inverseOf owl:sameAs, equivalentClass, equivalentProperty Query Language: SMW-QL, plus Negation as failure Cardinality (aggregation) May 10, 2011 5 [1] http://www.w3.org/2007/OWL/wiki/OwlPrime

Formalization Note: Semantic Wiki is NOT an open world (as oppose to OWL) Formalizing OWL Prime with CWA using datalog Descriptive, closed-world semantics Well-understood complexity and mature tool support May 10, 2011 6

SMW-ML+ [[Domain::C]] [[Range::C]] [[Type::Transitive]] [[Type::Symmetric]] [[Type::Functional]] [[Type::InverseFunctional]] [[Inverse of::Q]] C(x) :- P(x,y) C(y) :- P(x,y) P(x,y) :- P(x,z), P(z,y) P(x,y) :- P(y,x) SameAs(x,y) :- P(z,x),P(z,y) SameAs(x,y) :- P(x,z),P(y,z) Q(x,y) :- P(y,x) May 10, 2011 7 On page “Property:P” Not owl:sameAs!

Translation Rules for SMW-QL {{#ask: [[Category:City]] [[capital of::+]] }} result(x) :- City(x), capital_of(x, y) . May 10, 2011 8 Other constructs: for conjunction, disjunction, subquery, property chain etc, see backup slides

SMW-QL+ : Negations {{#askplus: [[<>Category:C]] [[Category:D]] }} {{#askplus: [[Category:C]] [[<>P::+]] }} result(x) :- D(x), not C(x) . result(x) :- C(x), #count{x: P(x,y)}<=0 . Why not “C(x), not P(x,y)” ? May 10, 2011 9

SMW-QL+: (Non)qualified Cardinality {{#askplus: [[>=3#P::+]] }} {{#askplus: [[>=3#P:: <q>[[Category:D]]</q>]] }} result(x) :- thing(x), #count{x: P(x,y)}>=3 . result(x) :- thing(x), #count{x: P(x,y),D(x)}>=3 . May 10, 2011 10 For safeness

Implementation Using DLV as the reasoner Other LP solvers may be used as well Two work modes File-based: reasoning based on a static dump (snapshot) of wiki semantic data. Database-based: reasoning based on a shadow database via ODBC; Real-time changes of instance data will be updated. Optimization Caching May 10, 2011 11 Download: http://www.mediawiki.org/wiki/Extension:SemanticQueryPlus

Example: May 10, 2011 12 Inverse property Caching Transitive property

Scalability: Data Complexity Test machine: 2 * Xeon 5365 Quad 3.0GHz 1333MHz /16G / 2 * 1TB Dataset: part of DBLP, 10,396 pages, 100,736 triples May 10, 2011 13 {{#askplus: [[Category:Person]] }} Almost linear

Scalability: Query Complexity May 10, 2011 14 {{#askplus: [[Knows::<q>[[Knows::<q>[[Knows::<q>…</q>]]</q>]]</q>]] }} Almost constant ,[object Object],[object Object]

Some other work on SMW by us Semantic History – tracking provenance of semantics http://www.mediawiki.org/wiki/Extension:SemanticHistory Tetherless Map – query-based map generation http://www.mediawiki.org/wiki/Extension:Tetherless_Map DBLP Import – bibtex to semantic wikihttp://www.mediawiki.org/wiki/Extension:DBLP_Import Array Extension – operate on arrayshttp://www.mediawiki.org/wiki/Extension:ArrayExtension RDFa Extension – RDFa <-> Wikihttp://www.mediawiki.org/wiki/Extension:ArrayExtension Joint work with Li Ding, Jin Zheng, Rui Huang May 10, 2011 16

Summary Formalizing SMW using datalog allows us to extend SMW for an expressive subset of OWL. implement a SMW query engine that is scalable good for typical uses. analyze the reasoning complexity of SMW (not mentioned in the talk) Future Work Incremental reasoning Customized reasoning rules SPARQL <-> SMW-QL+ translations May 10, 2011 17

Expressivity (SMW 1.5.4) SMW-ML (Modeling Language) category instantiation e.g., [[Category:C]] property instantiation e.g., [[P::v]] subclass, e.g., [[Category:C]] (on a category page) subproperty, e.g., [[Subpropetyof:Property:P]] (on a property page) SMW-QL (Query Language) conjunction: e.g., [[Category:C]][[P::v]] disjunction: e.g., [[Category:C]] or [[P::v]], [[A||B]] or [[P::v||w]] property chain: e.g., [[P.Q::v]] property wildcat: e.g., [[P::+]] subquery: e.g., [[P::<q>[[Category:C]]</q>]] inverse property e.g., [[-P::v]] value comparison, e.g. [[P::>3]][[P::<7]][[P::!5]] May 10, 2011 19

Translation Rules for SMW-ML Subproperty Subclass Class instance Property instance Redirection P(x,y) :- Q(x,y) . C(x) :- D(x) . C(a) . P(a,b) . a=b. May 10, 2011 20

Translation Rules for SMW-QL result(x) :- _tmp0(x). _tmp0(x) :- A(x), p3(x,x0), x0=category:B. _tmp0(x) :- p(x,x2), p1(x2,x3), p2(x3,x1), _tmp9(x1). _tmp9(x1) :- _tmp12(x1). _tmp12(x1) :- D(x1). _tmp12(x1) :- p1(x1,x4), x4=SomePage. _tmp9(x1) :- thing(x), x !=v. _tmp9(x1) :- E(x1). {{#ask: [[Category:A]][[p3::category:B]] or [[p.p1.p2:: <q> [[Category:D]] or [[p1::<q>[[SomePage]]</q>]] </q> ||!v ||<q>[[Category:E]]</q> ]] }} Conjunction Property chain Disjunction Inequality Subquery May 10, 2011 21

Theoretical Complexity May 10, 2011 22 Recall that L  NL  P  NP

Expressive Query Answering For Semantic Wikis (20min)

Expressive Query Answering For Semantic Wikis (20min)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (6)

Similar to Expressive Query Answering For Semantic Wikis (20min)

Similar to Expressive Query Answering For Semantic Wikis (20min) (20)

More from Jie Bao

More from Jie Bao (20)

Recently uploaded

Recently uploaded (20)

Expressive Query Answering For Semantic Wikis (20min)

Editor's Notes