SlideShare a Scribd company logo
1 of 16
Keyword-basedContext-aware Selection of Natural Language Query-Patterns Giorgio Orsi, LetiziaTanca and Eugenio Zimeo EDBT Conference – Uppsala March 23rd 2011
Background:Cardiovascular diseases 2 March 16, 2011 SAFE – EDBT Conference Courtesy of American Heart Association: Heart Disease & Stroke Statististics (2009)
3 March 16, 2011 SAFE – EDBT Conference Background:Emergency rescue of people with CVD (1) emergency (2) rescue (3) on-site assistance missing information time constraints limited technology (5) surgey preparation (4) transport to hospital (6) surgery
Positioning:which information access paradigm? 4 March 16, 2011 SAFE – EDBT Conference Form-based: IR-style: NLP queries: Keyword search Schema-less: graph patterns: too rigid, application flow does not always “covers” the users needs. interpretation of keywords, output are documents and not tuples. non-trivial NL analysis takes time and shallow analysis is too imprecise. good! if keywords are interpreted ,[object Object]
still affected by uncertainty,[object Object]
Approach:Query Patterns 6 March 22, 2011 SAFE – EDBT Conference <nlquery id=“Q23">      <sentence>            …       </sentence>       <variables>            …       </variables>       <formalQuery>             <query>                 …            </query>            <resources>      …  </resources>       </formalQuery> </nlquery> <query>      select ?name ?formula where {       ?x rdf:typedomain:Substance. ?y rdf:typedomain:Substance. ?x domain:subName ?n1. ?x domain:formula ?formula. {       ?x domain:interacts ?y. } ?y domain:subName ?n2 FILTER (?n2 = '<fvarref=“v1"/>') } </query> <resources>       <res modelRef="&domain#Substance" />       <res modelRef=“&domain#Additive" /> <res modelRef="&domain#Molecule" />      <res modelRef="&domain#Pharmacon" />       <res modelRef="&domain#interacts" /> <res modelRef="&domain#foodPresence" /> </resources> <sentence description=“pharmacological interactions">      <fixed> show the substances and their formulas            which are known to interact with       </fixed>      <varref=“v1"/> </sentence> <variables>       <variable id=“v1" label=“pharmacon name" type=“xsd:string"/> </variables>
Approach:Keyword to Ontology Matching LOnt: ontological terms (labels) ontology  controlled vocabulary keywords: search terms (e.g., patient, drug) parameters (e.g., “John Doe”, “49.5 Kg”) online keyword suggestion auto-completion semantically-related terms frequently-used terms 7 March 22, 2011 SAFE – EDBT Conference S: suggested keywords K: input keywords LOnt = {…, heart stroke, heart failure, 	  CPR, resuscitation, …} Intended input: <heart stroke> Intended input: <CPR> S = Ø S = {CPR, heart massage, …} S = {heart stroke, heart failure, …} S = {resuscitation, …} K = Ø K = {heart stroke, CPR} K = {heart stroke} input = <he…  Input = <c…  input = “”
8 March 22, 2011 SAFE – EDBT Conference Approach:Pertinence Construct S by picking nterms t from LOnt related to the keywords already chosen (those in the set K) S = f( freq( t ), pert( t, K ) )  LR1 = {drug, pharmaceutical, medicinal} K = {       , ascriptin} drug LR2 = {disease, condition, illness, sickness} input=Ø LR3 = {treats, cures, heals} R4 R3 R5 R1 R2 LR4 = {name} string string 0.5 0.5 0.25 LR5 = {code} 1.0 0.25 pertinence computation: phase 1: best-match decoration phase 2: neighbors decoration phase 3: pertinence combination assuming n = 6… S = {           treats, cures, heals,              name, disease,              condition       }
9 March 22, 2011 SAFE – EDBT Conference Approach:Ranking the Query Patterns naïve approach Rank by average pertinence of the formal resources in the pattern rkgp=𝑟𝑖∈𝑅𝑃(𝑝)𝑝𝑒𝑟𝑡(𝑟𝑖, 𝐾)𝑅𝑃(𝑝)   normalized approach use the number of resources directly associated to a keyword and mentioned in the pattern rkgnormp=𝜃×rkgp   𝜃= 1−𝑅𝑃(𝑝)𝑅𝑃(𝑝)×𝑅𝑃(𝑝)K+𝑅𝑃(𝑝)  
Approach:Focus by Context-Awareness 10 March 22, 2011 SAFE – EDBT Conference all role situation topic anamn treat doctor para-md rescue ER patology pharm relevant areas definition keyword suggestion pattern-ranking query-answering chem natural CeV CaV
10 people without a previous experience of the systems 50 natural-language query patterns metric  access time (aT) Break down metrics: aT = thT + kpT + srT + qeT + coT Thinking time (thT) pertinence computation (kpT) Scoring and ranking (srT) query execution time (qeT) communication time (coT) Experimentation:Experimental settings 11 March 22, 2011 SAFE – EDBT Conference
Experimentation:Validation (1) 5 query patterns (randomly selected from the pool) to each user. The “right” queries were found:  in 65 % of cases on top of the list in 25 % of cases at the second position in 8 % cases of the first result page In some cases, the testers were not able to formulate the right query using the form-based system. 12 March 22, 2011 SAFE – EDBT Conference
Experimentation:Validation (2) 13 March 22, 2011 SAFE – EDBT Conference 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Summary:Conclusion and future work now… novel paradigm for keyword-based search context-aware and semantic ranking of query patterns fast and precise information access 14 March 22, 2011 SAFE – EDBT Conference future… automatic definition of query patterns automatic definition of natural language descriptions automatic definition of relevant areas
Q & A
Two implementations: Maemo Linux on Nokia Smartphones N810 and N900 Web based on OpenLaszlo and enterprise technologies Experimental testbed in a client/server environment Web-based SAFE vs form-based system provided by a hospital Experimentation:Testbeds 16 March 22, 2011 SAFE – EDBT Conference

More Related Content

What's hot

Introduction to computer_lec_09_fall_2018_python_lec_103
Introduction to computer_lec_09_fall_2018_python_lec_103Introduction to computer_lec_09_fall_2018_python_lec_103
Introduction to computer_lec_09_fall_2018_python_lec_103Ramadan Babers, PhD
 
Placement oriented data structures
Placement oriented data structuresPlacement oriented data structures
Placement oriented data structuresLovelyn Rose
 
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...RuleML
 
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...RuleML
 
Introduction to unidimensional item response model
Introduction to unidimensional item response modelIntroduction to unidimensional item response model
Introduction to unidimensional item response modelSumit Das
 
Creation of Software Focusing on Patent Analysis
Creation of Software Focusing on Patent AnalysisCreation of Software Focusing on Patent Analysis
Creation of Software Focusing on Patent AnalysisIRJET Journal
 
EPSAHypACone2005
EPSAHypACone2005EPSAHypACone2005
EPSAHypACone2005Adam Cone
 
How does the QA team prepare test cases for Data Warehouse (BI) projects?
How does the QA team prepare test cases for Data Warehouse (BI) projects?How does the QA team prepare test cases for Data Warehouse (BI) projects?
How does the QA team prepare test cases for Data Warehouse (BI) projects?Rakesh Hansalia
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentNathan Thompson
 
Information Extraction from the Web - Algorithms and Tools
Information Extraction from the Web - Algorithms and ToolsInformation Extraction from the Web - Algorithms and Tools
Information Extraction from the Web - Algorithms and ToolsBenjamin Habegger
 
Analysing-MMPLs
Analysing-MMPLsAnalysing-MMPLs
Analysing-MMPLsmiso_uam
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...Alexander Panchenko
 

What's hot (14)

Introduction to computer_lec_09_fall_2018_python_lec_103
Introduction to computer_lec_09_fall_2018_python_lec_103Introduction to computer_lec_09_fall_2018_python_lec_103
Introduction to computer_lec_09_fall_2018_python_lec_103
 
Lecture 02 - DSA
Lecture 02 - DSALecture 02 - DSA
Lecture 02 - DSA
 
Placement oriented data structures
Placement oriented data structuresPlacement oriented data structures
Placement oriented data structures
 
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
 
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...RuleML2015 - Tutorial -  Powerful Practical Semantic Rules in Rulelog - Funda...
RuleML2015 - Tutorial - Powerful Practical Semantic Rules in Rulelog - Funda...
 
Introduction to unidimensional item response model
Introduction to unidimensional item response modelIntroduction to unidimensional item response model
Introduction to unidimensional item response model
 
Creation of Software Focusing on Patent Analysis
Creation of Software Focusing on Patent AnalysisCreation of Software Focusing on Patent Analysis
Creation of Software Focusing on Patent Analysis
 
EPSAHypACone2005
EPSAHypACone2005EPSAHypACone2005
EPSAHypACone2005
 
How does the QA team prepare test cases for Data Warehouse (BI) projects?
How does the QA team prepare test cases for Data Warehouse (BI) projects?How does the QA team prepare test cases for Data Warehouse (BI) projects?
How does the QA team prepare test cases for Data Warehouse (BI) projects?
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve Assessment
 
Information Extraction from the Web - Algorithms and Tools
Information Extraction from the Web - Algorithms and ToolsInformation Extraction from the Web - Algorithms and Tools
Information Extraction from the Web - Algorithms and Tools
 
Analysing-MMPLs
Analysing-MMPLsAnalysing-MMPLs
Analysing-MMPLs
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
 
Item Response Theory (IRT)
Item Response Theory (IRT)Item Response Theory (IRT)
Item Response Theory (IRT)
 

Viewers also liked

Invited Iceis Tanca Orsi
Invited Iceis Tanca OrsiInvited Iceis Tanca Orsi
Invited Iceis Tanca OrsiGiorgio Orsi
 
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014Giorgio Orsi
 
SAE: Structured Aspect Extraction
SAE: Structured Aspect ExtractionSAE: Structured Aspect Extraction
SAE: Structured Aspect ExtractionGiorgio Orsi
 
Web Data Extraction Como2010
Web Data Extraction Como2010Web Data Extraction Como2010
Web Data Extraction Como2010Giorgio Orsi
 
Joint Repairs for Web Wrappers
Joint Repairs for Web WrappersJoint Repairs for Web Wrappers
Joint Repairs for Web WrappersGiorgio Orsi
 

Viewers also liked (8)

Invited Iceis Tanca Orsi
Invited Iceis Tanca OrsiInvited Iceis Tanca Orsi
Invited Iceis Tanca Orsi
 
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014
ROSeAnn: Reconciling Opinions of Semantic Annotators VLDB 2014
 
Semantic Data Box
Semantic Data BoxSemantic Data Box
Semantic Data Box
 
Orsi PersDB11
Orsi PersDB11Orsi PersDB11
Orsi PersDB11
 
SAE: Structured Aspect Extraction
SAE: Structured Aspect ExtractionSAE: Structured Aspect Extraction
SAE: Structured Aspect Extraction
 
Table Recognition
Table RecognitionTable Recognition
Table Recognition
 
Web Data Extraction Como2010
Web Data Extraction Como2010Web Data Extraction Como2010
Web Data Extraction Como2010
 
Joint Repairs for Web Wrappers
Joint Repairs for Web WrappersJoint Repairs for Web Wrappers
Joint Repairs for Web Wrappers
 

Similar to SAFE EDBT 2011

Swertz Molgenis Bosc2009
Swertz Molgenis Bosc2009Swertz Molgenis Bosc2009
Swertz Molgenis Bosc2009bosc
 
Unstructured data processing webinar 06272016
Unstructured data processing webinar 06272016Unstructured data processing webinar 06272016
Unstructured data processing webinar 06272016George Roth
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologySnow Owl
 
NLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsNLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsDimitris Kontokostas
 
Linking data without common identifiers
Linking data without common identifiersLinking data without common identifiers
Linking data without common identifiersLars Marius Garshol
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesIJRES Journal
 
A Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentA Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentFaculty of Computer Science
 
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016Untangling Healthcare With Spark and Dataflow - PhillyETE 2016
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016Ryan Brush
 
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining IJDKP
 
Catalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics pptCatalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics pptNikita Khromov-Borisov
 
Local and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachLocal and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachIRJET Journal
 
Cec2010 araujo santamaria
Cec2010 araujo santamariaCec2010 araujo santamaria
Cec2010 araujo santamariaLourdes Araujo
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learningmohdshoaibuddin1
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataAndre Freitas
 
Cross-lingual Information Retrieval
Cross-lingual Information RetrievalCross-lingual Information Retrieval
Cross-lingual Information RetrievalShadi Saleh
 
Myanmar Named Entity Recognition with Hidden Markov Model
Myanmar Named Entity Recognition with Hidden Markov ModelMyanmar Named Entity Recognition with Hidden Markov Model
Myanmar Named Entity Recognition with Hidden Markov Modelijtsrd
 
An introduction to system-oriented evaluation in Information Retrieval
An introduction to system-oriented evaluation in Information RetrievalAn introduction to system-oriented evaluation in Information Retrieval
An introduction to system-oriented evaluation in Information RetrievalMounia Lalmas-Roelleke
 
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...Nawanan Theera-Ampornpunt
 

Similar to SAFE EDBT 2011 (20)

Swertz Molgenis Bosc2009
Swertz Molgenis Bosc2009Swertz Molgenis Bosc2009
Swertz Molgenis Bosc2009
 
Unstructured data processing webinar 06272016
Unstructured data processing webinar 06272016Unstructured data processing webinar 06272016
Unstructured data processing webinar 06272016
 
EB-eye Back End
EB-eye Back EndEB-eye Back End
EB-eye Back End
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
 
NLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology ConstraintsNLP Data Cleansing Based on Linguistic Ontology Constraints
NLP Data Cleansing Based on Linguistic Ontology Constraints
 
Linking data without common identifiers
Linking data without common identifiersLinking data without common identifiers
Linking data without common identifiers
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
A Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentA Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual Entailment
 
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016Untangling Healthcare With Spark and Dataflow - PhillyETE 2016
Untangling Healthcare With Spark and Dataflow - PhillyETE 2016
 
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining
Distortion Based Algorithms For Privacy Preserving Frequent Item Set Mining
 
Catalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics pptCatalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics ppt
 
Local and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering ApproachLocal and Global Learning Method for Question Answering Approach
Local and Global Learning Method for Question Answering Approach
 
210 2008 using text mining to classify requests and prepare semiautomatic ans...
210 2008 using text mining to classify requests and prepare semiautomatic ans...210 2008 using text mining to classify requests and prepare semiautomatic ans...
210 2008 using text mining to classify requests and prepare semiautomatic ans...
 
Cec2010 araujo santamaria
Cec2010 araujo santamariaCec2010 araujo santamaria
Cec2010 araujo santamaria
 
Heart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine LearningHeart Attack Prediction using Machine Learning
Heart Attack Prediction using Machine Learning
 
WISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked DataWISS QA Do it yourself Question answering over Linked Data
WISS QA Do it yourself Question answering over Linked Data
 
Cross-lingual Information Retrieval
Cross-lingual Information RetrievalCross-lingual Information Retrieval
Cross-lingual Information Retrieval
 
Myanmar Named Entity Recognition with Hidden Markov Model
Myanmar Named Entity Recognition with Hidden Markov ModelMyanmar Named Entity Recognition with Hidden Markov Model
Myanmar Named Entity Recognition with Hidden Markov Model
 
An introduction to system-oriented evaluation in Information Retrieval
An introduction to system-oriented evaluation in Information RetrievalAn introduction to system-oriented evaluation in Information Retrieval
An introduction to system-oriented evaluation in Information Retrieval
 
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...
HL7 & HL7 CDA: The Implementation of Thailand's Healthcare Messaging Exchange...
 

More from Giorgio Orsi

Web Data Extraction: A Crash Course
Web Data Extraction: A Crash CourseWeb Data Extraction: A Crash Course
Web Data Extraction: A Crash CourseGiorgio Orsi
 
Fairhair.ai – alan turing institute june '17 (public)
Fairhair.ai – alan turing institute june '17 (public)Fairhair.ai – alan turing institute june '17 (public)
Fairhair.ai – alan turing institute june '17 (public)Giorgio Orsi
 
wadar_poster_final
wadar_poster_finalwadar_poster_final
wadar_poster_finalGiorgio Orsi
 
Query Rewriting and Optimization for Ontological Databases
Query Rewriting and Optimization for Ontological DatabasesQuery Rewriting and Optimization for Ontological Databases
Query Rewriting and Optimization for Ontological DatabasesGiorgio Orsi
 
Deos 2014 - Welcome
Deos 2014 - WelcomeDeos 2014 - Welcome
Deos 2014 - WelcomeGiorgio Orsi
 
Heuristic Ranking in Tightly Coupled Probabilistic Description Logics
Heuristic Ranking in Tightly Coupled Probabilistic Description LogicsHeuristic Ranking in Tightly Coupled Probabilistic Description Logics
Heuristic Ranking in Tightly Coupled Probabilistic Description LogicsGiorgio Orsi
 
Datalog and its Extensions for Semantic Web Databases
Datalog and its Extensions for Semantic Web DatabasesDatalog and its Extensions for Semantic Web Databases
Datalog and its Extensions for Semantic Web DatabasesGiorgio Orsi
 
AMBER WWW 2012 Poster
AMBER WWW 2012 PosterAMBER WWW 2012 Poster
AMBER WWW 2012 PosterGiorgio Orsi
 
AMBER WWW 2012 (Demonstration)
AMBER WWW 2012 (Demonstration)AMBER WWW 2012 (Demonstration)
AMBER WWW 2012 (Demonstration)Giorgio Orsi
 
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)Giorgio Orsi
 
Querying UML Class Diagrams - FoSSaCS 2012
Querying UML Class Diagrams - FoSSaCS 2012Querying UML Class Diagrams - FoSSaCS 2012
Querying UML Class Diagrams - FoSSaCS 2012Giorgio Orsi
 
OPAL: automated form understanding for the deep web - WWW 2012
OPAL: automated form understanding for the deep web - WWW 2012OPAL: automated form understanding for the deep web - WWW 2012
OPAL: automated form understanding for the deep web - WWW 2012Giorgio Orsi
 
Nyaya: Semantic data markets: a flexible environment for knowledge management...
Nyaya: Semantic data markets: a flexible environment for knowledge management...Nyaya: Semantic data markets: a flexible environment for knowledge management...
Nyaya: Semantic data markets: a flexible environment for knowledge management...Giorgio Orsi
 
The Diadem Ontology
The Diadem OntologyThe Diadem Ontology
The Diadem OntologyGiorgio Orsi
 

More from Giorgio Orsi (20)

Web Data Extraction: A Crash Course
Web Data Extraction: A Crash CourseWeb Data Extraction: A Crash Course
Web Data Extraction: A Crash Course
 
Fairhair.ai – alan turing institute june '17 (public)
Fairhair.ai – alan turing institute june '17 (public)Fairhair.ai – alan turing institute june '17 (public)
Fairhair.ai – alan turing institute june '17 (public)
 
diadem-vldb-2015
diadem-vldb-2015diadem-vldb-2015
diadem-vldb-2015
 
wadar_poster_final
wadar_poster_finalwadar_poster_final
wadar_poster_final
 
Query Rewriting and Optimization for Ontological Databases
Query Rewriting and Optimization for Ontological DatabasesQuery Rewriting and Optimization for Ontological Databases
Query Rewriting and Optimization for Ontological Databases
 
Deos 2014 - Welcome
Deos 2014 - WelcomeDeos 2014 - Welcome
Deos 2014 - Welcome
 
Perv a ds-rr13
Perv a ds-rr13Perv a ds-rr13
Perv a ds-rr13
 
Heuristic Ranking in Tightly Coupled Probabilistic Description Logics
Heuristic Ranking in Tightly Coupled Probabilistic Description LogicsHeuristic Ranking in Tightly Coupled Probabilistic Description Logics
Heuristic Ranking in Tightly Coupled Probabilistic Description Logics
 
Datalog and its Extensions for Semantic Web Databases
Datalog and its Extensions for Semantic Web DatabasesDatalog and its Extensions for Semantic Web Databases
Datalog and its Extensions for Semantic Web Databases
 
AMBER WWW 2012 Poster
AMBER WWW 2012 PosterAMBER WWW 2012 Poster
AMBER WWW 2012 Poster
 
AMBER WWW 2012 (Demonstration)
AMBER WWW 2012 (Demonstration)AMBER WWW 2012 (Demonstration)
AMBER WWW 2012 (Demonstration)
 
DIADEM WWW 2012
DIADEM WWW 2012DIADEM WWW 2012
DIADEM WWW 2012
 
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)
OPAL: a passe-partout for web forms - WWW 2012 (Demonstration)
 
Querying UML Class Diagrams - FoSSaCS 2012
Querying UML Class Diagrams - FoSSaCS 2012Querying UML Class Diagrams - FoSSaCS 2012
Querying UML Class Diagrams - FoSSaCS 2012
 
OPAL: automated form understanding for the deep web - WWW 2012
OPAL: automated form understanding for the deep web - WWW 2012OPAL: automated form understanding for the deep web - WWW 2012
OPAL: automated form understanding for the deep web - WWW 2012
 
Nyaya: Semantic data markets: a flexible environment for knowledge management...
Nyaya: Semantic data markets: a flexible environment for knowledge management...Nyaya: Semantic data markets: a flexible environment for knowledge management...
Nyaya: Semantic data markets: a flexible environment for knowledge management...
 
The Diadem Ontology
The Diadem OntologyThe Diadem Ontology
The Diadem Ontology
 
Diadem 1.0
Diadem 1.0Diadem 1.0
Diadem 1.0
 
Oxpath vldb
Oxpath vldbOxpath vldb
Oxpath vldb
 
Gottlob ICDE 2011
Gottlob ICDE 2011Gottlob ICDE 2011
Gottlob ICDE 2011
 

SAFE EDBT 2011

  • 1. Keyword-basedContext-aware Selection of Natural Language Query-Patterns Giorgio Orsi, LetiziaTanca and Eugenio Zimeo EDBT Conference – Uppsala March 23rd 2011
  • 2. Background:Cardiovascular diseases 2 March 16, 2011 SAFE – EDBT Conference Courtesy of American Heart Association: Heart Disease & Stroke Statististics (2009)
  • 3. 3 March 16, 2011 SAFE – EDBT Conference Background:Emergency rescue of people with CVD (1) emergency (2) rescue (3) on-site assistance missing information time constraints limited technology (5) surgey preparation (4) transport to hospital (6) surgery
  • 4.
  • 5.
  • 6. Approach:Query Patterns 6 March 22, 2011 SAFE – EDBT Conference <nlquery id=“Q23"> <sentence> … </sentence> <variables> … </variables> <formalQuery> <query> … </query> <resources> … </resources> </formalQuery> </nlquery> <query> select ?name ?formula where { ?x rdf:typedomain:Substance. ?y rdf:typedomain:Substance. ?x domain:subName ?n1. ?x domain:formula ?formula. { ?x domain:interacts ?y. } ?y domain:subName ?n2 FILTER (?n2 = '<fvarref=“v1"/>') } </query> <resources> <res modelRef="&domain#Substance" /> <res modelRef=“&domain#Additive" /> <res modelRef="&domain#Molecule" /> <res modelRef="&domain#Pharmacon" /> <res modelRef="&domain#interacts" /> <res modelRef="&domain#foodPresence" /> </resources> <sentence description=“pharmacological interactions"> <fixed> show the substances and their formulas which are known to interact with </fixed> <varref=“v1"/> </sentence> <variables> <variable id=“v1" label=“pharmacon name" type=“xsd:string"/> </variables>
  • 7. Approach:Keyword to Ontology Matching LOnt: ontological terms (labels) ontology  controlled vocabulary keywords: search terms (e.g., patient, drug) parameters (e.g., “John Doe”, “49.5 Kg”) online keyword suggestion auto-completion semantically-related terms frequently-used terms 7 March 22, 2011 SAFE – EDBT Conference S: suggested keywords K: input keywords LOnt = {…, heart stroke, heart failure, CPR, resuscitation, …} Intended input: <heart stroke> Intended input: <CPR> S = Ø S = {CPR, heart massage, …} S = {heart stroke, heart failure, …} S = {resuscitation, …} K = Ø K = {heart stroke, CPR} K = {heart stroke} input = <he… Input = <c… input = “”
  • 8. 8 March 22, 2011 SAFE – EDBT Conference Approach:Pertinence Construct S by picking nterms t from LOnt related to the keywords already chosen (those in the set K) S = f( freq( t ), pert( t, K ) ) LR1 = {drug, pharmaceutical, medicinal} K = { , ascriptin} drug LR2 = {disease, condition, illness, sickness} input=Ø LR3 = {treats, cures, heals} R4 R3 R5 R1 R2 LR4 = {name} string string 0.5 0.5 0.25 LR5 = {code} 1.0 0.25 pertinence computation: phase 1: best-match decoration phase 2: neighbors decoration phase 3: pertinence combination assuming n = 6… S = { treats, cures, heals, name, disease, condition }
  • 9. 9 March 22, 2011 SAFE – EDBT Conference Approach:Ranking the Query Patterns naïve approach Rank by average pertinence of the formal resources in the pattern rkgp=𝑟𝑖∈𝑅𝑃(𝑝)𝑝𝑒𝑟𝑡(𝑟𝑖, 𝐾)𝑅𝑃(𝑝)   normalized approach use the number of resources directly associated to a keyword and mentioned in the pattern rkgnormp=𝜃×rkgp   𝜃= 1−𝑅𝑃(𝑝)𝑅𝑃(𝑝)×𝑅𝑃(𝑝)K+𝑅𝑃(𝑝)  
  • 10. Approach:Focus by Context-Awareness 10 March 22, 2011 SAFE – EDBT Conference all role situation topic anamn treat doctor para-md rescue ER patology pharm relevant areas definition keyword suggestion pattern-ranking query-answering chem natural CeV CaV
  • 11. 10 people without a previous experience of the systems 50 natural-language query patterns metric  access time (aT) Break down metrics: aT = thT + kpT + srT + qeT + coT Thinking time (thT) pertinence computation (kpT) Scoring and ranking (srT) query execution time (qeT) communication time (coT) Experimentation:Experimental settings 11 March 22, 2011 SAFE – EDBT Conference
  • 12. Experimentation:Validation (1) 5 query patterns (randomly selected from the pool) to each user. The “right” queries were found: in 65 % of cases on top of the list in 25 % of cases at the second position in 8 % cases of the first result page In some cases, the testers were not able to formulate the right query using the form-based system. 12 March 22, 2011 SAFE – EDBT Conference
  • 13. Experimentation:Validation (2) 13 March 22, 2011 SAFE – EDBT Conference 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
  • 14. Summary:Conclusion and future work now… novel paradigm for keyword-based search context-aware and semantic ranking of query patterns fast and precise information access 14 March 22, 2011 SAFE – EDBT Conference future… automatic definition of query patterns automatic definition of natural language descriptions automatic definition of relevant areas
  • 15. Q & A
  • 16. Two implementations: Maemo Linux on Nokia Smartphones N810 and N900 Web based on OpenLaszlo and enterprise technologies Experimental testbed in a client/server environment Web-based SAFE vs form-based system provided by a hospital Experimentation:Testbeds 16 March 22, 2011 SAFE – EDBT Conference

Editor's Notes

  1. Cardiovascular diseases kill more than cancer and they generate more direct and indirect costs.
  2. This happens mainly because information doesn’t flow as expected during emergency situations.Patient unconscious.Doctors and paramedics act mainly on the basis of statistical and not actual knowledge.Motivated by these facts, a hospital asked us to find a way to overcome the problem of limited information.Hoping that this would have been resolved by straightforward technology transfer and easy money for the researchers involved we accepted the job. Unfortunately the problem wasn’t so easy and we actually had to do research to provide a solution. Problem Solved? No. Now the information flows but we still need to access it effectively.