SlideShare a Scribd company logo
1 of 14
Download to read offline
Handling Dynamicity and Temporality of
Web Data
Hady Elsahar
hadyelsahar@gmail.com
Jean Monnet University
Saint-Étienne, France
First try with Question Answering
Weet it : Natural language interface for Linked Data (ElSahar et al. ‘11 )
● Most of the current knowledge bases focus on static facts and ignore
the temporal dimension of facts.
● Aspects of temporality and Dynamicity of Datasets :
○ Aspect 1 : Many facts are valid only during a particular time period.
○ Aspect 2 : New extracted facts can contradict with, verify or modify new ones
○ Aspect 3 : Some Facts are collectively induced from a series of Events
Handling Dynamicity of Data
Challenges and Motivations (1) :
Stephen Hawking
Many facts are valid only during a particular time
period.
Use Case : Questions about Temporal facts
● Who is first Wife of Stephen Hawiking ?
● Who is the 10th President of France ?
● Who is the past CEO of google ?
Extraction and Represenation of Temporal data
Extraction and representation of Temporal Facts and Events
❏ Representation :
❏ Keeping the last updated fact is not enough (DBpedia)
❏ Higher order fact (Erdal and Weikum ‘11)
❏ f1:Bill_Clinton isPresidentOf USA.
❏ f2:f1 startedOnDate 20-01-1993
❏ Wikidata Qualifiers (Vrandečić ‘12)
❏ Temporal fact and event extraction:
❏ Free Text and structured data from wikipedia (patterns and pattern induction)
(Erdal and Weikum ‘11)
Annotation of temporal facts in documents for Question answering
SemEval-2015 Task 5: QA TempEval
SemEval-2015 Task 5: QA TempEval
Question Examples in the Evaluation Dataset :
Yes / No:
● “Did the the Indonesian stock market rise again after it’s last fall ?
List:
● “What happened after the crash?”
● “What happened between the crash and yesterday?”
When (Factoid):
● “When did the Oscar ceremony end yesterday ?”
Applications ?
Challenges and Motivations (2) :
Stephen Hawking
In Highly dynamic datasets, new extracted facts
can contradict with, verify or modify new ones.
Existing facts New Extracted Fact
Matt Smith
is dbo:starring of
■ dbr:Womb_(film)
■ dbr:Lost_River_(film)
■ dbr:Bert_and_Dickie
■ dbr:The_Science_of_Doctor_Who
“Matt Smith is the doctor”
(Matt Smith, occupation, Medicine)
confidence : 0.1
(Frank Sinatra, profession, Singer) confidence : 0.9
(Jared leto, influenced_by, Frank Sinatra) confidence : 0.8
● People influenced by Writers are probably writers as well
● people are probably born at the same place of their siblings
Challenges and Motivations (2) :
Stephen Hawking
In Highly dynamic datasets, new extracted facts
can contradict with, verify or modify new ones.
Evaluation of new facts using Link prediction
Link Prediction
● Add new facts without extra knowledge
● Assess the validity of an unknown fact
Embedding Models for knowledge bases
TransE : Modeling Relations as Translations (Bordes et al. ’13):
● Modeling Facts as translations between vectors of entities
VSubject
+ VRelation
≅ VObject
● distance is used to Quantify confidence in facts
● Training objective: Find the representations that Minimizes distances across all true facts and
maximize across “corrupted” facts ( s’ , o’ ):
Other Embedding Models:
● Structured Embeddings (SE) (Bordes et al ‘11 )
● Collective Matrix Factorization (RESCAL) (Nickel et al., ’11)
● Neural Tensor Networks (socher et al. ‘13)
● TATEC (Garcia-Duran et al., ’14)
Embedding Models for Text + Knowledge bases:
● Joint Learning of Words and Meaning Representations (Bordes et al. ‘12)
● Knowledge Graph and Text Jointly Embedding (Wang et al ‘14)
Link prediction using Embedding Models
Applications ?
● Verification of new Extracted Facts
● Completeness of new added datasets
● Modeing literals dataypes (length, date ..etc ) not only relations and
entities.
Embedding Models other benefits ? (collaboration potential)
● Entity Disambiguation for Fact Extraction and QA (Bordes et al. ‘12)
● Paraphrase Detection for Questions, (PARALEX) (Fader et al. ‘13)
Challenges and Motivations (3) :
Reasoning with more than one supporting
facts ● Reasoning about positions (ex: Geo Data)
● Reasoning about Counts
● Reasoning about sizesFact 1 : 55 passengers crammed into the smuggler’s boat.
Fact 2 : The boat made it to the Greek island.
Question : Where are the passengers ?
Stephen Hawking
Facts induced from a series of Events
● Towards AI-Complete QA: A Set of Prerequisite Toy Tasks (Wetson et al ‘15)
● Memory Networks (Wetson et al ‘14)

More Related Content

Viewers also liked

Smart Comm Marketing Report
Smart Comm Marketing ReportSmart Comm Marketing Report
Smart Comm Marketing ReportBrian DeMontfort
 
Starting a portfolio
Starting a portfolioStarting a portfolio
Starting a portfolioMaven
 
Clipping El Observador Solitario 14/11/11 @ IED Barcelona
Clipping El Observador Solitario 14/11/11 @ IED BarcelonaClipping El Observador Solitario 14/11/11 @ IED Barcelona
Clipping El Observador Solitario 14/11/11 @ IED BarcelonaIED Barcelona
 
Clipping Hola.com 14/11/11 @ IED Barcelona
Clipping Hola.com 14/11/11 @ IED BarcelonaClipping Hola.com 14/11/11 @ IED Barcelona
Clipping Hola.com 14/11/11 @ IED BarcelonaIED Barcelona
 
120313 wb mpresentation_rotterdam2012
120313 wb mpresentation_rotterdam2012120313 wb mpresentation_rotterdam2012
120313 wb mpresentation_rotterdam2012ralphbrieskorn
 
Understanding project management qualifications
Understanding project management qualificationsUnderstanding project management qualifications
Understanding project management qualificationsMaven
 
Data center dynamics ver. 1.0
Data center dynamics ver. 1.0Data center dynamics ver. 1.0
Data center dynamics ver. 1.0Hajime Arimoto
 
SES - Plush Search
SES - Plush SearchSES - Plush Search
SES - Plush SearchPhilip James
 
Trabajo investigación udh 2015
Trabajo investigación udh 2015Trabajo investigación udh 2015
Trabajo investigación udh 2015oscar rivera
 
. Net Training Institute in Noida/NCR
. Net Training Institute in Noida/NCR. Net Training Institute in Noida/NCR
. Net Training Institute in Noida/NCRTech Mentro
 
Perfil Profesional J.Carlos Nesta
Perfil Profesional J.Carlos NestaPerfil Profesional J.Carlos Nesta
Perfil Profesional J.Carlos NestaJ. Carlos Nesta
 

Viewers also liked (20)

Smart Comm Marketing Report
Smart Comm Marketing ReportSmart Comm Marketing Report
Smart Comm Marketing Report
 
Starting a portfolio
Starting a portfolioStarting a portfolio
Starting a portfolio
 
AFMS April 2014 - Louisiana Resources & Issues
AFMS April 2014 - Louisiana Resources & IssuesAFMS April 2014 - Louisiana Resources & Issues
AFMS April 2014 - Louisiana Resources & Issues
 
Clipping El Observador Solitario 14/11/11 @ IED Barcelona
Clipping El Observador Solitario 14/11/11 @ IED BarcelonaClipping El Observador Solitario 14/11/11 @ IED Barcelona
Clipping El Observador Solitario 14/11/11 @ IED Barcelona
 
Frase maleïda
Frase maleïdaFrase maleïda
Frase maleïda
 
Clipping Hola.com 14/11/11 @ IED Barcelona
Clipping Hola.com 14/11/11 @ IED BarcelonaClipping Hola.com 14/11/11 @ IED Barcelona
Clipping Hola.com 14/11/11 @ IED Barcelona
 
120313 wb mpresentation_rotterdam2012
120313 wb mpresentation_rotterdam2012120313 wb mpresentation_rotterdam2012
120313 wb mpresentation_rotterdam2012
 
Understanding project management qualifications
Understanding project management qualificationsUnderstanding project management qualifications
Understanding project management qualifications
 
Data center dynamics ver. 1.0
Data center dynamics ver. 1.0Data center dynamics ver. 1.0
Data center dynamics ver. 1.0
 
STC PMC Newsletter 2011-04
STC PMC Newsletter 2011-04STC PMC Newsletter 2011-04
STC PMC Newsletter 2011-04
 
SES - Plush Search
SES - Plush SearchSES - Plush Search
SES - Plush Search
 
Ds 011 201100000002
Ds 011 201100000002Ds 011 201100000002
Ds 011 201100000002
 
Trabajo investigación udh 2015
Trabajo investigación udh 2015Trabajo investigación udh 2015
Trabajo investigación udh 2015
 
All about
All aboutAll about
All about
 
Synthetic division
Synthetic divisionSynthetic division
Synthetic division
 
. Net Training Institute in Noida/NCR
. Net Training Institute in Noida/NCR. Net Training Institute in Noida/NCR
. Net Training Institute in Noida/NCR
 
LOGA State of the Industry: Houston, TX
LOGA State of the Industry: Houston, TXLOGA State of the Industry: Houston, TX
LOGA State of the Industry: Houston, TX
 
Perfil Profesional J.Carlos Nesta
Perfil Profesional J.Carlos NestaPerfil Profesional J.Carlos Nesta
Perfil Profesional J.Carlos Nesta
 
Currency book
Currency bookCurrency book
Currency book
 
INGLES V
INGLES VINGLES V
INGLES V
 

Similar to WDAqua introduction presentation

From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...Stefan Dietze
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DUniversity of Washington
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectbodaceacat
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSara-Jayne Terp
 
Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Han Woo PARK
 
Kdd 2014 tutorial bringing structure to text - chi
Kdd 2014 tutorial   bringing structure to text - chiKdd 2014 tutorial   bringing structure to text - chi
Kdd 2014 tutorial bringing structure to text - chiBarbara Starr
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadKelly Technologies
 
Top (10) challenging problems in data mining
Top (10) challenging problems  in data miningTop (10) challenging problems  in data mining
Top (10) challenging problems in data miningAhmedasbasb
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextMurad Daryousse
 
Lecture Case Studies
Lecture Case StudiesLecture Case Studies
Lecture Case StudiesFleur Deken
 
Spark Social Media
Spark Social Media Spark Social Media
Spark Social Media suresh sood
 
Generating educational assessment items from Linked Open Data
Generating educational assessment items from Linked Open DataGenerating educational assessment items from Linked Open Data
Generating educational assessment items from Linked Open Datachessmu
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Matthew Lease
 
Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014MedicReS
 
Data Analysis for Qualitative Research
Data Analysis for Qualitative Research Data Analysis for Qualitative Research
Data Analysis for Qualitative Research Ashley Casey
 
Turning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkTurning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkHendrik Drachsler
 
What knowledge bases know (and what they don't)
What knowledge bases know (and what they don't)What knowledge bases know (and what they don't)
What knowledge bases know (and what they don't)srazniewski
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 

Similar to WDAqua introduction presentation (20)

From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&D
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science projectSession 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
 
Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생
 
Kdd 2014 tutorial bringing structure to text - chi
Kdd 2014 tutorial   bringing structure to text - chiKdd 2014 tutorial   bringing structure to text - chi
Kdd 2014 tutorial bringing structure to text - chi
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Top (10) challenging problems in data mining
Top (10) challenging problems  in data miningTop (10) challenging problems  in data mining
Top (10) challenging problems in data mining
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data Context
 
Intro to Data Science Concepts
Intro to Data Science ConceptsIntro to Data Science Concepts
Intro to Data Science Concepts
 
Lecture Case Studies
Lecture Case StudiesLecture Case Studies
Lecture Case Studies
 
Spark Social Media
Spark Social Media Spark Social Media
Spark Social Media
 
Generating educational assessment items from Linked Open Data
Generating educational assessment items from Linked Open DataGenerating educational assessment items from Linked Open Data
Generating educational assessment items from Linked Open Data
 
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
Designing at the Intersection of HCI & AI: Misinformation & Crowdsourced Anno...
 
Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014Nicholas Jewell MedicReS World Congress 2014
Nicholas Jewell MedicReS World Congress 2014
 
Data Analysis for Qualitative Research
Data Analysis for Qualitative Research Data Analysis for Qualitative Research
Data Analysis for Qualitative Research
 
Turning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics FrameworkTurning Learning into Numbers - A Learning Analytics Framework
Turning Learning into Numbers - A Learning Analytics Framework
 
What knowledge bases know (and what they don't)
What knowledge bases know (and what they don't)What knowledge bases know (and what they don't)
What knowledge bases know (and what they don't)
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 

Recently uploaded

Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 

Recently uploaded (20)

Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 

WDAqua introduction presentation

  • 1. Handling Dynamicity and Temporality of Web Data Hady Elsahar hadyelsahar@gmail.com Jean Monnet University Saint-Étienne, France
  • 2. First try with Question Answering Weet it : Natural language interface for Linked Data (ElSahar et al. ‘11 )
  • 3. ● Most of the current knowledge bases focus on static facts and ignore the temporal dimension of facts. ● Aspects of temporality and Dynamicity of Datasets : ○ Aspect 1 : Many facts are valid only during a particular time period. ○ Aspect 2 : New extracted facts can contradict with, verify or modify new ones ○ Aspect 3 : Some Facts are collectively induced from a series of Events Handling Dynamicity of Data
  • 4. Challenges and Motivations (1) : Stephen Hawking Many facts are valid only during a particular time period. Use Case : Questions about Temporal facts ● Who is first Wife of Stephen Hawiking ? ● Who is the 10th President of France ? ● Who is the past CEO of google ?
  • 5. Extraction and Represenation of Temporal data Extraction and representation of Temporal Facts and Events ❏ Representation : ❏ Keeping the last updated fact is not enough (DBpedia) ❏ Higher order fact (Erdal and Weikum ‘11) ❏ f1:Bill_Clinton isPresidentOf USA. ❏ f2:f1 startedOnDate 20-01-1993 ❏ Wikidata Qualifiers (Vrandečić ‘12) ❏ Temporal fact and event extraction: ❏ Free Text and structured data from wikipedia (patterns and pattern induction) (Erdal and Weikum ‘11)
  • 6. Annotation of temporal facts in documents for Question answering SemEval-2015 Task 5: QA TempEval
  • 7. SemEval-2015 Task 5: QA TempEval Question Examples in the Evaluation Dataset : Yes / No: ● “Did the the Indonesian stock market rise again after it’s last fall ? List: ● “What happened after the crash?” ● “What happened between the crash and yesterday?” When (Factoid): ● “When did the Oscar ceremony end yesterday ?” Applications ?
  • 8. Challenges and Motivations (2) : Stephen Hawking In Highly dynamic datasets, new extracted facts can contradict with, verify or modify new ones. Existing facts New Extracted Fact Matt Smith is dbo:starring of ■ dbr:Womb_(film) ■ dbr:Lost_River_(film) ■ dbr:Bert_and_Dickie ■ dbr:The_Science_of_Doctor_Who “Matt Smith is the doctor” (Matt Smith, occupation, Medicine) confidence : 0.1
  • 9. (Frank Sinatra, profession, Singer) confidence : 0.9 (Jared leto, influenced_by, Frank Sinatra) confidence : 0.8 ● People influenced by Writers are probably writers as well ● people are probably born at the same place of their siblings Challenges and Motivations (2) : Stephen Hawking In Highly dynamic datasets, new extracted facts can contradict with, verify or modify new ones.
  • 10. Evaluation of new facts using Link prediction Link Prediction ● Add new facts without extra knowledge ● Assess the validity of an unknown fact
  • 11. Embedding Models for knowledge bases TransE : Modeling Relations as Translations (Bordes et al. ’13): ● Modeling Facts as translations between vectors of entities VSubject + VRelation ≅ VObject ● distance is used to Quantify confidence in facts ● Training objective: Find the representations that Minimizes distances across all true facts and maximize across “corrupted” facts ( s’ , o’ ):
  • 12. Other Embedding Models: ● Structured Embeddings (SE) (Bordes et al ‘11 ) ● Collective Matrix Factorization (RESCAL) (Nickel et al., ’11) ● Neural Tensor Networks (socher et al. ‘13) ● TATEC (Garcia-Duran et al., ’14) Embedding Models for Text + Knowledge bases: ● Joint Learning of Words and Meaning Representations (Bordes et al. ‘12) ● Knowledge Graph and Text Jointly Embedding (Wang et al ‘14) Link prediction using Embedding Models
  • 13. Applications ? ● Verification of new Extracted Facts ● Completeness of new added datasets ● Modeing literals dataypes (length, date ..etc ) not only relations and entities. Embedding Models other benefits ? (collaboration potential) ● Entity Disambiguation for Fact Extraction and QA (Bordes et al. ‘12) ● Paraphrase Detection for Questions, (PARALEX) (Fader et al. ‘13)
  • 14. Challenges and Motivations (3) : Reasoning with more than one supporting facts ● Reasoning about positions (ex: Geo Data) ● Reasoning about Counts ● Reasoning about sizesFact 1 : 55 passengers crammed into the smuggler’s boat. Fact 2 : The boat made it to the Greek island. Question : Where are the passengers ? Stephen Hawking Facts induced from a series of Events ● Towards AI-Complete QA: A Set of Prerequisite Toy Tasks (Wetson et al ‘15) ● Memory Networks (Wetson et al ‘14)