SlideShare a Scribd company logo
COMP80122: final presentation




temporal expressions
   identification in biomedical texts
                           Michele Filannino




                                                  Manchester, 29/02/2012
presentation temporal expressions




where we are


■ Computer science
  ●   natural language processing
      ▶   information extraction
           ★ temporal expressions extraction




                                                    29/02/2012, Michele Filannino   2 / 23
presentation temporal expressions




temporal expression definition


       ■ natural language phrase that denotes a temporal
         entity: an interval, or an instant (Ferro et Al.)1
          ●   She has been at work for more than a month
          ●   He wrapped up a three-hour meeting with the Iraqi
              president in Baghdad today.




1 L.
   Ferro, I. Mani, B. Sundheim, and G. Wilson, “Tides temporal annotation
guidelines, v. 1.0.2,” MITRE, 2001
                                                                                 29/02/2012, Michele Filannino   3 / 23
presentation temporal expressions




why?

■ user’s perspective
   ●   temporal aspects of events and entities provide a
       natural mechanism for organising information.

■ machine’s perspective
   ●   improvements in
        ▶   question answering, summarisation, browsing



                                                29/02/2012, Michele Filannino   4 / 23
presentation temporal expressions




why clinical domain?

 ■ diagnosis explanation
 ■ disease progression
   modelling

 ■ analysis of effectiveness of
   treatment



                                      29/02/2012, Michele Filannino   5 / 23
presentation temporal expressions




scientific interest
                                “temporal expressions” AND “clinical”
   70

   63

   56

   49

   42

   35
                                                                                                    61
   28
                                                                                          49
                                                          46                    46
   21                                                                   43
                                                                 38

   14                                              25
                           18      16
    7                                      15
          10       12                                                                                          10

    0
         2000    2001    2002     2003    2004    2005   2006   2007   2008   2009      2010      2011        2012


Source: Google Scholar (last update 27/02/2012)                               29/02/2012, Michele Filannino         6 / 23
presentation temporal expressions




temporal                           forms 1

       ■ time or date references
          ●   11pm, February 14th, 2005

       ■ time references that anchor on another time
          ●   one hour after midnight, two weeks before Christmas

       ■ durations
          ●   few months, two days, five years

       ■ recurring times
          ●   every third month, twice in the hour

1 J.
   Poveda, M. Surdeanu, and J. Turmo, “An analysis of Bootstrapping for the
Recognition of Temporal Expressions”, 2009
                                                                                   29/02/2012, Michele Filannino   7 / 23
presentation temporal expressions




temporal                           forms 1

       ■ context-dependent times
          ●   today, last year

       ■ vague references
          ●   somewhere in the middle of June, the near future

       ■ times indicated by an event
          ●   the day S. Berlusconi resigned
                ▶    an event is considered a cover term for situations that

                     happen or occur

1 J.
   Poveda, M. Surdeanu, and J. Turmo, “An analysis of Bootstrapping for the
Recognition of Temporal Expressions”, 2009
                                                                                   29/02/2012, Michele Filannino   8 / 23
presentation temporal expressions




methodology
■ annotation
  ●   recognition
      ▶   automatically detect and delimitate expressions
      ▶   mostly machine-learning techniques
  ●   normalisation
      ▶   assign attributes values for all the recognised
          expressions
      ▶   using a shared and formal format
      ▶   mostly rule-based techniques
■ reasoning or searching
                                                  29/02/2012, Michele Filannino   9 / 23
presentation temporal expressions




example: raw text



       That means Unisys must pay about $100 million in interest every
       quarter, on top of $27 million in dividends on preferred stock.




Source: TRIOS TimeBank v.0.1                           29/02/2012, Michele Filannino   10 / 23
presentation temporal expressions




example: recognition


       That means Unisys must <ev>pay</ev> about $100 million in
       interest <te>every quarter</te>, on top of $27 million in
       dividends on preferred stock.




Source: TRIOS TimeBank v.0.1                          29/02/2012, Michele Filannino   11 / 23
presentation temporal expressions




example: normalisation
       That means Unisys must <EVENT eid="e110" ...>pay</EVENT>
       about $100 million in interest <TIMEX3 tid="t256" type="SET"
       value="P1Q" temporalFunction="false"
       functionInDocument="NONE" quant="every">every quarter</
       TIMEX3>, on top of $27 million in dividends on preferred stock.
       <TLINK lid="l32" relType="BEFORE" relatedToEvent="e110"
       eventID="e107"/>
       <TLINK lid="l26" relType="OVERLAP" eventID="e110"
       relatedToTime="t256"/>


Source: TRIOS TimeBank v.0.1                          29/02/2012, Michele Filannino   12 / 23
presentation temporal expressions




lack of corpora




                       29/02/2012, Michele Filannino   13 / 23
presentation temporal expressions




my contributions
■ built the first timex corpus using all the possible
  freely available timexes
   ●   {timex, type, normalised_value, utterance_reference}
   ●   2822 different timexes

■ built a normaliser
   ●   as TRIOS’ extension (University of Rochester)
   ●   71.66% accuracy from 62.57%

                                            29/02/2012, Michele Filannino   14 / 23
presentation temporal expressions




human mistakes
 utterance expression                  type             annotation
     -        three years before       DATE                FUTURE_REF

 26/09/2011   this morning             DATE             1998-02-06TMO

     -        two decades             DURATION                    P20Y

     -        the summer of 1862       DATE                FUTURE_REF

     -        centuries               DURATION                     PXE

     -        the last half of ‘80s    DATE                        198


                                                 29/02/2012, Michele Filannino   15 / 23
presentation temporal expressions




my to-do list
 ✓ study the literature

 ✓ build a corpus of timexes

 ✓ build a normaliser

 ■ release my timexes corpus freely
 ■ literature review

                22 days elapsed                          8 days remaining
0     3    6     9       12       15   18     21           24           27         30




                                                   29/02/2012, Michele Filannino   16 / 23
Thank you.

More Related Content

More from Michele Filannino

Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
Michele Filannino
 
Temporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domainTemporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domain
Michele Filannino
 
Mining temporal footprints from Wikipedia
Mining temporal footprints from WikipediaMining temporal footprints from Wikipedia
Mining temporal footprints from Wikipedia
Michele Filannino
 
Detecting novel associations in large data sets
Detecting novel associations in large data setsDetecting novel associations in large data sets
Detecting novel associations in large data sets
Michele Filannino
 
Nonlinear component analysis as a kernel eigenvalue problem
Nonlinear component analysis as a kernel eigenvalue problemNonlinear component analysis as a kernel eigenvalue problem
Nonlinear component analysis as a kernel eigenvalue problem
Michele Filannino
 
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Michele Filannino
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Michele Filannino
 
Serendipity module in Item Recommender System
Serendipity module in Item Recommender SystemSerendipity module in Item Recommender System
Serendipity module in Item Recommender System
Michele Filannino
 
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
Michele Filannino
 

More from Michele Filannino (10)

me_t3_october
me_t3_octoberme_t3_october
me_t3_october
 
Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
 
Temporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domainTemporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domain
 
Mining temporal footprints from Wikipedia
Mining temporal footprints from WikipediaMining temporal footprints from Wikipedia
Mining temporal footprints from Wikipedia
 
Detecting novel associations in large data sets
Detecting novel associations in large data setsDetecting novel associations in large data sets
Detecting novel associations in large data sets
 
Nonlinear component analysis as a kernel eigenvalue problem
Nonlinear component analysis as a kernel eigenvalue problemNonlinear component analysis as a kernel eigenvalue problem
Nonlinear component analysis as a kernel eigenvalue problem
 
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
 
Serendipity module in Item Recommender System
Serendipity module in Item Recommender SystemSerendipity module in Item Recommender System
Serendipity module in Item Recommender System
 
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
Orchestrazione di risorse umane nel BPM: Gestione dinamica feature-based dell...
 

Recently uploaded

みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 

Recently uploaded (20)

みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 

Temporal expressions identification in biomedical texts

  • 1. COMP80122: final presentation temporal expressions identification in biomedical texts Michele Filannino Manchester, 29/02/2012
  • 2. presentation temporal expressions where we are ■ Computer science ● natural language processing ▶ information extraction ★ temporal expressions extraction 29/02/2012, Michele Filannino 2 / 23
  • 3. presentation temporal expressions temporal expression definition ■ natural language phrase that denotes a temporal entity: an interval, or an instant (Ferro et Al.)1 ● She has been at work for more than a month ● He wrapped up a three-hour meeting with the Iraqi president in Baghdad today. 1 L. Ferro, I. Mani, B. Sundheim, and G. Wilson, “Tides temporal annotation guidelines, v. 1.0.2,” MITRE, 2001 29/02/2012, Michele Filannino 3 / 23
  • 4. presentation temporal expressions why? ■ user’s perspective ● temporal aspects of events and entities provide a natural mechanism for organising information. ■ machine’s perspective ● improvements in ▶ question answering, summarisation, browsing 29/02/2012, Michele Filannino 4 / 23
  • 5. presentation temporal expressions why clinical domain? ■ diagnosis explanation ■ disease progression modelling ■ analysis of effectiveness of treatment 29/02/2012, Michele Filannino 5 / 23
  • 6. presentation temporal expressions scientific interest “temporal expressions” AND “clinical” 70 63 56 49 42 35 61 28 49 46 46 21 43 38 14 25 18 16 7 15 10 12 10 0 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 Source: Google Scholar (last update 27/02/2012) 29/02/2012, Michele Filannino 6 / 23
  • 7. presentation temporal expressions temporal forms 1 ■ time or date references ● 11pm, February 14th, 2005 ■ time references that anchor on another time ● one hour after midnight, two weeks before Christmas ■ durations ● few months, two days, five years ■ recurring times ● every third month, twice in the hour 1 J. Poveda, M. Surdeanu, and J. Turmo, “An analysis of Bootstrapping for the Recognition of Temporal Expressions”, 2009 29/02/2012, Michele Filannino 7 / 23
  • 8. presentation temporal expressions temporal forms 1 ■ context-dependent times ● today, last year ■ vague references ● somewhere in the middle of June, the near future ■ times indicated by an event ● the day S. Berlusconi resigned ▶ an event is considered a cover term for situations that happen or occur 1 J. Poveda, M. Surdeanu, and J. Turmo, “An analysis of Bootstrapping for the Recognition of Temporal Expressions”, 2009 29/02/2012, Michele Filannino 8 / 23
  • 9. presentation temporal expressions methodology ■ annotation ● recognition ▶ automatically detect and delimitate expressions ▶ mostly machine-learning techniques ● normalisation ▶ assign attributes values for all the recognised expressions ▶ using a shared and formal format ▶ mostly rule-based techniques ■ reasoning or searching 29/02/2012, Michele Filannino 9 / 23
  • 10. presentation temporal expressions example: raw text That means Unisys must pay about $100 million in interest every quarter, on top of $27 million in dividends on preferred stock. Source: TRIOS TimeBank v.0.1 29/02/2012, Michele Filannino 10 / 23
  • 11. presentation temporal expressions example: recognition That means Unisys must <ev>pay</ev> about $100 million in interest <te>every quarter</te>, on top of $27 million in dividends on preferred stock. Source: TRIOS TimeBank v.0.1 29/02/2012, Michele Filannino 11 / 23
  • 12. presentation temporal expressions example: normalisation That means Unisys must <EVENT eid="e110" ...>pay</EVENT> about $100 million in interest <TIMEX3 tid="t256" type="SET" value="P1Q" temporalFunction="false" functionInDocument="NONE" quant="every">every quarter</ TIMEX3>, on top of $27 million in dividends on preferred stock. <TLINK lid="l32" relType="BEFORE" relatedToEvent="e110" eventID="e107"/> <TLINK lid="l26" relType="OVERLAP" eventID="e110" relatedToTime="t256"/> Source: TRIOS TimeBank v.0.1 29/02/2012, Michele Filannino 12 / 23
  • 13. presentation temporal expressions lack of corpora 29/02/2012, Michele Filannino 13 / 23
  • 14. presentation temporal expressions my contributions ■ built the first timex corpus using all the possible freely available timexes ● {timex, type, normalised_value, utterance_reference} ● 2822 different timexes ■ built a normaliser ● as TRIOS’ extension (University of Rochester) ● 71.66% accuracy from 62.57% 29/02/2012, Michele Filannino 14 / 23
  • 15. presentation temporal expressions human mistakes utterance expression type annotation - three years before DATE FUTURE_REF 26/09/2011 this morning DATE 1998-02-06TMO - two decades DURATION P20Y - the summer of 1862 DATE FUTURE_REF - centuries DURATION PXE - the last half of ‘80s DATE 198 29/02/2012, Michele Filannino 15 / 23
  • 16. presentation temporal expressions my to-do list ✓ study the literature ✓ build a corpus of timexes ✓ build a normaliser ■ release my timexes corpus freely ■ literature review 22 days elapsed 8 days remaining 0 3 6 9 12 15 18 21 24 27 30 29/02/2012, Michele Filannino 16 / 23