• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Bootstrapping Language Associated with Biomedical Entities - TREC 2007 Genomics Talk - Edgar Meij (Univ. of Amsterdam)
 

Bootstrapping Language Associated with Biomedical Entities - TREC 2007 Genomics Talk - Edgar Meij (Univ. of Amsterdam)

on

  • 1,321 views

The TREC Genomics 2007 task included recognizing topic-specific entities in the returned answer snippets. To tackle this problem, we designed and implemented a novel data-driven approach by combining ...

The TREC Genomics 2007 task included recognizing topic-specific entities in the returned answer snippets. To tackle this problem, we designed and implemented a novel data-driven approach by combining information extraction with language modeling techniques. Instead of using an exhaustive list of all possible instances for an entity, we look at the language usage around each entity type and use that as a classifier by comparing it with the language models of the snippets. E.g., given the entity type "genes", our approach can measure the gene-iness of a piece of text.

In the presentation we describe the underlying algorithm, which works as follows. Given a list of possible entity types, it first uses Hearst patterns to extract instances of each type. To extract more, we look for new syntactic patterns around the instances and use them as input for a bootstrapping method, in which new instances and patterns are discovered iteratively. Afterwards, all discovered instances and patterns are used to find the sentences in the collection which are most on par with the entity type. A language model is generated from these sentences by sampling i.i.d. Then, at retrieval time, we use the language model of the topic's entity type to rerank the found snippets.

Statistics

Views

Total Views
1,321
Views on SlideShare
1,306
Embed Views
15

Actions

Likes
1
Downloads
0
Comments
0

3 Embeds 15

http://edgar.meij.pro 11
http://www.linkedin.com 3
http://www.slideshare.net 1

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Bootstrapping Language Associated with Biomedical Entities - TREC 2007 Genomics Talk - Edgar Meij (Univ. of Amsterdam) Bootstrapping Language Associated with Biomedical Entities - TREC 2007 Genomics Talk - Edgar Meij (Univ. of Amsterdam) Presentation Transcript

    • Introduction Our Approach Results and Discussion Future Work Bootstrapping Language Associated with Biomedical Entities The AID group at TREC Genomics 2007 Edgar Meij and Sophia Katrenko ISLA University of Amsterdam TREC 2007 Edgar Meij and Sophia Katrenko Bootstrapping Language Associated with Biomedical Entities