SlideShare a Scribd company logo
1 of 25
Download to read offline
A GRAPH-BASED CROSS-LINGUAL
      PROJECTION APPROACH FOR
WEAKLY SUPERVISED RELATION EXTRACTION
    The 50th Annual Meeting of the Association for Computational Linguistics
                                  (ACL 2012)
                             July 11th, 2012, Jeju

       Seokhwan Kim (Institute for Infocomm Research)
                 Gary Geunbae Lee (POSTECH)
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   2
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   3
Problem Definition
• Relation Extraction
    To identify semantic relations between a pair of entities

                         Birthplace

       Barack Obama was born in Honolulu           ,   Hawaii    .
              PER                          LOC           LOC



    Considered as a classification problem




                                                                     4
Related Work (1)
• Supervised Learning
    Many supervised machine learning approaches have been
     successfully applied
      • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and
        Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006)

• Semi-supervised Learning
    To obtain the annotations of unlabeled instances from the seed
     information
      • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000;
        Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006;
        Zhang, 2004; Chen el al., 2006; Zhou et al., 2009)



                                                                             5
Motivation
• Resources for Relation Extraction
    Supervised/Semi-supervised Approaches
      • Labeled corpora for supervised learning
      • Seed instances for semi-supervised learning
      • Available for only a few languages
           ACE 2003 Multilingual Training Dataset
              • English (252 articles)
              • Chinese (221 articles)
              • Arabic (206 articles)
      • No resources for other languages
           Korean




                                                      6
Related Work (2)
• Self-supervised Learning
    To obtain the annotated dataset without any human effort
    Using the information obtained from external resources
       • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008)
       • Wikipedia-based Methods (Wu and Weld, 2010)

• Cross-lingual Annotation Projection
    To leverage parallel corpora to project the relation annotations on
     the resource-rich source language to the resource-poor target
     language (Kim et al., 2010, Kim et al., 2011)




                                                                           7
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Implementation
• Evaluation
• Conclusions




                                                               8
Overall Architecture
Annotation                Parallel
                                                     Projection
                          Corpus


         Sentences in                 Sentences in
                 Ls                        Lt



        Preprocessing                Preprocessing
        (POS Tagging,                (POS Tagging,
           Parsing)                     Parsing)




               NER                   Word Alignment




              Relation
                                       Projection
             Extraction



          Annotated                    Annotated
         Sentences in                 Sentences in
                 Ls                        Lt                     9
Direct Projection
                                                                                (Kim et al., 2010)
• Annotation


• Projection




                        fE (<Barack Obama, Honolulu>) = 1
       Barack Obama              was born in        Honolulu          ,    Hawaii         .


    버락 오바마               는       하와이         의      호놀룰루              에서          태어났다
    (beo-rak-o-ba-ma)   (neun)   (ha-wa-i)   (ui)   (ho-nol-rul-ru)   (e-seo)     (tae-eo-nat-da)


                          fK (<버락 오바마, 호놀룰루>) = 1
                                                                                                    10
Limitations of Direct Projection
• Direct projection approach is still vulnerable to the
  erroneous inputs generated by preprocessors
• Main causes of this limitation
    Considering alignment between entity candidates only, not any
     contextual information
    Performed by just a single pass process




                                                                     11
Graph-based Learning
• Semi-supervised learning algorithm
• Defining a graph
    The nodes represent labeled and unlabeled examples in a dataset
    The edges reflect the similarity of examples
• Learning a labeling function in an iterative manner
    It should be close to the given labels on the similar labeled nodes
    It should be smooth on the whole graph
• Related Work
    Graph-based Learning for Relation Extraction (Chen et al, 2006)
    Bilingual projection of POS tagging (Das and Petrov, 2011)


                                                                           12
Graph Construction
• Graph Nodes
   Instance Nodes
      • Defined for all pairs of entity candidates in both languages
      • Each instance node has a soft label vector Y = [y+ y-]
   Context Nodes
      • For identifying the relation descriptors of the positive instances
      • Defined for each trigram which is located between a given entity pair
        which is semantically related
      • Each context node has a soft label vector Y = [y+ y-]


                       <ARG1> was born in <ARG2>



   <ARG1> was born                was born in                born in <ARG2>     13
Graph Construction
• Edge Weights
   Between instance node and context node in the same language
        𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘
                             1             𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒,
                           = 0                                𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.
   Between context nodes in a language
             𝑘,
                                                   |𝑢 𝑘 ∩ 𝑢 𝑙 |
       𝑤(𝑢        𝑢 𝑙)     = 𝐽(𝑢      𝑘,
                                           𝑢 𝑙)   = 𝑘           .
                                                   |𝑢 ∪ 𝑢 𝑙 |

   Between context nodes in source and target languages
                                      𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡
       𝑤(𝑢 𝑠𝑘 ,   𝑢 𝑙𝑡 )   =                      𝑘       𝑚
                                                               ,
                                 𝑢𝑡   𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡




                                                                                          14
Graph Construction
• Example




                                 15
Label Propagation
                                       Initialize T
• Algorithm
    Input
      • A transition matrix T
      • An initial label matrix Y0    Normalize T
    Output
      • The updated label matrix Yt

                                       Initialize Y




                                        Update Y




                                                 16
Label Propagation
• Executed in three phases




               1st phase




                             2nd phase




                                         3rd phase




                                                     17
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   18
Implementation
• Dataset
    English-Korean parallel corpus
       • 266,982 bi-sentence pairs in English and Korean
       • Aligned by GIZA++
• Annotation
    ReVerb (Fader et al., 2011)
       • English Open IE system
• Label Propagation
    Junto Label Propagation Toolkit
• Learning
    Tree kernel-based SVM classifier
       • Shortest path dependency kernel (Bunescu and Mooney, 2005)
       • SVM-Light (Joachims, 1998)

                                                                      19
Evaluation
• Dataset
    Manually annotated Korean dataset
      • Obtained from the Web following Bunescu and Mooney(2007)’s work
      • 500 sentences with manual annotations for four relation types
             Acquisition
             Birthplace
             Inventor Of
             Won Prize

• Evaluation Metrics
    Precision/Recall/F-measure




                                                                          20
Experimental Results
• Direct Projection vs. Graph-based Projection


                   Direct Projection     Graph-based Projection
      Type
                  P       R        F       P       R       F
   Acquisition   51.6    87.7     64.9    55.3    91.2    68.9
   Birthplace    69.8    84.5     76.4    73.8    87.3    80.0
   Inventor of   62.4    85.3     72.1    66.3    89.7    76.3
   Won Prize     73.3    80.5     76.7    76.4    82.9    79.5
      Total      63.9    84.2     72.7    67.7    87.4    76.3




                                                                 21
Experimental Results
• Comparisons to other self-supervised approaches
    Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008)
       • Korean Treebank and Syntactic Heuristics
    Wikipedia-based Approach (Wu and Weld, 2010)
       • Korean Wikipedia articles and Infoboxes


          Approach                    P              R        F

       Heuristic-based              92.31           17.27   29.09

      Wikipedia-based               66.67           66.91   66.79

      Projection-based             67.69            87.41   76.30

                                                                         22
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   23
Conclusion
• Summary
    A graph-based projection approach for relation extraction
       • Label propagation algorithm
       • On a graph that represents the instance and context features of both
         the source and target languages
    Experimental results show that our approach helps to improve the
     performances of relation extraction compared to other approaches
• Future work
    To relieve the high complexity problem of the approach
    To deal with more expanded graph structure to improve the
     extraction performances


                                                                                24
Q&A

More Related Content

What's hot

Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersGuy De Pauw
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsRoelof Pieters
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERTshaurya uppal
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical InterpreterS-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpretervirtual-campus
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionMultitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionDr. Syed Hassan Amin
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deren Lei
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...kevig
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesDiversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesBryan Gummibearehausen
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needsIvan Berlocher
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Jason Yang
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesRepresentation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesFelipe Moraes
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understandinggohyunwoong
 

What's hot (20)

Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
NLP from scratch
NLP from scratch NLP from scratch
NLP from scratch
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERT
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical InterpreterS-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
 
BERT introduction
BERT introductionBERT introduction
BERT introduction
 
[Paper review] BERT
[Paper review] BERT[Paper review] BERT
[Paper review] BERT
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionMultitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesDiversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News Stories
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesRepresentation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 

Similar to A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...Seokhwan Kim
 
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De... Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...Ohsawa Goodfellow
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languageshs0041
 
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Zachary S. Brown
 
Erlang/OTP for Rubyists
Erlang/OTP for RubyistsErlang/OTP for Rubyists
Erlang/OTP for RubyistsSean Cribbs
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik
 
Towards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataTowards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataFernando Silva Parreiras
 
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Yuki Tomo
 
Nltk natural language toolkit overview and application @ PyCon.tw 2012
Nltk  natural language toolkit overview and application @ PyCon.tw 2012Nltk  natural language toolkit overview and application @ PyCon.tw 2012
Nltk natural language toolkit overview and application @ PyCon.tw 2012Jimmy Lai
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...Lifeng (Aaron) Han
 
MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)STI International
 
Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Ana Marasović
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...Association for Computational Linguistics
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIDES Editor
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award Pavel Shvaiko
 

Similar to A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction (20)

A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De... Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
 
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
 
Erlang/OTP for Rubyists
Erlang/OTP for RubyistsErlang/OTP for Rubyists
Erlang/OTP for Rubyists
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
 
Towards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataTowards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software Data
 
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
 
Word embedding
Word embedding Word embedding
Word embedding
 
Nltk natural language toolkit overview and application @ PyCon.tw 2012
Nltk  natural language toolkit overview and application @ PyCon.tw 2012Nltk  natural language toolkit overview and application @ PyCon.tw 2012
Nltk natural language toolkit overview and application @ PyCon.tw 2012
 
Deep learning for nlp
Deep learning for nlpDeep learning for nlp
Deep learning for nlp
 
wordembedding.pptx
wordembedding.pptxwordembedding.pptx
wordembedding.pptx
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
 
MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)
 
Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...
 
Knowledge Extraction
Knowledge ExtractionKnowledge Extraction
Knowledge Extraction
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for Lipreading
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award
 
Dependency-Based Word Embeddings
Dependency-Based Word EmbeddingsDependency-Based Word Embeddings
Dependency-Based Word Embeddings
 

More from Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingSeokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionSeokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...Seokhwan Kim
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSeokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingSeokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessSeokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionSeokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 

More from Seokhwan Kim (19)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Recently uploaded (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

  • 1. A GRAPH-BASED CROSS-LINGUAL PROJECTION APPROACH FOR WEAKLY SUPERVISED RELATION EXTRACTION The 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012) July 11th, 2012, Jeju Seokhwan Kim (Institute for Infocomm Research) Gary Geunbae Lee (POSTECH)
  • 2. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 2
  • 3. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 3
  • 4. Problem Definition • Relation Extraction  To identify semantic relations between a pair of entities Birthplace Barack Obama was born in Honolulu , Hawaii . PER LOC LOC  Considered as a classification problem 4
  • 5. Related Work (1) • Supervised Learning  Many supervised machine learning approaches have been successfully applied • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006) • Semi-supervised Learning  To obtain the annotations of unlabeled instances from the seed information • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000; Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006; Zhang, 2004; Chen el al., 2006; Zhou et al., 2009) 5
  • 6. Motivation • Resources for Relation Extraction  Supervised/Semi-supervised Approaches • Labeled corpora for supervised learning • Seed instances for semi-supervised learning • Available for only a few languages  ACE 2003 Multilingual Training Dataset • English (252 articles) • Chinese (221 articles) • Arabic (206 articles) • No resources for other languages  Korean 6
  • 7. Related Work (2) • Self-supervised Learning  To obtain the annotated dataset without any human effort  Using the information obtained from external resources • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008) • Wikipedia-based Methods (Wu and Weld, 2010) • Cross-lingual Annotation Projection  To leverage parallel corpora to project the relation annotations on the resource-rich source language to the resource-poor target language (Kim et al., 2010, Kim et al., 2011) 7
  • 8. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Implementation • Evaluation • Conclusions 8
  • 9. Overall Architecture Annotation Parallel Projection Corpus Sentences in Sentences in Ls Lt Preprocessing Preprocessing (POS Tagging, (POS Tagging, Parsing) Parsing) NER Word Alignment Relation Projection Extraction Annotated Annotated Sentences in Sentences in Ls Lt 9
  • 10. Direct Projection (Kim et al., 2010) • Annotation • Projection fE (<Barack Obama, Honolulu>) = 1 Barack Obama was born in Honolulu , Hawaii . 버락 오바마 는 하와이 의 호놀룰루 에서 태어났다 (beo-rak-o-ba-ma) (neun) (ha-wa-i) (ui) (ho-nol-rul-ru) (e-seo) (tae-eo-nat-da) fK (<버락 오바마, 호놀룰루>) = 1 10
  • 11. Limitations of Direct Projection • Direct projection approach is still vulnerable to the erroneous inputs generated by preprocessors • Main causes of this limitation  Considering alignment between entity candidates only, not any contextual information  Performed by just a single pass process 11
  • 12. Graph-based Learning • Semi-supervised learning algorithm • Defining a graph  The nodes represent labeled and unlabeled examples in a dataset  The edges reflect the similarity of examples • Learning a labeling function in an iterative manner  It should be close to the given labels on the similar labeled nodes  It should be smooth on the whole graph • Related Work  Graph-based Learning for Relation Extraction (Chen et al, 2006)  Bilingual projection of POS tagging (Das and Petrov, 2011) 12
  • 13. Graph Construction • Graph Nodes  Instance Nodes • Defined for all pairs of entity candidates in both languages • Each instance node has a soft label vector Y = [y+ y-]  Context Nodes • For identifying the relation descriptors of the positive instances • Defined for each trigram which is located between a given entity pair which is semantically related • Each context node has a soft label vector Y = [y+ y-] <ARG1> was born in <ARG2> <ARG1> was born was born in born in <ARG2> 13
  • 14. Graph Construction • Edge Weights  Between instance node and context node in the same language 𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘 1 𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒, = 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.  Between context nodes in a language 𝑘, |𝑢 𝑘 ∩ 𝑢 𝑙 | 𝑤(𝑢 𝑢 𝑙) = 𝐽(𝑢 𝑘, 𝑢 𝑙) = 𝑘 . |𝑢 ∪ 𝑢 𝑙 |  Between context nodes in source and target languages 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 𝑤(𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 ) = 𝑘 𝑚 , 𝑢𝑡 𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡 14
  • 16. Label Propagation Initialize T • Algorithm  Input • A transition matrix T • An initial label matrix Y0 Normalize T  Output • The updated label matrix Yt Initialize Y Update Y 16
  • 17. Label Propagation • Executed in three phases 1st phase 2nd phase 3rd phase 17
  • 18. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 18
  • 19. Implementation • Dataset  English-Korean parallel corpus • 266,982 bi-sentence pairs in English and Korean • Aligned by GIZA++ • Annotation  ReVerb (Fader et al., 2011) • English Open IE system • Label Propagation  Junto Label Propagation Toolkit • Learning  Tree kernel-based SVM classifier • Shortest path dependency kernel (Bunescu and Mooney, 2005) • SVM-Light (Joachims, 1998) 19
  • 20. Evaluation • Dataset  Manually annotated Korean dataset • Obtained from the Web following Bunescu and Mooney(2007)’s work • 500 sentences with manual annotations for four relation types  Acquisition  Birthplace  Inventor Of  Won Prize • Evaluation Metrics  Precision/Recall/F-measure 20
  • 21. Experimental Results • Direct Projection vs. Graph-based Projection Direct Projection Graph-based Projection Type P R F P R F Acquisition 51.6 87.7 64.9 55.3 91.2 68.9 Birthplace 69.8 84.5 76.4 73.8 87.3 80.0 Inventor of 62.4 85.3 72.1 66.3 89.7 76.3 Won Prize 73.3 80.5 76.7 76.4 82.9 79.5 Total 63.9 84.2 72.7 67.7 87.4 76.3 21
  • 22. Experimental Results • Comparisons to other self-supervised approaches  Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008) • Korean Treebank and Syntactic Heuristics  Wikipedia-based Approach (Wu and Weld, 2010) • Korean Wikipedia articles and Infoboxes Approach P R F Heuristic-based 92.31 17.27 29.09 Wikipedia-based 66.67 66.91 66.79 Projection-based 67.69 87.41 76.30 22
  • 23. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 23
  • 24. Conclusion • Summary  A graph-based projection approach for relation extraction • Label propagation algorithm • On a graph that represents the instance and context features of both the source and target languages  Experimental results show that our approach helps to improve the performances of relation extraction compared to other approaches • Future work  To relieve the high complexity problem of the approach  To deal with more expanded graph structure to improve the extraction performances 24
  • 25. Q&A