SlideShare a Scribd company logo
An alignment-based Approach to Semi-supervised Relation Extraction
                   Including Multiple Arguments
                                                          Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee, Kwangil Ko, and Zino Lee
                                                               {megaup, stardust, gblee}@postech.ac.kr, {kik, zino}@alticast.com

  Abstract - We present an alignment-based approach to semi-supervised relation extraction task including more than two arguments. We concentrate
  on improving not only the precision of the extracted result, but also on the coverage of the method. Our relation extraction method is based on an
  alignment-based pattern matching approach which provides more flexibility of the method. In addition, we extract all relationships including two or
  more arguments at once in order to obtain the integrated result with high quality. We present experimental results which indicate the effectiveness of
  our method.

                                                                                                                            Alignment-based Information Extraction
v Information Extraction                                                                                        v Sentence Alignment for Information Extraction                                                                                 w Matrix Computation
w Extracting the defined number of relevant                                                                     w Example                                                                                                                                       M i 1, j 1 sim i
arguments from natural language documents                                                                                the character <ROLE> portrayed by <ACTOR> in the television series <PROGRAM> is
                                                                                                                                                                                                                                                                                                  1, j 1
                                                                                                                                                                                                                                                                M i 1, j gp
w Subtasks                                                                                                                                                                                                                                       M i, j     max
                                                                                                                                                                                                                                                                M i , j 1 gp
 # of arguments                                              subtask                                                                                                                                                                                            0
        1                                           named-entity recognition      character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is


                                                                                                                                                                                                                                                              {
                                                                                                                                                                                                                                                                      1, if PTNi = RAWj
        2                                           binary relation extraction w Alignment Matrix
                                                                                                                                                                                                                                                   simi,j =              or PTNi = <label>
   more than 2                                      relation/event extraction         character
                                                                                                the character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is
                                                                                                 0      1        1       1         1     1      1      1     1 1 1       1      1      1   1                                                                          0, otherwise
                                                                                                                               <ROLE>       1     1        2      2   2   2   2   2   2   2   2   2                       2     2   2

w Approaches                                                                                                                   portrayed
                                                                                                                                   by
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  1
                                                                                                                                                  1
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  2
                                                                                                                                                                  2
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          3
                                                                                                                                                                          4
                                                                                                                                                                              3
                                                                                                                                                                              4
                                                                                                                                                                                  3
                                                                                                                                                                                  4
                                                                                                                                                                                      3
                                                                                                                                                                                      4
                                                                                                                                                                                          3
                                                                                                                                                                                          4
                                                                                                                                                                                              3
                                                                                                                                                                                              4
                                                                                                                                                                                                  3
                                                                                                                                                                                                  4
                                                                                                                                                                                                                          3
                                                                                                                                                                                                                          4
                                                                                                                                                                                                                                3
                                                                                                                                                                                                                                4
                                                                                                                                                                                                                                    3
                                                                                                                                                                                                                                    4
                                                                                                                                                                                                                                                w Trace Back
                                                                                                                              <ACTOR>       1     2        2      3   3   4   5   5   5   5   5   5                       5     5   5
  w Supervised                                                                                                                      in
                                                                                                                                   the
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  2
                                                                                                                                                  2
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  3
                                                                                                                                                                  3
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          4
                                                                                                                                                                          4
                                                                                                                                                                              5
                                                                                                                                                                              5
                                                                                                                                                                                  5
                                                                                                                                                                                  5
                                                                                                                                                                                      6
                                                                                                                                                                                      6
                                                                                                                                                                                          6
                                                                                                                                                                                          7
                                                                                                                                                                                              6
                                                                                                                                                                                              7
                                                                                                                                                                                                  6
                                                                                                                                                                                                  7
                                                                                                                                                                                                                          6
                                                                                                                                                                                                                          7
                                                                                                                                                                                                                                6
                                                                                                                                                                                                                                7
                                                                                                                                                                                                                                    6
                                                                                                                                                                                                                                    7
                                                                                                                                                                                                                                                             M i,j                next position
  w Un/Semi-Supervised                                                                                                         television
                                                                                                                                 series
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  2
                                                                                                                                                  2
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  3
                                                                                                                                                                  3
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          4
                                                                                                                                                                          4
                                                                                                                                                                              5
                                                                                                                                                                              5
                                                                                                                                                                                  5
                                                                                                                                                                                  5
                                                                                                                                                                                      6
                                                                                                                                                                                      6
                                                                                                                                                                                          7
                                                                                                                                                                                          7
                                                                                                                                                                                              7
                                                                                                                                                                                              7
                                                                                                                                                                                                  7
                                                                                                                                                                                                  8
                                                                                                                                                                                                                          7
                                                                                                                                                                                                                          8
                                                                                                                                                                                                                                7
                                                                                                                                                                                                                                8
                                                                                                                                                                                                                                    7
                                                                                                                                                                                                                                    8                   M i,j-1 +gp                  [i, j-1]
                                                                                                                                                                                                                                                       M i-1,j-1 + simi,j           [i-1, j-1]
                                                                                                                             <PROGRAM>      1     2        3      3   4   4   5   6   6   7   8   8                       9     9   9
                                                                                                                                    is      1     2        3      3   4   4   5   6   6   7   8   8                       9     9   10

                                                                                                                                                                                                                                                        M i-1,j +gp                  [i-1, j]

                                                                       Semi-supervised Relation Extraction Including Multiple Arguments
 v Overall Architecture                                                                                                               v Context Patterns Extraction                                                             v Alignment-based Verification
                                                                                                                                      1) Searching the sentences containing all                                                 w Aligning between two candidate arguments
                                                                                                                                      arguments of each tuple in source documents
                                                          Seed Data
                                                                                                                                      2) Segmenting out subpart of the sentence with                         max{M(A, B)}× 2
                                                                          n arguments
                                                                                                                                                                                         similarity(A,B) =
                                                                                                                                      the window size w                                                    length(A) + length(B)
                                                                                                                                      3) Replacing the parts of arguments in the sub-
   Seed Data    Seed Data              Seed Data           Seed Data   Seed Data          Seed Data         Seed Data
                                                                                                                                                                                      w Tuple clustering based on
                             2 arguments                                        k arguments                        n args
                                                                                                                                      sentence with argument labels
   Extracting   Extracting
                               …       Extracting
                                                      …   Extracting   Extracting
                                                                                    …     Extracting
                                                                                                       …    Extracting


                                                                                                                                      v Relation Extraction based on                    sim(tuple1, tuple2) =
    Context      Context                Context            Context      Context            Context           Context
    Patterns     Patterns               Patterns           Patterns     Patterns           Patterns          Patterns


    Relation     Relation               Relation           Relation     Relation           Relation          Relation
                                                                                                                                      Pairwise Alignment                                                                                        |args|
                                                                                                                                                                                                                                                                                     tuple2i)
                                                                                                                                                                                                                                                i=1 similarity(tuple1i,
   Extraction   Extraction             Extraction         Extraction   Extraction         Extraction        Extraction


                                                                                                                                      w Alignment score
                                                                                                                                                                                                                                                           |arguments|
                                           Validation &                                                                                                       max{M(PTN, RAW)}
                                            Integration
                                                                            Results
                                                                                                                                            score(PTN, RAW) =                                                                   w Selecting the most probable tuple for each
                                                                                              n arguments
                                                                                                                                                                 length(PTN)
                                                                                                                                                                                                                                cluster

                                                                                                                                                Experimental Results
v Experimental Setup
w 930 Korean news documents (13,175 sents) about TV series
w Only a tuple with 4 arguments (CHANNEL, PROGRAM, ACTOR, ROLE) is used as a seed
                                                                                                                                                                                                  v Comparison on the Coverage for
w Each result is collected after the first iteration and evaluated manually
                                                                                                                                                                                                  Various Threshold Values
v Result of the verification                                                                                   v Result of the integration
                                                                                                                                                                                                                         90

                                                                                                                                                                                                                         80

                                      before          after                                                                                       with only
    type of                                                                                                                                                              with all
                                                                                                                                                                                                                         70
                                  verification    verification                                                               type of               binary
   relations                                                                                                                                                          intermediates                                      60
                                |tuples|     P  |tuples|    P                                                               relations             relations
                                                                                                                                                                                                  # of correct results




    (A,R)                         249 36.55        79     73.42                                                                                 |tuples|         P   |tuples|   P
                                                                                                                                                                                                                         50

    (P,R)                          19     52.63    17     58.82                                                           (P,A,R)                   9          77.78     9    88.89                                      40

    (P,A)                          10       60     10       60                                                            (C,P,R)                  11          81.82    16    87.5                                       30
    (C,P)                          12     33.33     6     66.67                                                           (C,P,A)                  12          58.33     9    77.78                                      20
   (P,A,R)                          7     42.86     5       60                                                           (C,P,A,R)                  8          87.5     16    87.5                                                                                             including 2 arguments
   (C,P,R)                         18     55.56    16     81.25                                                                                                                                                          10                                                    including 3 arguments
                                                                                                                                                                                                                                                                               including 4 arguments
   (C,P,A)                          8      62.5     8       75                                                  w th = 0.85                                                                                              0
                                                                                                                                                                                                                         1.00            0.95   0.90         0.85       0.80           0.75            0.70
  (C,P,A,R)                        15       60     14     85.71                                                 w C(Channel), P(Program), A(Actor), R(Role)                                                                                               threshold

More Related Content

What's hot

Ch2 rev[1]
Ch2 rev[1]Ch2 rev[1]
Ch2 rev[1]
wb2008xp
 
Insulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast ComparisonInsulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast Comparison
entogenex
 
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
Rosdiadee Nordin
 
Mekong BFP Review
Mekong BFP ReviewMekong BFP Review
[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project
igemiitkgp
 
Cosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser windowCosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser window
IAEME Publication
 
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
David Parsons
 
Acs2006 Mrm
Acs2006 MrmAcs2006 Mrm
Acs2006 Mrm
jcruzsilva
 
F0543645
F0543645F0543645
F0543645
IOSR Journals
 

What's hot (9)

Ch2 rev[1]
Ch2 rev[1]Ch2 rev[1]
Ch2 rev[1]
 
Insulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast ComparisonInsulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast Comparison
 
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
 
Mekong BFP Review
Mekong BFP ReviewMekong BFP Review
Mekong BFP Review
 
[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project
 
Cosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser windowCosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser window
 
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
 
Acs2006 Mrm
Acs2006 MrmAcs2006 Mrm
Acs2006 Mrm
 
F0543645
F0543645F0543645
F0543645
 

Viewers also liked

Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
Seokhwan Kim
 
AI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're goingAI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're going
Keith Klundt
 
Requisitos para elaborar una visión y una misión
Requisitos  para elaborar una visión y una misiónRequisitos  para elaborar una visión y una misión
Requisitos para elaborar una visión y una misión
gissella_zambrano
 

Viewers also liked (8)

Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
AI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're goingAI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're going
 
Requisitos para elaborar una visión y una misión
Requisitos  para elaborar una visión y una misiónRequisitos  para elaborar una visión y una misión
Requisitos para elaborar una visión y una misión
 

More from Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
Seokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
Seokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
Seokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
Seokhwan Kim
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 

More from Seokhwan Kim (15)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 

Recently uploaded

Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
IPLTech Electric
 
Prescriptive analytics BA4206 Anna University PPT
Prescriptive analytics BA4206 Anna University PPTPrescriptive analytics BA4206 Anna University PPT
Prescriptive analytics BA4206 Anna University PPT
Freelance
 
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
Niswey
 
Science Around Us Module 2 Matter Around Us
Science Around Us Module 2 Matter Around UsScience Around Us Module 2 Matter Around Us
Science Around Us Module 2 Matter Around Us
PennapaKeavsiri
 
The Most Inspiring Entrepreneurs to Follow in 2024.pdf
The Most Inspiring Entrepreneurs to Follow in 2024.pdfThe Most Inspiring Entrepreneurs to Follow in 2024.pdf
The Most Inspiring Entrepreneurs to Follow in 2024.pdf
thesiliconleaders
 
AI Transformation Playbook: Thinking AI-First for Your Business
AI Transformation Playbook: Thinking AI-First for Your BusinessAI Transformation Playbook: Thinking AI-First for Your Business
AI Transformation Playbook: Thinking AI-First for Your Business
Arijit Dutta
 
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
❼❷⓿❺❻❷❽❷❼❽ Dpboss Kalyan Satta Matka Guessing Matka Result Main Bazar chart
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results
 
Call8328958814 satta matka Kalyan result satta guessing
Call8328958814 satta matka Kalyan result satta guessingCall8328958814 satta matka Kalyan result satta guessing
Call8328958814 satta matka Kalyan result satta guessing
➑➌➋➑➒➎➑➑➊➍
 
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
concepsionchomo153
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results
 
Profiles of Iconic Fashion Personalities.pdf
Profiles of Iconic Fashion Personalities.pdfProfiles of Iconic Fashion Personalities.pdf
Profiles of Iconic Fashion Personalities.pdf
TTop Threads
 
Kirill Klip GEM Royalty TNR Gold Copper Presentation
Kirill Klip GEM Royalty TNR Gold Copper PresentationKirill Klip GEM Royalty TNR Gold Copper Presentation
Kirill Klip GEM Royalty TNR Gold Copper Presentation
Kirill Klip
 
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
irhcs
 
State of D2C in India: A Logistics Update
State of D2C in India: A Logistics UpdateState of D2C in India: A Logistics Update
State of D2C in India: A Logistics Update
RedSeer
 
8328958814KALYAN MATKA | MATKA RESULT | KALYAN
8328958814KALYAN MATKA | MATKA RESULT | KALYAN8328958814KALYAN MATKA | MATKA RESULT | KALYAN
8328958814KALYAN MATKA | MATKA RESULT | KALYAN
➑➌➋➑➒➎➑➑➊➍
 
Digital Transformation Frameworks: Driving Digital Excellence
Digital Transformation Frameworks: Driving Digital ExcellenceDigital Transformation Frameworks: Driving Digital Excellence
Digital Transformation Frameworks: Driving Digital Excellence
Operational Excellence Consulting
 
Pro Tips for Effortless Contract Management
Pro Tips for Effortless Contract ManagementPro Tips for Effortless Contract Management
Pro Tips for Effortless Contract Management
Eternity Paralegal Services
 
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
Ksquare Energy Pvt. Ltd.
 

Recently uploaded (20)

Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
Sustainable Logistics for Cost Reduction_ IPLTech Electric's Eco-Friendly Tra...
 
Prescriptive analytics BA4206 Anna University PPT
Prescriptive analytics BA4206 Anna University PPTPrescriptive analytics BA4206 Anna University PPT
Prescriptive analytics BA4206 Anna University PPT
 
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
Unlocking WhatsApp Marketing with HubSpot: Integrating Messaging into Your Ma...
 
Science Around Us Module 2 Matter Around Us
Science Around Us Module 2 Matter Around UsScience Around Us Module 2 Matter Around Us
Science Around Us Module 2 Matter Around Us
 
The Most Inspiring Entrepreneurs to Follow in 2024.pdf
The Most Inspiring Entrepreneurs to Follow in 2024.pdfThe Most Inspiring Entrepreneurs to Follow in 2024.pdf
The Most Inspiring Entrepreneurs to Follow in 2024.pdf
 
AI Transformation Playbook: Thinking AI-First for Your Business
AI Transformation Playbook: Thinking AI-First for Your BusinessAI Transformation Playbook: Thinking AI-First for Your Business
AI Transformation Playbook: Thinking AI-First for Your Business
 
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
❼❷⓿❺❻❷❽❷❼❽ Dpboss Matka Result Satta Matka Guessing Satta Fix jodi Kalyan Fin...
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
 
Call8328958814 satta matka Kalyan result satta guessing
Call8328958814 satta matka Kalyan result satta guessingCall8328958814 satta matka Kalyan result satta guessing
Call8328958814 satta matka Kalyan result satta guessing
 
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
欧洲杯投注-欧洲杯投注外围盘口-欧洲杯投注盘口app|【​网址​🎉ac22.net🎉​】
 
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan ChartSatta Matka Dpboss Kalyan Matka Results Kalyan Chart
Satta Matka Dpboss Kalyan Matka Results Kalyan Chart
 
Profiles of Iconic Fashion Personalities.pdf
Profiles of Iconic Fashion Personalities.pdfProfiles of Iconic Fashion Personalities.pdf
Profiles of Iconic Fashion Personalities.pdf
 
Kirill Klip GEM Royalty TNR Gold Copper Presentation
Kirill Klip GEM Royalty TNR Gold Copper PresentationKirill Klip GEM Royalty TNR Gold Copper Presentation
Kirill Klip GEM Royalty TNR Gold Copper Presentation
 
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
1Q24_HYUNDAI CAPITAL SERVICES INC. AND SUBSIDIARIES
 
State of D2C in India: A Logistics Update
State of D2C in India: A Logistics UpdateState of D2C in India: A Logistics Update
State of D2C in India: A Logistics Update
 
8328958814KALYAN MATKA | MATKA RESULT | KALYAN
8328958814KALYAN MATKA | MATKA RESULT | KALYAN8328958814KALYAN MATKA | MATKA RESULT | KALYAN
8328958814KALYAN MATKA | MATKA RESULT | KALYAN
 
Digital Transformation Frameworks: Driving Digital Excellence
Digital Transformation Frameworks: Driving Digital ExcellenceDigital Transformation Frameworks: Driving Digital Excellence
Digital Transformation Frameworks: Driving Digital Excellence
 
Pro Tips for Effortless Contract Management
Pro Tips for Effortless Contract ManagementPro Tips for Effortless Contract Management
Pro Tips for Effortless Contract Management
 
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
PM Surya Ghar Muft Bijli Yojana: Online Application, Eligibility, Subsidies &...
 

An alignment-based approach to semi-supervised relation extraction including multiple arguments

  • 1. An alignment-based Approach to Semi-supervised Relation Extraction Including Multiple Arguments Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee, Kwangil Ko, and Zino Lee {megaup, stardust, gblee}@postech.ac.kr, {kik, zino}@alticast.com Abstract - We present an alignment-based approach to semi-supervised relation extraction task including more than two arguments. We concentrate on improving not only the precision of the extracted result, but also on the coverage of the method. Our relation extraction method is based on an alignment-based pattern matching approach which provides more flexibility of the method. In addition, we extract all relationships including two or more arguments at once in order to obtain the integrated result with high quality. We present experimental results which indicate the effectiveness of our method. Alignment-based Information Extraction v Information Extraction v Sentence Alignment for Information Extraction w Matrix Computation w Extracting the defined number of relevant w Example M i 1, j 1 sim i arguments from natural language documents the character <ROLE> portrayed by <ACTOR> in the television series <PROGRAM> is 1, j 1 M i 1, j gp w Subtasks M i, j max M i , j 1 gp # of arguments subtask 0 1 named-entity recognition character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is { 1, if PTNi = RAWj 2 binary relation extraction w Alignment Matrix simi,j = or PTNi = <label> more than 2 relation/event extraction character the character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0, otherwise <ROLE> 1 1 2 2 2 2 2 2 2 2 2 2 2 2 2 w Approaches portrayed by 1 1 1 1 2 2 2 2 3 3 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 w Trace Back <ACTOR> 1 2 2 3 3 4 5 5 5 5 5 5 5 5 5 w Supervised in the 1 1 2 2 2 2 3 3 3 3 4 4 5 5 5 5 6 6 6 7 6 7 6 7 6 7 6 7 6 7 M i,j next position w Un/Semi-Supervised television series 1 1 2 2 2 2 3 3 3 3 4 4 5 5 5 5 6 6 7 7 7 7 7 8 7 8 7 8 7 8 M i,j-1 +gp [i, j-1] M i-1,j-1 + simi,j [i-1, j-1] <PROGRAM> 1 2 3 3 4 4 5 6 6 7 8 8 9 9 9 is 1 2 3 3 4 4 5 6 6 7 8 8 9 9 10 M i-1,j +gp [i-1, j] Semi-supervised Relation Extraction Including Multiple Arguments v Overall Architecture v Context Patterns Extraction v Alignment-based Verification 1) Searching the sentences containing all w Aligning between two candidate arguments arguments of each tuple in source documents Seed Data 2) Segmenting out subpart of the sentence with max{M(A, B)}× 2 n arguments similarity(A,B) = the window size w length(A) + length(B) 3) Replacing the parts of arguments in the sub- Seed Data Seed Data Seed Data Seed Data Seed Data Seed Data Seed Data w Tuple clustering based on 2 arguments k arguments n args sentence with argument labels Extracting Extracting … Extracting … Extracting Extracting … Extracting … Extracting v Relation Extraction based on sim(tuple1, tuple2) = Context Context Context Context Context Context Context Patterns Patterns Patterns Patterns Patterns Patterns Patterns Relation Relation Relation Relation Relation Relation Relation Pairwise Alignment |args| tuple2i) i=1 similarity(tuple1i, Extraction Extraction Extraction Extraction Extraction Extraction Extraction w Alignment score |arguments| Validation & max{M(PTN, RAW)} Integration Results score(PTN, RAW) = w Selecting the most probable tuple for each n arguments length(PTN) cluster Experimental Results v Experimental Setup w 930 Korean news documents (13,175 sents) about TV series w Only a tuple with 4 arguments (CHANNEL, PROGRAM, ACTOR, ROLE) is used as a seed v Comparison on the Coverage for w Each result is collected after the first iteration and evaluated manually Various Threshold Values v Result of the verification v Result of the integration 90 80 before after with only type of with all 70 verification verification type of binary relations intermediates 60 |tuples| P |tuples| P relations relations # of correct results (A,R) 249 36.55 79 73.42 |tuples| P |tuples| P 50 (P,R) 19 52.63 17 58.82 (P,A,R) 9 77.78 9 88.89 40 (P,A) 10 60 10 60 (C,P,R) 11 81.82 16 87.5 30 (C,P) 12 33.33 6 66.67 (C,P,A) 12 58.33 9 77.78 20 (P,A,R) 7 42.86 5 60 (C,P,A,R) 8 87.5 16 87.5 including 2 arguments (C,P,R) 18 55.56 16 81.25 10 including 3 arguments including 4 arguments (C,P,A) 8 62.5 8 75 w th = 0.85 0 1.00 0.95 0.90 0.85 0.80 0.75 0.70 (C,P,A,R) 15 60 14 85.71 w C(Channel), P(Program), A(Actor), R(Role) threshold