OpineSum Entailment-based self-training for abstractive opinion summarization_변현정.pdf

•

0 likes•43 views

오늘은 의견 요약 분야에서의 흥미로운 발전에 대해 이야기하고자 합니다. 특히, 제품이나 장소에 대한 수백 개의 리뷰를 요약하는 것은 중요하고도 어려운 과제인데요, 최근에 이를 위한 새로운 자가학습 접근법, 'OpineSum'이 소개되었습니다. 1. 의견 요약의 중요성 일반적으로 제품이나 장소에 대한 리뷰는 많은 양으로 존재합니다. 이러한 리뷰들을 요약하는 것은 사용자가 정보를 빠르게 파악하는 데 도움을 주며, 의사결정 과정을 간소화할 수 있습니다. 2. 기존의 접근 방식과 한계 뉴스 분야에서의 추상적 요약은 수백만 개의 뉴스 기사와 함께 제공되는 인간 작성 요약을 통해 훈련된 감독 시스템에 의해 큰 진전을 보였습니다. 하지만 의견 텍스트의 경우, 이러한 대규모 데이터셋이 드물게 존재합니다. 3. OpineSum의 소개 이러한 문제를 해결하기 위해, 'OpineSum'이라는 새로운 자가학습 접근법이 제안되었습니다. 이 방법은 텍스트 함축의 새로운 응용을 사용하여 여러 리뷰에서의 의견 합의를 포착하는 요약을 구축합니다. 4. OpineSum의 작동 방식 OpineSum은 대규모에서 은근한 표준 요약을 얻을 수 있으며, 비지도 및 소수샷 추상적 요약 시스템 훈련에 사용할 수 있습니다. 이 방법은 SOTA 달성했습니다. OpineSum은 의견 요약의 새로운 지평을 열고, 대규모 데이터가 부족한 상황에서도 효과적인 요약을 생성할 수 있는 방법을 제시합니다. 이러한 발전은 의견 요약 기술의 미래에 큰 영향을 미칠 것으로 기대됩니다. 오늘 논문 리뷰를 위해 자연어 처리 변현정님이 자세한 리뷰를 도와주셨습니다 많은 관심 미리 감사드립니다! https://youtu.be/gqJCWyYPtXQ

Data & Analytics

OpineSum: Entailment-based self-training
for abstractive opinion summarization
Annie Louis, Joshua Maynez, Google Research. 2022

Experiments
Conclusions
Introduction
Related Work
01
02
03
04

Introduction
• OpineSum
 Abstractive Opinion summarization : Entailment-base self-training
 Textual entailment and capture the consensus of opinions
 Train both unsupervised and few-shot learning

• Unsupervised neural networks
 Auto-encoder : input data Encoder Focus
 VAEs(Variation Auto Encoders) : Generate Decoder
• Self-training methods
• Unlabeled data features, auto labeling, pretraining and fine-tunning
• Few-shot learning : k-way n-shot , Similarity learning
Related Work

• Defining review consensus
 find out how often the collection supports the proposition,
• Extracting propositions
 split review sentences into propositions and use our key units
• Scoring consensus
 , , proposition
• Silver summaries
 MMR(Maximal Marginal Relevance)
Textual entailment to identify consensus

Consensus opinion
• Consensus or common opinion
 the SPACE corpus ( on TripAdvisor.com)
“location is right across balboa park”
: summary-worthy one
* create our silver-standard data

Consensus opinion
• Consensus or common opinion
 Same claim may be expressed
“This hotel is in the heart of Times Square ” ,
‘Hotel’s location is slap bang in the middle of Times Square.’
“The fishis tasty” and ‘The salmon is delicious’

Consensus opinion
• Extracting propositions
 Long review sentences : contain a bunch of different claims
-> split sentences into propositions : as a ‘single claim or fact’
(* conjunctions, period, and comma subject to a length of four)

Consensus opinion
• Scoring consensus
 the M propositions
• textual entailment rela on as P → H (P is a premise and H is a hypothesis)
BERT-large : [CLS] Token (a linear layer to predict three classes: entailment, contradiction and neutral)
 Apache Beam1 pipeline
review
proposition

Consensus opinion
• Silver summaries
• decreasing order of their scores S(mi)
• take the top n as the silver summary sentences
• edundancy removal technique : MMR (maximal marginal relevance)
Highest scoring proposition

Consensus opinion
• Silver summaries
• P1 was extracted from R2
• P2 from R4

• Dataset
 Unlabelled review corpus
 Evaluation dataset containing with human summaries
 SPACE-unlabeled : 1.1 million reviews for 11,000 hotels
 SPACE-eval : human generated summaries for a smaller set of 50 hotels
• Models
 T5 (Text-to-Text Transfer Transformer)
 Pre-trained encoder-decoder models based on T5’s framework
 LongT5 : polynomial scale relationship to input length
Experiments

• Evaluation
 Silver Data (as SPACE-OpineSum) : the unlabelled review corpus
• SPACE items with a minimum of 50 reviews (4,729 items)
• Computed 1.3B entailment predictions
 Self-training
• Unsupervised and few-shot learning abstractive summarization
• Trained LongT5-Large (770M parameters) and LongT5-(Large, XL)
• Compare these systems with prior unsupervised work in the SPACE-eval dataset
• the ROUGE performance on the validation set
Experiments

• Evaluation
 Few-shot Learning
• SPACE-eval (25 total) : training set (15items) and validation set(10 items)
• Reduced learning rate, 1/5th of the standard 1e−4
Experiments

Results
OPINESUM system use self-training only

Conclusion
 simple self-training approach both unsupervised and few-shot
 using the entailment weights (scores) of each proposition
 self-training models could serve as checkpoints

OpineSum Entailment-based self-training for abstractive opinion summarization_변현정.pdf

More from taeseon ryu

RL_UpsideDowntaeseon ryu

Packed Levitated Marker for Entity and Relation Extractiontaeseon ryu

MOReL: Model-Based Offline Reinforcement Learningtaeseon ryu

Scaling Instruction-Finetuned Language Modelstaeseon ryu

Visual prompt tuningtaeseon ryu

mPLUGtaeseon ryu

variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdftaeseon ryu

Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdftaeseon ryu

The Forward-Forward Algorithmtaeseon ryu

Towards Robust and Reproducible Active Learning using Neural Networkstaeseon ryu

BRIO: Bringing Order to Abstractive Summarizationtaeseon ryu

ProximalPolicyOptimizationtaeseon ryu

Dream2Control paper reviewtaeseon ryu

Online Continual Learning on Class Incremental Blurry Task Configuration with...taeseon ryu

[2023] Cut and Learn for Unsupervised Object Detection and Instance Segmentationtaeseon ryu

Unsupervised Neural Machine Translation for Low-Resource Domainstaeseon ryu

PaLM Scaling Language Modeling with Pathways - 230219 (1).pdftaeseon ryu

Distributional RL via Moment Matchingtaeseon ryu

Deep Reinforcement Learning from Human Preferencestaeseon ryu

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...taeseon ryu

More from taeseon ryu (20)

RL_UpsideDown

Packed Levitated Marker for Entity and Relation Extraction

MOReL: Model-Based Offline Reinforcement Learning

Scaling Instruction-Finetuned Language Models

Visual prompt tuning

mPLUG

variBAD, A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning.pdf

Reinforced Genetic Algorithm Learning For Optimizing Computation Graphs.pdf

The Forward-Forward Algorithm

Towards Robust and Reproducible Active Learning using Neural Networks

BRIO: Bringing Order to Abstractive Summarization

ProximalPolicyOptimization

Dream2Control paper review

Online Continual Learning on Class Incremental Blurry Task Configuration with...

[2023] Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Unsupervised Neural Machine Translation for Low-Resource Domains

PaLM Scaling Language Modeling with Pathways - 230219 (1).pdf

Distributional RL via Moment Matching

Deep Reinforcement Learning from Human Preferences

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable C...

OpineSum Entailment-based self-training for abstractive opinion summarization_변현정.pdf

1. OpineSum: Entailment-based self-training for abstractive opinion summarization Annie Louis, Joshua Maynez, Google Research. 2022

2. Experiments Conclusions Introduction Related Work 01 02 03 04

3. Introduction • OpineSum  Abstractive Opinion summarization : Entailment-base self-training  Textual entailment and capture the consensus of opinions  Train both unsupervised and few-shot learning

4. • Unsupervised neural networks  Auto-encoder : input data Encoder Focus  VAEs(Variation Auto Encoders) : Generate Decoder • Self-training methods • Unlabeled data features, auto labeling, pretraining and fine-tunning • Few-shot learning : k-way n-shot , Similarity learning Related Work

5. • Defining review consensus  find out how often the collection supports the proposition, • Extracting propositions  split review sentences into propositions and use our key units • Scoring consensus  , , proposition • Silver summaries  MMR(Maximal Marginal Relevance) Textual entailment to identify consensus

6. Consensus opinion • Consensus or common opinion  the SPACE corpus ( on TripAdvisor.com) “location is right across balboa park” : summary-worthy one * create our silver-standard data

7. Consensus opinion • Consensus or common opinion  Same claim may be expressed “This hotel is in the heart of Times Square ” , ‘Hotel’s location is slap bang in the middle of Times Square.’ “The fishis tasty” and ‘The salmon is delicious’

8. Consensus opinion • Extracting propositions  Long review sentences : contain a bunch of different claims -> split sentences into propositions : as a ‘single claim or fact’ (* conjunctions, period, and comma subject to a length of four)

9. Consensus opinion • Scoring consensus  the M propositions • textual entailment rela on as P → H (P is a premise and H is a hypothesis) BERT-large : [CLS] Token (a linear layer to predict three classes: entailment, contradiction and neutral)  Apache Beam1 pipeline review proposition

10. Consensus opinion • Silver summaries • decreasing order of their scores S(mi) • take the top n as the silver summary sentences • edundancy removal technique : MMR (maximal marginal relevance) Highest scoring proposition

11. Consensus opinion • Silver summaries • P1 was extracted from R2 • P2 from R4

12.

13. • Dataset  Unlabelled review corpus  Evaluation dataset containing with human summaries  SPACE-unlabeled : 1.1 million reviews for 11,000 hotels  SPACE-eval : human generated summaries for a smaller set of 50 hotels • Models  T5 (Text-to-Text Transfer Transformer)  Pre-trained encoder-decoder models based on T5’s framework  LongT5 : polynomial scale relationship to input length Experiments

14. • Evaluation  Silver Data (as SPACE-OpineSum) : the unlabelled review corpus • SPACE items with a minimum of 50 reviews (4,729 items) • Computed 1.3B entailment predictions  Self-training • Unsupervised and few-shot learning abstractive summarization • Trained LongT5-Large (770M parameters) and LongT5-(Large, XL) • Compare these systems with prior unsupervised work in the SPACE-eval dataset • the ROUGE performance on the validation set Experiments

15. • Evaluation  Few-shot Learning • SPACE-eval (25 total) : training set (15items) and validation set(10 items) • Reduced learning rate, 1/5th of the standard 1e−4 Experiments

16. Results OPINESUM system use self-training only

17. Results

18. Conclusion  simple self-training approach both unsupervised and few-shot  using the entailment weights (scores) of each proposition  self-training models could serve as checkpoints

OpineSum Entailment-based self-training for abstractive opinion summarization_변현정.pdf

Recommended

Recommended

More Related Content

More from taeseon ryu

More from taeseon ryu (20)

OpineSum Entailment-based self-training for abstractive opinion summarization_변현정.pdf