Bleu vs rouge

•Download as PPTX, PDF•

1 like•4,769 views

Hamid Shahrivari Joghan

This slide compare BLUE with Rouge which are evaluation metrics

Data & Analytics

BLEU
the closer a machine translation is to a professional human
translation, the better it is
brevity penalty : defined to be e^(1-r/c)

ROUGE
 ROUGE-N: N-gram based co-occurrence statistics.
 ROUGE-L: Longest Common Subsequence (LCS) based statistics. Longest
common subsequence problem takes into account sentence level structure
similarity naturally and identifies longest co-occurring in sequence n-grams
automatically.
 ROUGE-W: Weighted LCS-based statistics that favors consecutive
 ROUGE-S: Skip-bigram based co-occurrence statistics. Skip-bigram is any pair
of words in their sentence order.
 ROUGE-SU: Skip-bigram plus unigram-based co-occurrence statistics.

 Bleu measures precision: how much the words (and/or n-grams) in the
machine generated summaries appeared in the human reference summaries.
 Rouge measures recall: how much the words (and/or n-grams) in the
human reference summaries appeared in the machine generated summaries.

 these results are complementing, as is often the case in precision vs recall.
 If you have many words from the system results appearing in the human
references you will have high Bleu
 if you have many words from the human references appearing in the system
results you will have high Rouge.

What's hot

Intro to nlpankit_ppt

Volume Rendering in Unity3DMatias Lavik

CS571: Sentiment AnalysisJinho Choi

Text similarity measuresankit_ppt

NLPGirish Khanzode

BertAbdallah Bashir

Word representations in vector spaceAbdullah Khan Zehady

Siamese networksNicholas McClure

Natural Language Processing (NLP)Yuriy Guts

Deep learning for NLP and TransformerArvind Devaraj

Text ClassificationRAX Automation Suite

Skip gram and cbowhyunyoung Lee

Introduction to Recurrent Neural NetworkKnoldus Inc.

Word2Vecmohammad javad hasani

Glove global vectors for word representationhyunyoung Lee

NLP using transformers Arvind Devaraj

Language Model (N-Gram).pptxHeneWijaya

Word embeddingsShruti kar

Natural language processing Md.Sumon Sarder

Language modelsMaryam Khordad

What's hot (20)

Intro to nlp

Volume Rendering in Unity3D

CS571: Sentiment Analysis

Text similarity measures

NLP

Bert

Word representations in vector space

Siamese networks

Natural Language Processing (NLP)

Deep learning for NLP and Transformer

Text Classification

Skip gram and cbow

Introduction to Recurrent Neural Network

Word2Vec

Glove global vectors for word representation

NLP using transformers

Language Model (N-Gram).pptx

Word embeddings

Natural language processing

Language models

Similar to Bleu vs rouge

ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...Lifeng (Aaron) Han

Miguel Rios - 2015 - Obtaining SMT dictionaries for related languagesAssociation for Computational Linguistics

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Daniel Valcarce

Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...Welocalize

MT SUMMIT2013 poster boaster slides.Language-independent Model for Machine Tr...Lifeng (Aaron) Han

NLPMohamed El-Serngawy

NLPguestff64339

TSD2013.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFORMATIONLifeng (Aaron) Han

A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...Seokhwan Kim

P99 1067ALEXANDRASUWANN

Lecture 7: Definite Clause GrammarsCS, NcState

COLING 2012 - LEPOR: A Robust Evaluation Metric for Machine Translation with ...Lifeng (Aaron) Han

Unsupervised Quality Estimation Model for English to German Translation and I...Lifeng (Aaron) Han

AN INVESTIGATION OF THE SAMPLING-BASED ALIGNMENT METHOD AND ITS CONTRIBUTIONSijaia

Corpus Effects on the Evaluation of Automated Transliteration SystemsSarvnaz Karimi

cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...Lifeng (Aaron) Han

semeval2016Lukáš Svoboda

2-Chapter Two-N-gram Language Models.pptmilkesa13

IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET Journal

Intern presentationHimanshu Bansal

Similar to Bleu vs rouge (20)

ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...

Miguel Rios - 2015 - Obtaining SMT dictionaries for related languages

Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...

Welocalize EAMT 2014 Presentation Assumptions, Expectations and Outliers in P...

MT SUMMIT2013 poster boaster slides.Language-independent Model for Machine Tr...

NLP

TSD2013.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFORMATION

A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...

P99 1067

Lecture 7: Definite Clause Grammars

COLING 2012 - LEPOR: A Robust Evaluation Metric for Machine Translation with ...

Unsupervised Quality Estimation Model for English to German Translation and I...

AN INVESTIGATION OF THE SAMPLING-BASED ALIGNMENT METHOD AND ITS CONTRIBUTIONS

Corpus Effects on the Evaluation of Automated Transliteration Systems

cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...

semeval2016

2-Chapter Two-N-gram Language Models.ppt

IRJET- Vernacular Language Spell Checker & Autocorrection

Intern presentation

Recently uploaded

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk

Recently uploaded (20)

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Data Science Jobs and Salaries Analysis.pptx

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

PKS-TGC-1084-630 - Stage 1 Proposal.pptx

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

RA-11058_IRR-COMPRESS Do 198 series of 1998

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

DBA Basics: Getting Started with Performance Tuning.pdf

Schema on read is obsolete. Welcome metaprogramming..pdf

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

Call Girls In Dwarka 9654467111 Escorts Service

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

Call Girls In Mahipalpur O9654467111 Escorts Service

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

1:1定制(UQ毕业证）昆士兰大学毕业证成绩单修改留信学历认证原版一模一样

Bleu vs rouge

1. BLEU Evaluation vs Rouge Evaluation

2. BLEU the closer a machine translation is to a professional human translation, the better it is brevity penalty : defined to be e^(1-r/c)

3. ROUGE  ROUGE-N: N-gram based co-occurrence statistics.  ROUGE-L: Longest Common Subsequence (LCS) based statistics. Longest common subsequence problem takes into account sentence level structure similarity naturally and identifies longest co-occurring in sequence n-grams automatically.  ROUGE-W: Weighted LCS-based statistics that favors consecutive  ROUGE-S: Skip-bigram based co-occurrence statistics. Skip-bigram is any pair of words in their sentence order.  ROUGE-SU: Skip-bigram plus unigram-based co-occurrence statistics.

4.  Bleu measures precision: how much the words (and/or n-grams) in the machine generated summaries appeared in the human reference summaries.  Rouge measures recall: how much the words (and/or n-grams) in the human reference summaries appeared in the machine generated summaries.

5. BLEU

6.  these results are complementing, as is often the case in precision vs recall.  If you have many words from the system results appearing in the human references you will have high Bleu  if you have many words from the human references appearing in the system results you will have high Rouge.

7. 𝐹1 = 2 ∗ (𝐵𝐿𝐸𝑈 ∗ 𝑅𝑜𝑢𝑔𝑒) (𝐵𝐿𝐸𝑈 + 𝑅𝑜𝑢𝑔𝑒)

Bleu vs rouge

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Bleu vs rouge

Similar to Bleu vs rouge (20)

Recently uploaded

Recently uploaded (20)

Bleu vs rouge