Paper Presentation: HMM-based Alignment

•

1 like•645 views

The paper presentation I did for HMM-based Alignment at IIT Bombay as a part of the Topics in NLP course. The paper treats alignment as an HMM problem, which is a different approach compared to the IBM models approach which is predominantly used.

Technology Business

HMM-based Alignment in Statistical
Translation (1996)
Lekha Muraleedharan [133050002]
Sagar Ahire [133050073]

Roadmap
● Review of Alignment
● HMM-based Alignment
● Results and Examples

Roadmap: We Are Here
● Review of Alignment
● HMM-based Alignment
● Results and Examples

Review of Alignment
● In order to translate a French sentence F to
an English sentence E, the following
expression can be used:
E* = argmaxE P(E|F)
= argmaxE P(E) * P(F|E)

● To learn P(F|E), the concept of alignments is
used.

Review of Alignment
● Alignment refers to a correspondence
between E and F which indicates which word
in F is translated to a particular word in E.
● For Example:
पीटर
ज द
सोया
Peter slept
1
3

early
2

Alignment Models
Depending on the assumptions taken, there are
several possible alignment models:
● IBM Models (1 to 5)
● HMM-based Alignment Models

IBM Model 1,2 :The Math

MODEL 1

MODEL 2

IBM Model 1
● Assumes all alignments are equally likely
● Assumes source word depends only on
target word

IBM Model 2
● Assumes alignments are more likely to “lie
along the diagonal”

HMM-based Alignment
● Assumes alignment depends only on
○ The previous alignment (not all previous)
○ The jump width

● Thus, in this model alignments are relative

A Comparison
IBM MODEL 1

IBM MODEL 2

HMM Based Model

Statistical Results:
Basic Framework
● Models compared:
○ IBM 1
○ IBM 2
○ HMM

● Corpora Used (German to French)
○ Avalanche Bulletins Corpus (News)
○ Vermobil Corpus (Spoken Dialog)
○ EuTrans Corpus (Travel & Tourism)

Statistical Results:
Basic Framework
● Training Process:
○ IBM 1: 10 iterations of EM
○ IBM 2: 5 iterations of Maximum Approximation
○ HMM: 5 iterations of Maximum Approximation

● Metric Used
○ Perplexity (Wikipedia: “a measurement of how well a
probability model predicts a sample”)

Statistical Results

Corpus

IBM 1

IBM 2

HMM

EuTrans

16.267

9.781

9.686

Vermobil

46.672

30.706

26.495

Intuitive Example: 1
Hin:

पीटर

ज द

सोया

Eng: Peter slept early
A:
1
3
2
Jump: N/A
2
-1

Intuitive Example:
पीटर ज द सोया
● Relatively straightforward
● As there are no major jumps, translation
probabilities take precedence

Intuitive Example: 2
Hin:

पीटर

घर

लौटने पर

ज द

सोया

Eng: Peter slept early on returning home
A:
1
6
5
4
3
2
Jump: N/A 5
-1 -1
-1
-1

Intuitive Example:
पीटर घर लौटने पर ज द सोया
● IBM 2 stresses on diagonal alignments, so it
will find the correct alignment difficult, as all
alignments are nearly on the inverse
diagonal
● HMM only concentrates on previous
alignments and overall jump lengths, so this
alignment minimizes the total jump length

Intuitive Example: 3
Hin:

पीटर बहु त

ह

ज द

सोया

Eng: Peter slept very early
A:
1
3 ?
4
2

Intuitive Example:
पीटर बहु त ह ज द सोया
● The HMM model assumes that every source
word has a corresponding target word
● Moreover, empty word alignments are not
incorporated in the basic HMM model
● To model empty words an HMM of order 2 is
required

Intuitive Example: 4
Hin:

पीटर

आज

कल

ज द

सोता है

Eng:
A:

Peter sleeps early these days
1
2,3
3
2 2

Intuitive Example:
पीटर आज कल ज द सोता है
● सोता है ↔sleeps can be handled by HMM
● आज कल↔these days requires multi-word
handling to defeat a translation like “today
tomorrow”

References
● HMM-based Word Alignment in Statistical
Translation (1996) by Stephan Vogel,
Hermann Ney, Christoph Tillman; COLING
‘96, Copenhagen
● The Mathematics of Statistical Machine Translation:
Parameter Estimation (1993) by Peter Brown, Stephen
Della-Pietra, Vincent Della-Pietra, Robert Mercer;
Journal of Computational Linguistics

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Call to Action for Generative AI in 2024Results

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Artificial Intelligence: Facts and MythsJoaquim Jorge

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...

Finology Group – Insurtech Innovation Award 2024

A Call to Action for Generative AI in 2024

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Advantages of Hiring UIUX Design Service Providers for Your Business

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Axa Assurance Maroc - Insurer Innovation Award 2024

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Scaling API-first – The story of a global engineering organization

Artificial Intelligence: Facts and Myths

Tata AIG General Insurance Company - Insurer Innovation Award 2024

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Boost Fertility New Invention Ups Success Rates.pdf

The 7 Things I Know About Cyber Security After 25 Years | April 2024

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

08448380779 Call Girls In Civil Lines Women Seeking Men

Featured

2024 State of Marketing Report – by HubspotMarius Sescu

Everything You Need To Know About ChatGPTExpeed Software

Product Design Trends in 2024 | Teenage EngineeringsPixeldarts

How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Featured (20)

2024 State of Marketing Report – by Hubspot

Everything You Need To Know About ChatGPT

Product Design Trends in 2024 | Teenage Engineerings

How Race, Age and Gender Shape Attitudes Towards Mental Health

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Paper Presentation: HMM-based Alignment

1. HMM-based Alignment in Statistical Translation (1996) Lekha Muraleedharan [133050002] Sagar Ahire [133050073]

2. Roadmap ● Review of Alignment ● HMM-based Alignment ● Results and Examples

3. Roadmap: We Are Here ● Review of Alignment ● HMM-based Alignment ● Results and Examples

4. Review of Alignment ● In order to translate a French sentence F to an English sentence E, the following expression can be used: E* = argmaxE P(E|F) = argmaxE P(E) * P(F|E) ● To learn P(F|E), the concept of alignments is used.

5. Review of Alignment ● Alignment refers to a correspondence between E and F which indicates which word in F is translated to a particular word in E. ● For Example: पीटर ज द सोया Peter slept 1 3 early 2

6. Alignment Models Depending on the assumptions taken, there are several possible alignment models: ● IBM Models (1 to 5) ● HMM-based Alignment Models

7. IBM Model 1,2 :The Math MODEL 1 MODEL 2

8. IBM Model 1 ● Assumes all alignments are equally likely ● Assumes source word depends only on target word IBM Model 2 ● Assumes alignments are more likely to “lie along the diagonal”

9. Roadmap: We Are Here ● Review of Alignment ● HMM-based Alignment ● Results and Examples

10. HMM-based Alignment :The Math

11. HMM-based Alignment ● Assumes alignment depends only on ○ The previous alignment (not all previous) ○ The jump width ● Thus, in this model alignments are relative

12. A Comparison IBM MODEL 1 IBM MODEL 2 HMM Based Model

13. Roadmap: We Are Here ● Review of Alignment ● HMM-based Alignment ● Results and Examples

14. Statistical Results: Basic Framework ● Models compared: ○ IBM 1 ○ IBM 2 ○ HMM ● Corpora Used (German to French) ○ Avalanche Bulletins Corpus (News) ○ Vermobil Corpus (Spoken Dialog) ○ EuTrans Corpus (Travel & Tourism)

15. Statistical Results: Basic Framework ● Training Process: ○ IBM 1: 10 iterations of EM ○ IBM 2: 5 iterations of Maximum Approximation ○ HMM: 5 iterations of Maximum Approximation ● Metric Used ○ Perplexity (Wikipedia: “a measurement of how well a probability model predicts a sample”)

16. Statistical Results Corpus IBM 1 IBM 2 HMM EuTrans 16.267 9.781 9.686 Vermobil 46.672 30.706 26.495

17. Intuitive Example: 1 Hin: पीटर ज द सोया Eng: Peter slept early A: 1 3 2 Jump: N/A 2 -1

18. Intuitive Example: पीटर ज द सोया ● Relatively straightforward ● As there are no major jumps, translation probabilities take precedence

19. Intuitive Example: 2 Hin: पीटर घर लौटने पर ज द सोया Eng: Peter slept early on returning home A: 1 6 5 4 3 2 Jump: N/A 5 -1 -1 -1 -1

20. Intuitive Example: पीटर घर लौटने पर ज द सोया ● IBM 2 stresses on diagonal alignments, so it will find the correct alignment difficult, as all alignments are nearly on the inverse diagonal ● HMM only concentrates on previous alignments and overall jump lengths, so this alignment minimizes the total jump length

21. Intuitive Example: 3 Hin: पीटर बहु त ह ज द सोया Eng: Peter slept very early A: 1 3 ? 4 2

22. Intuitive Example: पीटर बहु त ह ज द सोया ● The HMM model assumes that every source word has a corresponding target word ● Moreover, empty word alignments are not incorporated in the basic HMM model ● To model empty words an HMM of order 2 is required

23. Intuitive Example: 4 Hin: पीटर आज कल ज द सोता है Eng: A: Peter sleeps early these days 1 2,3 3 2 2

24. Intuitive Example: पीटर आज कल ज द सोता है ● सोता है ↔sleeps can be handled by HMM ● आज कल↔these days requires multi-word handling to defeat a translation like “today tomorrow”

25. References ● HMM-based Word Alignment in Statistical Translation (1996) by Stephan Vogel, Hermann Ney, Christoph Tillman; COLING ‘96, Copenhagen ● The Mathematics of Statistical Machine Translation: Parameter Estimation (1993) by Peter Brown, Stephen Della-Pietra, Vincent Della-Pietra, Robert Mercer; Journal of Computational Linguistics

Paper Presentation: HMM-based Alignment

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Paper Presentation: HMM-based Alignment