SlideShare a Scribd company logo
1 of 42
Download to read offline
Changes in Moses 
Hieu Hoang 
TAUS 
October 2014
MosesCore 
Year 1 (2012) 
• Easier installation 
– Binary releases 
– Pre-built models 
• Testing and Releases 
– Linux, Mac OSX, Windows 
– 32 and 64-bit 
• Faster training 
– Parallelism at all stages
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Specify a Feature Function 
Then…. 
[lmodel-file] 
8 0 3 europarl.en.srilm.gz 
[weight-l] 
0.142 
ini file: 
• New Feature Function 
– New sections 
● [feature-function-file] 
● [weight-?] 
• Custom code 
– Parse ini file 
– Initialize feature function
Adding new Feature Function 
Now…. 
[feature] 
KENLM file=path order=0 
[weight] 
KENLM0= 0.142 
ini file: 
• New Feature Function 
– No new section 
● Line in [feature] section 
● Line in [weight] section 
– Framework 
● parse ini file 
● initialize feature
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
● Dynamic suffix array 
● Stores training data 
– Extract translation rule on-the-fly 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
● Continuous space LM 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
● Replicate Devlin et al, 2014 
● Large quality gains 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
● Character level translation 
● Learns from parallel data 
● Integrate into decoder 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
– Extra information for each rule 
● Context, syntax, domain etc 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding 
– Faster, memory efficient decoding 
– More syntactic models
Technical Breakout 
• Organization and Releases 
– Academic and commercial needs 
– Prevent forks 
– Development/Stable versions 
– Forwards/Backward compatibility 
– Upgradability 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
– Platform/Clouds 
– Docker containers 
– Priorities 
– Interaction and data formats 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development 
– User-friendliness 
– End-to-end solution 
– Users
Changes in Moses 
Hieu Hoang 
TAUS 
October 2014 
Thanks for inviting me to come 
Here to tell you a little about the things I’ve 
been doing to Moses 
- over the past 2 years 
- mainly concentrate of the past year 
- but will quickly tell you about things I did 
prior to that 
1
MosesCore 
Year 1 (2012) 
• Easier installation 
– Binary releases 
– Pre-built models 
• Testing and Releases 
– Linux, Mac OSX, Windows 
– 32 and 64-bit 
• Faster training 
– Parallelism at all stages 
In the 1st year 
- picked off the low hanging fruit 
- fixed many of the easy issues that required 
- time & effort 
Made installation easier 
Run a lot of experiments anyway 
- gave some of them away 
- with all the scripts + configuration 
- used to run them 
- students can see how to replicate our 
results 
Lots of testing 
- all major platforms 
Made obvious speed improvements 
2
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder 
In year 2 
- made it even easier to install 
- if you can’t be bother to compile or even 
download the binaries 
- download a virtual machine with moses + 
friends installed 
OR 
rent an amazon server with moses + friends 
installed 
3
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder 
However, the main reason I came here today 
- talk about the major changes I made 
- in decoder 
- and else where 
Makes is easier for us coders 
- add and change things in Moses 
4
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
What is a feature function? 
- something that gives a translation a score 
over the last few years 
- gotten bored with existing features like 
language models and reordering models 
the trend in MT 
- create novel features which give a score to 
a translation 
- like any feature, tries to give bigger scores 
to better models 
New feature function framework 
- designed to make it easy to add new 
features 
5
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
Simplify class structure 
- to make it easier for us to develop with 
Moses 
- Moses has been around for 8 years now 
- everyone has the freedom to add what 
they want 
- no-one is in overall control 
- this way of organising an open-source 
project is great 
- gotten lots of contribution, lots of 
features 
- downside 
- grown organically 
- things are not as well structured as 
they can be 
6
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
Why did I delete things 
- delete very little 
- I’m not the gatekeeper of moses, I don’t 
control it 
- if a functionality was deleted, it’s not a 
comment on usefulness of it 
- purely ‘cos it got in the way of the 
refactoring 
Quickly go thru the last 2 
- before telling you about feature functions 
7
Specify a Feature Function 
Then…. 
[lmodel-file] 
8 0 3 europarl.en.srilm.gz 
[weight-l] 
0.142 
ini file: 
• New Feature Function 
– New sections 
● [feature-function-file] 
● [weight-?] 
• Custom code 
– Parse ini file 
– Initialize feature function 
completely bestoked 
- no framework to help you 
- if you don’t do it right, wont’ work 
8
Adding new Feature Function 
Now…. 
[feature] 
KENLM file=path order=0 
[weight] 
KENLM0= 0.142 
ini file: 
• New Feature Function 
– No new section 
● Line in [feature] section 
● Line in [weight] section 
– Framework 
● parse ini file 
● initialize feature 
Write a class that implements the feature 
function 
The framework does the rest 
- no need to create a custom section in the ini 
file 
or 
- change StaticData class 
or 
- change Paramater class 
9
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
● Dynamic suffix array 
● Stores training data 
– Extract translation rule on-the-fly 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
● Continuous space LM 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
● Replicate Devlin et al, 2014 
● Large quality gains 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
● Character level translation 
● Learns from parallel data 
● Integrate into decoder 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
– Extra information for each rule 
● Context, syntax, domain etc 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding 
– Faster, memory efficient decoding 
– More syntactic models
Technical Breakout 
• Organization and Releases 
– Academic and commercial needs 
– Prevent forks 
– Development/Stable versions 
– Forwards/Backward compatibility 
– Upgradability 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
– Platform/Clouds 
– Docker containers 
– Priorities 
– Interaction and data formats 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development 
– User-friendliness 
– End-to-end solution 
– Users
TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

More Related Content

What's hot

OSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical UpdateOSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical Updatemfrancis
 
TokuDB - What You Need to Know
TokuDB - What You Need to KnowTokuDB - What You Need to Know
TokuDB - What You Need to KnowJervin Real
 
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestWEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestFreddy Vega
 
My S Q L Introduction for 1 day training
My S Q L  Introduction for 1 day trainingMy S Q L  Introduction for 1 day training
My S Q L Introduction for 1 day trainingIvan Tu
 
Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015kacfengine
 
A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0Sarah Dutkiewicz
 
Version control with subversion
Version control with subversionVersion control with subversion
Version control with subversionxprayc
 
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...BIOVIA
 
Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Sarah Dutkiewicz
 
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Lenz Grimmer
 
Ups and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingUps and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingCsaba Toth
 
Scaling with Symfony - PHP UK
Scaling with Symfony - PHP UKScaling with Symfony - PHP UK
Scaling with Symfony - PHP UKRicard Clau
 
Embedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewEmbedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewAhmed El-Arabawy
 
High Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningHigh Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningAlbert Chen
 

What's hot (18)

OSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical UpdateOSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical Update
 
TokuDB - What You Need to Know
TokuDB - What You Need to KnowTokuDB - What You Need to Know
TokuDB - What You Need to Know
 
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestWEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
 
My S Q L Introduction for 1 day training
My S Q L  Introduction for 1 day trainingMy S Q L  Introduction for 1 day training
My S Q L Introduction for 1 day training
 
Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015
 
A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0
 
Version control with subversion
Version control with subversionVersion control with subversion
Version control with subversion
 
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
 
Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
 
Ups and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingUps and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research setting
 
Scaling with Symfony - PHP UK
Scaling with Symfony - PHP UKScaling with Symfony - PHP UK
Scaling with Symfony - PHP UK
 
Embedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewEmbedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course Overview
 
High Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningHigh Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance Tuning
 

Similar to TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

Java for XPages Development
Java for XPages DevelopmentJava for XPages Development
Java for XPages DevelopmentTeamstudio
 
Automated product categorization
Automated product categorizationAutomated product categorization
Automated product categorizationAndreas Loupasakis
 
Automated product categorization
Automated product categorization   Automated product categorization
Automated product categorization Warply
 
Finite State Queries In Lucene
Finite State Queries In LuceneFinite State Queries In Lucene
Finite State Queries In Luceneotisg
 
Infrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryInfrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryCarlo Bonamico
 
Reading Notes : the practice of programming
Reading Notes : the practice of programmingReading Notes : the practice of programming
Reading Notes : the practice of programmingJuggernaut Liu
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibAndrew Coleman
 
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote   Yonik Seeley & Steve Rowe lucene solr roadmapKeynote   Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote Yonik Seeley & Steve Rowe lucene solr roadmaplucenerevolution
 
KEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road mapKEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road maplucenerevolution
 
Connect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience NecessaryConnect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience Necessarypanagenda
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsAndreas Chatziantoniou
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsAndreas Chatziantoniou
 
Mixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting exampleMixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting examplecorehard_by
 
A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012Imaginet
 
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteSearch Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteLucidworks
 
An Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerAn Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerMongoDB
 
CFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful CodeCFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful Codeindiver
 
Big datatraining.in devops-part1
Big datatraining.in devops-part1Big datatraining.in devops-part1
Big datatraining.in devops-part1MvkZ
 

Similar to TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh (20)

Java for XPages Development
Java for XPages DevelopmentJava for XPages Development
Java for XPages Development
 
Automated product categorization
Automated product categorizationAutomated product categorization
Automated product categorization
 
Automated product categorization
Automated product categorization   Automated product categorization
Automated product categorization
 
Finite State Queries In Lucene
Finite State Queries In LuceneFinite State Queries In Lucene
Finite State Queries In Lucene
 
Infrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryInfrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous Delivery
 
Reading Notes : the practice of programming
Reading Notes : the practice of programmingReading Notes : the practice of programming
Reading Notes : the practice of programming
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iib
 
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote   Yonik Seeley & Steve Rowe lucene solr roadmapKeynote   Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
 
KEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road mapKEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road map
 
Connect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience NecessaryConnect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience Necessary
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology experts
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology experts
 
Mixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting exampleMixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting example
 
Short-Training asp.net vNext
Short-Training asp.net vNextShort-Training asp.net vNext
Short-Training asp.net vNext
 
A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012
 
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteSearch Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
 
PostgreSQL and MySQL
PostgreSQL and MySQLPostgreSQL and MySQL
PostgreSQL and MySQL
 
An Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerAn Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops Manager
 
CFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful CodeCFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful Code
 
Big datatraining.in devops-part1
Big datatraining.in devops-part1Big datatraining.in devops-part1
Big datatraining.in devops-part1
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 

TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

  • 1. Changes in Moses Hieu Hoang TAUS October 2014
  • 2. MosesCore Year 1 (2012) • Easier installation – Binary releases – Pre-built models • Testing and Releases – Linux, Mac OSX, Windows – 32 and 64-bit • Faster training – Parallelism at all stages
  • 3. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder
  • 4. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder
  • 5. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 6. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 7. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 8. Specify a Feature Function Then…. [lmodel-file] 8 0 3 europarl.en.srilm.gz [weight-l] 0.142 ini file: • New Feature Function – New sections ● [feature-function-file] ● [weight-?] • Custom code – Parse ini file – Initialize feature function
  • 9. Adding new Feature Function Now…. [feature] KENLM file=path order=0 [weight] KENLM0= 0.142 ini file: • New Feature Function – No new section ● Line in [feature] section ● Line in [weight] section – Framework ● parse ini file ● initialize feature
  • 10. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 11. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table ● Dynamic suffix array ● Stores training data – Extract translation rule on-the-fly – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 12. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model ● Continuous space LM – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 13. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models ● Replicate Devlin et al, 2014 ● Large quality gains – Transliteration • Translation rule properties • Syntax decoding
  • 14. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration ● Character level translation ● Learns from parallel data ● Integrate into decoder • Translation rule properties • Syntax decoding
  • 15. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties – Extra information for each rule ● Context, syntax, domain etc • Syntax decoding
  • 16. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding – Faster, memory efficient decoding – More syntactic models
  • 17. Technical Breakout • Organization and Releases – Academic and commercial needs – Prevent forks – Development/Stable versions – Forwards/Backward compatibility – Upgradability • Features • Deployment • Future development
  • 18. Technical Breakout • Organization and Releases • Features • Deployment • Future development
  • 19. Technical Breakout • Organization and Releases • Features • Deployment – Platform/Clouds – Docker containers – Priorities – Interaction and data formats • Future development
  • 20. Technical Breakout • Organization and Releases • Features • Deployment • Future development – User-friendliness – End-to-end solution – Users
  • 21.
  • 22. Changes in Moses Hieu Hoang TAUS October 2014 Thanks for inviting me to come Here to tell you a little about the things I’ve been doing to Moses - over the past 2 years - mainly concentrate of the past year - but will quickly tell you about things I did prior to that 1
  • 23. MosesCore Year 1 (2012) • Easier installation – Binary releases – Pre-built models • Testing and Releases – Linux, Mac OSX, Windows – 32 and 64-bit • Faster training – Parallelism at all stages In the 1st year - picked off the low hanging fruit - fixed many of the easy issues that required - time & effort Made installation easier Run a lot of experiments anyway - gave some of them away - with all the scripts + configuration - used to run them - students can see how to replicate our results Lots of testing - all major platforms Made obvious speed improvements 2
  • 24. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder In year 2 - made it even easier to install - if you can’t be bother to compile or even download the binaries - download a virtual machine with moses + friends installed OR rent an amazon server with moses + friends installed 3
  • 25. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder However, the main reason I came here today - talk about the major changes I made - in decoder - and else where Makes is easier for us coders - add and change things in Moses 4
  • 26. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion What is a feature function? - something that gives a translation a score over the last few years - gotten bored with existing features like language models and reordering models the trend in MT - create novel features which give a score to a translation - like any feature, tries to give bigger scores to better models New feature function framework - designed to make it easy to add new features 5
  • 27. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion Simplify class structure - to make it easier for us to develop with Moses - Moses has been around for 8 years now - everyone has the freedom to add what they want - no-one is in overall control - this way of organising an open-source project is great - gotten lots of contribution, lots of features - downside - grown organically - things are not as well structured as they can be 6
  • 28. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion Why did I delete things - delete very little - I’m not the gatekeeper of moses, I don’t control it - if a functionality was deleted, it’s not a comment on usefulness of it - purely ‘cos it got in the way of the refactoring Quickly go thru the last 2 - before telling you about feature functions 7
  • 29. Specify a Feature Function Then…. [lmodel-file] 8 0 3 europarl.en.srilm.gz [weight-l] 0.142 ini file: • New Feature Function – New sections ● [feature-function-file] ● [weight-?] • Custom code – Parse ini file – Initialize feature function completely bestoked - no framework to help you - if you don’t do it right, wont’ work 8
  • 30. Adding new Feature Function Now…. [feature] KENLM file=path order=0 [weight] KENLM0= 0.142 ini file: • New Feature Function – No new section ● Line in [feature] section ● Line in [weight] section – Framework ● parse ini file ● initialize feature Write a class that implements the feature function The framework does the rest - no need to create a custom section in the ini file or - change StaticData class or - change Paramater class 9
  • 31. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 32. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table ● Dynamic suffix array ● Stores training data – Extract translation rule on-the-fly – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 33. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model ● Continuous space LM – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 34. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models ● Replicate Devlin et al, 2014 ● Large quality gains – Transliteration • Translation rule properties • Syntax decoding
  • 35. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration ● Character level translation ● Learns from parallel data ● Integrate into decoder • Translation rule properties • Syntax decoding
  • 36. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties – Extra information for each rule ● Context, syntax, domain etc • Syntax decoding
  • 37. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding – Faster, memory efficient decoding – More syntactic models
  • 38. Technical Breakout • Organization and Releases – Academic and commercial needs – Prevent forks – Development/Stable versions – Forwards/Backward compatibility – Upgradability • Features • Deployment • Future development
  • 39. Technical Breakout • Organization and Releases • Features • Deployment • Future development
  • 40. Technical Breakout • Organization and Releases • Features • Deployment – Platform/Clouds – Docker containers – Priorities – Interaction and data formats • Future development
  • 41. Technical Breakout • Organization and Releases • Features • Deployment • Future development – User-friendliness – End-to-end solution – Users