SlideShare a Scribd company logo
1 of 7
Language Weaver Components



                                Web Service or C++ API
                                                            Specific
  Processor: 2 GHz, Pentium 4
                                                          Translation
                                Language Weaver Decoder
  RAM: 2 GB                        Server
                                                            Engine
  Hard Disk Space: 100 MB
  Operating System:
  Microsoft Windows 2000         Domain                   Language
                                             Customizer
                                Dictionary                 Model
  Advanced Server SP3 or
  Microsoft Windows 2003




     Language Weaver should be installed on a dedicated machine
Web Service or C++ API

Specific Translation Engine                                                                                                    Specific
                                                                                                   Language Weaver Decoder   Translation
                                                                                                                               Engine

Improves translation quality using specific data                                                    Domain                   Language
                                                                                                                Customizer
                                                                                                   Dictionary                 Model




The specific translation engine is built by processing pre-translated documents.
       LW software “learns” from what it sees in pre-translated documents, so the first step in making the
       translation output more accurate and appropriate for your organization is to give the software your
       organization’s translated documents. This information can come from an existing translation system,
       knowledge bases, human resources documents, technical manuals, etc. Language Weaver will need
       digital versions of both the original document and its translation; the more data we have, the better
       the translation output

Then the processed documents are “aligned” with the translated versions using a
Language Weaver alignment tool
        Once the documents are gathered, each document and its respective translation needs to be
        aligned at the sentence or segment level. To help automate this process, users can use Language
        Weaver’s alignment tool or a third party alignment application. This alignment process prepares
        the parallel corpus shown in the graphic on the previous page.

 If needed the process can be repeated to improve accuracy and relevance.



    Specific translation engines may already exist within the Agency.
Web Service or C++ API

Web Service or C++ API                                                                                Specific
                                                                          Language Weaver Decoder   Translation
                                                                                                      Engine

2 interfaces exist for sending and receiving translations                  Domain                   Language
                                                                                       Customizer
                                                                          Dictionary                 Model



1) Web Service – suggested as a loosely coupled interface
  •   Language Weaver supplies a WSDL for interfacing systems to use as the contract on how
      to use Language Weaver.
  •   Technology agnostic – Web Service communication allows any technology able to adhere
      to the WSDL methods and Simple Object Access Protocol (SOAP) to leverage Language
      Weaver.

2) C++ API – Local or Remote Windows only interface
  •   Two licenses (local or remote) depending on need
  •   Local license used for applications or processes running on same machine as Language
      Weaver (LW). Remote license communicates to LW via port-port communications.
  •   Recommended for systems or products embedding LW.

Input Formats
  •   Both interfaces allows translation of any size:
             • Documents
             • Snippets of text
             • A single document
  •   Supported input formats:
             • Plain text, HTML, TMX,XLIFF, PDF, ODF (Open office/MS office documents)
Web Service or C++ API

Other Components                                                                                            Specific
                                                                                Language Weaver Decoder   Translation
                                                                                                            Engine

Improve translation fidelity using specific lists and data                       Domain                   Language
                                                                                             Customizer
                                                                                Dictionary                 Model



Decoder – main processing unit within Language Weaver
  •       The commander of the translation process
  •       Leverages other components (Language Model, Dictionary etc) and uses statistical
          analysis to produce translations

Domain Dictionary
  •       A set of domain specific words (Nouns, labels, adjectives)
  •       Bilingual entries that are matched at run-time against the incoming text.
  •       Users can create multiple specialized dictionaries to meet ongoing needs.

Customizer
      •   A set of domain specific words (Nouns, labels, adjectives)
      •   Allows users to customize Language Weaver’s baseline translation software (on a small
          scale) to a specific subject domain.
      •   Users can create multiple specialized dictionaries to meet ongoing needs.
      •   Customization process is done in-house so that the system can be continuously updated
          and sensitive data stays secure
Suggested Architectures
A Translation Sub-System
                                                                                    Determine document type
                       Main System   Translation sub-system                         Identify language pair
                                                                                    Perform any cleansing
                                                                                    Assign priority

                  Content to                                  Pre-process
                                                                                                     Check translation version to
                   translate                                   document                              ensure most recent translation
                                                                                                     engine is being use
                                                                                                     Or ensure that a specific
                                                                                                     translation engine is being
                                           Translation
                                                                                                     applied.
                                           Document                             Version check
                                             Object




              Submitted by
                                                                            Language
              Topics
                                               Apply metadata                Weaver
              Dictionary updates
                                                                            Translate
              Status (needs human
                                                                                                          Translation
              review, success….)
                                                                                                          Sub-system
Provides:
 •   Creates a central point for translation logic and application of specific meta-data
 •   Translated versions stored and displayed when requested
 •   Populates central corpus
Suggested Architectures
                                                                      •Determine if document is a duplicate
Automated Batch Process                                               •Identify language pair
                                                                      •Perform any cleansing
                                                                      •Assign priority
         Content to
                                                         Queue
          translate


             •Alert specific user         Post                    Translation
             •Index translation
                                       Processing                 Sub-system
             •Add to corpus




             •Keywords, Hot words
             •People, Places, Things          Entity            Store
             •Dates
                                            Extraction       Translation
             •Topics


Allows for separate translation factory to run independently of main system
 •       Creates a central point for translation logic
 •       Translated versions stored and displayed when requested
 •       Populates central corpus and facilitates post-processing of translated content
Negatives
     •   Development of a sub-system requires additional resources and time
     •   Not needed IF:
          • Translation throughput is slow (0-10 documents/hour)
          • Latency in user seeing translation is acceptable
Suggested Architectures
Real-Time Batch Process

                                         Language Weaver
             Content to
              translate                   Translation Web
                                              Service


                              Store
                           Translation

Simplest implementation
 •   Creates a central point for translation logic
 •   Translated versions stored and displayed when requested
 •   Populates central corpus

Negatives
 •   Latency between user seeing translated document may be in the neighborhood of 10 sec.
 •   May have scaling issues

            This is suggested as the iteration 1 goal since it can be
                      expanded to any other architecture.

More Related Content

What's hot

6 develop web20_with_rad-tim_frnacis_sarika-s
6 develop web20_with_rad-tim_frnacis_sarika-s6 develop web20_with_rad-tim_frnacis_sarika-s
6 develop web20_with_rad-tim_frnacis_sarika-sIBM
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsEduardo Castro
 
Branch repeater technical training presentation 26 oct-12
Branch repeater technical training presentation 26 oct-12Branch repeater technical training presentation 26 oct-12
Branch repeater technical training presentation 26 oct-12Nuno Alves
 
Innovating with Cisco Contact Center Architectures
Innovating with Cisco Contact Center ArchitecturesInnovating with Cisco Contact Center Architectures
Innovating with Cisco Contact Center ArchitecturesCisco Canada
 
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013Microsoft Décideurs IT
 
App-V og UE-V
App-V og UE-VApp-V og UE-V
App-V og UE-Vdkvug
 
Develop Ruby Applications Fast | TubroRuby
Develop Ruby Applications Fast | TubroRubyDevelop Ruby Applications Fast | TubroRuby
Develop Ruby Applications Fast | TubroRubyMichael Findling
 
Dedicated Hosting Service Agreement
Dedicated Hosting Service AgreementDedicated Hosting Service Agreement
Dedicated Hosting Service Agreementwebhostingguy
 
Android for Java Developers
Android for Java DevelopersAndroid for Java Developers
Android for Java DevelopersMarko Gargenta
 
Lotus Sametime 8.5: Using the new Sametime System Console
Lotus Sametime 8.5: Using the new Sametime System ConsoleLotus Sametime 8.5: Using the new Sametime System Console
Lotus Sametime 8.5: Using the new Sametime System Consolejackdowning
 
Android For Java Developers
Android For Java DevelopersAndroid For Java Developers
Android For Java DevelopersMike Wolfson
 
HTML5 and Timed Media Playback
HTML5 and Timed Media PlaybackHTML5 and Timed Media Playback
HTML5 and Timed Media PlaybackSidra Abbasi
 
Cvc2009 Moscow Xd3 Fabian Kienle Final
Cvc2009 Moscow Xd3  Fabian Kienle FinalCvc2009 Moscow Xd3  Fabian Kienle Final
Cvc2009 Moscow Xd3 Fabian Kienle FinalLiudmila Li
 

What's hot (20)

Exchange Server 2013 Architecture Deep Dive, Part 2
Exchange Server 2013 Architecture Deep Dive, Part 2 Exchange Server 2013 Architecture Deep Dive, Part 2
Exchange Server 2013 Architecture Deep Dive, Part 2
 
6 develop web20_with_rad-tim_frnacis_sarika-s
6 develop web20_with_rad-tim_frnacis_sarika-s6 develop web20_with_rad-tim_frnacis_sarika-s
6 develop web20_with_rad-tim_frnacis_sarika-s
 
HP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management SolutionsHP Microsoft SQL Server Data Management Solutions
HP Microsoft SQL Server Data Management Solutions
 
Branch repeater technical training presentation 26 oct-12
Branch repeater technical training presentation 26 oct-12Branch repeater technical training presentation 26 oct-12
Branch repeater technical training presentation 26 oct-12
 
User Group Bi
User Group BiUser Group Bi
User Group Bi
 
Innovating with Cisco Contact Center Architectures
Innovating with Cisco Contact Center ArchitecturesInnovating with Cisco Contact Center Architectures
Innovating with Cisco Contact Center Architectures
 
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013
Plongée en eaux profondes dans l'architecture du nouvel Exchange 2013
 
App-V og UE-V
App-V og UE-VApp-V og UE-V
App-V og UE-V
 
Develop Ruby Applications Fast | TubroRuby
Develop Ruby Applications Fast | TubroRubyDevelop Ruby Applications Fast | TubroRuby
Develop Ruby Applications Fast | TubroRuby
 
Dedicated Hosting Service Agreement
Dedicated Hosting Service AgreementDedicated Hosting Service Agreement
Dedicated Hosting Service Agreement
 
Android Deep Dive
Android Deep DiveAndroid Deep Dive
Android Deep Dive
 
Android for Java Developers
Android for Java DevelopersAndroid for Java Developers
Android for Java Developers
 
4
44
4
 
XenApp Presentsation 25022010
XenApp Presentsation 25022010XenApp Presentsation 25022010
XenApp Presentsation 25022010
 
Lotus Sametime 8.5: Using the new Sametime System Console
Lotus Sametime 8.5: Using the new Sametime System ConsoleLotus Sametime 8.5: Using the new Sametime System Console
Lotus Sametime 8.5: Using the new Sametime System Console
 
Server 2008 R2 Yeniliklər
Server 2008 R2 YeniliklərServer 2008 R2 Yeniliklər
Server 2008 R2 Yeniliklər
 
Android For Java Developers
Android For Java DevelopersAndroid For Java Developers
Android For Java Developers
 
Slide
SlideSlide
Slide
 
HTML5 and Timed Media Playback
HTML5 and Timed Media PlaybackHTML5 and Timed Media Playback
HTML5 and Timed Media Playback
 
Cvc2009 Moscow Xd3 Fabian Kienle Final
Cvc2009 Moscow Xd3  Fabian Kienle FinalCvc2009 Moscow Xd3  Fabian Kienle Final
Cvc2009 Moscow Xd3 Fabian Kienle Final
 

Similar to Language Weaver

Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).AEGIS-ACCESSIBLE Projects
 
Lets Localize Your Plugins
Lets Localize Your PluginsLets Localize Your Plugins
Lets Localize Your Pluginsguesta950cdd8d
 
A Lap Around Silverlight 5
A Lap Around Silverlight 5A Lap Around Silverlight 5
A Lap Around Silverlight 5Frank La Vigne
 
The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)lennartkats
 
Mike Taulty TechDays 2010 Silverlight 4 - What's New?
Mike Taulty TechDays 2010 Silverlight 4 - What's New?Mike Taulty TechDays 2010 Silverlight 4 - What's New?
Mike Taulty TechDays 2010 Silverlight 4 - What's New?ukdpe
 
Soa204 Kawasaki Final
Soa204 Kawasaki FinalSoa204 Kawasaki Final
Soa204 Kawasaki FinalAnush Kumar
 
What’s new in Visual Studio 2012 & .NET 4.5
What’s new in Visual Studio 2012 & .NET 4.5What’s new in Visual Studio 2012 & .NET 4.5
What’s new in Visual Studio 2012 & .NET 4.5Robert MacLean
 
A 10 Point Localisation Plan For Games
A 10 Point Localisation Plan For GamesA 10 Point Localisation Plan For Games
A 10 Point Localisation Plan For GamesShamusd
 
HTML5, Silverlight & Kinect
HTML5, Silverlight & KinectHTML5, Silverlight & Kinect
HTML5, Silverlight & KinectFrank La Vigne
 
DSL, the absolute weapon for the development
DSL, the absolute weapon for the developmentDSL, the absolute weapon for the development
DSL, the absolute weapon for the developmentESUG
 
Introduction to .NET with C# @ university of wayamba
Introduction to .NET with C# @ university of wayambaIntroduction to .NET with C# @ university of wayamba
Introduction to .NET with C# @ university of wayambaPrageeth Sandakalum
 
Parallel language & compilers
Parallel language & compilersParallel language & compilers
Parallel language & compilersdikshagupta111
 
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle Final
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle FinalCvc2009 Moscow Xen App5 Fp1 Fabian Kienle Final
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle FinalLiudmila Li
 
Windows Server 2008 R2 Overview
Windows Server 2008 R2 OverviewWindows Server 2008 R2 Overview
Windows Server 2008 R2 OverviewSteven Wilder
 
Insight into progam execution ppt
Insight into progam execution pptInsight into progam execution ppt
Insight into progam execution pptKeerty Smile
 
Language Engineering With Xtext
Language Engineering With XtextLanguage Engineering With Xtext
Language Engineering With XtextSven Efftinge
 

Similar to Language Weaver (20)

Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).Voice Enabled Desktop Interaction and Control System (VEDICS).
Voice Enabled Desktop Interaction and Control System (VEDICS).
 
Lets Localize Your Plugins
Lets Localize Your PluginsLets Localize Your Plugins
Lets Localize Your Plugins
 
A Lap Around Silverlight 5
A Lap Around Silverlight 5A Lap Around Silverlight 5
A Lap Around Silverlight 5
 
Easyling at atc London
Easyling at atc LondonEasyling at atc London
Easyling at atc London
 
The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)The Spoofax Language Workbench (SPLASH 2010)
The Spoofax Language Workbench (SPLASH 2010)
 
Mike Taulty TechDays 2010 Silverlight 4 - What's New?
Mike Taulty TechDays 2010 Silverlight 4 - What's New?Mike Taulty TechDays 2010 Silverlight 4 - What's New?
Mike Taulty TechDays 2010 Silverlight 4 - What's New?
 
Soa204 Kawasaki Final
Soa204 Kawasaki FinalSoa204 Kawasaki Final
Soa204 Kawasaki Final
 
05
0505
05
 
What’s new in Visual Studio 2012 & .NET 4.5
What’s new in Visual Studio 2012 & .NET 4.5What’s new in Visual Studio 2012 & .NET 4.5
What’s new in Visual Studio 2012 & .NET 4.5
 
Subversion and bug tracking
Subversion and bug trackingSubversion and bug tracking
Subversion and bug tracking
 
A 10 Point Localisation Plan For Games
A 10 Point Localisation Plan For GamesA 10 Point Localisation Plan For Games
A 10 Point Localisation Plan For Games
 
HTML5, Silverlight & Kinect
HTML5, Silverlight & KinectHTML5, Silverlight & Kinect
HTML5, Silverlight & Kinect
 
Product internationalization
Product internationalizationProduct internationalization
Product internationalization
 
DSL, the absolute weapon for the development
DSL, the absolute weapon for the developmentDSL, the absolute weapon for the development
DSL, the absolute weapon for the development
 
Introduction to .NET with C# @ university of wayamba
Introduction to .NET with C# @ university of wayambaIntroduction to .NET with C# @ university of wayamba
Introduction to .NET with C# @ university of wayamba
 
Parallel language & compilers
Parallel language & compilersParallel language & compilers
Parallel language & compilers
 
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle Final
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle FinalCvc2009 Moscow Xen App5 Fp1 Fabian Kienle Final
Cvc2009 Moscow Xen App5 Fp1 Fabian Kienle Final
 
Windows Server 2008 R2 Overview
Windows Server 2008 R2 OverviewWindows Server 2008 R2 Overview
Windows Server 2008 R2 Overview
 
Insight into progam execution ppt
Insight into progam execution pptInsight into progam execution ppt
Insight into progam execution ppt
 
Language Engineering With Xtext
Language Engineering With XtextLanguage Engineering With Xtext
Language Engineering With Xtext
 

Recently uploaded

Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurSuhani Kapoor
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechNewman George Leech
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
Non Text Magic Studio Magic Design for Presentations L&P.pptx
Non Text Magic Studio Magic Design for Presentations L&P.pptxNon Text Magic Studio Magic Design for Presentations L&P.pptx
Non Text Magic Studio Magic Design for Presentations L&P.pptxAbhayThakur200703
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 

Recently uploaded (20)

Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdf
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
Best Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting PartnershipBest Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting Partnership
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman Leech
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
Non Text Magic Studio Magic Design for Presentations L&P.pptx
Non Text Magic Studio Magic Design for Presentations L&P.pptxNon Text Magic Studio Magic Design for Presentations L&P.pptx
Non Text Magic Studio Magic Design for Presentations L&P.pptx
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 

Language Weaver

  • 1. Language Weaver Components Web Service or C++ API Specific Processor: 2 GHz, Pentium 4 Translation Language Weaver Decoder RAM: 2 GB Server Engine Hard Disk Space: 100 MB Operating System: Microsoft Windows 2000 Domain Language Customizer Dictionary Model Advanced Server SP3 or Microsoft Windows 2003 Language Weaver should be installed on a dedicated machine
  • 2. Web Service or C++ API Specific Translation Engine Specific Language Weaver Decoder Translation Engine Improves translation quality using specific data Domain Language Customizer Dictionary Model The specific translation engine is built by processing pre-translated documents. LW software “learns” from what it sees in pre-translated documents, so the first step in making the translation output more accurate and appropriate for your organization is to give the software your organization’s translated documents. This information can come from an existing translation system, knowledge bases, human resources documents, technical manuals, etc. Language Weaver will need digital versions of both the original document and its translation; the more data we have, the better the translation output Then the processed documents are “aligned” with the translated versions using a Language Weaver alignment tool Once the documents are gathered, each document and its respective translation needs to be aligned at the sentence or segment level. To help automate this process, users can use Language Weaver’s alignment tool or a third party alignment application. This alignment process prepares the parallel corpus shown in the graphic on the previous page. If needed the process can be repeated to improve accuracy and relevance. Specific translation engines may already exist within the Agency.
  • 3. Web Service or C++ API Web Service or C++ API Specific Language Weaver Decoder Translation Engine 2 interfaces exist for sending and receiving translations Domain Language Customizer Dictionary Model 1) Web Service – suggested as a loosely coupled interface • Language Weaver supplies a WSDL for interfacing systems to use as the contract on how to use Language Weaver. • Technology agnostic – Web Service communication allows any technology able to adhere to the WSDL methods and Simple Object Access Protocol (SOAP) to leverage Language Weaver. 2) C++ API – Local or Remote Windows only interface • Two licenses (local or remote) depending on need • Local license used for applications or processes running on same machine as Language Weaver (LW). Remote license communicates to LW via port-port communications. • Recommended for systems or products embedding LW. Input Formats • Both interfaces allows translation of any size: • Documents • Snippets of text • A single document • Supported input formats: • Plain text, HTML, TMX,XLIFF, PDF, ODF (Open office/MS office documents)
  • 4. Web Service or C++ API Other Components Specific Language Weaver Decoder Translation Engine Improve translation fidelity using specific lists and data Domain Language Customizer Dictionary Model Decoder – main processing unit within Language Weaver • The commander of the translation process • Leverages other components (Language Model, Dictionary etc) and uses statistical analysis to produce translations Domain Dictionary • A set of domain specific words (Nouns, labels, adjectives) • Bilingual entries that are matched at run-time against the incoming text. • Users can create multiple specialized dictionaries to meet ongoing needs. Customizer • A set of domain specific words (Nouns, labels, adjectives) • Allows users to customize Language Weaver’s baseline translation software (on a small scale) to a specific subject domain. • Users can create multiple specialized dictionaries to meet ongoing needs. • Customization process is done in-house so that the system can be continuously updated and sensitive data stays secure
  • 5. Suggested Architectures A Translation Sub-System Determine document type Main System Translation sub-system Identify language pair Perform any cleansing Assign priority Content to Pre-process Check translation version to translate document ensure most recent translation engine is being use Or ensure that a specific translation engine is being Translation applied. Document Version check Object Submitted by Language Topics Apply metadata Weaver Dictionary updates Translate Status (needs human Translation review, success….) Sub-system Provides: • Creates a central point for translation logic and application of specific meta-data • Translated versions stored and displayed when requested • Populates central corpus
  • 6. Suggested Architectures •Determine if document is a duplicate Automated Batch Process •Identify language pair •Perform any cleansing •Assign priority Content to Queue translate •Alert specific user Post Translation •Index translation Processing Sub-system •Add to corpus •Keywords, Hot words •People, Places, Things Entity Store •Dates Extraction Translation •Topics Allows for separate translation factory to run independently of main system • Creates a central point for translation logic • Translated versions stored and displayed when requested • Populates central corpus and facilitates post-processing of translated content Negatives • Development of a sub-system requires additional resources and time • Not needed IF: • Translation throughput is slow (0-10 documents/hour) • Latency in user seeing translation is acceptable
  • 7. Suggested Architectures Real-Time Batch Process Language Weaver Content to translate Translation Web Service Store Translation Simplest implementation • Creates a central point for translation logic • Translated versions stored and displayed when requested • Populates central corpus Negatives • Latency between user seeing translated document may be in the neighborhood of 10 sec. • May have scaling issues This is suggested as the iteration 1 goal since it can be expanded to any other architecture.