presentation on history of MT and how language resources have helped to develop MT (particularly statistical MT) with an emphasis in Pangeanic's experience
This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit.
MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme.
For the latest updates, follow us on Twitter - #MosesCore
machine translation manuel herranz PangeaMT TAUS BarcelonaManuel Herranz
how machine translation is about empowering users and how users can be empowered using DIY SMT technology to build their own statistical machine translation solutions
Moses has been the core of most of the developments on the machine translation market nowadays. Despite a thriving community, its open source core means that users need to find ways to adapt it and improve its features. Pangea v3, part of the EU's EXPERT project provides hybrid features based on search engine type recall to improve translation output.
pangeanic hybrid syntax-based approach to machine translation for Japanese, brief history of machine translation, productivity gains with machine translation
Gestión proyectos traducción en la Universitat Autònoma de BarcelonaManuel Herranz
Descripción del funcionamiento de una empresa de traducción, departamentos y procesos, tomando a www.pangeanic.es como ejemplo. Descripción de funciones, normas y flujo de trabajo con un énfasis en los procesos de traducción automática.
Our statistical machine translation platform and hybrid features were presented at the European Commission offices in Luxembourg last Tuesday 22nd September. It is one of the tools that the European Union will consider, among other machine translation commercial solutions, as a tool to help its mandate for CEF (Connecting Europe Facility). Pangeanic’s CEO, Manuel Herranz, presented the current state-of-the-art that PangeaMT version 3 represents. Representatives from the EU were particularly interested in the solid data management features, machine translation engine retraining routines, data cleaning and automated engine training and creation features. One of key features with the new PangeaMT version is the possibility to change translation algorithms and use rule-based systems like Apertium and Thot as well as the default Moses. It is also compatible with 3rd-party calls from other systems. Its powerful API can also provide machine translated output to requests anywhere in the world, although the platform is designed for onsite use at translation companies and organizations. PangeaMT is also compatible with several popular translation formats like ttx, sdlxliff, memoq, memsource, and most xml-based Tikal formats.
Pangeanic presentation at Elia Together Athens - Manuel HerranzManuel Herranz
Our presentation at #Eliatogether in Athens was favored by many attendees. Will disintermediation be a force to reckon with in the translation industry as it has happened in the hotel and travel industries? What is the role of machine translation in all this? How does neural machine translation work?
This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit.
MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme.
For the latest updates, follow us on Twitter - #MosesCore
machine translation manuel herranz PangeaMT TAUS BarcelonaManuel Herranz
how machine translation is about empowering users and how users can be empowered using DIY SMT technology to build their own statistical machine translation solutions
Moses has been the core of most of the developments on the machine translation market nowadays. Despite a thriving community, its open source core means that users need to find ways to adapt it and improve its features. Pangea v3, part of the EU's EXPERT project provides hybrid features based on search engine type recall to improve translation output.
pangeanic hybrid syntax-based approach to machine translation for Japanese, brief history of machine translation, productivity gains with machine translation
Gestión proyectos traducción en la Universitat Autònoma de BarcelonaManuel Herranz
Descripción del funcionamiento de una empresa de traducción, departamentos y procesos, tomando a www.pangeanic.es como ejemplo. Descripción de funciones, normas y flujo de trabajo con un énfasis en los procesos de traducción automática.
Our statistical machine translation platform and hybrid features were presented at the European Commission offices in Luxembourg last Tuesday 22nd September. It is one of the tools that the European Union will consider, among other machine translation commercial solutions, as a tool to help its mandate for CEF (Connecting Europe Facility). Pangeanic’s CEO, Manuel Herranz, presented the current state-of-the-art that PangeaMT version 3 represents. Representatives from the EU were particularly interested in the solid data management features, machine translation engine retraining routines, data cleaning and automated engine training and creation features. One of key features with the new PangeaMT version is the possibility to change translation algorithms and use rule-based systems like Apertium and Thot as well as the default Moses. It is also compatible with 3rd-party calls from other systems. Its powerful API can also provide machine translated output to requests anywhere in the world, although the platform is designed for onsite use at translation companies and organizations. PangeaMT is also compatible with several popular translation formats like ttx, sdlxliff, memoq, memsource, and most xml-based Tikal formats.
Pangeanic presentation at Elia Together Athens - Manuel HerranzManuel Herranz
Our presentation at #Eliatogether in Athens was favored by many attendees. Will disintermediation be a force to reckon with in the translation industry as it has happened in the hotel and travel industries? What is the role of machine translation in all this? How does neural machine translation work?
Pangea Machine Translation platform from Pangeanic. A product presentation by Manuel Herranz, Elia Yuste, Andi Frank showcasing the best of automated cleaning cycles, automated engine retraining, machine translation engine creation.
Big Data to SMART Data : Process scenario
Scenario of an implementation of a transformation process of the Data towards exploitable data and representative with treatments of the streaming, the distributed systems, the messages, the storage in an NoSQL environment, a management with an ecosystem Big Data graphic visualization of the data with the technologies:
Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, Apache Spark and Data-Driven Document.
Wide-spread adoption of MT software by end-users and language service providers is still hindered by the need to train custom MT systems, the lack of training data for some language pairs and domains as well as substantive hardware needs. The EU-funded MMT project addresses these barriers by providing cloud-ready software with a simple installation procedure, very fast setup times for MT systems, instant domain adaptation and integration of new data and high scalability. To address the data gap between the large web companies and the MT industry, MMT is also curating and collecting data from translation stakeholders and the web to improve MT quality for everybody. In this session we present the latest developments in this ongoing project and demo the service.
The Internet-Of-Things (IoT) is no longer a hype, but a reality. Connecting ANY devices, ANY place, ANY thing will transform the way we live. However from an engineers point of view how can he gain benefit from this? Here are some of the key technology trends that will play an important role.
4 European machine translation companies joined forces to build something bigger than themselves: an intelligent platform capable of detecting domain, detecting languages, balancing load with a view to create a marketplace. The project was financed by the European Commission. This is the presentation by Pangeanic in Gala Boston 2018.
A very categorized presentation about big data analytics Various topics like Introduction to Big Data,Hadoop,HDFS Map Reduce, Mahout,K-means Algorithm,H-Base are explained very clearly in simple language for everyone to understand easily.
Hadoop World 2011: Hadoop’s Life in Enterprise Systems - Y Masatani, NTTDataCloudera, Inc.
NTT DATA has been providing Hadoop professional services for enterprise customers for years. In this talk we will categorize Hadoop integration cases based on our experience and illustrate archetypal design practices how Hadoop clusters are deployed into existing infrastructure and services. We will also present enhancement cases motivated by customer’s demand including GPU for big math, HDFS capable storage system, etc.
Shared by Mansoor Mirza
Distributed Computing
What is it?
Why & when we need it?
Comparison with centralized computing
‘MapReduce’ (MR) Framework
Theory and practice
‘MapReduce’ in Action
Using Hadoop
Lab exercises
rNews: Embedding Metadata in On-line News
From the talk at SemTech
Wednesday, June 8, 2011
09:45 AM - 10:35 AM
Level: Business / Non-Technical
Case Study
Location: Yosemite A
The IPTC, a consortium of the world's major news agencies, news publishers and news industry vendors, recently released rNews, a semantic standard for on-line news. rNews uses RDFa to annotate HTML documents with news-specific metadata, to help with search, ad placement, aggregation and the sharing of on-line news. Jayson Lorenzen, a software engineer with Business Wire and one of the IPTC Member organization delegates working on rNews, will give an overview of the IPTC, the rNews standard, why rNews is needed and how the standard was eventually created. The talk will include use cases and live demonstrations of rNews and will end with a call to action for you to participate; rNews is currently at version 0.5 and the IPTC is looking for feedback on how to improve the standard.
(Recent) technology trends and bridges to gap in the localization industryLoctimize GmbH
Slides of the keynote presentation held by Daniel Zielinski during the frist Egyptian conference on Translation, Localization and Interpreting in Cairo on April 16, 2019
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017Manuel Herranz
Presentation of Pangeanic language technologies as a result of EU and national R&D: Cor for web crawling and website translation, linked to Elastic Search-based ActivaTM and NeuralMT
Gestión proyectos traducción - Universitat Autònoma de BarcelonaManuel Herranz
Presentación sobre el modelo de gestión de proyectos en una empresa de traducción, sirviendo www.pangeanic.es como ejemplo. Descripción de departamentos y procesos.
More Related Content
Similar to Panacea presentation - Pangeanic - Budapest
Pangea Machine Translation platform from Pangeanic. A product presentation by Manuel Herranz, Elia Yuste, Andi Frank showcasing the best of automated cleaning cycles, automated engine retraining, machine translation engine creation.
Big Data to SMART Data : Process scenario
Scenario of an implementation of a transformation process of the Data towards exploitable data and representative with treatments of the streaming, the distributed systems, the messages, the storage in an NoSQL environment, a management with an ecosystem Big Data graphic visualization of the data with the technologies:
Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, Apache Spark and Data-Driven Document.
Wide-spread adoption of MT software by end-users and language service providers is still hindered by the need to train custom MT systems, the lack of training data for some language pairs and domains as well as substantive hardware needs. The EU-funded MMT project addresses these barriers by providing cloud-ready software with a simple installation procedure, very fast setup times for MT systems, instant domain adaptation and integration of new data and high scalability. To address the data gap between the large web companies and the MT industry, MMT is also curating and collecting data from translation stakeholders and the web to improve MT quality for everybody. In this session we present the latest developments in this ongoing project and demo the service.
The Internet-Of-Things (IoT) is no longer a hype, but a reality. Connecting ANY devices, ANY place, ANY thing will transform the way we live. However from an engineers point of view how can he gain benefit from this? Here are some of the key technology trends that will play an important role.
4 European machine translation companies joined forces to build something bigger than themselves: an intelligent platform capable of detecting domain, detecting languages, balancing load with a view to create a marketplace. The project was financed by the European Commission. This is the presentation by Pangeanic in Gala Boston 2018.
A very categorized presentation about big data analytics Various topics like Introduction to Big Data,Hadoop,HDFS Map Reduce, Mahout,K-means Algorithm,H-Base are explained very clearly in simple language for everyone to understand easily.
Hadoop World 2011: Hadoop’s Life in Enterprise Systems - Y Masatani, NTTDataCloudera, Inc.
NTT DATA has been providing Hadoop professional services for enterprise customers for years. In this talk we will categorize Hadoop integration cases based on our experience and illustrate archetypal design practices how Hadoop clusters are deployed into existing infrastructure and services. We will also present enhancement cases motivated by customer’s demand including GPU for big math, HDFS capable storage system, etc.
Shared by Mansoor Mirza
Distributed Computing
What is it?
Why & when we need it?
Comparison with centralized computing
‘MapReduce’ (MR) Framework
Theory and practice
‘MapReduce’ in Action
Using Hadoop
Lab exercises
rNews: Embedding Metadata in On-line News
From the talk at SemTech
Wednesday, June 8, 2011
09:45 AM - 10:35 AM
Level: Business / Non-Technical
Case Study
Location: Yosemite A
The IPTC, a consortium of the world's major news agencies, news publishers and news industry vendors, recently released rNews, a semantic standard for on-line news. rNews uses RDFa to annotate HTML documents with news-specific metadata, to help with search, ad placement, aggregation and the sharing of on-line news. Jayson Lorenzen, a software engineer with Business Wire and one of the IPTC Member organization delegates working on rNews, will give an overview of the IPTC, the rNews standard, why rNews is needed and how the standard was eventually created. The talk will include use cases and live demonstrations of rNews and will end with a call to action for you to participate; rNews is currently at version 0.5 and the IPTC is looking for feedback on how to improve the standard.
(Recent) technology trends and bridges to gap in the localization industryLoctimize GmbH
Slides of the keynote presentation held by Daniel Zielinski during the frist Egyptian conference on Translation, Localization and Interpreting in Cairo on April 16, 2019
Similar to Panacea presentation - Pangeanic - Budapest (20)
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017Manuel Herranz
Presentation of Pangeanic language technologies as a result of EU and national R&D: Cor for web crawling and website translation, linked to Elastic Search-based ActivaTM and NeuralMT
Gestión proyectos traducción - Universitat Autònoma de BarcelonaManuel Herranz
Presentación sobre el modelo de gestión de proyectos en una empresa de traducción, sirviendo www.pangeanic.es como ejemplo. Descripción de departamentos y procesos.
Manuel Herranz presents at TMS Inspiration Days, on Pangeanic's use case, the application of MT to LSPs, the Pangeanic development case. Unveiling feature-rich PangeaMT Saas Power, Pangeanic's v3.
Pangeanic presentation at Japan Translation Federation, detailing history of MT, productivity gains with MT at LSPs, data from Autodesk and CSA, description of PangeaMT system
kerstin bier, localization world barcelona, manuel herranz, mt, pangeanic, sy...Manuel Herranz
Co-presentation by Kerstin Bier and Manuel Herranz in Localization World Barcelona 2011 on the achievement and progress made by a customized PangeaMT engine at Sybase. Initial machine translation implementation, machine translation customization for Sybase, use of client's data for training and productivity results.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
16. Compatibility with commercial formats (ttx, sdlxliff, itd)As of May 2009: 487 Billion gigabytes or 1,000,000,000 * 487,000,000,000 = 4,87 x 1020 Estimates Up 50% a year (Oracle) Doubles every 11 hours (IBM)
17.
18. To provide High Q MT for Post-Editing and save time and cost. No Google-type broad TR but domain-specific, user-centric.
19. Lower entry level for MT. Bring democracy and affordability to MT. Bring it to the user, take away from programmer.
20. How? By fostering open-standard geared translation automation strategies
28. 8 Therushfor data (clean) <tusrclang="en-GB"> <tuvxml:lang="EN-GB"> <seg>A system for recovering the methane that is emitted from the manure so that it does not leak into the atmosphere.</seg> </tuv> <tuvxml:lang="FR-FR"> <seg>Systèmepermettant de r€ pérer le méthane qui se dégage de l'engrais naturel d'origineanimale de sortequ'il ne se dissipe pas dansl'atmosphère.</seg> </tuv> <tucreationdate="20090817T114430Z" creationid="APIACCESS" changedate="20110617T141159Z" changeid=“pat"> <tuvxml:lang="EN-US"> <seg>Overall heigtht –<bpt i="1">{43 </bpt> <ept i="1">}</ept>25"; width –<bpt i="2">{43 </bpt> <ept i="2">}</ept>20.1".</seg> </tuv> <tuvxml:lang="ES-EM"> <seg><bpt i="1">{2 </bpt>Altura total - 25"; anchura <ept i="1">}</ept>–<bpt i="2">{43 </bpt> <ept i="2">}</ept><bpt i="3">{2 </bpt>20,1".<ept i="3">}</ept></seg> </tuv> </tu> <tuvxml:lang=“EN-US"> <seg>On 22nd May we decided not to join the group.</seg> <tuvxml:lang=“DE-DE"> <seg>Am 22. </seg> cleaning More cleaning
35. OngoingWork :) - API integration + userdomainbuilding- Audiovisual integration- Releasethecodetousers-> create a community and flavourstoeachsituation; hybridate and create rules- Have more and more companies and institutions use PangeaMT as theirplatform and makeitgrown
36. 2009 2010 Predictions Tech. notthe realm of afew providers 2011 2012 2013 User empowerment 000's of customized MT systems 2014 2015 YEAR 2016 2016 2017 2018