Компиляция и транскрибирование корпуса устной речи с EXMARaLDANLProc.by
Extensible Markup Language for Discourse Annotation
Набор инструментов для работы с видео и аудиофайлами
Разработан в Университете Гамбурга (Universität Hamburg)
Написан в Java
Поддерживается Windows, Mac OS, Linux
Компиляция и транскрибирование корпуса устной речи с EXMARaLDANLProc.by
Extensible Markup Language for Discourse Annotation
Набор инструментов для работы с видео и аудиофайлами
Разработан в Университете Гамбурга (Universität Hamburg)
Написан в Java
Поддерживается Windows, Mac OS, Linux
Компьютерные самоучители по языкам от компании "РЕПЕТИТОР МультиМедиа"Anatoly Zhislin
Компьютерные самоучители по языкам от компании "РЕПЕТИТОР МультиМедиа" адресованы пользователям, родной язык которых русский. Наиболее популярное издание - компьютерный самоучитель "РЕПЕТИТОР English" для так называемых "ложных новичков" (знал, но забыл). Представлены также самоучители для более высоких уровней изучения языка: тренажеры 'Living English' и 'Real American', обучающая программа по основам бизнес-лексики'Business English on VOA', мультимедийная хрестоматия и тренажер по искусству публичных выступлений 'Great Speakers and Great Speeches', интерактивная аудиокнига по рассказам Марка Твена, тренажеры по немецкому языку - серия 'Echtes Deutsch' и др.
Alexandr Serbul "The Rust language for a high-load network service - a quick ...Fwdays
In this talk, we will talk about the evolution of the development of a high-load network cluster for sending push notifications using technologies from Unix / bash and PHP to asynchronous non-blocking multithreaded connections based on Rust / Tokio. Let's talk about the intricacies of Rust development, language features, pitfalls, and ways to quickly learn and use it for web developers with LAMP skills. Let's also talk about Go, Java, and the reasons for our technological decisions.
The talk will be useful for developers wishing to master the latest and popular Rust programming language, functional programming, Haskell ideas with PHP / Python / JavaScript web development experience.
UTICamp-2020. We Translate/Localise into Two Languages: Yes, This Can Be Done...UTICamp
Speaker: Vatslav Yehurnov
Have you been asked to translate the same advertising leaflet into Ukrainian and Russian? Or do yourself want to offer this service and save time? Do you know colleagues who translate similar materials into their languages and want to ensure consistency in translations into multiple languages? There are some quite simple techniques to help you save time, perform an additional quality control session and ensure the uniqueness of your translations into two languages.
The presentation describes practical experience of speeding up translation into two languages: Russian and Ukrainian.
UTICamp-2020. Is Translation Safe Road for Beginner?UTICamp
Speaker: Igor Bukharkov
It is well known that the beginners have difficult time in any industry. However, newbies in translation demand some special attention. How to train a professional from a postgraduate, achieve additional long-term benefits for translation companies and, most importantly, stop young and talented specialists from leaving the industry are some of the key issues we are going to focus on.
Speaker: Valentyna Kozlova
A translator can and should be happy. At least satisfied with themselves. How to reach nirvana when you are being assessed all the time by proofreaders, clients, and readers? We’ll try to figure it out and make up a brief manual, a roadmap to happiness for a translator.
UTICamp-2020. Technical Writing and Translation: the Interaction Between Simp...UTICamp
Speakers: Orlando Chiarello, Tatiana Shvets
The purpose of this workshop is to show the interaction between ASD Simplified Technical English, ASD-STE100 (STE) and the Russian language in technical writing and translation. STE is a controlled language consisting of a set of writing rules and a controlled vocabulary. It has been developed in the early Eighties by and for the aviation industry to help the readers of English-language documentation understand what they read, particularly when these readers are non-native English speakers.
When STE was created, one of its objectives was to make translation easier. If the vocabulary, meanings of words and types of sentence constructions are controlled, it would be easier for translators or translation engines to translate texts written in STE. Even in the absence of a Russian-based controlled language, the advantage in terms of translation and time reduction could be relevant when texts written in STE are translated into Russian, particularly when using machine translation.
An important purpose of this workshop is to demonstrate how the writing and translation process work by means of some practical examples. At the end, results will be discussed.
UTICamp-2020. Is translation automation a myth or a reality?UTICamp
Speaker: Istvan Lengyel
Smaller translation providers face very different realities from the largest companies in the sector. In reality, the differentiator is not as much technology as developing processes together with their customers.
With the proliferation of small and frequent jobs, the industry is already seeing a move away from the “translations with love”, yet many company owners argue that the automation of processes in the translation industry is not possible, or at least not profitable. In this presentation I will talk about the many variations and aspects in translation projects that make automation difficult, and I will also look at the skills and qualities of project managers that will be those colleagues that bring your company to the next level. I will also discuss some of the lesser-known middleware tools that are worth knowing if you are interested in moving towards automation. I make an attempt to give participants both a profile that they should be looking for when hiring a “different” project manager and some recommendations on how to start instilling the culture of automation into their daily work.
UTICamp-2020. From Quality Assurance to Operational ExcellenceUTICamp
Speaker: Manal Amin
It is one thing to achieve compliance with quality standard’s requirements; which is great already. Yet it’s totally different to reach excellence and deliver what is “Desired” not only what is “Required”.
Being one step ahead of your competition through strategic management deployment, performance management, continuous process optimization; and a highly performing, fully engaged team that is what we call Operational Excellence. Investing in long-term change in the organizational culture, so all team members work in putting business strategies into practice; achieving efficient strategic management.
Tackling OpEx is going through (but not limited to):
1- Process Optimization & Continuous Improvement through Six Sigma and DMAIC Methodology
2- Organizational Cultural Enhancement to reach a standard methodology of correcting things when they go wrong
3- Adopting a risk-based thinking mindset that guide business strategies
4- Efficient Performance and Data Management
UTICamp-2020. Step Across The Border and Find Your NicheUTICamp
The document discusses finding niches at the intersection of translation companies and potential clients. It provides examples of how a translation company called Vocal Point developed accessibility solutions for the event industry. It suggests approaches for language service providers to go beyond one-directional translation by considering client perspectives, finding complementary or overlapping business activities, and generating translatable content in niche areas of expertise. Potential models involve offering multilingual support and accessibility solutions to event organizers and pursuing public funding. The key is to establish expertise that can generate content requiring translation services.
UTICamp-2020. Size Does Matter, or Is the translation Industry Restructuring ...UTICamp
Speaker: Nigel Saych
Every linguist knows what the challenge of technological development is bringing to our profession. Nobody needs another presentation entitled ‘The threat of….’. But are the days of ‘Big is beautiful’ still relevant to our profession? Are changes taking place within our industry that suggest we are rethinking the roles of the classic translation agency and of the stand-alone linguist working on the kitchen table?
I have noticed trends in recent years that attract surprisingly little comment. I am aware of a change of direction that is influencing the face of the way we work. It’s not a spectacular change, but it’s creeping in from all directions. I call it my BBB theory. But that needs quite a lot of explanation. All will be revealed…
UTICamp-2020. Preparing for Post-Doom – Diversifying business: Content creati...UTICamp
Speakers: Leonid Glazychev, Fedor Bezrukov
After discussing the doomsday scenario (previous presentation), we would like to present the positive side of things: How can both companies and individuals get prepared and march straight into the doomsday-free future. We will discuss diversification scenarios and synergies between traditional translation/localization and various neighboring areas, such as content creation, video and eLearning, game and multimedia localization, design and page-setting, and others.
We will then talk about short- and long-term potential, advantages and challenges for each of these development areas, and provide concrete examples or case studies. We also hope to introduce you to one or more surprise guests...
Speaker: Oleksandr Bondarenko (Translatel Ltd)
It has been widely acknowledged that PEMT specialists are in great demand at the market. According to a diagnostic survey of the translators’ professional profile (see Bondarenko, Marczak 2019), the PEMT skills are highly estimated by employers and underestimated by employees.
In this talk, Oleksandr will offer a brief review of some of the preliminary findings of a two-year project of configuring MT engines that he has been personally involved in. The talk sheds more light on how MT customizing can influence the translation quality in popular language pairs. The speaker will focus on a workflow, do’s and don’ts, as well as on strategies to reduce time and efforts required for post-editing.
Human MT evaluation statistics will also be provided to better forecast the MT leverage for the definite projects. The speaker will suggest a framework to acquire MT-engines literacy that can bring maximum effect with little cost and little time.
The case study can be instructive both for companies’ executives and for freelancers.
UTICamp-2020. Quality Assurance in TranslationUTICamp
Speaker: Nigel Wheatley
Many a freelancer's heart will sink at the mention of quality assurance (QA). For too many translators it is synonymous with pointless semi-automated checks that have an overwhelming rate of false-positives; but such software "solutions" are not QA at all. Approached correctly, true QA can offer a solution to one of our fundamental marketing problems: how do we persuade potential clients that we are competent, professional and worth our rates. This presentation will look at the various QA options available to individual freelancers and/or small agencies.
Speaker: William Hackett-Jones
Every language is equipped with words and set expressions that are metaphorical. They beautify language, they're emphatic... and they're also a pain to translate - because you cannot translate slang and idioms literally. It's not enough to know the meaning of the component words of an idiom, in order to translate it. Indeed, one needs a decent degree of background knowledge about the cultural aspects of the country whose language you're translating. When we translate dialogue in film, we always try to keep the flavour of the speaker's language, which means finding appropriate slang and set expressions to fit the character.
More Related Content
Similar to UTICamp-2020. Transcription, Subtitles, Translation: Easier Than Ever
Компьютерные самоучители по языкам от компании "РЕПЕТИТОР МультиМедиа"Anatoly Zhislin
Компьютерные самоучители по языкам от компании "РЕПЕТИТОР МультиМедиа" адресованы пользователям, родной язык которых русский. Наиболее популярное издание - компьютерный самоучитель "РЕПЕТИТОР English" для так называемых "ложных новичков" (знал, но забыл). Представлены также самоучители для более высоких уровней изучения языка: тренажеры 'Living English' и 'Real American', обучающая программа по основам бизнес-лексики'Business English on VOA', мультимедийная хрестоматия и тренажер по искусству публичных выступлений 'Great Speakers and Great Speeches', интерактивная аудиокнига по рассказам Марка Твена, тренажеры по немецкому языку - серия 'Echtes Deutsch' и др.
Alexandr Serbul "The Rust language for a high-load network service - a quick ...Fwdays
In this talk, we will talk about the evolution of the development of a high-load network cluster for sending push notifications using technologies from Unix / bash and PHP to asynchronous non-blocking multithreaded connections based on Rust / Tokio. Let's talk about the intricacies of Rust development, language features, pitfalls, and ways to quickly learn and use it for web developers with LAMP skills. Let's also talk about Go, Java, and the reasons for our technological decisions.
The talk will be useful for developers wishing to master the latest and popular Rust programming language, functional programming, Haskell ideas with PHP / Python / JavaScript web development experience.
UTICamp-2020. We Translate/Localise into Two Languages: Yes, This Can Be Done...UTICamp
Speaker: Vatslav Yehurnov
Have you been asked to translate the same advertising leaflet into Ukrainian and Russian? Or do yourself want to offer this service and save time? Do you know colleagues who translate similar materials into their languages and want to ensure consistency in translations into multiple languages? There are some quite simple techniques to help you save time, perform an additional quality control session and ensure the uniqueness of your translations into two languages.
The presentation describes practical experience of speeding up translation into two languages: Russian and Ukrainian.
UTICamp-2020. Is Translation Safe Road for Beginner?UTICamp
Speaker: Igor Bukharkov
It is well known that the beginners have difficult time in any industry. However, newbies in translation demand some special attention. How to train a professional from a postgraduate, achieve additional long-term benefits for translation companies and, most importantly, stop young and talented specialists from leaving the industry are some of the key issues we are going to focus on.
Speaker: Valentyna Kozlova
A translator can and should be happy. At least satisfied with themselves. How to reach nirvana when you are being assessed all the time by proofreaders, clients, and readers? We’ll try to figure it out and make up a brief manual, a roadmap to happiness for a translator.
UTICamp-2020. Technical Writing and Translation: the Interaction Between Simp...UTICamp
Speakers: Orlando Chiarello, Tatiana Shvets
The purpose of this workshop is to show the interaction between ASD Simplified Technical English, ASD-STE100 (STE) and the Russian language in technical writing and translation. STE is a controlled language consisting of a set of writing rules and a controlled vocabulary. It has been developed in the early Eighties by and for the aviation industry to help the readers of English-language documentation understand what they read, particularly when these readers are non-native English speakers.
When STE was created, one of its objectives was to make translation easier. If the vocabulary, meanings of words and types of sentence constructions are controlled, it would be easier for translators or translation engines to translate texts written in STE. Even in the absence of a Russian-based controlled language, the advantage in terms of translation and time reduction could be relevant when texts written in STE are translated into Russian, particularly when using machine translation.
An important purpose of this workshop is to demonstrate how the writing and translation process work by means of some practical examples. At the end, results will be discussed.
UTICamp-2020. Is translation automation a myth or a reality?UTICamp
Speaker: Istvan Lengyel
Smaller translation providers face very different realities from the largest companies in the sector. In reality, the differentiator is not as much technology as developing processes together with their customers.
With the proliferation of small and frequent jobs, the industry is already seeing a move away from the “translations with love”, yet many company owners argue that the automation of processes in the translation industry is not possible, or at least not profitable. In this presentation I will talk about the many variations and aspects in translation projects that make automation difficult, and I will also look at the skills and qualities of project managers that will be those colleagues that bring your company to the next level. I will also discuss some of the lesser-known middleware tools that are worth knowing if you are interested in moving towards automation. I make an attempt to give participants both a profile that they should be looking for when hiring a “different” project manager and some recommendations on how to start instilling the culture of automation into their daily work.
UTICamp-2020. From Quality Assurance to Operational ExcellenceUTICamp
Speaker: Manal Amin
It is one thing to achieve compliance with quality standard’s requirements; which is great already. Yet it’s totally different to reach excellence and deliver what is “Desired” not only what is “Required”.
Being one step ahead of your competition through strategic management deployment, performance management, continuous process optimization; and a highly performing, fully engaged team that is what we call Operational Excellence. Investing in long-term change in the organizational culture, so all team members work in putting business strategies into practice; achieving efficient strategic management.
Tackling OpEx is going through (but not limited to):
1- Process Optimization & Continuous Improvement through Six Sigma and DMAIC Methodology
2- Organizational Cultural Enhancement to reach a standard methodology of correcting things when they go wrong
3- Adopting a risk-based thinking mindset that guide business strategies
4- Efficient Performance and Data Management
UTICamp-2020. Step Across The Border and Find Your NicheUTICamp
The document discusses finding niches at the intersection of translation companies and potential clients. It provides examples of how a translation company called Vocal Point developed accessibility solutions for the event industry. It suggests approaches for language service providers to go beyond one-directional translation by considering client perspectives, finding complementary or overlapping business activities, and generating translatable content in niche areas of expertise. Potential models involve offering multilingual support and accessibility solutions to event organizers and pursuing public funding. The key is to establish expertise that can generate content requiring translation services.
UTICamp-2020. Size Does Matter, or Is the translation Industry Restructuring ...UTICamp
Speaker: Nigel Saych
Every linguist knows what the challenge of technological development is bringing to our profession. Nobody needs another presentation entitled ‘The threat of….’. But are the days of ‘Big is beautiful’ still relevant to our profession? Are changes taking place within our industry that suggest we are rethinking the roles of the classic translation agency and of the stand-alone linguist working on the kitchen table?
I have noticed trends in recent years that attract surprisingly little comment. I am aware of a change of direction that is influencing the face of the way we work. It’s not a spectacular change, but it’s creeping in from all directions. I call it my BBB theory. But that needs quite a lot of explanation. All will be revealed…
UTICamp-2020. Preparing for Post-Doom – Diversifying business: Content creati...UTICamp
Speakers: Leonid Glazychev, Fedor Bezrukov
After discussing the doomsday scenario (previous presentation), we would like to present the positive side of things: How can both companies and individuals get prepared and march straight into the doomsday-free future. We will discuss diversification scenarios and synergies between traditional translation/localization and various neighboring areas, such as content creation, video and eLearning, game and multimedia localization, design and page-setting, and others.
We will then talk about short- and long-term potential, advantages and challenges for each of these development areas, and provide concrete examples or case studies. We also hope to introduce you to one or more surprise guests...
Speaker: Oleksandr Bondarenko (Translatel Ltd)
It has been widely acknowledged that PEMT specialists are in great demand at the market. According to a diagnostic survey of the translators’ professional profile (see Bondarenko, Marczak 2019), the PEMT skills are highly estimated by employers and underestimated by employees.
In this talk, Oleksandr will offer a brief review of some of the preliminary findings of a two-year project of configuring MT engines that he has been personally involved in. The talk sheds more light on how MT customizing can influence the translation quality in popular language pairs. The speaker will focus on a workflow, do’s and don’ts, as well as on strategies to reduce time and efforts required for post-editing.
Human MT evaluation statistics will also be provided to better forecast the MT leverage for the definite projects. The speaker will suggest a framework to acquire MT-engines literacy that can bring maximum effect with little cost and little time.
The case study can be instructive both for companies’ executives and for freelancers.
UTICamp-2020. Quality Assurance in TranslationUTICamp
Speaker: Nigel Wheatley
Many a freelancer's heart will sink at the mention of quality assurance (QA). For too many translators it is synonymous with pointless semi-automated checks that have an overwhelming rate of false-positives; but such software "solutions" are not QA at all. Approached correctly, true QA can offer a solution to one of our fundamental marketing problems: how do we persuade potential clients that we are competent, professional and worth our rates. This presentation will look at the various QA options available to individual freelancers and/or small agencies.
Speaker: William Hackett-Jones
Every language is equipped with words and set expressions that are metaphorical. They beautify language, they're emphatic... and they're also a pain to translate - because you cannot translate slang and idioms literally. It's not enough to know the meaning of the component words of an idiom, in order to translate it. Indeed, one needs a decent degree of background knowledge about the cultural aspects of the country whose language you're translating. When we translate dialogue in film, we always try to keep the flavour of the speaker's language, which means finding appropriate slang and set expressions to fit the character.
UTICamp-2020. Interpreting As Part Of the Official Delegation: Expectations A...UTICamp
Speaker: Andrew Liashenko
You will learn recommendations on how to prepare for interpreting, while being a part of the official state delegation.
You will hear about the importance of the Official programme of visit and some useful information it contains for the interpreter; when, how and where to obtain the programme; or should you even trust it.
There will be personal recommendations on how to prepare for bilaterals, what aspects to consider, and what useful information you can get from YouTube.
The report also unveils some of peculiarities of the official protocol and its importance for the interpreter. We will analyze a few case studies to explain it properly. You will also hear about the peculiarities of being a “personal” interpreter.
Finally, you will have a checklist to consider before entering a room for bilaterals.
UTICamp-2020. Interpreting Oncology: A View from Both SidesUTICamp
Speakers: Dmytro Shapochka, Anna Ivanchenko
A joint presentation from an interpreter and an expert in the field. What does oncology consist of? What kind of work can an interpreter in this field expect? We will tell about the main notions the interpreter should master to achieve communication goals. We’ll also spend some time on the terms with no equivalents and talk about different logic of term formation in different languages.
UTICamp-2020. Conference Trends in a Post-Covid-19 WorldUTICamp
Speaker: Rob Davidson
The international conference market is changing rapidly in the face of sweeping technological, demographic, geopolitical and economic trends – to which we may now add the challenge presented by the global health crisis engendered by the spread of the Coronavirus.
These changes are creating important qualitative and quantitative shifts in the demand for, and supply of, conference services - including interpretation. My presentation will explore these key trends in the international conference market and will examine their consequences for the conference interpreting profession.
Минута истории
https://sonix.ai/history-of-speech-recognition
https://habr.com/ru/post/131945/
Средняя скорость – 1 час материала за 10 минут
Не совсем. Если объем материала до 2-3 минут и скорость речи медленная, возможно, использовать STT нецелесообразно. Если нужен перевод, можно перевести и «со слуха».
Но, если:
объем становится больше
вам нужно контролировать качество перевода (что невозможно без исходника), возможно, подключать МП
вам нужно создать титры
– вам придется сначала создать исходник, и лучше, если он будет изначально в правильном формате, который позволяет экспорт как в Word, так и в титры – как с указанием спикеров, так и без.
Потратьте 3 минуты, воспользуйтесь бесплатной пробной версией любого из лучших сервисов. Вы ничего не теряете, но получите готовый подстрочник.
Что касается стоимости и средней скорости (данные Sonix), данные говорят сами за себя. В нормальном режиме минута речи – около 200 слов. Даже если у вас очень высокая скорость слепого набора, применение STT более чем оправдано (проверено личной практикой).
Средние расценки на биржах фрилансеров в Украине - 10 долларов за час звука (что запредельно ниже плинтуса)
Инструкция здесь https://sonix.ai/remove-metallic-sound-from-audio
Соникс, например, поддерживает мультитрек-где разные голоса записаны на разных дорожках
Совместная работа над одним проектом поддерживается большинством, но на бесплатных планах ограничения почти у всех, на дешевых - у части.
Otter - для транскрипта интервью, онлайн-встреч и совещаний
Качество распознавания – Editing Distance для эталонного текста и неотредактированного распознанного.
Традиционных баллов не будет. Не всем нужны определенные фичи, такие как поддержка русского или украинского языков, возможность напрямую скачивать видео с Youtube/DropBox/GoogleDrive etc. Поэтому выбирайте систему, исходя из нужных именно вам параметров.
CopyLeaks – identical + minor changes
194 слова