SlideShare a Scribd company logo
1 of 28
Download to read offline
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
REPORTS FROM THE BATTLEFRONT, OR: BUILD YOUR
OWN SPEECH TO TEXT DATASET FROM SCRATCH IN
30 DAYS
Dmytro Naumov (dima@m-ailabs.bayern)
March 2018
Munich Artificial Intelligence Laboratories GmbH (M-AILABS)
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
LET’S START WITH SOME ADVERTISING…
2
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
M-AILABS IN A NUTSHELL
• Founded April 1st*
• Designs & develops ML/AI-based software engines
• 100% owner-financed and profitable
• Currently 4 FTE (3x Machine Learning/Neuronal Networks; 1 Administrative)
3
*: Yes, it was in fact founded on April 1st, but the actual registration was on April 3rd because … April 1st 2017 is a Saturday and the registration office doesn’t work on weekends…
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
ARTIFICIAL INTELLIGENCE AT THE HEART OF EUROPE…
Our mission is to enable
European organizations to take
advantage of AI & ML without
having to give up control or
know-how
4
VISION: APPLIED AI/ML
We use state-of-the art technologies to develop solutions for, e.g.:
5
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
Speech Recognition Speech Synthesis Document Classification Object Recognition
Text Classification Specialized OCR Signal Processing Specialized Translation
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
TECHNOLOGIES WE USE (ALL OPEN SOURCE)
• Classical ML-Algorithms: scikit_learn, numpy, scipy, …
• Classical Neural Network Algorithms: scikit_learn, …
• Deep Learning-based NN algorithms: s. below
• NLP in all its incarnations: NLTK, spaCy, GenSim, …
6
Deep Learning
• Torch & PyTorch
• Caffe2
• TensorFlow (less and less)
• Keras 2 (on top of TensorFlow)
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
A SELECTION OF OUR DEEP LEARNING-HARDWARE
7
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
8
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
OK, BACK TO THE TOPIC: SOME BACKGROUND
REFRESHER…
9
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
MACHINE LEARNING PROCESS
10
E.g. Raw images

(>1,000,000)
Categorized Data
Real & Simulated
Tagged Data

(E.g.: 100,000 Images)
Trained Model

(Days, weeks,…)
…
Data Labeler Data Scientist Algorithm / Optimization Developer
Optimize Operate
Validate
Accuracy ➠ ➠Collect Data
Establish
Ground Truth
Select
DataSet
Design &
Train Models➠ ➠ ➠ ➠
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
DATA COLLECTION & LABELING (ESTABLISHING GROUND TRUTH)
• Thousands of sample data
• Each need to be labeled
• Each label represents a category
• Result: Training material to teach the
Machine our own Knowledge
11
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
TASK: DEVELOP A SPEECH-RECOGNITION & SPEECH
SYNTHESIS SOLUTION FOR GERMAN (AND SPANISH, AND …)
12
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
“A KINGDOM FOR DATA…”
• Read papers, papers, papers, and more
papers…
• Write an SR-system based on DeepSpeech
(Baidu)
• Train on LJSpeech-data (english)
• … works great! Now to German…
13
• Problems, problems, problems
• No public German voice data available
• Universities / Colleges have data, but…
• … even for testing they want € € €
• … you are not allowed to use it for
commercial purposes
• … and there are other issues as well …
“I have an idea: Let’s create
our own dataset…”
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
RAW DATA
• How much data do we need?
• Hundreds of hours per language
• How do we generate them?
• Hire people to read books sentence by
sentence
• … or… checkout LibriVox and split them
sentence by sentence
14
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
WOW, MORE DIFFICULT THAN THOUGHT… WHERE IS THE
SENTENCE?
Inhalt
Das grüne Haus Die goldne Spinne
Mariechen und die Sonne
Als es nicht regnen wollte
…
Das grüne Haus
Ja, es ist ein grünes Haus, in dem ich wohne; und alle
Märchen und Geschichten, die in diesem Buche stehn, sind darin
geschrieben worden. Es ist nicht etwa grün angestrichen wie ein
Gartenzaun oder eine …
15
?
?
?
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
APPROACH 1: USE A HUMAN TO ESTABLISH GROUND-TRUTH FOR
EACH AUDIO SEQUENCE MANUALLY
• Take the original audio
• Using ‘sox’, split audio files separated by
pauses
• Generate a list of audio-files
• Have a person listen to each audio-file and
assign the original text to it (CSV-file)
16
• Results:
• Quality: excellent
• Speed: … well… it is slow, repetitive and boring
work
• It took very long (at least the same time as the
audio-length) - but normally 2x - 4x the audio
length
• Not really scalable…
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
APPROACH 2: USE OWN TOOLS + GOOGLE/APPLE SPEECH API
• Use in-house developed script to split audio files separated by
pauses (analyze the whole audio and decide what is pause and
what is not)
• Generate a list of audio-files
• Use Google and Apple Speech APIs to transcribe these files
• Use in-house developed ‘trqa’ to perform ‘transcription-QA’ in
order to find the “original text”
• Results: extremely QA-heavy, but still doable (‘trqa’ partially
automated recognition, thus requiring only around 20-30%
real QA)
17
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
WE ARE ACTUALLY INTERESTED ABOUT THE END OF THE
SENTENCE, NOT BEGINNING
Inhalt
Das grüne Haus Die goldne Spinne
Mariechen und die Sonne
Als es nicht regnen wollte
…
Das grüne Haus
Ja, es ist ein grünes Haus, in dem ich wohne; und alle
Märchen und Geschichten, die in diesem Buche stehn, sind darin
geschrieben worden. Es ist nicht etwa grün angestrichen wie ein
Gartenzaun oder eine …
18
?
?
?
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
APPROACH 3: SPLIT FILES BASED ON TEXT-SENTENCES AND
SEARCH FOR THE ENDS OF THOSE TEXTS…
• Split original text into manageable “sentences”
• Make sure that each “sentence” does not result in audio longer than
15 seconds
• Generate audio out of text (TTS)
• Compare generated audio (spectrograms, etc) with begin of original
recording
• … thus finding the end of the original text
• Cut the text and the respective audio from beginning of the
recording
• Rinse, clean, repeat…
19
• Results:
• Quite good and fast results
• Requires manually preparing the original
text (adding/removing some tags)
• Remove “intro” and “outro” from audio-files
• Reliability depends very much on audio-
quality
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
BUT WAIT, WHY DOESN’T MY DNN CONVERGE WELL? I HAVE TONS
OF TRAINING DATA, …
• Just aligning texts to audio is not enough
• We also need to do language-specific post-processing, e.g. for numbers, abbreviations,
inflections, etc.
• Problem: some languages are more di"cult than others
20
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
POST-PROCESSING (NUMBERS -> TEXT)
21
# de en tr
13 dreizehn (three-ten) thirteen On-üç (ten-three)
27 Sieben-und-zwanzig (seven-and-twenty) twenty-seven Yirmi-yedi (twenty-seven)
127 Hundert-sieben-und-zwanzig (hundred-seven-and-twenty) hundred-twenty-seven Yüz-yirmi-yedi (hundred-twenty-seven)
1127
Tausend-ein-hundert-sieben-und-zwanzig (thousand-one-hundred-
seven-and-twenty)
Eleven-hundred-twenty-seven
Thousand-one-hundred-twenty-seven
Bin-yüz-yirmi-yedi (thousand-hundred-twenty-
seven)
111127
hundert-elf-tausend-ein-hundert-sieben-und-zwanzig (hundred-
eleven-thousand-one-hundred-seven-and-twenty)
Hundred-eleven-thousand-one-hundred-twenty-
seven
Yüz-on-bir-bin-yüz-yirmi-yedi (hundred-ten-
one-thousand-hundred-twenty-seven)
1981
Neun-zehn-hundert-ein-und-achtzig (year between 1100-1999)
Tausend-neun-hundert-ein-und-achtzig (non-year)
Nineteen-hundred-eighty-one (US: sometimes up
til 9999)
Bin-dokuz-yüz-seksen-bir (thousand-nine-
hundred-eighty-one)
9.
“Am 9. Juni” = “Am neunten Juni”
“9. Juni: …” = “Neunter Juni: …”
“9. Haus” = “Neuntes Haus”
ninth Dokuz-uncu
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
LEARNINGS, TONS OF LEARNINGS…
• Google/Apple transcriptions are not really good
(crowd-sourced)
• Be careful which tools you use
• When developing tools for data labelers,
concentrate mostly on UX and then on UX and
lastly on UX (for the data labeler)
• Make sure you fulfill UX-related wishes of the
data labelers
22
• Check your original data
• E.g.: audio and text-files must match 100%
• Standardize on sample rate, amplification, channels, audio
file-formats (but document it, so you can replicate during
inference if needed)
• Do NOT throw away data (e.g. punctuations, etc. in text) - do
it during training or preprocessing for a specific training task
• Data preparation & labeling is hard work - do not
underestimate it…
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
RESULTS…
23
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
24
TRAINING DATA (HOURS:MINS) - BY LANGUAGE (V1.0)
de_DE en_UK en_US es_ES it_IT uk_UK ru_RU fr_FR*
Female 150:56 45:34 63:43 10:37 8:23 10:28 16:04 108:10
Male 39:44 0:00 38:23 72:24 31:45 62:32 30:43 93:32
Mixed 46:42 0:00 00:00 25:33 87:53 0:00 0:00 15:33
TOTAL 237:22 45:34 102:07 108:34 127:40 73:00 46:47 217:15
Quantity +++ + +++ +++ +++ ++ + +++
Quality +++ +++ +++ +++ +++ +++ +++ ++
*: French is still “work-in-progress”
Grand Total: 741:04 (~82GB) - excluding fr_FR
Additionally planned: pl_PL, nl_NL, tr_TR, pt_PT, pt_BR, ar_…, plus some dialects (de_AT, de_CH, de_DE_SX, de_DE_BY, …)
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
SO, WHAT? WHAT DOES THIS MEAN FOR YOU?
25
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
THE LICENSE
Copyright (c) 2017-2018 MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES GmbH
Redistribution and use in any form, including any commercial use, with or without modification are permitted - bar the exceptions listed below -
provided that the following conditions are met:
1. Redistributions of source data must retain the above copyright notice, this list of conditions and the following disclaimer.
2. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this downloaded data, source-code or
binary-code without specific prior written permission.
3. ANY USE BY ANY UNIVERSITY, COLLEGE, RESEARCH INSTITUTE OR SIMILAR HIGHER EDUCATION INSTITUTION IN EUROPE, INCLUDING BY MEMBERS OF SUCH
INSTITUTIONS (including but not limited to the students, tutors and teachers at those institutions), REQUIRES A SEPARATE (free-of-charge) LICENSE AND IS NOT COVERED
BY THIS LICENSE AGREEMENT. PLEASE CONTACT US FOR DETAILS AT info@m-ailabs.bayern.
THIS DATA IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR
CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE and/
or DATA, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
26
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
WHERE TO GET IT
Available as of today at:
27
http://data.m-ailabs.eu/speech
MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES
… mia san do, um zu helfen
info@m-ailabs.bayern
28

More Related Content

What's hot

Comparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP ModelsComparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP Modelssaurav singla
 
Introduction to Compilers
Introduction to CompilersIntroduction to Compilers
Introduction to CompilersAkhil Kaushik
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Fasttext 20170720 yjy
Fasttext 20170720 yjyFasttext 20170720 yjy
Fasttext 20170720 yjy재연 윤
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine LearningKarthik Sankar
 
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...butest
 
Text to Speech PowerPoint
Text to Speech PowerPointText to Speech PowerPoint
Text to Speech PowerPointmatthewmahony
 
Self-Attention with Linear Complexity
Self-Attention with Linear ComplexitySelf-Attention with Linear Complexity
Self-Attention with Linear ComplexitySangwoo Mo
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognitionRajanivetha G
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingVeenaSKumar2
 
Statistical voice conversion with direct waveform modeling
Statistical voice conversion with direct waveform modelingStatistical voice conversion with direct waveform modeling
Statistical voice conversion with direct waveform modelingNU_I_TODALAB
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
敏捷式組織架構
敏捷式組織架構敏捷式組織架構
敏捷式組織架構Yves Lin
 
Mitsubishi servo system family catalog
Mitsubishi servo system family catalogMitsubishi servo system family catalog
Mitsubishi servo system family catalogDien Ha The
 

What's hot (20)

Comparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP ModelsComparative Analysis of Transformer Based Pre-Trained NLP Models
Comparative Analysis of Transformer Based Pre-Trained NLP Models
 
UI/UXなUXのお話
UI/UXなUXのお話 UI/UXなUXのお話
UI/UXなUXのお話
 
Introduction to Compilers
Introduction to CompilersIntroduction to Compilers
Introduction to Compilers
 
Language processors
Language processorsLanguage processors
Language processors
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Fasttext 20170720 yjy
Fasttext 20170720 yjyFasttext 20170720 yjy
Fasttext 20170720 yjy
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine Learning
 
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
Part-of-Speech Tagging for Bengali Thesis submitted to Indian ...
 
System Programming Overview
System Programming OverviewSystem Programming Overview
System Programming Overview
 
Text to Speech PowerPoint
Text to Speech PowerPointText to Speech PowerPoint
Text to Speech PowerPoint
 
Self-Attention with Linear Complexity
Self-Attention with Linear ComplexitySelf-Attention with Linear Complexity
Self-Attention with Linear Complexity
 
Artificial intelligence in speech recognition
Artificial intelligence in speech recognitionArtificial intelligence in speech recognition
Artificial intelligence in speech recognition
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Phases of compiler
Phases of compilerPhases of compiler
Phases of compiler
 
Statistical voice conversion with direct waveform modeling
Statistical voice conversion with direct waveform modelingStatistical voice conversion with direct waveform modeling
Statistical voice conversion with direct waveform modeling
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Transition System
Transition SystemTransition System
Transition System
 
敏捷式組織架構
敏捷式組織架構敏捷式組織架構
敏捷式組織架構
 
Acazia Software Company Presentation
Acazia Software Company Presentation Acazia Software Company Presentation
Acazia Software Company Presentation
 
Mitsubishi servo system family catalog
Mitsubishi servo system family catalogMitsubishi servo system family catalog
Mitsubishi servo system family catalog
 

Similar to Build your own speech to text dataset in 30 days

Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1
 
Programming Languages #devcon2013
Programming Languages #devcon2013Programming Languages #devcon2013
Programming Languages #devcon2013Iván Montes
 
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectDeep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectKeunwoo Choi
 
Ry pyconjp2015 karaoke
Ry pyconjp2015 karaokeRy pyconjp2015 karaoke
Ry pyconjp2015 karaokeRenyuan Lyu
 
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...TAUS - The Language Data Network
 
Introducción a NLP (Natural Language Processing) en Azure
Introducción a NLP (Natural Language Processing) en AzureIntroducción a NLP (Natural Language Processing) en Azure
Introducción a NLP (Natural Language Processing) en AzurePlain Concepts
 
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...curryon
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersRoelof Pieters
 
Introduction To Autumata Theory
 Introduction To Autumata Theory Introduction To Autumata Theory
Introduction To Autumata TheoryAbdul Rehman
 
OWF14 - Big Data : The State of Machine Learning in 2014
OWF14 - Big Data : The State of Machine  Learning in 2014OWF14 - Big Data : The State of Machine  Learning in 2014
OWF14 - Big Data : The State of Machine Learning in 2014Paris Open Source Summit
 
Natural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesNatural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesXiang Li
 
python presntation 2.pptx
python presntation 2.pptxpython presntation 2.pptx
python presntation 2.pptxArpittripathi45
 

Similar to Build your own speech to text dataset in 30 days (20)

Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLP
 
Programming Languages #devcon2013
Programming Languages #devcon2013Programming Languages #devcon2013
Programming Languages #devcon2013
 
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, ExpectDeep Learning with Audio Signals: Prepare, Process, Design, Expect
Deep Learning with Audio Signals: Prepare, Process, Design, Expect
 
Sound soft hackday-100905
Sound soft hackday-100905Sound soft hackday-100905
Sound soft hackday-100905
 
Ry pyconjp2015 karaoke
Ry pyconjp2015 karaokeRy pyconjp2015 karaoke
Ry pyconjp2015 karaoke
 
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
 
Paradigms
ParadigmsParadigms
Paradigms
 
Lec 01 introduction
Lec 01   introductionLec 01   introduction
Lec 01 introduction
 
Introducción a NLP (Natural Language Processing) en Azure
Introducción a NLP (Natural Language Processing) en AzureIntroducción a NLP (Natural Language Processing) en Azure
Introducción a NLP (Natural Language Processing) en Azure
 
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...
What – if anything – have we learned from C++? by Bjarne Stroustrup @ Curry O...
 
Java basics
Java basicsJava basics
Java basics
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
Lecture 1 (bce-7)
Lecture   1 (bce-7)Lecture   1 (bce-7)
Lecture 1 (bce-7)
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ers
 
Introduction To Autumata Theory
 Introduction To Autumata Theory Introduction To Autumata Theory
Introduction To Autumata Theory
 
OWF14 - Big Data : The State of Machine Learning in 2014
OWF14 - Big Data : The State of Machine  Learning in 2014OWF14 - Big Data : The State of Machine  Learning in 2014
OWF14 - Big Data : The State of Machine Learning in 2014
 
P1 2017 python
P1 2017 pythonP1 2017 python
P1 2017 python
 
System softare
System softareSystem softare
System softare
 
Natural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital HumanitiesNatural Language Processing Tools for the Digital Humanities
Natural Language Processing Tools for the Digital Humanities
 
python presntation 2.pptx
python presntation 2.pptxpython presntation 2.pptx
python presntation 2.pptx
 

Recently uploaded

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 

Recently uploaded (20)

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 

Build your own speech to text dataset in 30 days

  • 1. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES REPORTS FROM THE BATTLEFRONT, OR: BUILD YOUR OWN SPEECH TO TEXT DATASET FROM SCRATCH IN 30 DAYS Dmytro Naumov (dima@m-ailabs.bayern) March 2018 Munich Artificial Intelligence Laboratories GmbH (M-AILABS)
  • 2. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES LET’S START WITH SOME ADVERTISING… 2
  • 3. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES M-AILABS IN A NUTSHELL • Founded April 1st* • Designs & develops ML/AI-based software engines • 100% owner-financed and profitable • Currently 4 FTE (3x Machine Learning/Neuronal Networks; 1 Administrative) 3 *: Yes, it was in fact founded on April 1st, but the actual registration was on April 3rd because … April 1st 2017 is a Saturday and the registration office doesn’t work on weekends…
  • 4. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES ARTIFICIAL INTELLIGENCE AT THE HEART OF EUROPE… Our mission is to enable European organizations to take advantage of AI & ML without having to give up control or know-how 4
  • 5. VISION: APPLIED AI/ML We use state-of-the art technologies to develop solutions for, e.g.: 5 MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES Speech Recognition Speech Synthesis Document Classification Object Recognition Text Classification Specialized OCR Signal Processing Specialized Translation
  • 6. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES TECHNOLOGIES WE USE (ALL OPEN SOURCE) • Classical ML-Algorithms: scikit_learn, numpy, scipy, … • Classical Neural Network Algorithms: scikit_learn, … • Deep Learning-based NN algorithms: s. below • NLP in all its incarnations: NLTK, spaCy, GenSim, … 6 Deep Learning • Torch & PyTorch • Caffe2 • TensorFlow (less and less) • Keras 2 (on top of TensorFlow)
  • 7. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES A SELECTION OF OUR DEEP LEARNING-HARDWARE 7
  • 9. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES OK, BACK TO THE TOPIC: SOME BACKGROUND REFRESHER… 9
  • 10. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES MACHINE LEARNING PROCESS 10 E.g. Raw images
 (>1,000,000) Categorized Data Real & Simulated Tagged Data
 (E.g.: 100,000 Images) Trained Model
 (Days, weeks,…) … Data Labeler Data Scientist Algorithm / Optimization Developer Optimize Operate Validate Accuracy ➠ ➠Collect Data Establish Ground Truth Select DataSet Design & Train Models➠ ➠ ➠ ➠
  • 11. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES DATA COLLECTION & LABELING (ESTABLISHING GROUND TRUTH) • Thousands of sample data • Each need to be labeled • Each label represents a category • Result: Training material to teach the Machine our own Knowledge 11
  • 12. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES TASK: DEVELOP A SPEECH-RECOGNITION & SPEECH SYNTHESIS SOLUTION FOR GERMAN (AND SPANISH, AND …) 12
  • 13. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES “A KINGDOM FOR DATA…” • Read papers, papers, papers, and more papers… • Write an SR-system based on DeepSpeech (Baidu) • Train on LJSpeech-data (english) • … works great! Now to German… 13 • Problems, problems, problems • No public German voice data available • Universities / Colleges have data, but… • … even for testing they want € € € • … you are not allowed to use it for commercial purposes • … and there are other issues as well … “I have an idea: Let’s create our own dataset…”
  • 14. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES RAW DATA • How much data do we need? • Hundreds of hours per language • How do we generate them? • Hire people to read books sentence by sentence • … or… checkout LibriVox and split them sentence by sentence 14
  • 15. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES WOW, MORE DIFFICULT THAN THOUGHT… WHERE IS THE SENTENCE? Inhalt Das grüne Haus Die goldne Spinne Mariechen und die Sonne Als es nicht regnen wollte … Das grüne Haus Ja, es ist ein grünes Haus, in dem ich wohne; und alle Märchen und Geschichten, die in diesem Buche stehn, sind darin geschrieben worden. Es ist nicht etwa grün angestrichen wie ein Gartenzaun oder eine … 15 ? ? ?
  • 16. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES APPROACH 1: USE A HUMAN TO ESTABLISH GROUND-TRUTH FOR EACH AUDIO SEQUENCE MANUALLY • Take the original audio • Using ‘sox’, split audio files separated by pauses • Generate a list of audio-files • Have a person listen to each audio-file and assign the original text to it (CSV-file) 16 • Results: • Quality: excellent • Speed: … well… it is slow, repetitive and boring work • It took very long (at least the same time as the audio-length) - but normally 2x - 4x the audio length • Not really scalable…
  • 17. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES APPROACH 2: USE OWN TOOLS + GOOGLE/APPLE SPEECH API • Use in-house developed script to split audio files separated by pauses (analyze the whole audio and decide what is pause and what is not) • Generate a list of audio-files • Use Google and Apple Speech APIs to transcribe these files • Use in-house developed ‘trqa’ to perform ‘transcription-QA’ in order to find the “original text” • Results: extremely QA-heavy, but still doable (‘trqa’ partially automated recognition, thus requiring only around 20-30% real QA) 17
  • 18. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES WE ARE ACTUALLY INTERESTED ABOUT THE END OF THE SENTENCE, NOT BEGINNING Inhalt Das grüne Haus Die goldne Spinne Mariechen und die Sonne Als es nicht regnen wollte … Das grüne Haus Ja, es ist ein grünes Haus, in dem ich wohne; und alle Märchen und Geschichten, die in diesem Buche stehn, sind darin geschrieben worden. Es ist nicht etwa grün angestrichen wie ein Gartenzaun oder eine … 18 ? ? ?
  • 19. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES APPROACH 3: SPLIT FILES BASED ON TEXT-SENTENCES AND SEARCH FOR THE ENDS OF THOSE TEXTS… • Split original text into manageable “sentences” • Make sure that each “sentence” does not result in audio longer than 15 seconds • Generate audio out of text (TTS) • Compare generated audio (spectrograms, etc) with begin of original recording • … thus finding the end of the original text • Cut the text and the respective audio from beginning of the recording • Rinse, clean, repeat… 19 • Results: • Quite good and fast results • Requires manually preparing the original text (adding/removing some tags) • Remove “intro” and “outro” from audio-files • Reliability depends very much on audio- quality
  • 20. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES BUT WAIT, WHY DOESN’T MY DNN CONVERGE WELL? I HAVE TONS OF TRAINING DATA, … • Just aligning texts to audio is not enough • We also need to do language-specific post-processing, e.g. for numbers, abbreviations, inflections, etc. • Problem: some languages are more di"cult than others 20
  • 21. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES POST-PROCESSING (NUMBERS -> TEXT) 21 # de en tr 13 dreizehn (three-ten) thirteen On-üç (ten-three) 27 Sieben-und-zwanzig (seven-and-twenty) twenty-seven Yirmi-yedi (twenty-seven) 127 Hundert-sieben-und-zwanzig (hundred-seven-and-twenty) hundred-twenty-seven Yüz-yirmi-yedi (hundred-twenty-seven) 1127 Tausend-ein-hundert-sieben-und-zwanzig (thousand-one-hundred- seven-and-twenty) Eleven-hundred-twenty-seven Thousand-one-hundred-twenty-seven Bin-yüz-yirmi-yedi (thousand-hundred-twenty- seven) 111127 hundert-elf-tausend-ein-hundert-sieben-und-zwanzig (hundred- eleven-thousand-one-hundred-seven-and-twenty) Hundred-eleven-thousand-one-hundred-twenty- seven Yüz-on-bir-bin-yüz-yirmi-yedi (hundred-ten- one-thousand-hundred-twenty-seven) 1981 Neun-zehn-hundert-ein-und-achtzig (year between 1100-1999) Tausend-neun-hundert-ein-und-achtzig (non-year) Nineteen-hundred-eighty-one (US: sometimes up til 9999) Bin-dokuz-yüz-seksen-bir (thousand-nine- hundred-eighty-one) 9. “Am 9. Juni” = “Am neunten Juni” “9. Juni: …” = “Neunter Juni: …” “9. Haus” = “Neuntes Haus” ninth Dokuz-uncu
  • 22. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES LEARNINGS, TONS OF LEARNINGS… • Google/Apple transcriptions are not really good (crowd-sourced) • Be careful which tools you use • When developing tools for data labelers, concentrate mostly on UX and then on UX and lastly on UX (for the data labeler) • Make sure you fulfill UX-related wishes of the data labelers 22 • Check your original data • E.g.: audio and text-files must match 100% • Standardize on sample rate, amplification, channels, audio file-formats (but document it, so you can replicate during inference if needed) • Do NOT throw away data (e.g. punctuations, etc. in text) - do it during training or preprocessing for a specific training task • Data preparation & labeling is hard work - do not underestimate it…
  • 23. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES RESULTS… 23
  • 24. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES 24 TRAINING DATA (HOURS:MINS) - BY LANGUAGE (V1.0) de_DE en_UK en_US es_ES it_IT uk_UK ru_RU fr_FR* Female 150:56 45:34 63:43 10:37 8:23 10:28 16:04 108:10 Male 39:44 0:00 38:23 72:24 31:45 62:32 30:43 93:32 Mixed 46:42 0:00 00:00 25:33 87:53 0:00 0:00 15:33 TOTAL 237:22 45:34 102:07 108:34 127:40 73:00 46:47 217:15 Quantity +++ + +++ +++ +++ ++ + +++ Quality +++ +++ +++ +++ +++ +++ +++ ++ *: French is still “work-in-progress” Grand Total: 741:04 (~82GB) - excluding fr_FR Additionally planned: pl_PL, nl_NL, tr_TR, pt_PT, pt_BR, ar_…, plus some dialects (de_AT, de_CH, de_DE_SX, de_DE_BY, …)
  • 25. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES SO, WHAT? WHAT DOES THIS MEAN FOR YOU? 25
  • 26. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES THE LICENSE Copyright (c) 2017-2018 MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES GmbH Redistribution and use in any form, including any commercial use, with or without modification are permitted - bar the exceptions listed below - provided that the following conditions are met: 1. Redistributions of source data must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this downloaded data, source-code or binary-code without specific prior written permission. 3. ANY USE BY ANY UNIVERSITY, COLLEGE, RESEARCH INSTITUTE OR SIMILAR HIGHER EDUCATION INSTITUTION IN EUROPE, INCLUDING BY MEMBERS OF SUCH INSTITUTIONS (including but not limited to the students, tutors and teachers at those institutions), REQUIRES A SEPARATE (free-of-charge) LICENSE AND IS NOT COVERED BY THIS LICENSE AGREEMENT. PLEASE CONTACT US FOR DETAILS AT info@m-ailabs.bayern. THIS DATA IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE and/ or DATA, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. 26
  • 27. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES WHERE TO GET IT Available as of today at: 27 http://data.m-ailabs.eu/speech
  • 28. MUNICH ARTIFICIAL INTELLIGENCE LABORATORIES … mia san do, um zu helfen info@m-ailabs.bayern 28