3. Solutions: Partial transcripts
2019-04-09 3
• Summaries
– Intersection of output from two ASR systems
– News reports for language models
• OCR errors
– Compare ASR and OCR
– ‘r’ + ‘n’ may look like ‘m’, but do not sound like ‘m’
• Missing fillers/repetitions
– Transducer-based language models
• Divergences from the filed speech
– BERT-style language models can supply synonyms