Speech recognition for Riksdag open data

1. Speech recognition for Riksdag open data

2. Challenges: Partial transcripts 2019-04-09 2 • Summaries • OCR errors • Missing fillers/repetitions • Divergences from the filed speech

3. Solutions: Partial transcripts 2019-04-09 3 • Summaries – Intersection of output from two ASR systems – News reports for language models • OCR errors – Compare ASR and OCR – ‘r’ + ‘n’ may look like ‘m’, but do not sound like ‘m’ • Missing fillers/repetitions – Transducer-based language models • Divergences from the filed speech – BERT-style language models can supply synonyms

4. Opportunities: Language change 2019-04-09 4 • ~70 years of recordings • Ideal for identifying phonetic changes over time

5. Opportunities: Affected speech 2019-04-09 5 • Rhetorical use of affectations • “Emotional” speech • CONS: requires annotation

Speech recognition for Riksdag open data

Recommended

Recommended

More Related Content

Recently uploaded

Recently uploaded (20)

Featured

Featured (20)

Speech recognition for Riksdag open data