Unraveling Multimodality with Large Language Models.pdf
Technology Offer Intro
1. SPEEREO™ SPEECH RECOGNITION
TECHNOLOGY OFFER
AND IMPLEMENTATION PREREQUISITES
everything that anyone needs…
Konstantin Lamin Oleg Maleev Daniel Ischenko
CEO CTO, VP of R&D VP of Business Development
lamin@speereo.com maleev@speereo.com d_ischenko@speereo.com
2. SPEEREO SOFTWARE UK: ABOUT
SPEEREO™ SOFTWARE UK offers developing customized state-of-the-art
speaker independent speech interfaces and licensing of Speereo Speech
Recognition System engine for variety of software and hardware
products.
ADVANTAGES OF SPEECH INTERFACE
Unmatched usability: No more mess of menu buttons and options. Say
predefined words for an immediate access or command execution.
Broad functionality: An opportunity to create complex applications with a
simple user-friendly SPEEREO™ Speech SDK.
Charm of novelty: Takes the user to the fantastic world of future.
3. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY
Automatic Speech Recognition (ASR)
Requirements:
Memory from 700KB
CPU performance from 40(80) MIPS
Speech synthesizer (TTS)
Requirements:
Memory from 500KB(2 Mb)
CPU performance from 40(80) MIPS
Signal compression (1400 bit/sec)
Requirements:
Memory from 200 KB
CPU performance from 40 MIPS
4. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY
SVT key technology competence
Accuracy up to 99,8%*, speaker independent,
continuous speech recognition, noise robustness*,
minimum CPU and memory requirements*, automated
Speech command
recognition transcription system, tuned for car noises.
systems (ASR) Languages – English, Russian, German, Spanish.
Platform – WIN, ARM, MIPS, Embedded.
Russian patent #2103753.
Seven-year experience, multiply end-user software for
Smartphones and PPCs, various software awards.
Our partners: Nokia, Sony-Ericsson, Microsoft, Texas
Speech command
Instruments, ARM and many others.
interface design
CORCP for the Speech Recognition System,
#2000610427
* World best results
5. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY
Speereo™ key technology competence
High quality, minimum requirements.
Languages – English, Spanish, French, Russian,
Text-To-Speech
systems (TTS) German, and more to come.
Platforms – WIN, ARM, MIPS, SHx, Embedded, J2M.
Human state
We can recognize speaker tiredness and emotions.
recognition via voice
Our technology and experience are perfect for car command interfaces
design. We have tuned ASR for car noises for best accuracy results and
extremely high cost effectiveness.
6. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY:
IMPLEMENTATION PREREQUISITES
Prerequisites to speech interface for mobile market are obvious:
Users are not satisfied with touch-screens any more – more and more mobile
device manipulations can be done via voice commands.
Standard everyday operations such as contact search, making and receiving
calls, notes’ taking, etc. should be done by simple command and not by number
of manipulations with screen or stylus, or complex menu.
Speech recognition systems that can offer more or less acceptable quality of
work require too much from hardware (memory, storage, etc.).
No other manufacturer can offer speaker independent speech recognition
but Speereo Speech Recognition Engine.
High quality of noise robustness is unavailable to end-user but by Speereo
products.
7. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY:
IMPLEMENTATION PREREQUISITES
And the main thing is…
If we are to be honest to ourselves, we’ll admit that next big and
revolutionary step for any hardware manufacturer will be a successful
implementation of speech recognition technology or voice interface.
There is nowhere to go with touch-screens and styluses and menus are
products of the past.
Next logical step – offer to end-user the most natural way to interact with
your product – speak to it.
Therefore, first manufacturer who understands that and takes a step into
that direction will take the lead and become that talked about and popular
revolutionary who will win over clientele and developers.
8. SPEEREOTM SPEECH RECOGNITION TECHNOLOGY:
IMPLEMENTATION BENEFITS
SPEEREO™ SPEECH RECOGNITION TECHNOLOGY is an ideal tool for platform
owners and manufacturers. OEM/MID or licensing of SSR benefit manufacturer
in several ways:
Unique voice operated software – SPEEREO™ VOICE TRANSLATOR &
SPEEREO™ VOICE ORGANIZER can be already installed and operative upon
mobile device purchase, giving end users basic voice operated functionality
from the start.
SSR reliability will compliment to device and manufacturer value in consumer
eyes and minds.
SSR SDK will be available to third party developers as part of platform SDK
and will ensure a flow of voice operated applications into the market and onto
mobile devices produced.
9. THANK YOU!
Konstantin Lamin Oleg Maleev Daniel Ischenko
CEO CTO, VP of R&D VP of Business
lamin@speereo.com maleev@speereo.com Development
d_ischenko@speereo.
com
SPEEREO SOFTWARE UK
http://www.speereo.com
Tel./fax +7(812)327-4455