SlideShare a Scribd company logo
1 of 29
© 2002 IBM Corporation
SPRÅKBANKEN
Teknologisk Plattform
IBM WebSphere Voice Server and Toolkit
Språkrådet 18.12.2007
© 2002 IBM Corporation
2
“Dette gjer Noreg til eit av dei første landa i verda med eit
fritt tilgjengeleg talesynteseprogram. Samtidig er det klart
at dette ikkje er ei løysing som dekkjer alle tekniske
plattformer. Regjeringa vil oppmode leverandørar og
utviklingsmiljø til å få fram fritt tilgjengelege og gode
løysingar for andre plattformer enn Windows”.
“Regjeringa ser det som ei utfordring å gjere
programvare for talesyntese tilgjengeleg, og då helst
med fri tilgang for dei som ynskjer det. Programvaren må
distribuerast via fleire kanalar, og ved bruk av Internett til
distribusjon må taleprogrammet kunne lagrast lokalt på
datamaskina”.
Fornyings- og administrasjonsminister Heidi Grande Røys
Statsråd
© 2002 IBM Corporation
3
Avtale undertegnet 5.1.2007
© 2002 IBM Corporation
4
IBM: Worldwide Voice Team (R&D, LBS)
Boca Raton
Yorktown Beijing
Tokyo
Seville
Hursley
Rome
Paris
Cairo
Haifa
Böblingen
Prague
© 2002 IBM Corporation
5
Why Voice?
 AT&T, Richard Cox, VP of Voice Enabled Services Research
– Speech is the most natural form of communication. As technology evolves, most people want to communicate with machines as easily
as they communicate with each other.
– http://www.research.att.com/news/2002/January/VES.html
 Microsoft
– Voice is the most natural way to communicate. Microsoft Research has a group in Beijing and another in Redmond collaborating to
improve spoken-language technologies for both enabling human-computer interaction by voice and enriching human-human voice
communication
– http://research.microsoft.com/speech/
 Christen Krogh, VP Engineering, Opera Software ASA
– "Voice is the most natural and effective way we communicate. In the years to come it will greatly facilitate how we interact with
technology," says Christen Krogh, VP Engineering, Opera Software ASA. "By making this technology available today for the wider Web
audience, the serious work of voice-enabling the Web can commence."
– http://www.scienceblog.com/community/article2499.html
Speech is the most natural form of interaction
© 2002 IBM Corporation
6
Enable Internet Applications For Access Through Voice
http://www.accuweather.com/weather.vxml
Human-Machine Interaction
Gartner, Steve Cramoysan, Lead Analyst Worldwide on Speech
– “IBM is the fastest growing vendor in Speech market”
– verbal comment, 4Q 2005
•Over Telephones
•With small Devices
•Interaction with PC
© 2002 IBM Corporation
7
IBM Voice Technologies: Selected Key Events
1992 1997 1999
Data Entry (Dictation): Arabic, Chinese, French,
German, Italian, Japanese, Spanish, US & UK English
2000
1961 1968 1969 1974 1986
Interaction Applications: standards-based architectures:
>15 languages
2002 2003
Horizon: Information & Communication
2004 2005
1994
Real-time PC-Based Isolated
Word Dictation (20k)
Speaker Verification System
Text-to-Speech Synthesis System
“Shoebox”: 10 digits + 6 commands recognizer
IBM Invention: Hidden Markov
Modeling for Speech Recognition
Real-Time Continuous
Dictation (20k)
Core Technology Research and Invention
IBM Speech Server Series:
Client / Server Solution
Discrete Dictation
ViaVoice 4.0 Gold: Continuous Dictation
ViaVoice Millennium: >200k words
VoiceType Dictation 1.0:
PC-based, Discrete Dictation
2001 2006
VoiceXML 1.0 specification
WebSphere Voice Server 1.0
Embedded ViaVoice 1.0
X+V: multi-modal standardization
Automated Contact Center: Natural Dialog Deployed
eVV: >100k lists (street/city names)
WVS 4.2: VXML 2.0, MRCP 1.0, NLU
WVS 2.0: VXML 1.0
eVV Multi-Modal Toolkit
eVV 4.4: free-form commands
WVS 3.1: Concatenative TTS
WVS 5.1 + VoiceEnabler: SIP stack
Conversational Biometrics Server
Audio-Visual Speech Recognition
Natural-Language Speech-to-Speech Translation
ViaScribe: Real-Time Video Sub-Titling
Real-time Broadcast News Transcription
© 2002 IBM Corporation
8
Det store bilde
Partner Tools
Tools,
Utilities
APIs,
Plug-Ins
3rd Party &
Open Source
Embedded
Voice
Toolkit
Everyplace
Toolkit
Voice
Toolkit
WSAD/WSSD or RAD/RWD
Monitoring
Workbench
Domino
Toolkit
Database
Tools
XDE
Design
Tools
Tivoli Lotus DB2 Rational
Software Platform
Multimodal
Toolkit
WebSphere
Micro
Environment
Toolkit
LINUX UNIX/AIX WINDOWS
© 2002 IBM Corporation
9
WebSphere Voice Server
IVR
CCXML
ASR
VXML
TTS
WebSphere (WAS) foundation
MRCP standard speech interface
MRCP
© 2002 IBM Corporation
10
Hva er Voice Server
En WebSphere-basert implementering av
industristandarden MRCP
 Media Resource Control Protocol er en IETF standard.
 MRCP definerer en protokoll for aksessering av speech servers.
 WVS er basert på WebSphere Application Server architecture.
© 2002 IBM Corporation
11
Språkbanken og tilhørende nettverk
WVT
WVS WVT
WVT
WVT
WVT
Språkbanken
UiO
UiB/AKSIS UiT
Medialt
IBM Prag/CTU
WVT
WVT
IBM Norge
NST++
WVT
© 2002 IBM Corporation
12
Voice Technologies for Interaction Applications
http://www.accuweather.com/weather.vxml
Human-Machine Interaction
Telephone
Network
Telephone Gateway
(PSTN, VoIP)
1. Voice Technologies
2. Application Scenarios
Audio
Audio
Audio
Words & Scores
Dialog Voice Technologies
Voice
Application
Browser Verify Speaker
Prompt
Recognize
© 2002 IBM Corporation
13
Proprietary
Proprietary
WVS 5.1 (on WAS)
WAS-based Voice Architecture, Programming Model
Telephone
Network
any VoIP GW
WVR
1. WebSphere Voice Server
2. Programming Model
VoiceEnabler (on WAS) WVS 5.1 (on WAS)
Verify
Prompt
Recognize
Voice team
IBM
Partner
Dialog Server Speech Server
Telephony Gateway
MRCP
SIP
VXML
CCXML
Siplet
VXML
Edge Server
Load Balancer
-workload balancing
-failover
-serviceability
SIP
IP
MRCP
Proprietary
MRCP
Cisco
Genesys
Avaya
VoiceGenie
WVR
Cell
Cell
© 2002 IBM Corporation
14
Some IBM Voice Technology customers
© 2002 IBM Corporation
15
Application Development
 User Interface Design
– How the user interfaces with the application
– How the application delivers information
to the user
– “Presentation”
 Application Framework
– Breakdown of application into scalable components
– “Structure”
 Business Logic (a.k.a. “back-end” or data access)
– Database queries based on application logic and user choices
– What the application does inside the “black box”
– “Work”
Note that Structure and Work are orthogonal to the Presentation (which could be graphical,
voice, multimodal, or otherwise rendered to the user).
© 2002 IBM Corporation
16
Voice User Interface Development Tasks
Dialogue Design
Persona Selection
Iterative Usability
Evaluative Usability
User Experience Research
Refine Persona Prompts
Transcription
Audio Production
Grammar Development
Dictionary Development
Application Development
Recognition Test
Application Testing
Grammar Updates
The steps that go into making
a “good” voice application that
will provide customer satisfaction
No “coding” necessary until this phase
Post-deployment testing/maintenance
Call
Flow
Design
and
Prototype
© 2002 IBM Corporation
17
Voice Toolkit - Overview
Speech User
Interface
Design
Application
Execution
Analysis
Audio Prompt
Production and
Management
Audio File
Recording and
Analysis
VoiceXML
Development
and Debugging
Grammar
Development
and Testing
Voice Portal
Development
Environment
Pronunciation
Validation and
Customization
CCXML
Development
and Validation
Application
Simulation and
Debugging
VoiceXML
Code
Generation
MRCP
Interface for
Grammar
Evaluation
One-Stop-Shop
Development
Solution for
Web-Centric
Speech
Applications
Integration with
Rational Tools
(RAD/RWD)
© 2002 IBM Corporation
18
A look at the WebSphere Voice Toolkit
Eclipse framework
provides default
views, icons,
layouts, XML
editor
functionality, etc. Editor panel
allows for
multiple files to
be edited
simultaneously
Navigator
panel shows
file view of
the project
Tabbed views show
words with
unknown
pronunciations or a
hierarchical view of
your application file
Pronunciation
composer enables
creation of
customized
phonetic
pronunciations
Informational
panels display
tasks, warnings,
errors, to-dos,
file properties,
and application
logs
© 2002 IBM Corporation
19
Voice Toolkit 6.0 Strategy
 IBM Products/Technology
–Rational (RAD, RWD, RSA)
–WebSphere (WAS, Portal)
–Voice Middleware (WVS, WVAA, WVR)
–Voice Technologies (ASR, TTS)
 Open Standards
–Markup (XML, VoiceXML, CCXML, SRGS, etc.)
–Protocols and connectivity (MRCP, Web, JSP, SIP, etc.)
–Tools (Eclipse, Voice Tools Project, Reusable Dialog Components)
 Ease-of-use and Productivity
–End-to-End integrated development environment
–Graphical modeling
–Code assistance and validation
–Application components
 Partners/Voice Platforms
–Extensibility/Adaptability for other platforms
–Incorporation of additional features and requirements
–Componentization of plugins/tools for bundling by partners and other products
© 2002 IBM Corporation
20
Graphical Enhancements
Communication Flow Builder
Grammar Builder
SIP-based Call Simulator
© 2002 IBM Corporation
21
Multimodal Scenario – Single Field Entry
Miami, Florida
Boston, Massachusetts
9/27/2004
Business Class
Boston,
Massachusetts
Where would you like to travel?
Please say your departure city.
Miami, Florida
Please say your destination city.
Departure date?
September 27th
Seating class?
Business
© 2002 IBM Corporation
22
WVS Simulator Architecture
VoiceXML
Browser
Web
Server
HTTP
MRCP
Server
MRCP
IVR
Gateway
SIP
Stack
Voice
Toolkit
Virtual
Phone
Log
Analysis
------
------
------
------
------
------
------
------
------
------
------
------
DB
WAS
SIP
Prox
y
RTP
Audio
ASR
TTS
© 2002 IBM Corporation
23
What is ASR, TTS, SIV, Voice Dialog
 Automatic Speech Recognition (Speech-To-Text) is the transformation of speech signals into text and
actions
– Goal: most people are recognized speaking naturally in any acoustic environment
– Status: natural sentences, reasonable noise level, wide application range
– Examples: Applications for Banks, Funds, Information Queries
 Speech Synthesis (Text-To-Speech) is the automatic generation of speech signals from text
– Goal: listeners can not distinguish it from human speech
– Status: “Concatenative TTS” is almost natural, especially when optimized for particular applications
– Examples: General Purpose Out-of-Box Voice, Application and/or Corporate Specific Voices
 Speaker Identity Verification (SIV) verifies the identity of an individual from their speech
– Goal: identify an individual or verify the claim to be a certain individual with highest accuracy
– Status: to be introduced speaker identity verification to support MRCP V2
– Examples: access to secure applications (phone, PDA), verify through whole interaction
 Voice Dialog models the applications interaction behavior between human and machine
– Goal: speak to an application as if it is a human counterpart
– Status: system can take initiative to request missing elements to complete a transaction
– Examples: Form-Filling Applications (e.g. Bank Transfers), Free-Form Dialogs (e.g. Mutual Funds)
© 2002 IBM Corporation
24
IBM pSeries model 55A
Partisjonering
Konsolidering
Ytelse ... lisensiering
Applikasjonsstøtte - konvertering
Etc.
16 GB minne
Oppgraderingsmuligheter uten konvertering til ny server
Krever lite nedetid
Krever nytt CPU-kort installert (kurant)
Teknologisk plattform - Språkbanken
© 2002 IBM Corporation
25
 WVS:
 - HW: x86 or p-series (Power) system
WITH 2GB RAM (3GB for "fifth
generation voices" - higher quality
speech synthesis, some languages
only)
 - OS: one of the following:
 - AIX 5L v 5.3, ML 3
 - RedHat Enterprise Linux v.3
(Update 3 or 4) or 4 (Update 2)
 - SUSE Linux Enterprise Server
v.8, SP 2a or 3
 - Windows 2003 Enterprise or
Standard
 VE:
 - x86. Caution: VE must run on a
different machine from WVS.
 - OS: one of the following:
 - Windows 2003 Enterprise or
Standard
 - RedHat Enterprise Linux v.3
(Update 3 or 4) or 4 (Update 2)
© 2002 IBM Corporation
26
IBM Språkteknologi på mobile enheter
© 2002 IBM Corporation
27
Aktuelle prosjekter for bruk av IBM WebSphere
Voice teknologi
Videreutvikling av HENRIK-stemmen
Prosjektet går ut på å utvikle taleteknologisk
programvare for norske språkbrukere, og evaluere
ulike systemers verdi som støtte for brukere med
kommunikasjons-, språk-, tale-, lese- og
skrivevansker. Prosjektets tar utgangspunkt i
eksisterende teknologi utviklet av selskapet NST i
samarbeid med IBM, dvs. talesyntesestemmen kalt
Henrik
Et samarbeide mellom Bredtvet
Kompetansesenter, Medialt, Aksis og IBM
IBM Business Partner
SMUDI
© 2002 IBM Corporation
28
Workshop at IBM Research lab in Prag
(preliminary agenda):
The telephony architecture overview
Installation of the Voice Enabler
Installation of the WebSphere Voice
Server
Configuration of the Asterisk PBX
Installation and use of the Voice
toolkit
Case study - presentation of the
development cycle and usage
Videre aktiviteter
IBM Research Lab. Prag
Deltagere
 Medialt
 Språkrådet
 Aksis
 UiO, UiB, NTNU
 IBM Norge
© 2002 IBM Corporation
29
IBM: Worldwide Voice Team (R&D, LBS)
Boca Raton
Yorktown Beijing
Tokyo
Seville
Hursley
Rome
Paris
Cairo
Haifa
Böblingen
Prague
Oslo

More Related Content

What's hot

Top IT Management Practices for Government Entities
Top IT Management Practices for Government EntitiesTop IT Management Practices for Government Entities
Top IT Management Practices for Government EntitiesSolarWinds
 
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...SolarWinds
 
Leveraging SolarWinds to Consolidate IT Operations and Management at NHS
Leveraging SolarWinds to Consolidate IT Operations and Management at NHSLeveraging SolarWinds to Consolidate IT Operations and Management at NHS
Leveraging SolarWinds to Consolidate IT Operations and Management at NHSSolarWinds
 
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...SolarWinds
 
Government and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsGovernment and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsSolarWinds
 
Government and Education Webinar: Optimize Performance With Advanced Host Mon...
Government and Education Webinar: Optimize Performance With Advanced Host Mon...Government and Education Webinar: Optimize Performance With Advanced Host Mon...
Government and Education Webinar: Optimize Performance With Advanced Host Mon...SolarWinds
 
Troubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesTroubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesSolarWinds
 
SolarWinds Government and Education Webinar: Optimizing the Orion Platform
SolarWinds Government and Education Webinar: Optimizing the Orion PlatformSolarWinds Government and Education Webinar: Optimizing the Orion Platform
SolarWinds Government and Education Webinar: Optimizing the Orion PlatformSolarWinds
 
Government and Education Webinar: Conquering Remote Work IT Challenges
Government and Education Webinar: Conquering Remote Work IT Challenges Government and Education Webinar: Conquering Remote Work IT Challenges
Government and Education Webinar: Conquering Remote Work IT Challenges SolarWinds
 
Government Webinar: Five Essential IT Tools You Need Today
Government Webinar: Five Essential IT Tools You Need TodayGovernment Webinar: Five Essential IT Tools You Need Today
Government Webinar: Five Essential IT Tools You Need TodaySolarWinds
 
Government and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceGovernment and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceSolarWinds
 
Taming Multi-Cloud, Hybrid Cloud, Docker, and Kubernetes
Taming Multi-Cloud, Hybrid Cloud, Docker, and KubernetesTaming Multi-Cloud, Hybrid Cloud, Docker, and Kubernetes
Taming Multi-Cloud, Hybrid Cloud, Docker, and KubernetesSolarWinds
 
Government and Education Webinar: Simplify Your Database Performance Manageme...
Government and Education Webinar: Simplify Your Database Performance Manageme...Government and Education Webinar: Simplify Your Database Performance Manageme...
Government and Education Webinar: Simplify Your Database Performance Manageme...SolarWinds
 
Cisco IT Infrastructure Monitoring with SolarWinds Tools
Cisco IT Infrastructure Monitoring with SolarWinds Tools Cisco IT Infrastructure Monitoring with SolarWinds Tools
Cisco IT Infrastructure Monitoring with SolarWinds Tools Gintare Stravinskaite
 
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...SolarWinds
 
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...SolarWinds
 
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...SolarWinds
 
Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...SolarWinds
 
Monitoring and Securing Active Directory Government Webinar for the US Army
Monitoring and Securing Active Directory Government Webinar for the US ArmyMonitoring and Securing Active Directory Government Webinar for the US Army
Monitoring and Securing Active Directory Government Webinar for the US ArmySolarWinds
 
Government and Education Webinar: Technical Update and Demo of New Features
Government and Education Webinar: Technical Update and Demo of New FeaturesGovernment and Education Webinar: Technical Update and Demo of New Features
Government and Education Webinar: Technical Update and Demo of New FeaturesSolarWinds
 

What's hot (20)

Top IT Management Practices for Government Entities
Top IT Management Practices for Government EntitiesTop IT Management Practices for Government Entities
Top IT Management Practices for Government Entities
 
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...
SolarWinds Government and Education Webinar: Virtual Technology Briefing 08.0...
 
Leveraging SolarWinds to Consolidate IT Operations and Management at NHS
Leveraging SolarWinds to Consolidate IT Operations and Management at NHSLeveraging SolarWinds to Consolidate IT Operations and Management at NHS
Leveraging SolarWinds to Consolidate IT Operations and Management at NHS
 
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...
Government and Education Webinar: Leveraging SolarWinds to Improve Remote Emp...
 
Government and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT OperationsGovernment and Education Webinar: Leverage Automation to Improve IT Operations
Government and Education Webinar: Leverage Automation to Improve IT Operations
 
Government and Education Webinar: Optimize Performance With Advanced Host Mon...
Government and Education Webinar: Optimize Performance With Advanced Host Mon...Government and Education Webinar: Optimize Performance With Advanced Host Mon...
Government and Education Webinar: Optimize Performance With Advanced Host Mon...
 
Troubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing RulesTroubleshoot Network Problems with Routing Rules
Troubleshoot Network Problems with Routing Rules
 
SolarWinds Government and Education Webinar: Optimizing the Orion Platform
SolarWinds Government and Education Webinar: Optimizing the Orion PlatformSolarWinds Government and Education Webinar: Optimizing the Orion Platform
SolarWinds Government and Education Webinar: Optimizing the Orion Platform
 
Government and Education Webinar: Conquering Remote Work IT Challenges
Government and Education Webinar: Conquering Remote Work IT Challenges Government and Education Webinar: Conquering Remote Work IT Challenges
Government and Education Webinar: Conquering Remote Work IT Challenges
 
Government Webinar: Five Essential IT Tools You Need Today
Government Webinar: Five Essential IT Tools You Need TodayGovernment Webinar: Five Essential IT Tools You Need Today
Government Webinar: Five Essential IT Tools You Need Today
 
Government and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for PerformanceGovernment and Education Webinar: SQL Server—Indexing for Performance
Government and Education Webinar: SQL Server—Indexing for Performance
 
Taming Multi-Cloud, Hybrid Cloud, Docker, and Kubernetes
Taming Multi-Cloud, Hybrid Cloud, Docker, and KubernetesTaming Multi-Cloud, Hybrid Cloud, Docker, and Kubernetes
Taming Multi-Cloud, Hybrid Cloud, Docker, and Kubernetes
 
Government and Education Webinar: Simplify Your Database Performance Manageme...
Government and Education Webinar: Simplify Your Database Performance Manageme...Government and Education Webinar: Simplify Your Database Performance Manageme...
Government and Education Webinar: Simplify Your Database Performance Manageme...
 
Cisco IT Infrastructure Monitoring with SolarWinds Tools
Cisco IT Infrastructure Monitoring with SolarWinds Tools Cisco IT Infrastructure Monitoring with SolarWinds Tools
Cisco IT Infrastructure Monitoring with SolarWinds Tools
 
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...
Federal Webinar: Application monitoring for on-premises, hybrid, and multi-cl...
 
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...
Government Webinar: Monitoring Azure and Deploying SolarWinds on Azure Govern...
 
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
Government Webinar: Low-Cost Log, Network Configuration, and IT Monitoring So...
 
Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...Government and Education Webinar: How the New Normal Could Improve your IT Op...
Government and Education Webinar: How the New Normal Could Improve your IT Op...
 
Monitoring and Securing Active Directory Government Webinar for the US Army
Monitoring and Securing Active Directory Government Webinar for the US ArmyMonitoring and Securing Active Directory Government Webinar for the US Army
Monitoring and Securing Active Directory Government Webinar for the US Army
 
Government and Education Webinar: Technical Update and Demo of New Features
Government and Education Webinar: Technical Update and Demo of New FeaturesGovernment and Education Webinar: Technical Update and Demo of New Features
Government and Education Webinar: Technical Update and Demo of New Features
 

Similar to ibm språkbanken websphere

A glimpse of voice technology
A glimpse of voice technologyA glimpse of voice technology
A glimpse of voice technologyVishad Garg
 
Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008eComm2008
 
Open Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan QuayleOpen Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan QuayleAlan Quayle
 
DavidShaffer-ResumeNew
DavidShaffer-ResumeNewDavidShaffer-ResumeNew
DavidShaffer-ResumeNewDavid Shaffer
 
Training - Managing .NET/J2EE Projects
Training - Managing .NET/J2EE ProjectsTraining - Managing .NET/J2EE Projects
Training - Managing .NET/J2EE ProjectsShashank Banerjea
 
Net framework
Net frameworkNet framework
Net frameworksumit1503
 
Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Lokesh Loki
 
Delivering ivi-speech-applications-white-paper
Delivering  ivi-speech-applications-white-paperDelivering  ivi-speech-applications-white-paper
Delivering ivi-speech-applications-white-papersiavoshani
 
WS-* Specifications Update 2007
WS-* Specifications Update 2007WS-* Specifications Update 2007
WS-* Specifications Update 2007Jorgen Thelin
 
Innovation for Participation - Paul De Decker, Sun Microsystems
Innovation for Participation - Paul De Decker, Sun MicrosystemsInnovation for Participation - Paul De Decker, Sun Microsystems
Innovation for Participation - Paul De Decker, Sun Microsystemsrobinwauters
 
Native WebRTC Mobile App Development: Tools & Tips
Native WebRTC Mobile App Development: Tools & TipsNative WebRTC Mobile App Development: Tools & Tips
Native WebRTC Mobile App Development: Tools & TipsAjeet Singh
 
Agile Seaside
Agile SeasideAgile Seaside
Agile SeasideESUG
 
Seaside — Agile Software Development
Seaside — Agile Software DevelopmentSeaside — Agile Software Development
Seaside — Agile Software DevelopmentLukas Renggli
 
WebRTC Webinar & Q&A - All About Microsoft & WebRTC Hosting Guest Speaker Ja...
WebRTC Webinar & Q&A -  All About Microsoft & WebRTC Hosting Guest Speaker Ja...WebRTC Webinar & Q&A -  All About Microsoft & WebRTC Hosting Guest Speaker Ja...
WebRTC Webinar & Q&A - All About Microsoft & WebRTC Hosting Guest Speaker Ja...Amir Zmora
 
Hss Trends May2009c
Hss Trends May2009cHss Trends May2009c
Hss Trends May2009cJoe Bachana
 
An introduction to Android
An introduction to AndroidAn introduction to Android
An introduction to AndroidRajesh Jambukia
 

Similar to ibm språkbanken websphere (20)

A glimpse of voice technology
A glimpse of voice technologyA glimpse of voice technology
A glimpse of voice technology
 
Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008
 
Open Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan QuayleOpen Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan Quayle
 
DavidShaffer-ResumeNew
DavidShaffer-ResumeNewDavidShaffer-ResumeNew
DavidShaffer-ResumeNew
 
Training - Managing .NET/J2EE Projects
Training - Managing .NET/J2EE ProjectsTraining - Managing .NET/J2EE Projects
Training - Managing .NET/J2EE Projects
 
Flex3
Flex3Flex3
Flex3
 
01 introduction
01 introduction01 introduction
01 introduction
 
Net framework
Net frameworkNet framework
Net framework
 
Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02Voicebasedsrs 130319103050-phpapp02
Voicebasedsrs 130319103050-phpapp02
 
Delivering ivi-speech-applications-white-paper
Delivering  ivi-speech-applications-white-paperDelivering  ivi-speech-applications-white-paper
Delivering ivi-speech-applications-white-paper
 
WS-* Specifications Update 2007
WS-* Specifications Update 2007WS-* Specifications Update 2007
WS-* Specifications Update 2007
 
Innovation for Participation - Paul De Decker, Sun Microsystems
Innovation for Participation - Paul De Decker, Sun MicrosystemsInnovation for Participation - Paul De Decker, Sun Microsystems
Innovation for Participation - Paul De Decker, Sun Microsystems
 
Native WebRTC Mobile App Development: Tools & Tips
Native WebRTC Mobile App Development: Tools & TipsNative WebRTC Mobile App Development: Tools & Tips
Native WebRTC Mobile App Development: Tools & Tips
 
Agile Seaside
Agile SeasideAgile Seaside
Agile Seaside
 
Seaside — Agile Software Development
Seaside — Agile Software DevelopmentSeaside — Agile Software Development
Seaside — Agile Software Development
 
WebRTC Webinar & Q&A - All About Microsoft & WebRTC Hosting Guest Speaker Ja...
WebRTC Webinar & Q&A -  All About Microsoft & WebRTC Hosting Guest Speaker Ja...WebRTC Webinar & Q&A -  All About Microsoft & WebRTC Hosting Guest Speaker Ja...
WebRTC Webinar & Q&A - All About Microsoft & WebRTC Hosting Guest Speaker Ja...
 
Voice browser
Voice browserVoice browser
Voice browser
 
Hss Trends May2009c
Hss Trends May2009cHss Trends May2009c
Hss Trends May2009c
 
An introduction to Android
An introduction to AndroidAn introduction to Android
An introduction to Android
 
Webrtc and tokbox
Webrtc and tokboxWebrtc and tokbox
Webrtc and tokbox
 

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxhublikarsn
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdfKamal Acharya
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"mphochane1998
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...ssuserdfc773
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiessarkmank1
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...ppkakm
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARKOUSTAV SARKAR
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxkalpana413121
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...ronahami
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessorAshwiniTodkar4
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsmeharikiros2
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdfAldoGarca30
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelDrAjayKumarYadav4
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 

Recently uploaded (20)

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Introduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptxIntroduction to Robotics in Mechanical Engineering.pptx
Introduction to Robotics in Mechanical Engineering.pptx
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
Convergence of Robotics and Gen AI offers excellent opportunities for Entrepr...
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...Basic Electronics for diploma students as per technical education Kerala Syll...
Basic Electronics for diploma students as per technical education Kerala Syll...
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
UNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptxUNIT 4 PTRP final Convergence in probability.pptx
UNIT 4 PTRP final Convergence in probability.pptx
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...Max. shear stress theory-Maximum Shear Stress Theory ​  Maximum Distortional ...
Max. shear stress theory-Maximum Shear Stress Theory ​ Maximum Distortional ...
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor8086 Microprocessor Architecture: 16-bit microprocessor
8086 Microprocessor Architecture: 16-bit microprocessor
 
Query optimization and processing for advanced database systems
Query optimization and processing for advanced database systemsQuery optimization and processing for advanced database systems
Query optimization and processing for advanced database systems
 
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Path loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata ModelPath loss model, OKUMURA Model, Hata Model
Path loss model, OKUMURA Model, Hata Model
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 

ibm språkbanken websphere

  • 1. © 2002 IBM Corporation SPRÅKBANKEN Teknologisk Plattform IBM WebSphere Voice Server and Toolkit Språkrådet 18.12.2007
  • 2. © 2002 IBM Corporation 2 “Dette gjer Noreg til eit av dei første landa i verda med eit fritt tilgjengeleg talesynteseprogram. Samtidig er det klart at dette ikkje er ei løysing som dekkjer alle tekniske plattformer. Regjeringa vil oppmode leverandørar og utviklingsmiljø til å få fram fritt tilgjengelege og gode løysingar for andre plattformer enn Windows”. “Regjeringa ser det som ei utfordring å gjere programvare for talesyntese tilgjengeleg, og då helst med fri tilgang for dei som ynskjer det. Programvaren må distribuerast via fleire kanalar, og ved bruk av Internett til distribusjon må taleprogrammet kunne lagrast lokalt på datamaskina”. Fornyings- og administrasjonsminister Heidi Grande Røys Statsråd
  • 3. © 2002 IBM Corporation 3 Avtale undertegnet 5.1.2007
  • 4. © 2002 IBM Corporation 4 IBM: Worldwide Voice Team (R&D, LBS) Boca Raton Yorktown Beijing Tokyo Seville Hursley Rome Paris Cairo Haifa Böblingen Prague
  • 5. © 2002 IBM Corporation 5 Why Voice?  AT&T, Richard Cox, VP of Voice Enabled Services Research – Speech is the most natural form of communication. As technology evolves, most people want to communicate with machines as easily as they communicate with each other. – http://www.research.att.com/news/2002/January/VES.html  Microsoft – Voice is the most natural way to communicate. Microsoft Research has a group in Beijing and another in Redmond collaborating to improve spoken-language technologies for both enabling human-computer interaction by voice and enriching human-human voice communication – http://research.microsoft.com/speech/  Christen Krogh, VP Engineering, Opera Software ASA – "Voice is the most natural and effective way we communicate. In the years to come it will greatly facilitate how we interact with technology," says Christen Krogh, VP Engineering, Opera Software ASA. "By making this technology available today for the wider Web audience, the serious work of voice-enabling the Web can commence." – http://www.scienceblog.com/community/article2499.html Speech is the most natural form of interaction
  • 6. © 2002 IBM Corporation 6 Enable Internet Applications For Access Through Voice http://www.accuweather.com/weather.vxml Human-Machine Interaction Gartner, Steve Cramoysan, Lead Analyst Worldwide on Speech – “IBM is the fastest growing vendor in Speech market” – verbal comment, 4Q 2005 •Over Telephones •With small Devices •Interaction with PC
  • 7. © 2002 IBM Corporation 7 IBM Voice Technologies: Selected Key Events 1992 1997 1999 Data Entry (Dictation): Arabic, Chinese, French, German, Italian, Japanese, Spanish, US & UK English 2000 1961 1968 1969 1974 1986 Interaction Applications: standards-based architectures: >15 languages 2002 2003 Horizon: Information & Communication 2004 2005 1994 Real-time PC-Based Isolated Word Dictation (20k) Speaker Verification System Text-to-Speech Synthesis System “Shoebox”: 10 digits + 6 commands recognizer IBM Invention: Hidden Markov Modeling for Speech Recognition Real-Time Continuous Dictation (20k) Core Technology Research and Invention IBM Speech Server Series: Client / Server Solution Discrete Dictation ViaVoice 4.0 Gold: Continuous Dictation ViaVoice Millennium: >200k words VoiceType Dictation 1.0: PC-based, Discrete Dictation 2001 2006 VoiceXML 1.0 specification WebSphere Voice Server 1.0 Embedded ViaVoice 1.0 X+V: multi-modal standardization Automated Contact Center: Natural Dialog Deployed eVV: >100k lists (street/city names) WVS 4.2: VXML 2.0, MRCP 1.0, NLU WVS 2.0: VXML 1.0 eVV Multi-Modal Toolkit eVV 4.4: free-form commands WVS 3.1: Concatenative TTS WVS 5.1 + VoiceEnabler: SIP stack Conversational Biometrics Server Audio-Visual Speech Recognition Natural-Language Speech-to-Speech Translation ViaScribe: Real-Time Video Sub-Titling Real-time Broadcast News Transcription
  • 8. © 2002 IBM Corporation 8 Det store bilde Partner Tools Tools, Utilities APIs, Plug-Ins 3rd Party & Open Source Embedded Voice Toolkit Everyplace Toolkit Voice Toolkit WSAD/WSSD or RAD/RWD Monitoring Workbench Domino Toolkit Database Tools XDE Design Tools Tivoli Lotus DB2 Rational Software Platform Multimodal Toolkit WebSphere Micro Environment Toolkit LINUX UNIX/AIX WINDOWS
  • 9. © 2002 IBM Corporation 9 WebSphere Voice Server IVR CCXML ASR VXML TTS WebSphere (WAS) foundation MRCP standard speech interface MRCP
  • 10. © 2002 IBM Corporation 10 Hva er Voice Server En WebSphere-basert implementering av industristandarden MRCP  Media Resource Control Protocol er en IETF standard.  MRCP definerer en protokoll for aksessering av speech servers.  WVS er basert på WebSphere Application Server architecture.
  • 11. © 2002 IBM Corporation 11 Språkbanken og tilhørende nettverk WVT WVS WVT WVT WVT WVT Språkbanken UiO UiB/AKSIS UiT Medialt IBM Prag/CTU WVT WVT IBM Norge NST++ WVT
  • 12. © 2002 IBM Corporation 12 Voice Technologies for Interaction Applications http://www.accuweather.com/weather.vxml Human-Machine Interaction Telephone Network Telephone Gateway (PSTN, VoIP) 1. Voice Technologies 2. Application Scenarios Audio Audio Audio Words & Scores Dialog Voice Technologies Voice Application Browser Verify Speaker Prompt Recognize
  • 13. © 2002 IBM Corporation 13 Proprietary Proprietary WVS 5.1 (on WAS) WAS-based Voice Architecture, Programming Model Telephone Network any VoIP GW WVR 1. WebSphere Voice Server 2. Programming Model VoiceEnabler (on WAS) WVS 5.1 (on WAS) Verify Prompt Recognize Voice team IBM Partner Dialog Server Speech Server Telephony Gateway MRCP SIP VXML CCXML Siplet VXML Edge Server Load Balancer -workload balancing -failover -serviceability SIP IP MRCP Proprietary MRCP Cisco Genesys Avaya VoiceGenie WVR Cell Cell
  • 14. © 2002 IBM Corporation 14 Some IBM Voice Technology customers
  • 15. © 2002 IBM Corporation 15 Application Development  User Interface Design – How the user interfaces with the application – How the application delivers information to the user – “Presentation”  Application Framework – Breakdown of application into scalable components – “Structure”  Business Logic (a.k.a. “back-end” or data access) – Database queries based on application logic and user choices – What the application does inside the “black box” – “Work” Note that Structure and Work are orthogonal to the Presentation (which could be graphical, voice, multimodal, or otherwise rendered to the user).
  • 16. © 2002 IBM Corporation 16 Voice User Interface Development Tasks Dialogue Design Persona Selection Iterative Usability Evaluative Usability User Experience Research Refine Persona Prompts Transcription Audio Production Grammar Development Dictionary Development Application Development Recognition Test Application Testing Grammar Updates The steps that go into making a “good” voice application that will provide customer satisfaction No “coding” necessary until this phase Post-deployment testing/maintenance Call Flow Design and Prototype
  • 17. © 2002 IBM Corporation 17 Voice Toolkit - Overview Speech User Interface Design Application Execution Analysis Audio Prompt Production and Management Audio File Recording and Analysis VoiceXML Development and Debugging Grammar Development and Testing Voice Portal Development Environment Pronunciation Validation and Customization CCXML Development and Validation Application Simulation and Debugging VoiceXML Code Generation MRCP Interface for Grammar Evaluation One-Stop-Shop Development Solution for Web-Centric Speech Applications Integration with Rational Tools (RAD/RWD)
  • 18. © 2002 IBM Corporation 18 A look at the WebSphere Voice Toolkit Eclipse framework provides default views, icons, layouts, XML editor functionality, etc. Editor panel allows for multiple files to be edited simultaneously Navigator panel shows file view of the project Tabbed views show words with unknown pronunciations or a hierarchical view of your application file Pronunciation composer enables creation of customized phonetic pronunciations Informational panels display tasks, warnings, errors, to-dos, file properties, and application logs
  • 19. © 2002 IBM Corporation 19 Voice Toolkit 6.0 Strategy  IBM Products/Technology –Rational (RAD, RWD, RSA) –WebSphere (WAS, Portal) –Voice Middleware (WVS, WVAA, WVR) –Voice Technologies (ASR, TTS)  Open Standards –Markup (XML, VoiceXML, CCXML, SRGS, etc.) –Protocols and connectivity (MRCP, Web, JSP, SIP, etc.) –Tools (Eclipse, Voice Tools Project, Reusable Dialog Components)  Ease-of-use and Productivity –End-to-End integrated development environment –Graphical modeling –Code assistance and validation –Application components  Partners/Voice Platforms –Extensibility/Adaptability for other platforms –Incorporation of additional features and requirements –Componentization of plugins/tools for bundling by partners and other products
  • 20. © 2002 IBM Corporation 20 Graphical Enhancements Communication Flow Builder Grammar Builder SIP-based Call Simulator
  • 21. © 2002 IBM Corporation 21 Multimodal Scenario – Single Field Entry Miami, Florida Boston, Massachusetts 9/27/2004 Business Class Boston, Massachusetts Where would you like to travel? Please say your departure city. Miami, Florida Please say your destination city. Departure date? September 27th Seating class? Business
  • 22. © 2002 IBM Corporation 22 WVS Simulator Architecture VoiceXML Browser Web Server HTTP MRCP Server MRCP IVR Gateway SIP Stack Voice Toolkit Virtual Phone Log Analysis ------ ------ ------ ------ ------ ------ ------ ------ ------ ------ ------ ------ DB WAS SIP Prox y RTP Audio ASR TTS
  • 23. © 2002 IBM Corporation 23 What is ASR, TTS, SIV, Voice Dialog  Automatic Speech Recognition (Speech-To-Text) is the transformation of speech signals into text and actions – Goal: most people are recognized speaking naturally in any acoustic environment – Status: natural sentences, reasonable noise level, wide application range – Examples: Applications for Banks, Funds, Information Queries  Speech Synthesis (Text-To-Speech) is the automatic generation of speech signals from text – Goal: listeners can not distinguish it from human speech – Status: “Concatenative TTS” is almost natural, especially when optimized for particular applications – Examples: General Purpose Out-of-Box Voice, Application and/or Corporate Specific Voices  Speaker Identity Verification (SIV) verifies the identity of an individual from their speech – Goal: identify an individual or verify the claim to be a certain individual with highest accuracy – Status: to be introduced speaker identity verification to support MRCP V2 – Examples: access to secure applications (phone, PDA), verify through whole interaction  Voice Dialog models the applications interaction behavior between human and machine – Goal: speak to an application as if it is a human counterpart – Status: system can take initiative to request missing elements to complete a transaction – Examples: Form-Filling Applications (e.g. Bank Transfers), Free-Form Dialogs (e.g. Mutual Funds)
  • 24. © 2002 IBM Corporation 24 IBM pSeries model 55A Partisjonering Konsolidering Ytelse ... lisensiering Applikasjonsstøtte - konvertering Etc. 16 GB minne Oppgraderingsmuligheter uten konvertering til ny server Krever lite nedetid Krever nytt CPU-kort installert (kurant) Teknologisk plattform - Språkbanken
  • 25. © 2002 IBM Corporation 25  WVS:  - HW: x86 or p-series (Power) system WITH 2GB RAM (3GB for "fifth generation voices" - higher quality speech synthesis, some languages only)  - OS: one of the following:  - AIX 5L v 5.3, ML 3  - RedHat Enterprise Linux v.3 (Update 3 or 4) or 4 (Update 2)  - SUSE Linux Enterprise Server v.8, SP 2a or 3  - Windows 2003 Enterprise or Standard  VE:  - x86. Caution: VE must run on a different machine from WVS.  - OS: one of the following:  - Windows 2003 Enterprise or Standard  - RedHat Enterprise Linux v.3 (Update 3 or 4) or 4 (Update 2)
  • 26. © 2002 IBM Corporation 26 IBM Språkteknologi på mobile enheter
  • 27. © 2002 IBM Corporation 27 Aktuelle prosjekter for bruk av IBM WebSphere Voice teknologi Videreutvikling av HENRIK-stemmen Prosjektet går ut på å utvikle taleteknologisk programvare for norske språkbrukere, og evaluere ulike systemers verdi som støtte for brukere med kommunikasjons-, språk-, tale-, lese- og skrivevansker. Prosjektets tar utgangspunkt i eksisterende teknologi utviklet av selskapet NST i samarbeid med IBM, dvs. talesyntesestemmen kalt Henrik Et samarbeide mellom Bredtvet Kompetansesenter, Medialt, Aksis og IBM IBM Business Partner SMUDI
  • 28. © 2002 IBM Corporation 28 Workshop at IBM Research lab in Prag (preliminary agenda): The telephony architecture overview Installation of the Voice Enabler Installation of the WebSphere Voice Server Configuration of the Asterisk PBX Installation and use of the Voice toolkit Case study - presentation of the development cycle and usage Videre aktiviteter IBM Research Lab. Prag Deltagere  Medialt  Språkrådet  Aksis  UiO, UiB, NTNU  IBM Norge
  • 29. © 2002 IBM Corporation 29 IBM: Worldwide Voice Team (R&D, LBS) Boca Raton Yorktown Beijing Tokyo Seville Hursley Rome Paris Cairo Haifa Böblingen Prague Oslo