SlideShare a Scribd company logo
1 of 24
Download to read offline
Presented by
Soumya Shuchi(14300211047)
Srirupa Das(14300211048)
Subhajit Karmakar(14300211049)
Subhendu Paul(14300211050)
Sumadhura Biswas(14300211051)
Suman Bose(14300211052)
Sumit Kr.Singh(14300211053)
IT Dept. GNIT
1
TABLE OF CONTENTS
 What is voice browser
 Motivation
 Difference between graphical browser and voice
browser
 Possible applications
 W3C
 VoiceXML
 Speech Recognition
 Call control
 TTS
 Voice style sheets
 Conclusion
IT Dept. GNIT
2
WHAT IS A VOICE BROWSER?
 A voice browser is a software application that
presents an interactive voice user interface to
the user in a manner analogous to the
functioning of a web browser.
 Expanding access to the Web.
 Will allow any telephone to be used to access
appropriately designed Web-based services.
IT Dept. GNIT
3
IT Dept. GNIT
4
WHAT IS A VOICE BROWSER?
 Server-based , Voice portals
 Interaction via keypads, spoken
commands, listening to prerecorded
speech, synthetic speech and music.
 An advantage to people with visual
impairment.
 Mobile Web
 Use of the hands during browsing might prove
inconvenient or impossible. Voice input is a
natural solution for such ands-busy situations.
 Even in standard browser applications, using
voice input is simply more fun than the
alternatives.
 Browser replaces the mouse in most instances
to enable hands-free browsing.
IT Dept. GNIT
5
WHY A VOICE BROWSER?
WHY A VOICE BROWSER?
 Voice input provides direct "see and say" access
to links, eliminating the wrist strain associated
with holding the mouse for often hours at a time.
IT Dept. GNIT
6
 Easy to use - for people with no knowledge
or fear of computers.
 Voice Browsers are the next generation of
call centers, which will become Voice Web
portals to the company's services and
related websites, whether accessed via the
telephone network or via the Internet.
IT Dept. GNIT
7
MOTIVATION
 Graphical browsing is more passive due to
the persistence of the visual information .
 Voice browsing is more active since the user
has to issue commands.
 Graphical Browsers can be client-based,
whereas Voice Browsers should be server-
based.
IT Dept. GNIT
8
GRAPHICAL & VOICE BROWSING
POSSIBLE APPLICATIONS
 Accessing business information:
 The corporate "front desk" which asks callers who or
what they want.
 Automated telephone ordering service .
 Airline arrival and departure information.
 Home banking services.
 Accessing public information:
 Community information such as weather, traffic
condition, school closures, directions and events.
IT Dept. GNIT
9
CONTD..
 Local, national and international news.
 National and international stock market
information.
 Business and e-commerce transactions.
 Accessing personal information:
 Voice mail.
 Calendars, address and telephone lists .
 Personal horoscope.
 Personal newsletter.
 To-do lists, shopping lists, and calorie
counters.
IT Dept. GNIT
10
W3C
 The World Wide Web Consortium (W3C) develops
interoperable technologies (specifications,
guidelines, software, and tools) to lead the Web to
its full potential as a forum for information,
commerce, communication, and collective
understanding.
11
IT Dept. GNIT
W3C Speech Interface Framework
 VoiceXML
 Speech Recognition :
1.Speech Grammars 2.Stochastic (N-Gram) Language
Models 3.Semantic Interpretation 4.Pronunciation Lexicon
 Call control
VOICEXML
 VoiceXML is a dialog markup language designed
for telephony applications, where users are
restricted to voice and DTMF (touch tone) input.
 There are other languages: VoXML, omniviewXML
text.html
text.vxml
Web
Server
Internet
Browse
r
IT Dept. GNIT
12
VOICEXML – ARCHITECTURE
SPEECH RECOGNITION
DTMF
Grammars
Speech
Grammars
Stochastic
Language
Models
Semantic
Interpretation
Touch Tone
USER
Speech
IT Dept. GNIT
14
DTMF GRAMMARS
 Touch tone input is often used as an
alternative to speech recognition.
 Especially useful in noisy conditions or
when the social context makes it awkward
to speak.
 The W3C DTMF grammar format allows
authors to specify the expected sequence
of digits, and to bind them to the
appropriate results.
IT Dept. GNIT
15
SPEECH GRAMMARS
 Speech Grammars allow authors to specify
rules covering the sequences of words that
users are expected to say in particular
contexts.
 These contexual clues allow the
recognition engine to focus on likely
utterances, improving the chances of a
correct match.
IT Dept. GNIT
16
STOCHASTIC (N-GRAM) LANGUAGE MODELS
 Speech Grammars are unuseful in case of
open-enden prompt(how can i help u).
 The solution is to use a stochastic
language model. Such models specify the
probability that one word occurs following
certain others. The probabilities are
computed from a collection of utterances
collected from many users.
IT Dept. GNIT
17
SEMANTIC INTERPRETATION
 The recognition process matches an
utterance to a speech grammar, building a
parse tree as a byproduct.
 There are two approaches to harvesting
semantic results from the parse tree:
1. Annotating grammar rules with
semantic interpretation tags.
2. Representing the result in XML.
IT Dept. GNIT
18
PRONUNCIATION LEXICON
o Application developers sometimes need to
ability to tune speech engines, whether for
synthesis or recognition.
o W3C is developing a markup language for
an open portable specification of
pronunciation information using a standard
phonetic alphabet.
o The most commonly needed pronunciations
are for proper nouns such as surnames or
business names.
IT Dept. GNIT
19
CALL CONTROL
 Fine-grained control of speech (signal
processing) resources and telephony
resources in a VoiceXML telephony
platform.
 Will enable application developers to use
markup to perform call screening, whisper
call waiting, call transfer, and more.
 Can be used to transfer a user from one
voice browser to another on a competely
different machine.
IT Dept. GNIT
20
TEXT TO SPEECH SYNTHESIS:
 1. Pre-processing
 2. Text normalization
i) digit normalization
ii) date normalization
iii) abbreviation normalization
 3. Parts of speech annotation
 4. Pronunciation lexicon
 5. Letter to sound rules
 6. Synthesis
IT Dept. GNIT
21
VOICE STYLE SHEETS!
 Volume
 Rate
 Pitch
 Direction
 Spelling out text letter by
letter
 Speech fonts (male/female,
adult/child etc.)
 Inserted text before and after
element content
 Sound effects and music
Authors want
control over how
the document is
rendered. Aural
style sheets
provide basis for
Controlling a
range of features
IT Dept. GNIT
22
CONCLUSION
 If voice browsers are meant to replace
human operator dialog, they must be fast
in response.
 Speech Recognition / Interpretation /
Synthesis depend on implementation
 When a user requests a certain document,
several related documents can be
downloaded for easier access.
IT Dept. GNIT
23
REFERENCES
 www.w3.org/standards/webofdevices/voice
 www.pcworld.com/article/230305/google
 www.hwg.org/opcenter/w3c/voicebrowsers.html
IT Dept. GNIT
24

More Related Content

What's hot

screen less display documentation
screen less display documentationscreen less display documentation
screen less display documentationmani akuthota
 
Gesture Recognition Technology-Seminar PPT
Gesture Recognition Technology-Seminar PPTGesture Recognition Technology-Seminar PPT
Gesture Recognition Technology-Seminar PPTSuraj Rai
 
Computer science seminar topics
Computer science seminar topicsComputer science seminar topics
Computer science seminar topics123seminarsonly
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentationsamyakbhuta
 
Screenless display
Screenless displayScreenless display
Screenless displaychnaveed
 
Silverlight
SilverlightSilverlight
SilverlightBiTWiSE
 
Introduction To Mobile Application Development
Introduction To Mobile Application DevelopmentIntroduction To Mobile Application Development
Introduction To Mobile Application DevelopmentSyed Absar
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technologySrijanKumar18
 
bluejacking.ppt
bluejacking.pptbluejacking.ppt
bluejacking.pptAeman Khan
 
Ambient intelligence
Ambient intelligenceAmbient intelligence
Ambient intelligencechandrika95
 
Gesture recognition technology
Gesture recognition technology Gesture recognition technology
Gesture recognition technology Nagamani Gurram
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognitionsowmith chatlapally
 

What's hot (20)

Touchless touch screen
Touchless touch screenTouchless touch screen
Touchless touch screen
 
GOOGLE GLASS
GOOGLE GLASSGOOGLE GLASS
GOOGLE GLASS
 
screen less display documentation
screen less display documentationscreen less display documentation
screen less display documentation
 
Gesture Recognition Technology-Seminar PPT
Gesture Recognition Technology-Seminar PPTGesture Recognition Technology-Seminar PPT
Gesture Recognition Technology-Seminar PPT
 
Computer science seminar topics
Computer science seminar topicsComputer science seminar topics
Computer science seminar topics
 
Clockless chips
Clockless chipsClockless chips
Clockless chips
 
Google glass ppt
Google glass pptGoogle glass ppt
Google glass ppt
 
Mobile phone-cloning
Mobile phone-cloningMobile phone-cloning
Mobile phone-cloning
 
Gujarati Text-to-Speech Presentation
Gujarati Text-to-Speech PresentationGujarati Text-to-Speech Presentation
Gujarati Text-to-Speech Presentation
 
Screenless display
Screenless displayScreenless display
Screenless display
 
Silverlight
SilverlightSilverlight
Silverlight
 
Introduction To Mobile Application Development
Introduction To Mobile Application DevelopmentIntroduction To Mobile Application Development
Introduction To Mobile Application Development
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
bluejacking.ppt
bluejacking.pptbluejacking.ppt
bluejacking.ppt
 
Ambient intelligence
Ambient intelligenceAmbient intelligence
Ambient intelligence
 
Gesture recognition technology
Gesture recognition technology Gesture recognition technology
Gesture recognition technology
 
Artificial intelligence for speech recognition
Artificial intelligence for speech recognitionArtificial intelligence for speech recognition
Artificial intelligence for speech recognition
 
Autonomic Computing PPT
Autonomic Computing PPTAutonomic Computing PPT
Autonomic Computing PPT
 
google glass
google glassgoogle glass
google glass
 
Chatbots
ChatbotsChatbots
Chatbots
 

Viewers also liked

Viewers also liked (20)

Voice based web browser
Voice based web browserVoice based web browser
Voice based web browser
 
voice browser
voice browservoice browser
voice browser
 
Voice based email for blinds
Voice based email for blindsVoice based email for blinds
Voice based email for blinds
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
PPT on mind reading computer
 PPT on mind reading computer PPT on mind reading computer
PPT on mind reading computer
 
Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Mind reading computer
Mind reading computerMind reading computer
Mind reading computer
 
Mind reading computer
Mind reading computerMind reading computer
Mind reading computer
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
PIXIE DUST
PIXIE DUSTPIXIE DUST
PIXIE DUST
 
Java Ring
Java RingJava Ring
Java Ring
 
Sensitive skin
Sensitive skinSensitive skin
Sensitive skin
 
GOOGLE BIGTABLE
GOOGLE BIGTABLEGOOGLE BIGTABLE
GOOGLE BIGTABLE
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Lamp technology
Lamp technologyLamp technology
Lamp technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Biometric Voting System
Biometric Voting SystemBiometric Voting System
Biometric Voting System
 
Honeypots
HoneypotsHoneypots
Honeypots
 
Brain chips
Brain chipsBrain chips
Brain chips
 
E paper
E paperE paper
E paper
 

Similar to Voice Browser Overview: What is it and How Does it Work

Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneIJERA Editor
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionRHIMRJ Journal
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Voice recognition in mobile devices: A Patent Analysis
Voice recognition in mobile devices: A Patent AnalysisVoice recognition in mobile devices: A Patent Analysis
Voice recognition in mobile devices: A Patent AnalysisPrashant Nair
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”IRJET Journal
 
Voice Command Mobile Phone Dialer
Voice Command Mobile Phone DialerVoice Command Mobile Phone Dialer
Voice Command Mobile Phone Dialerijtsrd
 
The Age of Conversational Agents
The Age of Conversational AgentsThe Age of Conversational Agents
The Age of Conversational AgentsFaction XYZ
 
Delivering ivi-speech-applications-white-paper
Delivering  ivi-speech-applications-white-paperDelivering  ivi-speech-applications-white-paper
Delivering ivi-speech-applications-white-papersiavoshani
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognitionVinay Jaisriram
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
Voicexml for farmers portal ppt
Voicexml for farmers portal pptVoicexml for farmers portal ppt
Voicexml for farmers portal pptAshish Mundada
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOMsathiyaseelanm
 

Similar to Voice Browser Overview: What is it and How Does it Work (20)

Hak voice-browser
Hak voice-browserHak voice-browser
Hak voice-browser
 
Phonet
PhonetPhonet
Phonet
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
final doc
final docfinal doc
final doc
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Voice recognition in mobile devices: A Patent Analysis
Voice recognition in mobile devices: A Patent AnalysisVoice recognition in mobile devices: A Patent Analysis
Voice recognition in mobile devices: A Patent Analysis
 
Voice/speech recognition
Voice/speech recognition Voice/speech recognition
Voice/speech recognition
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
 
Voice Command Mobile Phone Dialer
Voice Command Mobile Phone DialerVoice Command Mobile Phone Dialer
Voice Command Mobile Phone Dialer
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
Bt35408413
Bt35408413Bt35408413
Bt35408413
 
The Age of Conversational Agents
The Age of Conversational AgentsThe Age of Conversational Agents
The Age of Conversational Agents
 
Delivering ivi-speech-applications-white-paper
Delivering  ivi-speech-applications-white-paperDelivering  ivi-speech-applications-white-paper
Delivering ivi-speech-applications-white-paper
 
Abstract of speech recognition
Abstract of speech recognitionAbstract of speech recognition
Abstract of speech recognition
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Voicexml for farmers portal ppt
Voicexml for farmers portal pptVoicexml for farmers portal ppt
Voicexml for farmers portal ppt
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 

More from Suman Bose

More from Suman Bose (7)

Online Movie Ticket Booking
Online Movie Ticket BookingOnline Movie Ticket Booking
Online Movie Ticket Booking
 
Online Mobile Phone Recharge
Online Mobile Phone RechargeOnline Mobile Phone Recharge
Online Mobile Phone Recharge
 
Mobile jammer
Mobile jammerMobile jammer
Mobile jammer
 
Captcha
CaptchaCaptcha
Captcha
 
Virusppt
ViruspptVirusppt
Virusppt
 
Blue brain
Blue brainBlue brain
Blue brain
 
IPv6
IPv6IPv6
IPv6
 

Recently uploaded

Complet Documnetation for Smart Assistant Application for Disabled Person
Complet Documnetation   for Smart Assistant Application for Disabled PersonComplet Documnetation   for Smart Assistant Application for Disabled Person
Complet Documnetation for Smart Assistant Application for Disabled Personfurqan222004
 
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Deliverybabeytanya
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts servicevipmodelshub1
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一Fs
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)Christopher H Felton
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一Fs
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一3sw2qly1
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Paul Calvano
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMartaLoveguard
 
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With RoomVIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Roomdivyansh0kumar0
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作ys8omjxb
 
Denver Web Design brochure for public viewing
Denver Web Design brochure for public viewingDenver Web Design brochure for public viewing
Denver Web Design brochure for public viewingbigorange77
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsThierry TROUIN ☁
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一z xss
 
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With RoomVIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Roomishabajaj13
 

Recently uploaded (20)

Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in  Rk Puram 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Rk Puram 🔝 9953056974 🔝 Delhi escort Service
 
Complet Documnetation for Smart Assistant Application for Disabled Person
Complet Documnetation   for Smart Assistant Application for Disabled PersonComplet Documnetation   for Smart Assistant Application for Disabled Person
Complet Documnetation for Smart Assistant Application for Disabled Person
 
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls In Mumbai Central Mumbai ❤️ 9920874524 👈 Cash on Delivery
 
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service
 
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
定制(Lincoln毕业证书)新西兰林肯大学毕业证成绩单原版一比一
 
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
A Good Girl's Guide to Murder (A Good Girl's Guide to Murder, #1)
 
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
定制(UAL学位证)英国伦敦艺术大学毕业证成绩单原版一比一
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一
定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一
 
Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24Font Performance - NYC WebPerf Meetup April '24
Font Performance - NYC WebPerf Meetup April '24
 
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
 
Magic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptxMagic exist by Marta Loveguard - presentation.pptx
Magic exist by Marta Loveguard - presentation.pptx
 
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With RoomVIP Kolkata Call Girl Dum Dum 👉 8250192130  Available With Room
VIP Kolkata Call Girl Dum Dum 👉 8250192130 Available With Room
 
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
Potsdam FH学位证,波茨坦应用技术大学毕业证书1:1制作
 
Denver Web Design brochure for public viewing
Denver Web Design brochure for public viewingDenver Web Design brochure for public viewing
Denver Web Design brochure for public viewing
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with Flows
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
办理(UofR毕业证书)罗切斯特大学毕业证成绩单原版一比一
 
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With RoomVIP Kolkata Call Girl Salt Lake 👉 8250192130  Available With Room
VIP Kolkata Call Girl Salt Lake 👉 8250192130 Available With Room
 

Voice Browser Overview: What is it and How Does it Work

  • 1. Presented by Soumya Shuchi(14300211047) Srirupa Das(14300211048) Subhajit Karmakar(14300211049) Subhendu Paul(14300211050) Sumadhura Biswas(14300211051) Suman Bose(14300211052) Sumit Kr.Singh(14300211053) IT Dept. GNIT 1
  • 2. TABLE OF CONTENTS  What is voice browser  Motivation  Difference between graphical browser and voice browser  Possible applications  W3C  VoiceXML  Speech Recognition  Call control  TTS  Voice style sheets  Conclusion IT Dept. GNIT 2
  • 3. WHAT IS A VOICE BROWSER?  A voice browser is a software application that presents an interactive voice user interface to the user in a manner analogous to the functioning of a web browser.  Expanding access to the Web.  Will allow any telephone to be used to access appropriately designed Web-based services. IT Dept. GNIT 3
  • 4. IT Dept. GNIT 4 WHAT IS A VOICE BROWSER?  Server-based , Voice portals  Interaction via keypads, spoken commands, listening to prerecorded speech, synthetic speech and music.  An advantage to people with visual impairment.  Mobile Web
  • 5.  Use of the hands during browsing might prove inconvenient or impossible. Voice input is a natural solution for such ands-busy situations.  Even in standard browser applications, using voice input is simply more fun than the alternatives.  Browser replaces the mouse in most instances to enable hands-free browsing. IT Dept. GNIT 5 WHY A VOICE BROWSER?
  • 6. WHY A VOICE BROWSER?  Voice input provides direct "see and say" access to links, eliminating the wrist strain associated with holding the mouse for often hours at a time. IT Dept. GNIT 6
  • 7.  Easy to use - for people with no knowledge or fear of computers.  Voice Browsers are the next generation of call centers, which will become Voice Web portals to the company's services and related websites, whether accessed via the telephone network or via the Internet. IT Dept. GNIT 7 MOTIVATION
  • 8.  Graphical browsing is more passive due to the persistence of the visual information .  Voice browsing is more active since the user has to issue commands.  Graphical Browsers can be client-based, whereas Voice Browsers should be server- based. IT Dept. GNIT 8 GRAPHICAL & VOICE BROWSING
  • 9. POSSIBLE APPLICATIONS  Accessing business information:  The corporate "front desk" which asks callers who or what they want.  Automated telephone ordering service .  Airline arrival and departure information.  Home banking services.  Accessing public information:  Community information such as weather, traffic condition, school closures, directions and events. IT Dept. GNIT 9
  • 10. CONTD..  Local, national and international news.  National and international stock market information.  Business and e-commerce transactions.  Accessing personal information:  Voice mail.  Calendars, address and telephone lists .  Personal horoscope.  Personal newsletter.  To-do lists, shopping lists, and calorie counters. IT Dept. GNIT 10
  • 11. W3C  The World Wide Web Consortium (W3C) develops interoperable technologies (specifications, guidelines, software, and tools) to lead the Web to its full potential as a forum for information, commerce, communication, and collective understanding. 11 IT Dept. GNIT W3C Speech Interface Framework  VoiceXML  Speech Recognition : 1.Speech Grammars 2.Stochastic (N-Gram) Language Models 3.Semantic Interpretation 4.Pronunciation Lexicon  Call control
  • 12. VOICEXML  VoiceXML is a dialog markup language designed for telephony applications, where users are restricted to voice and DTMF (touch tone) input.  There are other languages: VoXML, omniviewXML text.html text.vxml Web Server Internet Browse r IT Dept. GNIT 12
  • 15. DTMF GRAMMARS  Touch tone input is often used as an alternative to speech recognition.  Especially useful in noisy conditions or when the social context makes it awkward to speak.  The W3C DTMF grammar format allows authors to specify the expected sequence of digits, and to bind them to the appropriate results. IT Dept. GNIT 15
  • 16. SPEECH GRAMMARS  Speech Grammars allow authors to specify rules covering the sequences of words that users are expected to say in particular contexts.  These contexual clues allow the recognition engine to focus on likely utterances, improving the chances of a correct match. IT Dept. GNIT 16
  • 17. STOCHASTIC (N-GRAM) LANGUAGE MODELS  Speech Grammars are unuseful in case of open-enden prompt(how can i help u).  The solution is to use a stochastic language model. Such models specify the probability that one word occurs following certain others. The probabilities are computed from a collection of utterances collected from many users. IT Dept. GNIT 17
  • 18. SEMANTIC INTERPRETATION  The recognition process matches an utterance to a speech grammar, building a parse tree as a byproduct.  There are two approaches to harvesting semantic results from the parse tree: 1. Annotating grammar rules with semantic interpretation tags. 2. Representing the result in XML. IT Dept. GNIT 18
  • 19. PRONUNCIATION LEXICON o Application developers sometimes need to ability to tune speech engines, whether for synthesis or recognition. o W3C is developing a markup language for an open portable specification of pronunciation information using a standard phonetic alphabet. o The most commonly needed pronunciations are for proper nouns such as surnames or business names. IT Dept. GNIT 19
  • 20. CALL CONTROL  Fine-grained control of speech (signal processing) resources and telephony resources in a VoiceXML telephony platform.  Will enable application developers to use markup to perform call screening, whisper call waiting, call transfer, and more.  Can be used to transfer a user from one voice browser to another on a competely different machine. IT Dept. GNIT 20
  • 21. TEXT TO SPEECH SYNTHESIS:  1. Pre-processing  2. Text normalization i) digit normalization ii) date normalization iii) abbreviation normalization  3. Parts of speech annotation  4. Pronunciation lexicon  5. Letter to sound rules  6. Synthesis IT Dept. GNIT 21
  • 22. VOICE STYLE SHEETS!  Volume  Rate  Pitch  Direction  Spelling out text letter by letter  Speech fonts (male/female, adult/child etc.)  Inserted text before and after element content  Sound effects and music Authors want control over how the document is rendered. Aural style sheets provide basis for Controlling a range of features IT Dept. GNIT 22
  • 23. CONCLUSION  If voice browsers are meant to replace human operator dialog, they must be fast in response.  Speech Recognition / Interpretation / Synthesis depend on implementation  When a user requests a certain document, several related documents can be downloaded for easier access. IT Dept. GNIT 23