This document discusses the creation of a low-cost VoiceXML gateway to replace traditional IVR systems. It uses open-source components including the OpenVXI VoiceXML interpreter, the Festival text-to-speech system, and the Asterisk PBX. These components are integrated to allow VoiceXML applications to be rendered by the gateway and interact with telephone calls managed by Asterisk. The gateway provides a low-cost alternative to commercial VoiceXML platforms, making voice applications more accessible.
The document provides an overview of the Elastix Call Center Addon module. It describes how the module allows for setting up inbound and outbound call campaigns with agents answering calls through a web-based console. The predictive dialer handles automatic call distribution to agents. Key components include the agent console, call management interface, support for the ECCP protocol, and an open-source licensing model.
This document provides instructions for using the Call Center module in Elastix, which allows for automatic outbound calling campaigns and call queue monitoring. Key features include creating agents, forms for collecting call data, outbound call campaigns with uploaded phone numbers, call queues for incoming calls, and reports on call activity. The module requires starting the Elastixdialer service and provides an agent console for making and receiving calls according to defined campaigns and queues.
The document discusses call center management solutions. It describes how call centers provide important customer service support for technical companies and help companies expand their customer bases. Outsourcing call center services allows companies to save money on training and benefits compared to employing full-time customer support staff. The document then outlines some of the key forms and features of the call center management software, including forms for employee details, call registration, and announcements. It provides examples of code used for validation processes and file input/output functions.
This document provides instructions for installing and configuring TekFax version 1.7 fax server software. It describes system requirements, installing the software, and configuring settings for the TekFax service including SIP accounts and SMTP settings. It also explains how to monitor active calls, define fax recipients, and troubleshoot issues using log files.
This document provides instructions for implementing a basic VOIP infrastructure using Xivo and configuring a Cisco SPA512G phone. The key steps include:
1. Installing and configuring Xivo using the installation wizard.
2. Configuring networking components like the HP switch and adding a voice VLAN.
3. Configuring autoprovisioning for the Cisco phone and installing the appropriate plugin.
4. Creating SIP lines, users, and associating the Cisco phone as an endpoint in Xivo.
This document is a project report submitted for a Bachelor's degree in Technology. It discusses developing a voice-based application for restaurants using Voice XML (VXML).
The introduction provides background on VXML and how it could help improve the customer experience for ordering food and making reservations at restaurants. It notes the current human-based interface in most restaurants and proposes features like placing orders and booking events over the phone using VXML.
The report is organized into sections covering the software requirements, design, implementation, testing and screenshots of the VXML application created for restaurants. It concludes by discussing opportunities for future improvements to the system.
This document provides an overview of Voice XML, a language used for building voice user interfaces and telephone applications. It discusses how Voice XML uses speech recognition, text-to-speech, and DTMF inputs. It also summarizes some of the major elements and goals of Voice XML, including simplifying application development, separating interaction code from service logic, and allowing voice-enabled web applications. Finally, it provides some examples of applications that can be built with Voice XML, such as voice portals and commerce applications.
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...IJMER
This document discusses implementing customized interactive voice response (IVR) applications using VoiceXML on a VoIP communication platform. It provides an overview of key concepts like VoIP, SIP, IPPBX, and VoiceXML. VoIP allows voice calls over IP networks using protocols like SIP. An IPPBX functions similar to a traditional PBX but routes calls over an IP network. VoiceXML is used to build voice dialog applications with features like speech recognition, prompts, and grammars. The document proposes using a VoiceXML browser integrated with an Asterisk gateway and IPPBX to enable voice applications with speech input/output that can be customized for different organizations.
The document provides an overview of the Elastix Call Center Addon module. It describes how the module allows for setting up inbound and outbound call campaigns with agents answering calls through a web-based console. The predictive dialer handles automatic call distribution to agents. Key components include the agent console, call management interface, support for the ECCP protocol, and an open-source licensing model.
This document provides instructions for using the Call Center module in Elastix, which allows for automatic outbound calling campaigns and call queue monitoring. Key features include creating agents, forms for collecting call data, outbound call campaigns with uploaded phone numbers, call queues for incoming calls, and reports on call activity. The module requires starting the Elastixdialer service and provides an agent console for making and receiving calls according to defined campaigns and queues.
The document discusses call center management solutions. It describes how call centers provide important customer service support for technical companies and help companies expand their customer bases. Outsourcing call center services allows companies to save money on training and benefits compared to employing full-time customer support staff. The document then outlines some of the key forms and features of the call center management software, including forms for employee details, call registration, and announcements. It provides examples of code used for validation processes and file input/output functions.
This document provides instructions for installing and configuring TekFax version 1.7 fax server software. It describes system requirements, installing the software, and configuring settings for the TekFax service including SIP accounts and SMTP settings. It also explains how to monitor active calls, define fax recipients, and troubleshoot issues using log files.
This document provides instructions for implementing a basic VOIP infrastructure using Xivo and configuring a Cisco SPA512G phone. The key steps include:
1. Installing and configuring Xivo using the installation wizard.
2. Configuring networking components like the HP switch and adding a voice VLAN.
3. Configuring autoprovisioning for the Cisco phone and installing the appropriate plugin.
4. Creating SIP lines, users, and associating the Cisco phone as an endpoint in Xivo.
This document is a project report submitted for a Bachelor's degree in Technology. It discusses developing a voice-based application for restaurants using Voice XML (VXML).
The introduction provides background on VXML and how it could help improve the customer experience for ordering food and making reservations at restaurants. It notes the current human-based interface in most restaurants and proposes features like placing orders and booking events over the phone using VXML.
The report is organized into sections covering the software requirements, design, implementation, testing and screenshots of the VXML application created for restaurants. It concludes by discussing opportunities for future improvements to the system.
This document provides an overview of Voice XML, a language used for building voice user interfaces and telephone applications. It discusses how Voice XML uses speech recognition, text-to-speech, and DTMF inputs. It also summarizes some of the major elements and goals of Voice XML, including simplifying application development, separating interaction code from service logic, and allowing voice-enabled web applications. Finally, it provides some examples of applications that can be built with Voice XML, such as voice portals and commerce applications.
Customized IVR Implementation Using Voicexml on SIP (Voip) Communication Plat...IJMER
This document discusses implementing customized interactive voice response (IVR) applications using VoiceXML on a VoIP communication platform. It provides an overview of key concepts like VoIP, SIP, IPPBX, and VoiceXML. VoIP allows voice calls over IP networks using protocols like SIP. An IPPBX functions similar to a traditional PBX but routes calls over an IP network. VoiceXML is used to build voice dialog applications with features like speech recognition, prompts, and grammars. The document proposes using a VoiceXML browser integrated with an Asterisk gateway and IPPBX to enable voice applications with speech input/output that can be customized for different organizations.
This document discusses various XML applications and technologies. It describes 13 different XML technologies including Wireless Markup Language (WML), XML News, Channel Definition Format (CDF), Open Software Distribution (OSD), Open Financial Exchange (OFX), RDF/XML, Mathematical Markup Language (MathML), Platform for Privacy Preferences Project (P3P), Human Resource Management Markup Language (HRMML), Voice Extensible Markup Language (VXML), Vector Markup Language (VML), Scalable Vector Graphics (SVG), and Synchronized Multimedia Integration Language (SMIL). Each technology is briefly described and its advantages are listed.
Phonologies was @ at Cluecon 2011, the annual 3-Day Open Source Telephony User and Developer Conference in Chicago, IL. Phonologies is an active contributor to the Open Source Community, promising quality, reliability, flexibility, lower cost and an end to predatory vendor lock-in. Phonologies has contributed to the Open Source Community by distributing freely portions of source code of Oktopous™ ccXML Engine under a BSD style license.
IMA/Thales EchoVoice (VOIP) for OpenSimulator Presentation at OSCC19Lisa Laxton
IMA/Thales EchoVoice (VOIP) for OpenSimulator
Frank Rulof, Seth Nygard, Lisa Laxton, Natacha Bru
Presentation Abstract: This presentation from Infinite Metaverse Alliance® (IMA) and Thales Group discusses progress towards improving open source code used in a self-hosted secure voice solution called EchoVoice. This is an alternative to ViVox which is commonly used for voice communication between avatars in OpenSimulator regions. The discussion is focused on the development work done to provide a modernized solution for HyperGrid-enabled regions as well as planned enhancements not currently available. The work of IMA and Thales is in general directed toward broadening use of the Metaverse for Public, Education, Industry and Government sectors but the community as a whole benefits from open source.
Target Audience & Outcomes: Participants from the OpenSimulator Community at large will learn about enhancements, features, and improvements IMA and Thales are working on together to deliver an open source solution that meets the needs of a broader OpenSimulator community.
@IMATalks
Open Source Telecom Software Landscape by Alan QuayleAlan Quayle
There are tens of successful Open Source Telecom Software projects, with vibrant communities supporting them. Asterisk, the most successful and longest running is 20 years old. We heard from them earlier in the agenda.
We’ll review the many other projects to help people understand the depth, breadth and rapidly evolving nature of these projects. As we’ll as some of the new projects like Drachtio.org, and used recently at TADHack-mini Orlando Online 2020, for the hack SMB Reschedule.
We’ll also review the results from an anonymous survey of open source telecom software. Comparing the different projects, the preferred application areas of the projects, common issues and solutions, sharing where the industry sees these projects in 5 years’ time.
The document discusses voice browsers, which allow users to interact with computer systems using voice rather than text. It describes how voice browsers use speech recognition to understand spoken input and speech synthesis to provide audible responses. The key technologies that enable voice browsers are speech recognition, speech synthesis, and VoiceXML. Voice browsers have applications in areas like web browsing, information access, and dialog systems. The future of voice browsers is expected to include integration with visual browsers and operating systems.
The document discusses voice browsers, which allow users to interact with computer systems using voice rather than text. It describes how voice browsers use speech recognition to understand spoken input and speech synthesis to provide audible responses. The key technologies that enable voice browsers are speech recognition, speech synthesis, and VoiceXML. Voice browsers have applications in areas like web browsing, information access, and dialog systems. The future of voice browsers includes improved integration with other technologies and operating systems.
VoiceXML is an XML-based markup language that allows the creation of audio dialogs utilizing synthesized speech, recorded audio, speech recognition, and touch-tone input. It was designed to work with technologies like text-to-speech, automatic speech recognition, and enable integration with enterprise software. VoiceXML provides a common language for developers, tool providers, and platform providers to build voice applications and services while shielding them from low-level implementation details. Some examples of applications include order inquiries, package tracking, driving directions, emergency notifications, and voice access to email.
The document provides an overview of VoiceXML, including its history as a standard developed by AT&T, IBM, Lucent, and Motorola, how it allows voice applications to be developed and deployed analogously to HTML, and some of its key features and advantages such as supporting automatic speech recognition, dialog management, and audio playback through the use of tags.
A voice browser interprets voice input and generates voice output by interpreting voice markup languages. It interprets scripts that specify the verbal information to present to users and when. Voice browsers allow hands-free web access and are a pragmatic interface for visually impaired users. Standards for voice browsers and their components are developed by the W3C Voice Browser Working Group.
The document discusses emerging standards for voice and telephony technologies. It outlines components of a voice solution including VoiceXML for dialogs, CCXML for call control, and MRCP for media resources. The benefits of open standards are described for building modular applications that can leverage existing web technologies and be deployed across platforms. Key standards bodies like the W3C Voice Browser Working Group are developing standards like VoiceXML, CCXML, and SSML to grow the voice web industry through interoperability.
Voice browser is a device that interprets voice input and generates voice output. It interprets scripts specifying what verbal information to present to users and when. Voice browsers were motivated by the fact that there are more phones than computers globally, phone usage is growing, and speaking is natural for phones. Key technologies include speech recognition to convert voice to text, speech synthesis to convert text to voice output, and VoiceXML for building voice applications. Voice browsers allow accessing information and services by voice on mobile devices.
CCXML For Advanced Communications ApplicationsVoxeo Corp
CCXML is a W3C standard for call control that complements VoiceXML. It allows building advanced telephony applications through call setup, bridging to VoiceXML dialogs, call transferring, conferencing and more. The document outlines CCXML's capabilities and provides examples of basic applications like call screening and advanced applications like a voice chat system. It also profiles several commercial solutions that utilize CCXML for call routing, conferencing and CRM integration.
Interfaces de Voz avanzadas con VoiceXML - Iván Sixto | VoIP2DAY 2015VOIP2DAY
This document provides an overview of advanced voice interfaces using VoiceXML. It discusses interactive voice response (IVR) systems, how they work using automatic speech recognition and text-to-speech, and their applications in telephony. It also summarizes VoiceXML, the W3C standard language for building voice user interfaces, its history and how it works similarly to web technologies using XML and HTTP. Examples of basic VoiceXML code are provided.
This document discusses the history and development of the Virtual Reality Modeling Language (VRML). It describes how VRML was conceived as a way to add three-dimensional graphical content to the World Wide Web. A mailing list was created where participants agreed on initial requirements for VRML 1.0 and decided to base the specification on the Open Inventor file format from Silicon Graphics. The specification was developed over several months and finalized at a conference in 1994.
This document discusses IBM's WebSphere Voice Server and Toolkit technology platform. It notes that Norway is one of the first countries in the world to have a freely available text-to-speech program. The government wants to encourage suppliers and developers to provide free solutions for other platforms besides Windows. It also wants to make speech synthesis software freely available and distributed via multiple channels including the internet.
WebRTC is said to create the next Internet-based revolution of telecommunications. Here FRAFOS explains what that means in practice and how WebRTC can be employed in real-life situations.
Voice over IP (VOIP) Security Research- A ResearchIJMER
This document summarizes research on Voice over IP (VoIP) security. It begins with an overview of SIP (Session Initiation Protocol), a commonly used VoIP standard, and a taxonomy of VoIP security threats. It then surveys existing research on VoIP security classified according to the threat categories. The research covers threats like eavesdropping, denial of service attacks, toll fraud, and spam over IP telephony (SPIT). Approaches studied include encryption, authentication, reputation systems, audio fingerprinting, and Turing tests to detect automated SPIT callers. The goal is to identify gaps and guide future work on analyzing VoIP attackers and improving the security and resilience of VoIP systems.
Voice over Internet Protocol with Novel Applicationsirjes
This document summarizes Voice over Internet Protocol (VoIP) technology. It discusses three types of VoIP calls: PC to PC, PC to phone, and phone to phone. It also explains some key VoIP protocols like H.323, Session Initiation Protocol (SIP), and VoiceXML. The document provides an overview of how VoIP works, its advantages over traditional phone systems, and examples of VoIP applications.
Voice over Internet Protocol (VoIP) is replacing legacy telephone networks by carrying digitized voice in IP data packets over data networks. This chapter introduces VoIP, comparing it to legacy telephone networks, and discusses VoIP standards and protocols. It also introduces WiMAX networks and discusses supporting QoS for multimedia like VoIP over WiMAX. The objectives are to guarantee QoS for multiple service classes over WiMAX and improve VoIP performance. Simulation using OPNET Modeler will analyze VoIP traffic and QoS parameters over WiMAX.
Abstract
The technology of voice browsing is rapidly evolving these days. It is because the use of cell phones is increasing at a very high rate, as compared to connected PCs. Listening and speaking are the natural modes of communication and information gathering. As a result we are now heading towards a more voice based approach of browsing rather than operating on textual mode. The command input and the delivery of web contents are entirely in voice. A voice browser is a device: that interprets voice input and interprets voice markup languages to generate voice output. That interprets a script which specifies exactly what to verbally present to the user as well as when to present each piece of information. Benefits Voice is a very natural user interface which speeds up browsing.
HD Video Player All Format - 4k & live streamHD Video Player
Discover the best video playback experience with HD Video Player. Our powerful, user-friendly app supports all popular video formats and codecs, ensuring seamless playback of your favorite videos in stunning HD and 4K quality. Whether you're watching movies, TV shows, or personal videos, HD Video Player provides the ultimate viewing experience on your device. 🚀
This document discusses various XML applications and technologies. It describes 13 different XML technologies including Wireless Markup Language (WML), XML News, Channel Definition Format (CDF), Open Software Distribution (OSD), Open Financial Exchange (OFX), RDF/XML, Mathematical Markup Language (MathML), Platform for Privacy Preferences Project (P3P), Human Resource Management Markup Language (HRMML), Voice Extensible Markup Language (VXML), Vector Markup Language (VML), Scalable Vector Graphics (SVG), and Synchronized Multimedia Integration Language (SMIL). Each technology is briefly described and its advantages are listed.
Phonologies was @ at Cluecon 2011, the annual 3-Day Open Source Telephony User and Developer Conference in Chicago, IL. Phonologies is an active contributor to the Open Source Community, promising quality, reliability, flexibility, lower cost and an end to predatory vendor lock-in. Phonologies has contributed to the Open Source Community by distributing freely portions of source code of Oktopous™ ccXML Engine under a BSD style license.
IMA/Thales EchoVoice (VOIP) for OpenSimulator Presentation at OSCC19Lisa Laxton
IMA/Thales EchoVoice (VOIP) for OpenSimulator
Frank Rulof, Seth Nygard, Lisa Laxton, Natacha Bru
Presentation Abstract: This presentation from Infinite Metaverse Alliance® (IMA) and Thales Group discusses progress towards improving open source code used in a self-hosted secure voice solution called EchoVoice. This is an alternative to ViVox which is commonly used for voice communication between avatars in OpenSimulator regions. The discussion is focused on the development work done to provide a modernized solution for HyperGrid-enabled regions as well as planned enhancements not currently available. The work of IMA and Thales is in general directed toward broadening use of the Metaverse for Public, Education, Industry and Government sectors but the community as a whole benefits from open source.
Target Audience & Outcomes: Participants from the OpenSimulator Community at large will learn about enhancements, features, and improvements IMA and Thales are working on together to deliver an open source solution that meets the needs of a broader OpenSimulator community.
@IMATalks
Open Source Telecom Software Landscape by Alan QuayleAlan Quayle
There are tens of successful Open Source Telecom Software projects, with vibrant communities supporting them. Asterisk, the most successful and longest running is 20 years old. We heard from them earlier in the agenda.
We’ll review the many other projects to help people understand the depth, breadth and rapidly evolving nature of these projects. As we’ll as some of the new projects like Drachtio.org, and used recently at TADHack-mini Orlando Online 2020, for the hack SMB Reschedule.
We’ll also review the results from an anonymous survey of open source telecom software. Comparing the different projects, the preferred application areas of the projects, common issues and solutions, sharing where the industry sees these projects in 5 years’ time.
The document discusses voice browsers, which allow users to interact with computer systems using voice rather than text. It describes how voice browsers use speech recognition to understand spoken input and speech synthesis to provide audible responses. The key technologies that enable voice browsers are speech recognition, speech synthesis, and VoiceXML. Voice browsers have applications in areas like web browsing, information access, and dialog systems. The future of voice browsers is expected to include integration with visual browsers and operating systems.
The document discusses voice browsers, which allow users to interact with computer systems using voice rather than text. It describes how voice browsers use speech recognition to understand spoken input and speech synthesis to provide audible responses. The key technologies that enable voice browsers are speech recognition, speech synthesis, and VoiceXML. Voice browsers have applications in areas like web browsing, information access, and dialog systems. The future of voice browsers includes improved integration with other technologies and operating systems.
VoiceXML is an XML-based markup language that allows the creation of audio dialogs utilizing synthesized speech, recorded audio, speech recognition, and touch-tone input. It was designed to work with technologies like text-to-speech, automatic speech recognition, and enable integration with enterprise software. VoiceXML provides a common language for developers, tool providers, and platform providers to build voice applications and services while shielding them from low-level implementation details. Some examples of applications include order inquiries, package tracking, driving directions, emergency notifications, and voice access to email.
The document provides an overview of VoiceXML, including its history as a standard developed by AT&T, IBM, Lucent, and Motorola, how it allows voice applications to be developed and deployed analogously to HTML, and some of its key features and advantages such as supporting automatic speech recognition, dialog management, and audio playback through the use of tags.
A voice browser interprets voice input and generates voice output by interpreting voice markup languages. It interprets scripts that specify the verbal information to present to users and when. Voice browsers allow hands-free web access and are a pragmatic interface for visually impaired users. Standards for voice browsers and their components are developed by the W3C Voice Browser Working Group.
The document discusses emerging standards for voice and telephony technologies. It outlines components of a voice solution including VoiceXML for dialogs, CCXML for call control, and MRCP for media resources. The benefits of open standards are described for building modular applications that can leverage existing web technologies and be deployed across platforms. Key standards bodies like the W3C Voice Browser Working Group are developing standards like VoiceXML, CCXML, and SSML to grow the voice web industry through interoperability.
Voice browser is a device that interprets voice input and generates voice output. It interprets scripts specifying what verbal information to present to users and when. Voice browsers were motivated by the fact that there are more phones than computers globally, phone usage is growing, and speaking is natural for phones. Key technologies include speech recognition to convert voice to text, speech synthesis to convert text to voice output, and VoiceXML for building voice applications. Voice browsers allow accessing information and services by voice on mobile devices.
CCXML For Advanced Communications ApplicationsVoxeo Corp
CCXML is a W3C standard for call control that complements VoiceXML. It allows building advanced telephony applications through call setup, bridging to VoiceXML dialogs, call transferring, conferencing and more. The document outlines CCXML's capabilities and provides examples of basic applications like call screening and advanced applications like a voice chat system. It also profiles several commercial solutions that utilize CCXML for call routing, conferencing and CRM integration.
Interfaces de Voz avanzadas con VoiceXML - Iván Sixto | VoIP2DAY 2015VOIP2DAY
This document provides an overview of advanced voice interfaces using VoiceXML. It discusses interactive voice response (IVR) systems, how they work using automatic speech recognition and text-to-speech, and their applications in telephony. It also summarizes VoiceXML, the W3C standard language for building voice user interfaces, its history and how it works similarly to web technologies using XML and HTTP. Examples of basic VoiceXML code are provided.
This document discusses the history and development of the Virtual Reality Modeling Language (VRML). It describes how VRML was conceived as a way to add three-dimensional graphical content to the World Wide Web. A mailing list was created where participants agreed on initial requirements for VRML 1.0 and decided to base the specification on the Open Inventor file format from Silicon Graphics. The specification was developed over several months and finalized at a conference in 1994.
This document discusses IBM's WebSphere Voice Server and Toolkit technology platform. It notes that Norway is one of the first countries in the world to have a freely available text-to-speech program. The government wants to encourage suppliers and developers to provide free solutions for other platforms besides Windows. It also wants to make speech synthesis software freely available and distributed via multiple channels including the internet.
WebRTC is said to create the next Internet-based revolution of telecommunications. Here FRAFOS explains what that means in practice and how WebRTC can be employed in real-life situations.
Voice over IP (VOIP) Security Research- A ResearchIJMER
This document summarizes research on Voice over IP (VoIP) security. It begins with an overview of SIP (Session Initiation Protocol), a commonly used VoIP standard, and a taxonomy of VoIP security threats. It then surveys existing research on VoIP security classified according to the threat categories. The research covers threats like eavesdropping, denial of service attacks, toll fraud, and spam over IP telephony (SPIT). Approaches studied include encryption, authentication, reputation systems, audio fingerprinting, and Turing tests to detect automated SPIT callers. The goal is to identify gaps and guide future work on analyzing VoIP attackers and improving the security and resilience of VoIP systems.
Voice over Internet Protocol with Novel Applicationsirjes
This document summarizes Voice over Internet Protocol (VoIP) technology. It discusses three types of VoIP calls: PC to PC, PC to phone, and phone to phone. It also explains some key VoIP protocols like H.323, Session Initiation Protocol (SIP), and VoiceXML. The document provides an overview of how VoIP works, its advantages over traditional phone systems, and examples of VoIP applications.
Voice over Internet Protocol (VoIP) is replacing legacy telephone networks by carrying digitized voice in IP data packets over data networks. This chapter introduces VoIP, comparing it to legacy telephone networks, and discusses VoIP standards and protocols. It also introduces WiMAX networks and discusses supporting QoS for multimedia like VoIP over WiMAX. The objectives are to guarantee QoS for multiple service classes over WiMAX and improve VoIP performance. Simulation using OPNET Modeler will analyze VoIP traffic and QoS parameters over WiMAX.
Abstract
The technology of voice browsing is rapidly evolving these days. It is because the use of cell phones is increasing at a very high rate, as compared to connected PCs. Listening and speaking are the natural modes of communication and information gathering. As a result we are now heading towards a more voice based approach of browsing rather than operating on textual mode. The command input and the delivery of web contents are entirely in voice. A voice browser is a device: that interprets voice input and interprets voice markup languages to generate voice output. That interprets a script which specifies exactly what to verbally present to the user as well as when to present each piece of information. Benefits Voice is a very natural user interface which speeds up browsing.
HD Video Player All Format - 4k & live streamHD Video Player
Discover the best video playback experience with HD Video Player. Our powerful, user-friendly app supports all popular video formats and codecs, ensuring seamless playback of your favorite videos in stunning HD and 4K quality. Whether you're watching movies, TV shows, or personal videos, HD Video Player provides the ultimate viewing experience on your device. 🚀
Morgan Freeman is Jimi Hendrix: Unveiling the Intriguing Hypothesisgreendigital
In celebrity mysteries and urban legends. Few narratives capture the imagination as the hypothesis that Morgan Freeman is Jimi Hendrix. This fascinating theory posits that the iconic actor and the legendary guitarist are, in fact, the same person. While this might seem like a far-fetched notion at first glance. a deeper exploration reveals a rich tapestry of coincidences, speculative connections. and a surprising alignment of life events fueling this captivating hypothesis.
Follow us on: Pinterest
Introduction to the Hypothesis: Morgan Freeman is Jimi Hendrix
The idea that Morgan Freeman is Jimi Hendrix stems from a mix of historical anomalies, physical resemblances. and a penchant for myth-making that surrounds celebrities. While Jimi Hendrix's official death in 1970 is well-documented. some theorists suggest that Hendrix did not die but instead reinvented himself as Morgan Freeman. a man who would become one of Hollywood's most revered actors. This article aims to delve into the various aspects of this hypothesis. examining its origins, the supporting arguments. and the cultural impact of such a theory.
The Genesis of the Theory
Early Life Parallels
The hypothesis that Morgan Freeman is Jimi Hendrix begins by comparing their early lives. Jimi Hendrix, born Johnny Allen Hendrix in Seattle, Washington, on November 27, 1942. and Morgan Freeman, born on June 1, 1937, in Memphis, Tennessee, have lived very different lives. But, proponents of the theory suggest that the five-year age difference is negligible and point to Freeman's late start in his acting career as evidence of a life lived before under a different identity.
The Disappearance and Reappearance
Jimi Hendrix's death in 1970 at the age of 27 is a well-documented event. But, theorists argue that Hendrix's death staged. and he reemerged as Morgan Freeman. They highlight Freeman's rise to prominence in the early 1970s. coinciding with Hendrix's supposed death. Freeman's first significant acting role came in 1971 on the children's television show "The Electric Company," a mere year after Hendrix's passing.
Physical Resemblances
Facial Structure and Features
One of the most compelling arguments for the hypothesis that Morgan Freeman is Jimi Hendrix lies in the physical resemblance between the two men. Analyzing photographs, proponents point out similarities in facial structure. particularly the cheekbones and jawline. Both men have a distinctive gap between their front teeth. which is rare and often highlighted as a critical point of similarity.
Voice and Mannerisms
Supporters of the theory also draw attention to the similarities in their voices. Jimi Hendrix known for his smooth, distinctive speaking voice. which, according to some, resembles Morgan Freeman's iconic, deep, and soothing voice. Additionally, both men share certain mannerisms. such as their calm demeanor and eloquent speech patterns.
Artistic Parallels
Musical and Acting Talents
Jimi Hendrix was regarded as one of t
The Evolution and Impact of Tom Cruise Long Hairgreendigital
Tom Cruise is one of Hollywood's most iconic figures, known for his versatility, charisma, and dedication to his craft. Over the decades, his appearance has been almost as dynamic as his filmography, with one aspect often drawing significant attention: his hair. In particular, Tom Cruise long hair has become a defining feature in various phases of his career. symbolizing different roles and adding layers to his on-screen characters. This article delves into the evolution of Tom Cruise long hair, its impact on his roles. and its influence on popular culture.
Follow us on: Pinterest
Introduction
Tom Cruise long hair has often been more than a style choice. it has been a significant element of his persona both on and off the screen. From the tousled locks of the rebellious Maverick in "Top Gun" to the sleek, sophisticated mane in "Mission: Impossible II." Cruise's hair has played a pivotal role in shaping his image and the characters he portrays. This article explores the various stages of Tom Cruise long hair. Examining how this iconic look has evolved and influenced his career and broader fashion trends.
Early Days: The Emergence of a Style Icon
The 1980s: The Birth of a Star
In the early stages of his career during the 1980s, Tom Cruise sported a range of hairstyles. but in "Top Gun" (1986), his hair began to gain significant attention. Though not long by later standards, his hair in this film was longer than the military crew cuts associated with fighter pilots. adding a rebellious edge to his character, Pete "Maverick" Mitchell.
Risky Business: The Transition Begins
In "Risky Business" (1983). Tom Cruise's hair was short but longer than the clean-cut styles dominant at the time. This look complemented his role as a high school student stepping into adulthood. embodying a sense of youthful freedom and experimentation. It was a precursor to the more dramatic hair transformations in his career.
The 1990s: Experimentation and Iconic Roles
Far and Away: Embracing Length
One of the first films in which Tom Cruise embraced long hair was "Far and Away" (1992). Playing the role of Joseph. an Irish immigrant in 1890s America, Cruise's long, hair added authenticity to his character's rugged and determined persona. This look was a stark departure from his earlier. more polished styles and marked the beginning of a more adventurous phase in his hairstyle choices.
Interview with the Vampire: Gothic Elegance
In "Interview with the Vampire" (1994). Tom Cruise long hair reached new lengths of sophistication and elegance. Portraying the vampire Lestat. Cruise's flowing blonde locks were integral to the character's ethereal and timeless allure. This hairstyle not only suited the gothic aesthetic of the film but also showcased Cruise's ability to transform his appearance for a role.
Mission: Impossible II: The Pinnacle of Long Hair
One of the most memorable instances of Tom Cruise long hair came in "Mission: Impossible II" (2000). His character, Ethan
Taylor Swift: Conquering Fame, Feuds, and Unmatched Success | CIO Women MagazineCIOWomenMagazine
From country star to global phenomenon, delve into Taylor Swift's incredible journey. Explore chart-topping hits, feuds, & her rise to billionaire status!
Unlocking the Secrets of IPTV App Development_ A Comprehensive Guide.pdfWHMCS Smarters
With IPTV apps, you can access and stream live TV, on-demand movies, series, and other content you like online. Viewers have more flexibility and customization of content to watch. To develop the best IPTV app that functions, you must combine creative problem-solving skills and technical knowledge. This post will look into the details of IPTV app development, so keep reading to learn more.
Enhance Your Viewing Experience with Gold IPTV- Tips and Tricks for 2024.pdfXtreame HDTV
In the ever-evolving landscape of digital entertainment, IPTV (Internet Protocol Television) has emerged as a popular alternative to traditional cable and satellite TV services. Offering unparalleled flexibility, a vast selection of channels, and affordability, IPTV services like Gold IPTV have revolutionized the way we consume television content. This comprehensive guide will delve into everything you need to know about Gold IPTV, its features, benefits, setup process, and how it can enhance your viewing experience.
Tom Cruise Daughter: An Insight into the Life of Suri Cruisegreendigital
Tom Cruise is a name that resonates with global audiences for his iconic roles in blockbuster films and his dynamic presence in Hollywood. But, beyond his illustrious career, Tom Cruise's personal life. especially his relationship with his daughter has been a subject of public fascination and media scrutiny. This article delves deep into the life of Tom Cruise daughter, Suri Cruise. Exploring her upbringing, the influence of her parents, and her current life.
Follow us on: Pinterest
Introduction: The Fame Surrounding Tom Cruise Daughter
Suri Cruise, the daughter of Tom Cruise and Katie Holmes, has been in the public eye since her birth on April 18, 2006. Thanks to the media's relentless coverage, the world watched her grow up. As the daughter of one of Hollywood's most renowned actors. Suri has had a unique upbringing marked by privilege and scrutiny. This article aims to provide a comprehensive overview of Suri Cruise's life. Her relationship with her parents, and her journey so far.
Early Life of Tom Cruise Daughter
Birth and Immediate Fame
Suri Cruise was born in Santa Monica, California. and from the moment she came into the world, she was thrust into the limelight. Her parents, Tom Cruise and Katie Holmes. Were one of Hollywood's most talked-about couples at the time. The birth of their daughter was a anticipated event. and Suri's first public appearance in Vanity Fair magazine set the tone for her life in the public eye.
The Impact of Celebrity Parents
Having celebrity parents like Tom Cruise and Katie Holmes comes with its own set of challenges and privileges. Suri Cruise's early life marked by a whirlwind of media attention. paparazzi, and public interest. Despite the constant spotlight. Her parents tried to provide her with an upbringing that was as normal as possible.
The Influence of Tom Cruise and Katie Holmes
Tom Cruise's Parenting Style
Tom Cruise known for his dedication and passion in both his professional and personal life. As a father, Cruise has described as loving and protective. His involvement in the Church of Scientology, but, has been a point of contention and has influenced his relationship with Suri. Cruise's commitment to Scientology has reported to be a significant factor in his and Holmes' divorce and his limited public interactions with Suri.
Katie Holmes' Role in Suri's Life
Katie Holmes has been Suri's primary caregiver since her separation from Tom Cruise in 2012. Holmes has provided a stable and grounded environment for her daughter. She moved to New York City with Suri to start a new chapter in their lives away from the intense scrutiny of Hollywood.
Suri Cruise: Growing Up in the Spotlight
Media Attention and Public Interest
From stylish outfits to everyday activities. Suri Cruise has been a favorite subject for tabloids and entertainment news. The constant media attention has shaped her childhood. Despite this, Suri has managed to maintain a level of normalcy, thanks to her mother's efforts.
The Midnight Sculptor.pdf writer by Ali alsiadali345alghlay
The city of Ravens burg was known for its gothic architecture, fog-covered streets, and an eerie silence that seemed to hang over the town like a shroud.
How OTT Players Are Transforming Our TV Viewing Experience.pdfGenny Knight
The advent of Over-The-Top (OTT) players has brought a seismic shift in the television industry, transforming how we consume media. These digital platforms, which deliver content directly over the internet, have outpaced traditional cable and satellite television, offering unparalleled convenience, variety, and personalization. Here’s an in-depth look at how OTT players are revolutionizing the TV viewing experience.
Leonardo DiCaprio Super Bowl: Hollywood Meets America’s Favorite Gamegreendigital
Introduction
Leonardo DiCaprio is synonymous with Hollywood stardom and acclaimed performances. has a unique connection with one of America's most beloved sports events—the Super Bowl. The "Leonardo DiCaprio Super Bowl" phenomenon combines the worlds of cinema and sports. drawing attention from fans of both domains. This article delves into the multifaceted relationship between DiCaprio and the Super Bowl. exploring his appearances at the event, His involvement in Super Bowl advertisements. and his cultural impact that bridges the gap between these two massive entertainment industries.
Follow us on: Pinterest
Leonardo DiCaprio: The Hollywood Icon
Early Life and Career Beginnings
Leonardo Wilhelm DiCaprio was born in Los Angeles, California, on November 11, 1974. His journey to stardom began at a young age with roles in television commercials and educational programs. DiCaprio's breakthrough came with his portrayal of Luke Brower in the sitcom "Growing Pains" and later as Tobias Wolff in "This Boy's Life" (1993). where he starred alongside Robert De Niro.
Rise to Stardom
DiCaprio's career skyrocketed with his performance in "What's Eating Gilbert Grape" (1993). earning him his first Academy Award nomination. He continued to gain acclaim with roles in "Romeo + Juliet" (1996) and "Titanic" (1997). the latter of which cemented his status as a global superstar. Over the years, DiCaprio has showcased his versatility in films like "The Aviator" (2004). "Start" (2010), and "The Revenant" (2015), for which he finally won an Academy Award for Best Actor.
Environmental Activism
Beyond his film career, DiCaprio is also renowned for his environmental activism. He established the Leonardo DiCaprio Foundation in 1998, focusing on global conservation efforts. His commitment to ecological issues often intersects with his public appearances. including those related to the Super Bowl.
The Super Bowl: An American Institution
History and Significance
The Super Bowl is the National Football League (NFL) championship game. is one of the most-watched sporting events in the world. First played in 1967, the Super Bowl has evolved into a cultural phenomenon. featuring high-profile halftime shows, memorable advertisements, and significant media coverage. The event attracts a diverse audience, from avid sports fans to casual viewers. making it a prime platform for celebrities to appear.
Entertainment and Advertisements
The Super Bowl is not only about football but also about entertainment. The halftime show features performances by some of the biggest names in the music industry. while the commercials are often as anticipated as the game itself. Companies invest millions in Super Bowl ads. creating iconic and sometimes controversial commercials that capture public attention.
Leonardo DiCaprio's Super Bowl Appearances
A Celebrity Among the Fans
Leonardo DiCaprio's presence at the Super Bowl has noted several times. As a high-profile celebrity. DiCaprio attracts
From Teacher to OnlyFans: Brianna Coppage's Story at 28get joys
At 28, Brianna Coppage left her teaching career to become an OnlyFans content creator. This bold move into digital entrepreneurship allowed her to harness her creativity and build a new identity. Brianna's experience highlights the intersection of technology and personal branding in today's economy.
From Teacher to OnlyFans: Brianna Coppage's Story at 28
10.1.1.510.6198
1. Creating a low cost VoiceXML Gateway to replace
IVR systems for rapid deployment of voice
applications.
Adam King, Alfredo Terzoli and Peter Clayton
Department of Computer Science
Tel: 046 6038291 Fax: 046 6361915
Rhodes University Grahamstown, 6140
g00k4406@campus.ru.ac.za A.Terzoli@ru.ac.za P.Clayton@ru.ac.za
Abstract— This paper reports on the creation of a low cost
VoiceXML gateway which can be used to replace traditional
Interactive Voice Response (IVR) platforms.
The gateway is created by integrating a VoiceXML interpreter,
OpenVXI and a PBX, Asterisk, producing a Linux based, open
source, system which is both a PBX and a VoiceXML browser.
Reasons for choosing the components for the gateway and then
the integration of these components are discussed.
VoiceXML applications can be used to replace IVR systems,
which are then rendered by the gateway.1
Index Terms— Asterisk, VoiceXML, IVR, VoIP
I. INTRODUCTION
Many organisations make use of IVRs to provide voice
interfaces to their users. These systems can be used to reduce
user interaction with human call center agents, used to interact
with voice services (such as voice mail systems) or used to
bring new and inventive voice services to customers.
Currently there is no uniform way of deploying IVR plat-
forms and implementing IVR systems. IVR applications are
tightly coupled to their underlying platforms. This can lead
to problems such as different IVR systems not being able
to interact with each other and IVR applications not being
portable. Installing and developing IVR systems can also be
expensive.
This paper proposes that VoiceXML be used as a uni-
form means of creating IVRs. VoiceXML is a mature XML
based standard which can be used to describe voice applica-
tions. Like HTML, VoiceXML pages have to be rendered by
browsers.
The use of VoiceXML would allow the decoupling of
voice applications from their underlying platforms, shield-
ing developers from the low level aspects of the specific
platform, and increasing portability of voice applications.
However VoiceXML gateways are costly, which may dissuade
many organisations. There is therefore a need for low cost
VoiceXML gateways.
This paper will firstly investigate the relationship between
VoiceXML and IVRs, detailing the advantages VoiceXML can
1This work was undertaken in the Distributed Multimedia Centre of Excel-
lence at Rhodes University, with financial support from Telkom SA, Business
Connexion, Comverse, Verso Technologies, Tellabs, StorTech THRIP, and the
National Research Foundation.
bring to the process of creating IVRs. It will then discuss
the components need to create a VoiceXML gateway, giving a
brief description of the components selected for this particular
gateway. Once the components have been discussed the paper
will look at the task of integrating the components to create a
gateway. Finally the complete architecture will be summarised,
detailing a typical call.
II. VOICEXML AND IVRS
VoiceXML provides sufficient functionality to completely
replace existing IVR systems, and has added advantages in
the area of voice application development.
VoiceXML applications residing on the corporate web
server have the same access to corporate databases as the web
server itself, and VoiceXML developers can take advantage of
existing web application development tools [1], [2]. Develop-
ers are also able to make use of familiar, and mature, web tech-
nologies, for example extending standard web security meth-
ods to create secure voice applications [2]. VoiceXML also
abstracts away from the low-level, platform specific, details.
Due to this abstraction, VoiceXML applications are portable
across web servers and gateways [1]. These factors help drive
down development costs while increasing the longevity and
scope of the VoiceXML applications. Companies can change
platforms without affecting existing development.
In contrast there seems to be no standard means of in-
teraction between IVR systems from different vendors, and
enterprises may deploy proprietary systems which are not
compatible with their existing web technologies. VoiceXML
can be used as a means of standardising the creation of voice
applications [3].
While creating VoiceXML applications may be an attractive
option, they have to be rendered by a VoiceXML browser.
Browsers can either be purchased, or a hosting solution can
be employed.
According to Ruiz et al VoiceXML gateways are very
expensive, due to the specialised hardware required [4]. Eisen-
zopf claims that the cost of VoiceXML gateways begins at
$10 000, while Thompson et al put this figure significantly
higher, at $100 000 [3], [5]. This figure, in reality, is an
estimation of the total startup costs, including costs such
2. as line rental, purchasing and installing the equipment and
development costs.
Another option is to have the VoiceXML applications hosted
by Voice hosting Service Providers (VSPs), as described by
[5]. In this case there are no startup costs when using a
hosted solution, only the cost of hiring the lines from the VSP.
Maintenance of the gateway now becomes the responsibility
of the VSP. This means that an enterprise using the hosted
solution need only maintain their VoiceXML pages, and not
an IVR platform as well, lowering maintenance costs as well
as sharing ownership costs.
While hosting seems to be a viable option for smaller
enterprises, there are a number disadvantages, especially in the
South African context. Firstly, the cost of the hosting is still
high, because the gateways employed for hosting are expen-
sive. Secondly, VSPs are predominantly found abroad, limiting
access to the application for many South Africans. In addition
to this hosted solutions may not allow easy customisation,
such as synthesising speech in different accents and different
languages or recognising speech in different languages.
These high startup costs and unattractive hosting options
will hinder the adoption of VoiceXML as a standard means
of creating IVRs, particularly amongst the many small to
medium enterprises which can be found in the South African
business landscape. There is therefore a need for a low cost
VoiceXML gateway. Such a gateway will bring the benefits
of using VoiceXML based voice applications to organisations
which would otherwise not be able to to afford it, increasing
the uptake of VoiceXML.
III. GATEWAY COMPONENTS
Creating a VoiceXML gateway from scratch is possible
but impractical. On the other hand following a component-
orientated approach can significantly reduce the scope of the
project, making the construction of a gateway feasible [6].
Gateway components have to be identified and then sourced
from the open source community. Using open source systems
allows systems to be integrated with more flexibility, and of
course ensures lower costs. The VoiceXML 2.0 specification
[7] describes the components needed to construct a fully
compliant VoiceXML 2.0 platform. These are:
a) Document acquisition: The gateway must have a
means of fetching documents from a web server, for the
interpreter to parse. This fetching mechanism must be able
to fetch documents using HTTP.
b) Audio output: VoiceXML 2.0 allows for both speech
synthesis and the play back of audio files. Any implementation
of a gateway must be able to synthesise text, and fetch and
play audio files. The gateway discussed in this paper supports
MP3, GSM and WAV audio formats, amongst others.
c) Audio input: A fully compliant VoiceXML 2.0 gate-
way must be able to simultaneously recognise both character
input, via DTMF, and spoken input. The gateway implemented
in this paper is only intended to be used as a replacement for
IVRs, and so speech recognition is not required. The gateway
must also be able to record user input.
d) Transfer: The gateway must be able to transfer calls.
Also implied here is the need for a telephony platform, to
handle calls to the gateway.
It follows that to implement a gateway a developer will need
the following components:
• A VoiceXML interpreter, which can fetch and interpret
VoiceXML 2.0 documents,
• A Text To Speech (TTS) system, to provide synthesised
speech, and a means of playing back audio files,
• An Automatic Speech Recognition (ASR) system, for
speech recognition, a DTMF recogniser, and a means of
recording audio input, and finally
• A telephony platform to handle calls, including the trans-
fer of existing calls.
A. Components
This gateway uses OpenVXI as the interpreter, Festival
provides synthesised text and the telephony platform is As-
terisk. Asterisk will also be responsible for playing audio files
(section IV-D), DTMF recognition (sections IV-D and IV-E)
and recording audio.
1) OpenVXI: OpenVXI is an open framework where devel-
opers can add their own components to create a VoiceXML
browser [8], [9]. To this purpose OpenVXI exposes three APIs
to the developer, the tel API, the prompt API and the rec
API. Creating the gateway involves fleshing out the APIs, to
allow OpenVXI to interact with the third party components.
The reader should note that an interface represents the reali-
sation of an API.
The tel API is responsible for setting up and tearing down
calls, monitoring the calls to detect hang ups and providing
the functionality to transfer calls. The prompt API represents
a one way interaction between the gateway and the user. It is
responsible for playing back audio files and synthesised text.
The prompt API also handles the fetching and caching of
pre-recorded audio. The rec API is responsible for evalu-
ating user input against any active grammars, and recording
utterances. (It is not responsible for converting DTMF and
voice signals into symbols; this task falls to the third party
components.)
The interpreter is fully compliant with the VoiceXML 2.0
specification, meaning that it can parse VoiceXML 2.0 docu-
ments, its functionality, however, is dependent on the amount
of fleshing out that has been done to the APIs. OpenVXI
runs on a Linux platform and is written in C and C++. It
uses SpiderMonkey as its JavaScript engine and Xerces as the
XML parser, which are open source projects available on the
Linux platform. OpenVXI has been chosen as the VoiceXML
interpreter.
publicVoiceXML was also considered as an open source
VoiceXML browser. It claims to provide built-in DTMF and
telephony support and makes provisions for adding TTS
systems. Initially this project seemed ideal. Unfortunately it
does not seem very active, and was therefore discarded.
2) Festival: The Festival speech synthesis system, devel-
oped at CMU, is a Linux based open source framework,
written in C++, for creating TTS systems [10]. Festival can
3. be used to create different voices in different languages. [11]
investigates creating new voices for the Festival system, con-
cluding that creating new Festival voices is a relatively simple
procedure. To this effect the Speech Technology and Research
(STAR) Group2
are developing new Festival languages in
some of the indigenous South African languages. This is a
clear advantage in the South African context, allowing voice
applications to ’speak’ in more than one language, bringing
these applications to a larger number of South Africans.
The quality of the voices depends on the the training data
used when creating the voice [10]. However the standard, free,
voices which come with Festival produce clear results, and the
majority of the output is understood, although the voices do
sound robotic.
3) Asterisk: Asterisk is an open source telecommunications
environment [12]. It can be used as a TDM and/or packet
switched voice and video PBX and an IVR platform. As a
Voice over IP gateway Asterisk supports IAX, SIP, MGCP
and H.323, at the same time it can be used as an interface to
the PSTN. Asterisk’s aim is “to interface any piece of tele-
phony hardware or software with any telephony application,
seamlessly and consistently” [12]. It runs on a Linux platform
and is written in C.
One goal of this project is to integrate OpenVXI and
Asterisk in a way that does not change Asterisk’s source code,
so that the gateway can be added to any existing instances of
Asterisk simply and seamlessly.
One major benefit of using Asterisk as the
telephony interface is the fact that it supports so many
telecommunications technologies. This means that the
VoiceXML gateway will be exposed to all the technologies
supported by Asterisk, allowing a lot of flexibility with
regards to interacting with the voice applications.
Once all the components have been selected they have to
be integrated to create the gateway. The integration of these
components is discussed in the following section.
IV. CREATING THE GATEWAY
This section will look at the actual construction of the
gateway. The construction involves the integration of the
components described in section III-A into a working system.
Firstly, we need a means of integrating OpenVXI into
Asterisk. typical call is initiated by the client, and is routed
to Asterisk. Asterisk then decides what to do with the call
using dialplan logic. Should the call need to be directed to a
VoiceXML service Asterisk routes the call to the interpreter.
A. Dialplan Applications
Once Asterisk knows that the call has to be passed to the
interpreter it needs a means of interacting with the gateway.
This must be done as seamlessly as possible. There are
two options available here. Using an AGI script to run the
interpreter, or creating a dialplan application.
An Asterisk Gateway Application (AGI) “executes an As-
terisk Gateway Interface compliant program on a channel.
2http://www.star.za.net/
AGI allows Asterisk to launch external programs written in
any language to control a telephony channel, play audio, read
DTMF digits, etc. by communicating with the AGI protocol on
stdin and stdout” [13]. While this may seem to be the easiest
means of integrating the interpreter into Asterisk, it is not the
most elegant.
For this reason it was decided to create a dialplan ap-
plication. Dialplan applications can be called directly from
the dialplan and, like AGI applications, are available to
any channel. This means that the VoiceXML service will
be available to any channel Asterisk can support. Dialplan
applications are relatively simple to create in Asterisk. A C
application is written and put in the asterisk/apps direc-
tory. This file must follow a basic template. It must be named
app_<app name>.c, in this case app_voicexml.c. A
similar naming convention is also used when naming functions
and variables which are then called by Asterisk at runtime. For
instance, execution in an application will begin in the <app
name>_exec function. Finally, a target must be created for
the application in the asterisk/apps makefile to ensure
that the module is created. This will require the Asterisk
source and a re-build, which creates the shared object and
moves it into /usr/libs/asterisk/modules. These
modules are then loaded by the Asterisk module loader at
start up. Creating these applications in this manner provides
a means to add functionality to Asterisk without changing the
source code, an important constraint to this project. Acquiring
the source and re-building Asterisk is relatively simple, and
so creating new dialplan applications is a simple means of
generically extending functionality in many different Asterisk
implementations.
B. Adding the interpreter
Once we have created a basic application we can begin
to add VoiceXML functionality. This means initialising and
activating the interpreter. OpenVXI is linked dynamically at
build time to the voicexml application. Control is then
passed to OpenVXI, which can begin interpretation of the
VoiceXML application. The interpreter can now start parsing a
page. OpenVXI needs to be able to perform semantic actions
on the channel, and so needs to be passed a pointer to the
particular channel. Therefore another dynamic link has to be
created, linking Asterisk to OpenVXI and allowing a pointer
to the channel to be passed to the interpreter.
Once OpenVXI and Asterisk are linked, the two processes
can interact with each other. Before OpenVXI is initialised,
Asterisk has set up the call and, once OpenVXI has exited,
Asterisk is responsible for tearing down the call. This func-
tionality does not have to be implemented again in the tel
API. When control is passed to the interpreter, the channel,
representing a correctly set up call, is passed as well. Once the
interpreter is done with the call (a hang-up has been received
or the VoiceXML application has exited), control is passed
back to the dialplan application, at which point Asterisk tears
down the call.
Once Asterisk and the OpenVXI are able to interact with
each other, the APIs have to be given the functionality to
4. render the VoiceXML. Specifically, the APIs have to be able to
perform actions, described by the VoiceXML, on the channel.
A pointer to the channel is passed from the dialplan application
to the OpenVXI platform, and from there to the prompt API,
along with pointers to the tel and rec APIs, resulting in a
somewhat prompt-centric architecture.
Once OpenVXI has control of the channel, it begins in-
terpreting the page, constructs a prompt, and sends it to the
prompt interface to be played. During the playing of the
prompts Asterisk is responsible for collecting DTMF tones,
translating these tones into characters, and passing the result
to the prompt interface. The prompt interface is then
responsible for setting the value of the received input in
the recognition interface. The rec interface then evaluates
this input against the current grammar when the prompt is
done playing. (Depending on the prompt’s barge-in property
the playing of the prompt can be stopped as soon as input
is received, or only once it is done.) Input received during
a prompt which requires no input is ignored by the rec
interface, but is still set by the prompt interface.
In addition to setting received input, the prompt interface
also has to alert the tel interface when it receives a hang-
up signal from the channel, so that the line status can be
updated. The interpreter frequently checks the status of the
line, and once it detects that the line has been hung up it will
stop queuing prompts, release resources, and exit. This returns
control to the voicexml dialplan application, which will also
exit, and Asterisk can commence tearing down the channel.
C. Queuing Prompts
OpenVXI describes prompts using the Speech Synthesis
Markup Language (SSML) specification [14]. SSML is used
as a standard method to define input for text synthesisers,
ensuring that the input adheres to a pre-defined set of rules,
thus improving performance. It is also a way to provide
additional linguistic information to the synthesisers [15].
The interpreter creates SSML documents from the
VoiceXML page and passes them to the prompt interface,
which is responsible for rendering the prompts. A simple
SSML parser has been written to extract the key/value pairs
which are passed as arguments in the <prompt> tag. The
SSML represents both types of prompts, synthesised speech
and pre-recorded audio. Besides extracting the parameters the
parser has to differentiate between these two types and act
accordingly. A prompt data structure has been created which
is used to represent the results of the parsing. A queue of these
prompt structures has been implemented.
Typically, the SSML document representing the initial
prompt is generated, and the prompt interface will attempt
to play this immediately. The interpreter will only attempt to
queue a prompt once it is sure, barring a hangup, that prompt
will be played. If the next prompt to be queued is dependent
on user input, the interpreter will only queue the prompt after
the user input has been received. Additionally, if a prompt is
to be played twice it will be queued twice.
This behavior simplifies the matter of choosing a data
structure to represent the queue. Prompts are only queued if
they are definitely going to be played, so the prompt at the
head of the queue can always be played. If a prompt is to
be played twice it will be queued twice, so the prompt at the
head of the queue, once played, can be discarded.
D. Playing Prompts
The prompt API defines VXIpromptPlay, which is
responsible for playing the prompts. As said, there are two
types of prompts which can be played, pre-recorded audio and
synthesised text. Playback of pre-recorded audio is made rel-
atively simple by using existing Asterisk functionality, which
can play audio on a channel in a number of formats. This is
provided by the ast_app_getdata function, which plays
a file and returns any DTMF input received during playback.
Synthesising text requires the integration of Festival into
the system. Festival provides a number of APIs to allow
other applications to interact with it. These are, Scheme API,
Shell API, C/C++ API, Java and JSAPI, C only API and the
Client/Server API [10]. The Client/Server API has been used
in this project.
There are a number of reasons for this. Firstly, this API
provides sufficient functionality. While the Scheme API may
provide more fine grained control, such control is unnecessary.
Secondly, once the server is running there are no time penalties
associated with initialisation, and so prompts can be generated
with minimal delay. Some of the APIs, in particular the Shell
API and Scheme API, will incur time penalties while initialis-
ing. Thirdly, the Client/Server API provides the option of load
balancing, and different servers can be configured differently.
Configuration files are used when setting up Festival servers.
This means that different servers can synthesise text using
different voices and different languages. Finally, excellent
examples of using the Client/Server API can be found from
both the Festival documentation [10] and from Asterisk itself.
Once the Festival server is configured and running, a client
application, in this case the prompt API, can connect to
the server, send a string, and the server will send back the
waveform representing the string. This waveform is received as
an array of characters. Playing this waveform on the channel is
done by piping the waveform to the channel. The channel reads
from the pipe and the prompt interface writes the waveform
to the pipe. The Asterisk function send_waveform_to_fd
is used to write to the pipe.
While the prompt is being played back, each frame received
from the channel is monitored. There are ten distinct frame
types, including DTMF frames, voice frames, video frames
and NULL frames. NULL frames are generated when the line is
hung up. When the prompt interface detects a NULL frame it
can stop playing the prompt and notify the telephony interface.
When we receive a DTMF signal we can treat it as input, and
notify the recognition interface.
E. DTMF input
Once DTMF input is received, it has to be sent to the rec
interface, using a VXIMap. The VXIMap data structure is a
container for key/value pairs which are of type VXIchar *.
The VXIchar type represents the locale char type, and is
5. Fig. 1. The complete system, including interactions
A is described in section IV-A
B is described in section IV-B
C is described in section IV-C
D is described by sections IV-D and IV-E
used for portability. The VXIMaps are used by OpenVXI to
pass data across interface boundaries, in this case between
the prompt and rec interfaces. The VXIMap must specify
the type of input, speech or DTMF, as well as the value of
the input. The VoiceXML specification states that input can
either be speech only, or DTMF only, or a combination of
the two. The rec interface is then responsible for matching
the input against the active grammar, or grammars, and acting
accordingly. Traditional IVRs only accept DTMF user input,
so in this case speech input is ignored.
In keeping with the ’prompt centric’ architecture all inputs,
including hangups, are detected during the playback of a
prompt. The input collected by the prompt interface is sent
to the tel and rec interfaces when appropriate.
V. COMPLETE ARCHITECTURE
Figure 1 shows the complete architecture, illustrating the
interaction between the components once they have been
integrated. It describes the process of handling a typical call
which requests a VoiceXML service, rendering the VoiceXML
page(s) while interacting with the user, and finally ending
the call. This process, described in detail in section IV, is
described here.
Once the call is set up the dialplan application,
app_voicexml, is executed, and is passed a pointer to the
channel and the URL of the VoiceXML page.
app_voicexml is then responsible for initialising the
OpenVXI platform. Once that is done it passes control, as well
as pointers to the channel and URL, to OpenVXI. OpenVXI
has a built in internet interface, which is responsible for
fetching the VoiceXML document(s).
When the document(s) has been returned from the web
server the interpreter begins parsing and sending prompts to
the prompt interface to be queued.
The prompt interface is responsible for queuing and then
playing the prompts. Before the prompts are queued they have
to be parsed by the SSML parser. Prompts which need to be
synthesised are sent to the Festival server, and the resulting
waveform is piped to the channel. Pre-recorded audio prompts
are played using ast_app_getdata, which is provided by
the app module.
While prompts are being played the prompt interface is
responsible for collecting input, which is sent to the rec
interface, and detecting and then alerting the tel interface
of hangups. The rec interface attempts to match the received
input against any active grammars, and sets the result for the
interpreter to act upon appropriately.
VI. CONCLUSION
Using VoiceXML to create IVRs is a better solution
than creating dedicated IVR platforms and applications.
VoiceXML 2.0 provides more than enough functionality to
allow VoiceXML applications to completely replace traditional
IVR systems, and should the gateway be completely compli-
ant, VoiceXML has the means to extend and improve IVR
systems as we know them today.
While using VoiceXML as a voice application development
language is attractive, the costs and inconveniences associated
with VoiceXML gateways can prove a hindrance. However
it is possible to create a low cost VoiceXML gateway which
has enough functionality to completely replace traditional IVR
systems. This architecture will be used to replace a mark
reading IVR at Rhodes University.
This paper shows that open source components can be used
to build a VoiceXML gateway which which has sufficient
functionality to replace traditional IVR systems. Integrating
6. OpenVXI and Asterisk produces a hybrid VoiceXML gateway
and PBX. The reader should note that as the VoiceXML 2.0
specification requires speech recognition the gateway being
discussed is not compliant. However, speech recognition will
be added to the system, creating a more natural means of
interacting with the applications, and so improving usability.
This functionality will be provided by integrating the Sphinx4
speech recognition system to the gateway discussed in this
paper.
The main components of this gateway, OpenVXI, Asterisk
and Festival are all mature and active projects, ensuring the
gateway’s longevity.
REFERENCES
[1] C. Bajorek, “VoiceXML - Taking IVR to
the Next Level,” 2000. [Online]. Available:
http://www.cconvergence.com/article/CTM20000927S0003
[2] Kristy Bradnum, “VoiceXML: A Field Evaluation,” Honour’s thesis,
Department of Computer Science, Rhodes University, Grahamstown,
South Africa, November 2004.
[3] Jonathan Eisenzopf, “Is VoiceXML right for your
customer service strategy?” 2002. [Online]. Available:
http://www.ddj.com/dept/architect/184413221
[4] JA Quiane Ruiz and JR Manjarrez Sanchez, “Design of a VoiceXML
Gateway,” In proceedings of the Fourth Mexican International Confer-
ence on Computer Science, vol. 00, pp. 49–53, 2003.
[5] David L. Thomson and John M. Hibel, “The Business of Voice Hosting
with VoiceXML,” Lucent Speech Solutions, Tech. Rep., 2000, Avaliable
at http://members.tripod.com/ David_Thomson/papers/Host4.doc.
[6] O. Nierstrasz, S. Gibbs, and D. Tsichritzis, “Component-
Oriented Software Development,” "Communications of the ACM,
vol. 35, no. 9, pp. 160–165, 1992. [Online]. Available:
citeseer.ist.psu.edu/nierstrasz92componentoriented.html
[7] W3C, “Voice Extensible Markup Language (VoiceXML) Version 2.0,”
2004. [Online]. Available: http://www.w3.org/TR/voicexml20
[8] C.-H. Hsu, M.-R. Hsu, C.-Y. Yang, and S.-C. Chang, “On the construc-
tion of a VoiceXML Voice Browser,” 2002, In ISCSLP 2002, paper 25.
[9] Jerry Carter, Brian Eberman, David Goddeau, Darren Meyer,
“Building VoiceXML Browsers with OpenVXI,” In proceedings of
the International WWW Conference, May 2002. [Online]. Available:
http://www2002.org/CDROM/refereed/260
[10] A. W. Black, P. A. Taylor, and R. Caley, “The Festival Speech Synthe-
sis System: System documentation,” Human Communciation Research
Centre, University of Edinburgh, University of Edinburgh, UK, Tech.
Rep., 2002, avaliable at http://festvox.org/docs/manual-1.4.3.
[11] Matthew Hood, “Creating a Voice for Festival Speech Synthesis Sys-
tem,” Honour’s thesis, Department of Computer Science, Rhodes Uni-
versity, Grahamstown, South Africa, November 2004.
[12] M. Spencer, M. Allison, and C. Rhodes, “The Asterisk Handbook. Vol.
2.” 2003.
[13] Various Authors, “Asterisk Help System for Asterisk version 1.0.6,”
2005.
[14] W3C, “Speech Synthesis Markup Language (SSML) Version 1.0,”
2004. [Online]. Available: http://www.w3.org/TR/speech-synthesis
[15] A. Isard, “SSML: A Markup Language for Speech Synthesis,” 1995.
[Online]. Available: citeseer.ist.psu.edu/isard95ssml.html
A King has completed an Honours degree in Computer Science at Rhodes
University. He is currently reading towards a Masters degree at the same
institute and is a Telkom bursar
P Clayton is the Director of Research, Computer Science Department, Rhodes
University. His research interests include distributed multimedia, distributed
and parallel processing and distance learning technology.
A Terzoli is the Project Director for the Centre of Excellence (CoE) in
Distributed Multimedia at Rhodes University and the Project Director for
the CoE in E-Commerce and E-Learning at the University of Fort Hare. His
research interests include real-time multimedia over packet networks and ICTs
in rural development.