What is Voice?
Why you should consider adding voice?
Voice: Pros vs Cons
Introducing Voice into your SDLC / PDLC
5 Steps to Designing for Voice Before Coding
7 Principles for Designing for Voice
Real Life Conditions
Voice AI Persona, Personality, Tone and Character
Resources, Tools, and More
@igotux
ABSTRACT The Virtual Trial Room (VTR) application software simulates an apparel dressing room by the implementation of a virtual mirror, portraying an augmented view of the user with virtual superimposed clothes. Traditional approach to the design and implementation of virtual dressing rooms have been wildly using either normal webcams with Tag/Marker based tracking or expensive 3D depth & motion sensing cameras like Microsoft Kinect. The main idea of this paper is to methodologically devise a novel VTR solution deploying ubiquitous 2D webcams with tag-less tracking, in a real-time live video mode using open source tools and technologies. The solution model implements a tag-less or marker-less Augmented Reality (AR) technique with face detection technology and provides an intuitive motion-augmented User Interface (UI) to the VTR application, in the form of an interactive human-friendly Virtual Mirror using simple hand gestures. A qualitative performance analysis of the application is evaluated at the end of the paper to determine the fundamental susceptibility of the VTR system against varied illumination conditions.
https://github.com/Project-VTR
Biometrics & AI (Artificial Intelligence) is the future of AuthenticationTony Chew
AI and biometrics will provide a new level of multi-dimension digital security for online and mobile banking which was elusive and unattainable in the past.
The privacy and confidentiality of PII and personal biometric data (original raw data should be discarded after enrolment) is not undermined nor compromised by a robust biometric system. The biometric template, a mathematical code of a few thousand bytes or more, is equivalent to a password ciphertext or hash value, but vastly more secure as it has no decryption key, and which is immune to reverse engineering or brute force attack, is used for authenticating the customer based on his/her claimed identity.
A biometric template of say 2,400 bits will have a permutation variable size greater than the key space of a 256 bit AES cryptographic system. It is not physically or functionally feasible to brute force attack a 256 AES bit key size encryption system. The universe space of a 2,400 bit biometric template system is much bigger than AES 256 bit key space. So, hacking or breaking a biometric template is infeasible.
Biometric systems are not interoperable and should not be "shared" or multi-tenanted in any cloud. Your identity is better protected by a strong biometric system than any password or knowledge-based challenge response system. As always, 2FA (Two Factor Authentication) is the foundation on which a biometric system should operate.
ABSTRACT The Virtual Trial Room (VTR) application software simulates an apparel dressing room by the implementation of a virtual mirror, portraying an augmented view of the user with virtual superimposed clothes. Traditional approach to the design and implementation of virtual dressing rooms have been wildly using either normal webcams with Tag/Marker based tracking or expensive 3D depth & motion sensing cameras like Microsoft Kinect. The main idea of this paper is to methodologically devise a novel VTR solution deploying ubiquitous 2D webcams with tag-less tracking, in a real-time live video mode using open source tools and technologies. The solution model implements a tag-less or marker-less Augmented Reality (AR) technique with face detection technology and provides an intuitive motion-augmented User Interface (UI) to the VTR application, in the form of an interactive human-friendly Virtual Mirror using simple hand gestures. A qualitative performance analysis of the application is evaluated at the end of the paper to determine the fundamental susceptibility of the VTR system against varied illumination conditions.
https://github.com/Project-VTR
Biometrics & AI (Artificial Intelligence) is the future of AuthenticationTony Chew
AI and biometrics will provide a new level of multi-dimension digital security for online and mobile banking which was elusive and unattainable in the past.
The privacy and confidentiality of PII and personal biometric data (original raw data should be discarded after enrolment) is not undermined nor compromised by a robust biometric system. The biometric template, a mathematical code of a few thousand bytes or more, is equivalent to a password ciphertext or hash value, but vastly more secure as it has no decryption key, and which is immune to reverse engineering or brute force attack, is used for authenticating the customer based on his/her claimed identity.
A biometric template of say 2,400 bits will have a permutation variable size greater than the key space of a 256 bit AES cryptographic system. It is not physically or functionally feasible to brute force attack a 256 AES bit key size encryption system. The universe space of a 2,400 bit biometric template system is much bigger than AES 256 bit key space. So, hacking or breaking a biometric template is infeasible.
Biometric systems are not interoperable and should not be "shared" or multi-tenanted in any cloud. Your identity is better protected by a strong biometric system than any password or knowledge-based challenge response system. As always, 2FA (Two Factor Authentication) is the foundation on which a biometric system should operate.
Decision Making and Information SystemsAriful Saimon
Premier University
[B.B.A]
Submitted To : Lecturer MS. Samima Parvez
Subject : Decision Making and Information
Semester: 5th Section: “A” Batch :22nd
Group Name: D’5
E-mail : Saimonchy20@gmail.com
Autonomic Computing: Vision or Reality - PresentationIvo Neskovic
Autonomic computing is a new computing paradigm which combines multiple disciplines of computer science with the sole aim of developing self-managing computer systems. Dating from early 2001, it is one of the most recent paradigm shifts, and as such it is still in a research-only phase, however, attracting a lot of business investors in the process.
The following survey presents in a clear and appropriately detailed manner the problem of computer science which autonomic computing tries to solve, the details of the proposed solution, together with the some of the immediate and long-term benefits it will provide. Moreover, the survey outlines the basic principles which define a system as an autonomic one, and presents a novel method of designing autonomic systems. Closing the survey are two sections which briefly outline the most prominent research projects on autonomic computing, together with a distiled summary of the major challenges which will be faced by businesses in the process of adopting autonomic systems.
This presentation is a summary of our first event, it will give you a walk you through the technical capabilities of the major voice platforms (Amazon Alexa, Google Home, Siri, MS Cortana, Bixby etc), examine how they can be leveraged to build better products, and give an introduction to the voice-specific design process.
Creating intuitive and usable Voice User Interfaces (VUI) involves coordinating a large number of research-based best practices, with the unique needs of each project. This white paper explores various factors that influence VUI design.
Decision Making and Information SystemsAriful Saimon
Premier University
[B.B.A]
Submitted To : Lecturer MS. Samima Parvez
Subject : Decision Making and Information
Semester: 5th Section: “A” Batch :22nd
Group Name: D’5
E-mail : Saimonchy20@gmail.com
Autonomic Computing: Vision or Reality - PresentationIvo Neskovic
Autonomic computing is a new computing paradigm which combines multiple disciplines of computer science with the sole aim of developing self-managing computer systems. Dating from early 2001, it is one of the most recent paradigm shifts, and as such it is still in a research-only phase, however, attracting a lot of business investors in the process.
The following survey presents in a clear and appropriately detailed manner the problem of computer science which autonomic computing tries to solve, the details of the proposed solution, together with the some of the immediate and long-term benefits it will provide. Moreover, the survey outlines the basic principles which define a system as an autonomic one, and presents a novel method of designing autonomic systems. Closing the survey are two sections which briefly outline the most prominent research projects on autonomic computing, together with a distiled summary of the major challenges which will be faced by businesses in the process of adopting autonomic systems.
This presentation is a summary of our first event, it will give you a walk you through the technical capabilities of the major voice platforms (Amazon Alexa, Google Home, Siri, MS Cortana, Bixby etc), examine how they can be leveraged to build better products, and give an introduction to the voice-specific design process.
Creating intuitive and usable Voice User Interfaces (VUI) involves coordinating a large number of research-based best practices, with the unique needs of each project. This white paper explores various factors that influence VUI design.
Accessibility and Inclusive Design at Microsoft Ignite Milano - Feb 1 2019Roberto D'Angelo
Interactive session delivered by Roberto D'Angelo and Jie Yuan at Microsoft Ignite event in Milano on Feb 1st 2019.
The presentation cover the basics of accessibility and inclusive design, why those topics are so relevant for Microsoft and why they should be relevant for every organization, with demos on accessibility features in W10/Office 365 and the role that AI is playing (Seeing AI app).
The session was very interactive, with many realtime demos, still the PPT has references to what has been showed live.
1.1 INTRODUCTION
In today’s era almost all tasks are digitalized. We have Smartphone in hands and it is nothing less than having world at your fingertips. These days we aren’t even using fingers. We just speak of the task and it is done. There exist systems where we can say Text Dad, “I’ll be late today.” And the text is sent. That is the task of a Virtual Assistant. It also supports specialized task such as booking a flight, or finding cheapest book online from various e- commerce sites and then providing an interface to book an order are helping automate search, discovery and online order operations.
Virtual Assistants are software programs that help you ease your day to day tasks, such as showing weather report, creating reminders, making shopping lists etc. They can take commands via text (online chat bots) or by voice. Voice based intelligent assistants need an invoking word or wake word to activate the listener, followed by the command. For my project the wake word is JIA. We have so many virtual assistants, such as Apple’s Siri, Amazon’s Alexa and Microsoft’s Cortana. For this project, wake word was chosen JIA.
This system is designed to be used efficiently on desktops. Personal assistant software improves user productivity by managing routine tasks of the user and by providing information from online sources to the user. JIA is effortless to use. Call the wake word ‘JIA’ followed by the command. And within seconds, it gets executed.
Voice searches have dominated over text search. Web searches conducted via mobile devices have only just overtaken those carried out using a computer and the analysts are already predicting that 50% of searches will be via voice by
2020.Virtual assistants are turning out to be smarter than ever. Allow your intelligent assistant to make email work for you. Detect intent, pick out important information, automate processes, and deliver personalized responses.
This project was started on the premise that there is sufficient amount of openly available data and information on the web that can be utilized to build a virtual assistant that has access to making intelligent decisions for routine user activities.
Running head SOFTWARE ANALYSIS CONTROL1SOFTWARE ANALYSIS CON.docxtoltonkendal
Running head: SOFTWARE ANALYSIS CONTROL 1
SOFTWARE ANALYSIS CONTROL 12
Tourist Experience At Piazza del Campo KIOSK
James Larkin
Institution Affiliation
Elam, Debora
12-5-17
Contents
Project Outline 3
User Interface Technical Requirements 4
User Interface Human Interaction 5
Deliverables 6
User Interface Design Prototypes 6
User Interface Design Mockups 6
Kiosk Main Menu 6
Smart Main Menu 7
Usability Test Questionnaire……………………………………………………………………………………………………………………8
References 12
Project Outline
The project aims at designing a software system in an attractive tourist city. The system will help the visitors to access various kind of information as well as support from the authority of the attraction site. For example, an English-speaking person visiting a tourist attraction site in another country that doesn’t speak English. He or she will use the system to access various services such as a request for a tour guide.
The city that I will concentrate on is Piazza del Campo, which is the Center of History in Siena, Italy. It holds a large public space which dominating the Siena’s historic center. It presents the beautiful and unique medieval squares with a red paving of brick, clam design of shells, and a wide area. The piazza has nine divisions of pie-shaped wedges, all radiating out from the square’s lowest point that is at the face of the Town Hall, referred to as Palazzo Pubblico. It is surrounded by precious public buildings, residences, and cafes’ that are attractive to many visiting tourists and the locals in the area.
Various major categories for the interface of the user will be supported. They will include firstly, graphical user interface accepting the input via a computer mouse and a keyboard. Secondly, touchscreen interface displays accepting inputs through a touch of the stylus or a finger. Thirdly, conversational interfaces enabling the user to direct the computer using plain text and voice commands rather than graphical elements. Fourthly, gesture interface accepting input in the form of mouse or hand gestures.
The system is intended even to support people with disabilities. The gesture interface will take care of those with hearing impairment. Conversational support means will help people with visual impairment. Others with disabilities, for example, missing fingers can use the graphical user or the touchscreen interfaces.User Interface Technical Requirements
There are many technical requirements for the kiosk. Input devises to support the interaction capabilities will include a computer mouse, a keyboard, a voice receptive control device, a stylus for touching purposes, and a touchscreen. For easy application by the users, a standard keyboard will be used. The voice control should be able to capture sounds appropriately to reduce the probability of error. The touchscreen should be having a display length of twenty-four inches for more comfortable use by all the visitors.
The technical requirements ...
Conversational User Interfaces, Past and FutureCrispin Reedy
How can chatbots learn from existing VUI design? What makes these new interfaces different, and how are they similar? Where do the Alexas and Siris come into the mix? We’ll discuss text-based vs. voice-based conversational user interfaces, and the landscape of Conversational User Interfaces, now and into the future.
The promise of conversational UI – your users already know how to talk to another human, now they can do just that with your product. As a designer, you have many different choices to consider in delivering conversational experiences to your customers – whether it’s through virtual assistants, chat UI or chatbots on messaging platforms.
Come join this workshop where we’ll share our learnings and do some hands-on exercises together to design conversational experiences.
In this workshop we’ll cover:
– Fundamentals of CUI & determining what’s right for your product
– Discussion on ingredients of CUI experiences
– Identifying features and prototyping CUI
– Multimodal CUI & emerging design patterns
UXPA2019 Enhancing the User Experience for People with Disabilities: Top 10 ...UXPA International
An estimated 1.3 billion people globally report limitations in their daily activities due to a disability. When it comes to the physical world, businesses have made progress in accommodating customers with disabilities. But in the digital world, websites lack basic accessibility features such as text alternatives describing images, proper heading level structures so individuals who are blind and use screen readers can understand the content on a webpage, or captioning for multimedia content for individuals who are deaf or are hard of hearing – let alone assistive technology for customers who have trouble using mobile devices due to dexterity limitations that arise from a variety of conditions.
In this session, attendees will:
* Understand people with disabilities (PWDs) and how they use the web
* Learn about common barriers, issues and solutions
* Discover the different testing methodologies and their interdependencies
* Uncover ROI
Rediscovering Accessibility for Future Tech - Everyone is affected!Samir Dash
his is a rediscovery of “Accessibility” in the world of touch-screens and other natural interfaces. With new technology innovation the lines between accessibility technology and Technology for Mass are getting blurred. What used to be a special need is becoming a general need for mass use.Situational Disabilities Use-cases are defining the new age devices, wearable & smart interfaces.
High time we need to rediscover on “accessibility” what we think we have already discovered!
Software for Humans: Anticipating User NeedsSarah Auvil
Presented at IA Summit 2017 in Vancouver, BC, Canada.
1. Practice mindfulness in design
2. Think like an anthropologist
3. Create with accessibility in mind
Anyone should be able to use your website; whether they are blind, deaf, or just have a broken arm. Inclusive design accounts for all of these; it’s a method of thinking that works to provide a more meaningful experience for your entire audience, no matter whom or where they are.
We’ll walk through a brief introduction to accessibility, why accessibility matters, and the code changes required to be accessibility compliant.
Participants will also get a chance to interact with leading accessibility tools and gain first-hand experience of what it is like to use a screen reader, magnifier, and other assistive devices that have a significant impact on how users interact with websites.
Quantitative + Qualitative = User Behavior Analytics
Introducing REAL User Behavior Analytics to your product team to better understand what and how users use your online/mobile applications.
7 Key Principles Designing for VUI/CUI
1. Onboarding the user & help them get started
2. Keep conversation exchanges brief to reduce cognitive load.
3. Examples work better than instructions.
4. Delight without interfering with the tasks.
5. Use explicit confirmation for important actions, & implicit for less risky.
6. Design for failure.
7. Respect the user's privacy and security.
Grow your business with the power of Graph Database. We take a look at how a social graph database could help legal professionals Grow their Legal Practice
User Experience Tools for the UX Professional Isidore Gotto
Being asked all the time on what tools do I use or recommend, I put together this list of common user experience tools that any UX professional should have in their toolbox.
Follow me @igotux
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
2. TOPICS
Voice Overview Designing for Voice
Resources & Reference Links
2
• What is Voice?
• Why you should consider adding Voice?
• Voice Only: Pro’s & Con’s
• Voice: Things to Consider
• Introducing Voice to SDLC
• Crawl, Walk, Run Approach
• Intro
• 5 Steps to Designing for Voice before Coding
• 7 Principles for Designing Voice
• Real Life User Conditions
• Error Handling
• Identifying the Problem
• Complexity by Data Inputs
• Voice AI Persona, Personality, Tone …
• Designer Tool-Kit Downloads
• UX Research Result
• Platform Comparison
• Industry Best Practices URLs
4
5
6
7
8
9-10
12
13
14
15
16
17
18
19
21
22
23
24
• Prototyping & Development Tools 25
4. What is Voice?
Voice experience has been around since the 1950’s. Today’s enhancements in
technology & demand for innovation has brought us to the next evolution of human
computer interactions.
In today’s market you may come across all types of terms for Voice; i.e. voice
assistant, voice-enabled speakers, Voice UI (VUI), Conversational UI (CUI), Artificial
Intelligence (AI), etc. All you need to understand is that its software that listens out
for grammatical details and attempts to recognize sentence structure to understand
the context and meaning of instructions.
Voice User Interface (VUI) is the next generation of human computer
interaction. VUIs allow people to use the power of their voice to interact with
computers/systems, instead of using their hands with a mouse, keyboard, or touch
screen.
This method of interacting with your product & services has unlimited potential.
* Apple’s Siri, Google’s Assistant, Amazon’s Alexa, and Microsoft’s Cortana are all prime examples of
consumer level AI that can respond to a request, control some level of physical devices, help give
options based on internet searches, and more.
** IBM’s Watson, a business to business solution, takes AI to another level by adding the ability to
make predictions, assumptions, and even some reasoning to computational outcomes.
4
“
Technology has Arrived
as of June 2017 Amazon Alexa
has grown to 15,000 skills &
98% speech recognition
accuracy.
Tech giants like Google, IBM,
Apple, Cisco and even Slack are
all investing into voice
technology.
“Hello, I’m _____.
5. Why should you consider introducing
Voice Assistance to your products &
services?
• Simplification / Ease of Use - “everyone knows how to talk…”
• Speed & Convenience in Hands Free / Screen Free Situations
• Multi-Tasking - working on one file requesting info from another
* Taking it beyond Voice Only an introducing multi-modal Voice experiences with a new Voice GUI,
we now bring contextual navigation, orientation, personalization & additional benefits to users.
** Voice assistants can help with human empathy as humans have a difficult time understanding tone
via the written word alone. Voice, which includes tone, volume, intonation, personality and rate of
speech conveys a great deal of information.
5
“
Introducing Voice Assistance to
your product & services will one
day help improve your overall
client experience.
Key benefits to users & clients:
Problem
Today it takes an avg. new user multiple attempts, endless
amount of training to get familiar with your product & services.
Fact: majority of call volume across all business types revolves
around – “how do I…”
“Voice is being seen as the
future of software & computer
interaction.”
“Hello, I’m _____.
6. 6
Voice Only: Pro’s vs. Con’s
Pro’s
Get a specific question addressed more
easily and faster; Ask/Command & Done!
Great for specific info/data lookup and data
analysis tasks, that are either buried or not
accessible via current navigation
Focused conversation & limiting number of
choices lends to speed & confidence with
decision making
Handy when user situation requires a
hands-free setup
More Natural interactions – “Humanize the
experience”
Con’s
✗ May not be obvious to user that they can
initiate conversation or what/how to ask
✗ User may need to adjust their work
environment
✗ High Risk of exceeding cognitive load to
process voice response
✗ Not suitable for complex tasks that require
visual guidance, user input or involve many
choices
✗ Privacy & security concerns with speaking
out loud
7. 7
Voice: Things to Consider
Benefit of Introducing Voice + GUI Experience “Multi-Modal Interactions“– combining two or more
modes of interaction.
• Multi-Modal allows you to compensate for cognitive memory weaknesses & task complexity, through current
visual interface or by introducing a new Voice GUI overlay.
Examples:
Leverage a voice/chat based experience.
Visual Confirmations (Hound app does a great job with only voice input, responses are voice + visual. )
UX Challenges to Overcome
• User Input - Speech 2 Text Recognition (based on technology selection constraints)
• Type of Data Input – will vary based on complexity of Use Case & Task
• Privacy - Speaking Out Loud Sensitive Information (system needs to be able to identify sensitive information and not respond
with audio)
The challenge is making the experience more natural, tackling the wide variety of ambiguity that may occur.
8. 8
Introducing Voice to your SDLC
A conversational or natural language user interface is a method of interacting with computers through text or voice
commands.
With good speech recognition, accurate instruction detection & quick responses, voice interaction is starting to feel natural.
“
“Hello, I’m _____.
11. 11
Designing for Voice
Voice User Interface (VUI) systems understand voice
commands, and respond either by speaking back, or by showing
a visual response.
The difference between Voice-Only interactions & ‘multi-modal’
means more information can be conveyed to the user than on
voice only devices. Multi-modal interfaces could help drive huge
advances in the workplace.
While Voice-Only interactions benefit the user in hands free
situations and providing quick answers to short commands.
Adding voice to any system will give it the sense of life, personality, &
character. Moving forward with voice, we must think about how verbal
conversations sound, feel, and flow.
““Using a VUI should feel as natural as speaking, and
listening, to any other human.” “Hello, I’m _____.
12. 5 Steps to Designing a Voice Experience before #Coding
1. Discover
What problem can voice solve?
How will voice provide value to your
users? i.e. consider all environments
2. Define
Voice Persona – Tone, Voice,
Personality…
Evaluate Capabilities – Will voice be a
good fit for this use case or task?
i.e. start with introducing 1 to 5 capabilities.
- Download Voice Evaluation Worksheet
3. Detail Conversation Flow
Begin with the “Happy Path” a
conversational flow in which the voice app
can respond to the users request without
any expectations or error. Then move on to
detailing the conversation flow for
exceptions and errors. - Download Design
Kit
4. Describe Alternative
Words & Phrases for NLP
People don’t always use the same
words to say the same thing and voice
apps need to be taught that. Phrase-
mapping is an exercise to teach voice
apps to accommodate variation in the
way users phrase their requests.
5. Refine
Test, learn, measure &
refine with user research.
12
“
Steps to VUI
Discover
Define
DetailsDescribe
Refine
“Hello, I’m _____.
14. interrupted
self correction
cut off to soon
background noise
confused
too many choices
didn’t understand
talked too long
speaks in other termscoughs
hesitation
connection cuts off
REAL LIFE
USER
CONDITIONS
}
language
accents
soft spoken
“It’s hard enough to
speak with another
human.”
culture
jumps from one thought to another
14
Things to Consider when Designing for Voice
privacy
15. I Don’t Understand You
When a so-called “error”
occurs in a conversation, it
should be treated simply as a
new turn in the dialog, only
with different conditions.
15
Error Handling
“
Example:
• I did not understand your request. Did
you say A or B?
• I currently am not able to process your
request, would you prefer A or B?
• I am not able to process your request.
Would you like me to connect you with a
Service Representative?
A
B
?
“Hello, I’m _____.
16. GET STARTED WITH ASKING: What user problem are you looking to solve?
Identifying if Voice UI experience is the right solution
• First, identify your intended user persona & personality.
• Then, layout their typical journey when using your application.
• Next, identify areas where Voice will benefit the user.
• Then, identify what other personas will benefit with the same or similar Voice experience.
• Design, porotype and test – more on this later
1. Difficulty finding or navigating applications. i.e. how do I… Where is… Shortcuts...
2. What’s my status? i.e. Did my package ship?
3. What is __________ phone number?
4. I have a specific question on ___________.
5. Look up _________ information or data.
6. Show me _________ report.
7. Calculate total or difference between _________ & _________.
Examples where Voice can make a BIG difference assisting users today.
Note: where possible try to use data/analytics first to identify areas of applications that are
most frequently used, have the largest amount of call volume. Then use the voice task
evaluation worksheet to evaluate.
16
“
Our GOAL is to build a
complete & seamless
Voice Experience
across all your
products.
Voice UI & Conversational
UI Design Kit - Download
💡
“Hello, I’m _____.
17. Complexity by Data Input Types on Users via Voice/Conversational
UI
TYPES OF DATA INPUT
VOICE ONLY
(standalone)
VOICE + GUI
(Multi-modal Exp.)
CONVERSATIONAL UI
CHAT / TEXT
PRO-ACTIVE
CONVERSATIONAL UI w/ AI
(Multi-modal Exp.)
On/Off
(checkbox, switch)
Easy Easy Easy Easy
Select one or multiple
from options offered
(radio options, dropdown menus,
checkboxes, cards, multi-select)
Difficult
(cognitive load with visual
aid)
Easy,
(Multi-Mode two or more
modes of interaction. GUI used
for data entry, selection,
validation, confirmation)
Difficult
Presentation of choices needs to be
limited; especially multiple choice
Difficult
Presentation of choices needs to be
limited; especially multiple choice)
Structured fields
(dates, currency, etc.)
Difficult
(inconsistent voice
recognition performance)
Easy
(Multi-Mode two or more
modes of interaction. GUI used
for data entry, selection,
validation, confirmation)
Easy, but could be tedious when
multiple fields are involved.
Recommend large input forms to be
designed in traditional UI Format.
Easy, but could be tedious when
multiple fields are involved.
Recommend large input forms to be
designed in traditional UI Format.
Text fields with variable
data
(email address, people names,
addresses)
Difficult
(voice recognition of
variable data)
Easy
(Multi-Mode two or more
modes of interaction. GUI used
for data entry, selection,
validation, confirmation)
Easy, but could be tedious when
multiple fields are involved.
Recommend large input forms to be
designed in traditional UI Format.
Easy, but could be tedious when
multiple fields are involved.
Recommend large input forms to be
designed in traditional UI Format.
17
18. Characteristics of Voice for A.I.
1. Tone of Voice
2. Gender of Voice
3. Personality
4. Character
5. Word & Phrase Choices
6. Functional Design
7. Style & Technique
Creating the Voice of A.I. for your Product
Base your characteristics on:
Your user population
Their needs
The imagery & qualities associated with
your brand
18
“
“Hello, I’m _____.
20. Reference & Resource Links
20
We have created several downloadable tool-kits for you to get started with adopting Voice/Conversation UI
experiences on your products.
• Customer Journey & Scripting for Voice – will assist you with facilitating stakeholder discussions in evaluating
where in your customer journey Voice UI would make an impact from Product Discovery, Initial Setup of new Client, First
Benefit/Use, Re-Use. As well samples on designing conversational UI with scripts and prototype references. – download
• Voice Use Case / Task Evaluation Worksheet – helps you quickly evaluate your product use cases for Voice prior to
designing. – download
• Voice Personality Development – expanding on traditional personas, looking deeper into user personality traits,
character and into your AI Personality.
21. Reference & Resource Links
Industry UX design best practices and heuristics for voice & conversational UI.
Amazon:
https://developer.amazon.com/designing-for-voice/design-process/
Apple Siri:
https://developer.apple.com/sirikit/
Google:
https://developers.google.com/actions/design/checklist
https://developers.google.com/actions/design/principles
Microsoft:
https://docs.microsoft.com/en-us/cortana/skills/design-principles
Samsung Bixby:
http://bixby.samsung.com/
21
22. Platform Comparison
AVAILABLE ON PRO’s CON’s
Amazon Skills Standalone, Mobile
(Nov.2017 announced
Alexa for business)
95-98% accuracy; languages US, Europe,
German, Japanese
…
- To Be Delivered (TBD)
Apple Siri Kit iPhone, iPad, mac,
macbook, iWatch,
HomePod
88% accuracy; multi-language supported
…
…
Google
Assistant
Phone, tablet, laptop,
standalone devices &
web
95-98% accuracy; multi-language supported
…
…
Microsoft
Cortana
Laptop, desktop,
standalone devices
95-98% accuracy; multi-language supported
…
…
Samsung
Bixby
Phone, tablet, TV - To Be Delivered (TBD)
Company
Virtual
Assistant
Company Ecosystem
of products &
services online or
native app.
- To Be Delivered (TBD)
Other
platforms…
As of Oct. 2017 22
23. Google Assistant
https://developers.google.com/assistant/sdk/overview |
https://developers.google.com/assistant/sdk/
Google Speech - https://cloud.google.com/speech/
Apple Siri Kit - https://developer.apple.com/sirikit/
Microsoft Cortana - https://developer.microsoft.com/en-us/cortana
Microsoft Bing Speech API - https://azure.microsoft.com/en-
us/services/cognitive-services/speech/
UMP Speech Recognition - https://docs.microsoft.com/en-
us/windows/uwp/input-and-devices/speech-recognition
Microsoft Cortana Skills Kit - https://developer.microsoft.com/en-us/cortana
Aug 2017 reached 5.1% error rate -
https://techcrunch.com/2017/08/20/microsofts-speech-recognition-system-
hits-a-new-accuracy-milestone/
Finnish IT company Blucup wanted to find a way for its salespeople to input
customer data and generate leads while in the
field. https://customers.microsoft.com/en-us/story/blucup-discrete-
manufacturing-cognitive-services
Samsung Bixby - http://developer.samsung.com/home.do
https://news.samsung.com/global/bixby-a-new-way-to-interact-with-your-
phone
Amazon Alexa -https://developer.amazon.com/alexa
Voice Design Guide - https://developer.amazon.com/designing-for-
voice/
Amazon - https://developer.amazon.com/designing-for-voice/
Google - https://developers.google.com/actions/design/
Facebook - https://developers.facebook.com/docs/messenger-
platform/introduction/general-best-practices
Slack - https://api.slack.com/best-practices
Apple - https://developer.apple.com/ios/human-interface-
guidelines/overview/themes/
Paid Vendors
KeenResearch - http://keenresearch.com/
DialogFlow - Conversational UX Platform for Web, Mobile and IoT -
https://dialogflow.com/
SpeechMatics - https://www.speechmatics.com/
Open Source Vendors
SoundHound “Hound” - https://soundhound.com/hound
CMU Sphinx - https://soundhound.com/hound
OpenEars - https://www.politepix.com/openears/
iSpeech - https://www.ispeech.org/
23
24. Prototyping & Development Tools
None Developer
• Wizard of Oz – set of microphones and speakers
• Sayspring.com (voice only, can be connected to Amazon and Google)
• InvisionApp, Axure, Keynote etc. (used to create GUI part of the experience)
Development Skills Required
• Wit.ai
• Dialogflow.com
• SoundHound.com ‘Houndify’
• Amazon Alexa Skills
• Google Cloud Platform
• Apple Speech Recongnition
• IBM Watson – Speech to Text and Text to Speech
Voice Analytics
• VoiceLabs.com
24
Editor's Notes
Created to help product teams, designers & researchers ramp up on the benefits of Voice, how to design for Voice and how to test voice.
To build a Voice Assistant we need to introduce:
1- Speech Recognition
2- Natural Language/Learning Processing
3- Machine Learning
4- Artificial Intelligence
5- Automation
Discover
How will voice provide value to my users?
Take into consideration why and where people use voice apps.
People use voice interfaces because of the benefits of hands-free interaction, the speed of interaction and the ease of use.
Define
Persona – Tone, Voice, Personality
Capabilities – What would benefit your users on a voice-driven device in a shared space?
Start with introducing 1 to 5 capabilities
Detail Conversation Flow
Begin with the “Happy Path” a conversational flow in which the voice app can respond to the users request without any expectations or error.
Then move on to detailing the conversation flow for exceptions and errors.
Use the Voice UI & Conversational UI Design Spreadsheet
Then with your team play out the conversation out loud and debate.
Describe Alternate Words & Phrases
People don’t always use the same words to say the same thing and voice apps need to be taught that. Phrase-mapping is an exercise to teach voice apps to accommodate variation in the way users phrase their requests.
For each path you detailed in step 3 now think of different ways users could word those requests.
Refine
Test the voice interface with users.
In Conversations, there are no “Errors” ?
We need to handle these conditions
- Error handling
- Decision support
- Ability to start over or go back
We need to handle these conditions
- Error handling
- Decision support
- Ability to start over or go back
Voice Only - We can speak 3 to 4 times faster than we type.
Chat / Text - More than 2.1 billion users today use a social messaging app. – Portio research
Smart Assistant companies claim their speech recognition software is now at 5.5% word error rate. Humans average around 5.1%.
Additional deeper R&D required into:
Limited to how open the platform is & capabilities
How does platform presence intersect with your app & brand experiences?
Discovery & voice command recall
Setting the right user expectations related to domain coverage