This document discusses voice recognition technology and Google Assistant. It begins with an introduction to voice recognition, its history, and how voice recognition systems work by matching voice segments to phonemes and words. It then focuses on Google Assistant, describing it as an AI assistant that can search the internet, schedule events, and control device settings. The document outlines where Google Assistant can be found, what it can do, and how developers can build integrations for it using the Google Assistant Library and GRPC API.
4. INTRODUCTION :-
Voice recognition is a computer software program or hardware device
with the ability to decode the human voice.
Voice recognition is also known as speaker recognition.
Voice recognition evaluates the voice biometrics of an individual, such
as the frequency and flow of their voice and their natural accent.
Voice recognition is commonly used to operate a device, perform
commands, or write without having to use a keyboard, mouse, or press any
buttons.
8. …CONTINUE
When we speak, we create vibrations in the air. The analog-to-digital
converter (ADC) translates this analog wave into digital data that the
computer can understand. To do this, it samples, normalizes the sound by
taking precise measurements of the wave at frequent intervals.
9. …CONTINUE
The signal is divided into small segments as short as a few hundredths of
a second, or even thousandths .
The program then matches these segments to known phonemes in the
appropriate language.
There are roughly 40 phonemes in the English language ,while other
languages have more or fewer phonemes.
The program examines phonemes in the context of the other phonemes
around them and compares them to a large library of known words, phrases
and sentences.
The program then determines what the user was probably saying and
either outputs it as text or issues a computer command.
11. ...CONTINUE
The smallest element of any language is said to be a phoneme.
According to this model, a phoneme is treated as a link in a chain, and
the completed chain represents a word.
To determine the next phoneme, the chain forms branches of different
sounds that can come next, a probability score is given to each branched
off phoneme based on the built in dictionary.
Thus, the complete word is finally determined.
15. WHAT IS GOOGLE ASSISTANT:-
Google Assistant is a virtual personal assistant developed by google
and announced at its developer conference in May 2016.
Google Assistant is an upgrade of Google Now, which can engage in
two-way conversation.
Google
now
16. WHERE YOU WILL FIND GOOGLE
ASSISTANT:-
GOOGLE ASSISTANT INITIALLY DEBUTED AS PART OF:
GOOGLE’S MESSAGING APP “ALLO” FOR ANDROID AND
IPHONE.
ITS VOICE-ACTIVATED SPEAKER “GOOGLE HOME”.
AND INITIALLY IN GOOGLE’S “PIXEL , PIXEL XL” SMART
PHONES.
17. WHAT GOOGLE ASSISTANT CAN DO?
IT IS ABLE TO SEARCH THE INTERNET, SCHEDULE EVENTS
AND ALARMS, ADJUST HARDWARE SETTINGS ON THE
USER'S DEVICE, AND SHOW INFORMATION FROM THE
USER'S GOOGLE ACCOUNT.
GOOGLE HAS ALSO ANNOUNCED THAT THE ASSISTANT
WILL BE ABLE TO IDENTIFY OBJECTS AND GATHER VISUAL
INFORMATION THROUGH THE DEVICE'S CAMERA, AND
SUPPORT PURCHASING PRODUCTS AND SENDING MONEY.
18. …CONTINUE:-
YOU CAN ALSO USE GOOGLE ASSISTANT TO CONTROL
MOST OF THE PHONE SETTINGS FOUND IN THE
NOTIFICATION SHADE. LIKE TO OPEN WI-FI , TORCH , TO
CONTROL BRIGHTNESS ETC.
19. DEVELOPER SUPPORT FOR GOOGLE
ASSISTANT:-
IN DECEMBER 2016, GOOGLE LAUNCHED A DEVELOPER PLATFORM
FOR THE GOOGLE ASSISTANT CALLED "ACTIONS ON GOOGLE“.
IN MARCH 2017, GOOGLE ADDED NEW TOOLS FOR DEVELOPING
ON “ACTIONS ON GOOGLE” TO SUPPORT THE CREATION OF
GAMES FOR THE GOOGLE ASSISTANT.
IN APRIL 2017, A SOFTWARE DEVELOPMENT KIT (SDK) WAS
RELEASED, ALLOWING THIRD-PARTY DEVELOPERS TO BUILD THEIR
OWN HARDWARE THAT CAN RUN THE GOOGLE ASSISTANT. IT HAS
BEEN INTEGRATED INTO ”RASPBERRY PI”.
20. EMBED THE GOOGLE ASSISTANT IN ANY
DEVICE:-
A DEVELOPER PREVIEW OF THE SDK IS AVAILABLE TODAY FOR
EVERYONE INTERESTED IN WORKING WITH PLATFORMS SUCH AS
THE RASPBERRY PI 3.
THE SDK PROVIDES TWO OPTIONS FOR INTEGRATING THE
ASSISTANT INTO YOUR DEVICE:
1. THE GOOGLE ASSISTANT LIBRARY.
2. THE GOOGLE ASSISTANT GRPC API.
21. …CONTINUE:-
THE LIBRARY OF GOOGLE ASSISTANT IS WRITTEN IN PYTHON AND
IT IS SUPPORTED ON THE DEVICES WITH LINUX-ARMV7L AND LINUX-
X86_64 ARCHITECTURES (LIKE THE RASPBERRY PI 3 B AND UBUNTU
DESKTOPS).
THE GRPC API IS FOR OTHER PLATFORMS AND GENERATE
BINDINGS FOR LANGUAGES LIKE GO, JAVA (INCLUDING SUPPORT
FOR ANDROID THINGS), C#, NODE.JS, AND RUBY TO GIVE YOU THE
FLEXIBILITY YOU NEED.
22. GOOGLE ASSISTANT INTEGRATION:-
GOOGLE ASSISTANT CAN BE INTEGRATED IN MANY FORMS:
YOU CAN MAKE YOUR OWN PROTOTYPE DEVICE WITH GOOGLE
ASSISTANT BUILT IN, LIKE “ROBOTIC MOCKTAILS MIXER” ETC. USING
“THE GOOGLE ASSISTANT LIBRARY” AND “RASPBERRY PI EMBED
SYSTEM”.
YOU CAN ALSO MAKE YOUR OWN ASSISTANT APP USING “API.AI”
AND “ACTIONS ON GOOGLE”, WITH WHICH YOU CAN INTERACT VIA
IN-BUILT GOOGLE ASSISTANT.