SlideShare a Scribd company logo
1 of 17
1
Artificial Intelligence- An Introduction
2
oArtificial Intelligence is composed of two
words Artificial and Intelligence, where Artificial
defines "man-made," and intelligence defines "thinking
power", hence AI means "a man-made thinking power.“
oArtificial Intelligence exists when a machine can have
human based skills such as learning, reasoning, and solving
problems
3
4
•Autonomous planning and scheduling of tasks aboard a spacecraft
•Beating Gary Kasparov in a chess match
•Steering a driver-less car
•Understanding language
•Robotic assistants in surgery
•Monitoring trade in the stock market to see if insider trading is going on
5
6
6/34
Problem solving
oProblem-solving agents:
oIn Artificial Intelligence, Search techniques are universal problem-solving methods. Rational
agents or Problem-solving agents in AI mostly used these search strategies or algorithms to
solve a specific problem and provide the best result.
7
‘Anything’ that can gather information about its
environment and take action based on that information.
8
A speech capturing Device: It consists of a microphone, which converts the sound wave signals to electrical
signals and an Analog to Digital Converter which samples and digitizes the analog signals to obtain the
data that the computer can understand.
A Digital Signal Module or a Processor: It performs processing on the raw speech signal like frequency
domain conversion, restoring only the required information etc.
Preprocessed signal storage: The preprocessed speech is stored in the memory to carry out further task of
speech recognition.
Reference Speech patterns: The computer or the system consists of predefined speech patterns or templates
already stored in the memory, to be used as the reference for matching.
Pattern matching algorithm: The unknown speech signal is compared with the reference speech pattern to
determine the actual words or the pattern of words.
A speech can be seen as an acoustic waveform, i.e., signal carrying message information. This acoustic
waveform is converted to analog electrical signals by the microphone. The Analog to Digital converter
converts this analog signal to digital samples by taking precise measurements of the wave at discrete
intervals.
The digitized signal consists of a stream of periodic signals sampled at 16000 times per second and is
not suitable to carry out actual speech recognition process as the pattern cannot be easily located. To
extract the actual information, the signal in time domain is converted to signal in frequency domain.
This is done by the Digital Signal Processor using FFT technique. In the digital signal, the component
after every 1/100th of a second is analyzed and the frequency spectrum for each such component is
computed. In other words, the digitized signal is segmented into small parts of frequency amplitudes.
Each segment or the frequency graph represents the different sounds made by human beings. The
computer performs the matching of the unknown segments with the stored phonetics of the particular
language.
The speech recognition system depends on the following factors:
Isolated Words: There needs to be a pause between the consecutive words spoken because continuous
continuous words can overlap making it difficult for the system to understand when a word starts or
ends. Thus, there needs to be a silence between consecutive words.
Single Speaker: Many speakers trying to give speech input at the same time can cause overlapping of
the signals and interruptions. Most of the speech recognition systems used are speaker dependent
systems.
Vocabulary size: Languages with large vocabulary are difficult to be considered for pattern matching
than those with small vocabulary as chances of having ambiguous words are lesser in the latter.
LEXICON MODEL, ACOUSTIC MODEL, & LANGUAGE MODEL
Lexicon
The lexicon is the primary step in decoding speech. Creating a comprehensive lexical design for an
system involves including the fundamental elements of both spoken language (the audio input the
system receives) and written vocabulary (the text the system sends out).
Acoustic Model
Acoustic modeling involves separating an audio signal into small time frames. Acoustic models analyze
each frame and provide the probability of using different phonemes in that section of audio. Simply
acoustic models aim to predict which sound is spoken in each frame.
Language Model
Today’s ASR systems employ natural language processing (NLP) to help computers understand the
context of what a speaker says. Language models recognize the intent of spoken phrases and use that
knowledge to compose word sequences. They operate in a similar way to acoustic models by using
neural networks trained on text data to estimate the probability of which word comes next in a phrase.
Together, the lexicon, acoustic model, and language model enable ASR systems to make close-to-
accurate predictions about the words and sentences in an audio input.
In the simplest terms, speech recognition occurs when a computer receives audio input
from a person speaking, processes that input by breaking down the various components of
speech, and then transcribes that speech to text.
Some ASR systems are speaker-dependent and must be trained to recognize particular
words and speech patterns. These are essentially the voice-recognition systems used in
your smart devices. You need to say specific words and phrases into your phone before the
ASR-powered voice assistant starts working in order for it to learn to identify your voice.
Other ASR systems are speaker-independent. These systems do not require any training.
Speak-independent systems have the ability to recognize spoken words regardless of the
speaker. Speaker-independent systems are practical solutions for business applications like
interactive voice response (IVR).
15
From speech recognition’s mid-twentieth-century origins to its multi-industry applications today, the use cases for ASR
technology are far-reaching. ASR made it out of the computer science laboratories and is now integrated into our everyday
lives.
 Voice Assistants : According to a 2020 survey conducted by NPR and Edison Research, 63% of respondents said they use a
voice assistant. The ability to use voice commands to help complete tasks like opening mobile apps, sending a text message,
or searching the web affords users a greater level of convenience.
 Language learning: For people engaged in self-guided language study, apps using speech-recognition tools put them a
step closer to having a comprehensive learning experience during independent study. Apps like Busuu and Babbel use ASR
technology to help students practice their pronunciation and accents in their target languages. Using these apps, a student
speaks into their phone or computer in their target language. The ASR software listens to that voice input, analyzes it, and if
it matches what the system identifies as the correct pronunciation, it informs the learner. If the student’s voice input doesn’t
match what the ASR knows to be correct, it will inform the student of their missed pronunciation as well.
 Transcription Services : One of the first widespread use cases of ASR was for the simple transcription of speech. Speech-to-
text services offer a level of convenience in many contexts and open the door to improved audio and video accessibility.
Health care practitioners use dictation products like Dragon Naturally Speaking to help them take hands-free notes while
attending to patients. ASR captioning also allows for real-time transcription of live video, which allows a broader audience to
access the media.
 Call Centers: ASR is crucial for the automation of processes for businesses with extensive customer support demands. With
an influx of callers, companies need a way to efficiently handle a vast amount of customer communication. ASR technology is
one of the main mechanisms involved in smart IVR — a system that automates routine inbound communications as well as
16
• Imprecision and false interpretations
• Time and lack of efficiency
• Accents and local differences
• Background noise and loud environments
• Privacy and data security
Aravali College of Engineering And Management
Jasana, Tigoan Road, Neharpar, Faridabad, Delhi NCR
Toll Free Number : 91- 8527538785
Website : www.acem.edu.in
17

More Related Content

What's hot

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Machine Learning with TensorFlow 2
Machine Learning with TensorFlow 2Machine Learning with TensorFlow 2
Machine Learning with TensorFlow 2Sarah Stemmler
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingVeenaSKumar2
 
AI Computer vision
AI Computer visionAI Computer vision
AI Computer visionKashafnaz2
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligenceIjajAhmedJaman
 
Machine learning
Machine learningMachine learning
Machine learningWes Eklund
 
Natural language processing
Natural language processing Natural language processing
Natural language processing Md.Sumon Sarder
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySrijanKumar18
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceIlhaan Marwat
 
Machine learning
Machine learningMachine learning
Machine learningInfoFarm
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice RecognitionAmrita More
 
Natural language processing
Natural language processingNatural language processing
Natural language processingAbash shah
 
Security in the age of Artificial Intelligence
Security in the age of Artificial IntelligenceSecurity in the age of Artificial Intelligence
Security in the age of Artificial IntelligenceFaction XYZ
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP) ASWINKP11
 

What's hot (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Machine Learning with TensorFlow 2
Machine Learning with TensorFlow 2Machine Learning with TensorFlow 2
Machine Learning with TensorFlow 2
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
AI Computer vision
AI Computer visionAI Computer vision
AI Computer vision
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
Computer vision
Computer visionComputer vision
Computer vision
 
Machine learning
Machine learningMachine learning
Machine learning
 
Natural language processing
Natural language processing Natural language processing
Natural language processing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Machine learning
Machine learningMachine learning
Machine learning
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Security in the age of Artificial Intelligence
Security in the age of Artificial IntelligenceSecurity in the age of Artificial Intelligence
Security in the age of Artificial Intelligence
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP)
 
Blue eye technology
Blue eye technologyBlue eye technology
Blue eye technology
 
Speech Recognition System
Speech Recognition SystemSpeech Recognition System
Speech Recognition System
 

Similar to Artificial Intelligence- An Introduction

Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionRHIMRJ Journal
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generatorsPaul Kahoro
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areasLearnbay Datascience
 
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONijistjournal
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptxJhalakDashora
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...IRJET Journal
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversionankit_saluja
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?CarterRodriguez6
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech RecognitionThejus Joby
 
A Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesA Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesNicole Heredia
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionIRJET Journal
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNIJCSEA Journal
 

Similar to Artificial Intelligence- An Introduction (20)

Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areas
 
ACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITIONACHIEVING SECURITY VIA SPEECH RECOGNITION
ACHIEVING SECURITY VIA SPEECH RECOGNITION
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
Seminar
SeminarSeminar
Seminar
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
Speech recognition - how does it work?
Speech recognition - how does it work?Speech recognition - how does it work?
Speech recognition - how does it work?
 
Assign
AssignAssign
Assign
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
A Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification TechniquesA Review On Speech Feature Techniques And Classification Techniques
A Review On Speech Feature Techniques And Classification Techniques
 
Presentation.ai
Presentation.aiPresentation.ai
Presentation.ai
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
FINAL report
FINAL reportFINAL report
FINAL report
 
A survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech RecognitionA survey on Enhancements in Speech Recognition
A survey on Enhancements in Speech Recognition
 
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANNUtterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
 

More from acemindia

B.Tech ME.pptx
B.Tech ME.pptxB.Tech ME.pptx
B.Tech ME.pptxacemindia
 
Data Models - Department of Computer Science & Engineering
Data Models - Department of Computer Science & EngineeringData Models - Department of Computer Science & Engineering
Data Models - Department of Computer Science & Engineeringacemindia
 
SOFTWARE TESTING
SOFTWARE TESTINGSOFTWARE TESTING
SOFTWARE TESTINGacemindia
 
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...acemindia
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction acemindia
 
Importance, Impact, Parameters and Impediments in Team Work
Importance, Impact, Parameters and Impediments in Team WorkImportance, Impact, Parameters and Impediments in Team Work
Importance, Impact, Parameters and Impediments in Team Workacemindia
 
Introduction To Cloud Computing
Introduction To  Cloud ComputingIntroduction To  Cloud Computing
Introduction To Cloud Computingacemindia
 
SCOPE OF CIVIL ENGINEERING
SCOPE OF CIVIL ENGINEERINGSCOPE OF CIVIL ENGINEERING
SCOPE OF CIVIL ENGINEERINGacemindia
 
Introduction To Cloud Computing
Introduction To  Cloud ComputingIntroduction To  Cloud Computing
Introduction To Cloud Computingacemindia
 
Communication skills
Communication skills Communication skills
Communication skills acemindia
 
Aravali College of Engineering and Management
Aravali College of Engineering and ManagementAravali College of Engineering and Management
Aravali College of Engineering and Managementacemindia
 
Business idea generation in today's economy
Business idea generation in today's economyBusiness idea generation in today's economy
Business idea generation in today's economyacemindia
 
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCR
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCRBest College for Electronics & Communication Engineering in Faridabad, Delhi NCR
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCRacemindia
 

More from acemindia (13)

B.Tech ME.pptx
B.Tech ME.pptxB.Tech ME.pptx
B.Tech ME.pptx
 
Data Models - Department of Computer Science & Engineering
Data Models - Department of Computer Science & EngineeringData Models - Department of Computer Science & Engineering
Data Models - Department of Computer Science & Engineering
 
SOFTWARE TESTING
SOFTWARE TESTINGSOFTWARE TESTING
SOFTWARE TESTING
 
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...
Security Management in Cloud Computing by Shivani Gogia - Aravali College of ...
 
Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction Artificial Intelligence - An Introduction
Artificial Intelligence - An Introduction
 
Importance, Impact, Parameters and Impediments in Team Work
Importance, Impact, Parameters and Impediments in Team WorkImportance, Impact, Parameters and Impediments in Team Work
Importance, Impact, Parameters and Impediments in Team Work
 
Introduction To Cloud Computing
Introduction To  Cloud ComputingIntroduction To  Cloud Computing
Introduction To Cloud Computing
 
SCOPE OF CIVIL ENGINEERING
SCOPE OF CIVIL ENGINEERINGSCOPE OF CIVIL ENGINEERING
SCOPE OF CIVIL ENGINEERING
 
Introduction To Cloud Computing
Introduction To  Cloud ComputingIntroduction To  Cloud Computing
Introduction To Cloud Computing
 
Communication skills
Communication skills Communication skills
Communication skills
 
Aravali College of Engineering and Management
Aravali College of Engineering and ManagementAravali College of Engineering and Management
Aravali College of Engineering and Management
 
Business idea generation in today's economy
Business idea generation in today's economyBusiness idea generation in today's economy
Business idea generation in today's economy
 
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCR
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCRBest College for Electronics & Communication Engineering in Faridabad, Delhi NCR
Best College for Electronics & Communication Engineering in Faridabad, Delhi NCR
 

Recently uploaded

ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 

Recently uploaded (20)

ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 

Artificial Intelligence- An Introduction

  • 2. 2 oArtificial Intelligence is composed of two words Artificial and Intelligence, where Artificial defines "man-made," and intelligence defines "thinking power", hence AI means "a man-made thinking power.“ oArtificial Intelligence exists when a machine can have human based skills such as learning, reasoning, and solving problems
  • 3. 3
  • 4. 4 •Autonomous planning and scheduling of tasks aboard a spacecraft •Beating Gary Kasparov in a chess match •Steering a driver-less car •Understanding language •Robotic assistants in surgery •Monitoring trade in the stock market to see if insider trading is going on
  • 5. 5
  • 6. 6 6/34 Problem solving oProblem-solving agents: oIn Artificial Intelligence, Search techniques are universal problem-solving methods. Rational agents or Problem-solving agents in AI mostly used these search strategies or algorithms to solve a specific problem and provide the best result.
  • 7. 7 ‘Anything’ that can gather information about its environment and take action based on that information.
  • 8. 8 A speech capturing Device: It consists of a microphone, which converts the sound wave signals to electrical signals and an Analog to Digital Converter which samples and digitizes the analog signals to obtain the data that the computer can understand. A Digital Signal Module or a Processor: It performs processing on the raw speech signal like frequency domain conversion, restoring only the required information etc. Preprocessed signal storage: The preprocessed speech is stored in the memory to carry out further task of speech recognition. Reference Speech patterns: The computer or the system consists of predefined speech patterns or templates already stored in the memory, to be used as the reference for matching. Pattern matching algorithm: The unknown speech signal is compared with the reference speech pattern to determine the actual words or the pattern of words.
  • 9.
  • 10. A speech can be seen as an acoustic waveform, i.e., signal carrying message information. This acoustic waveform is converted to analog electrical signals by the microphone. The Analog to Digital converter converts this analog signal to digital samples by taking precise measurements of the wave at discrete intervals. The digitized signal consists of a stream of periodic signals sampled at 16000 times per second and is not suitable to carry out actual speech recognition process as the pattern cannot be easily located. To extract the actual information, the signal in time domain is converted to signal in frequency domain. This is done by the Digital Signal Processor using FFT technique. In the digital signal, the component after every 1/100th of a second is analyzed and the frequency spectrum for each such component is computed. In other words, the digitized signal is segmented into small parts of frequency amplitudes. Each segment or the frequency graph represents the different sounds made by human beings. The computer performs the matching of the unknown segments with the stored phonetics of the particular language.
  • 11. The speech recognition system depends on the following factors: Isolated Words: There needs to be a pause between the consecutive words spoken because continuous continuous words can overlap making it difficult for the system to understand when a word starts or ends. Thus, there needs to be a silence between consecutive words. Single Speaker: Many speakers trying to give speech input at the same time can cause overlapping of the signals and interruptions. Most of the speech recognition systems used are speaker dependent systems. Vocabulary size: Languages with large vocabulary are difficult to be considered for pattern matching than those with small vocabulary as chances of having ambiguous words are lesser in the latter.
  • 12. LEXICON MODEL, ACOUSTIC MODEL, & LANGUAGE MODEL
  • 13. Lexicon The lexicon is the primary step in decoding speech. Creating a comprehensive lexical design for an system involves including the fundamental elements of both spoken language (the audio input the system receives) and written vocabulary (the text the system sends out). Acoustic Model Acoustic modeling involves separating an audio signal into small time frames. Acoustic models analyze each frame and provide the probability of using different phonemes in that section of audio. Simply acoustic models aim to predict which sound is spoken in each frame. Language Model Today’s ASR systems employ natural language processing (NLP) to help computers understand the context of what a speaker says. Language models recognize the intent of spoken phrases and use that knowledge to compose word sequences. They operate in a similar way to acoustic models by using neural networks trained on text data to estimate the probability of which word comes next in a phrase. Together, the lexicon, acoustic model, and language model enable ASR systems to make close-to- accurate predictions about the words and sentences in an audio input.
  • 14. In the simplest terms, speech recognition occurs when a computer receives audio input from a person speaking, processes that input by breaking down the various components of speech, and then transcribes that speech to text. Some ASR systems are speaker-dependent and must be trained to recognize particular words and speech patterns. These are essentially the voice-recognition systems used in your smart devices. You need to say specific words and phrases into your phone before the ASR-powered voice assistant starts working in order for it to learn to identify your voice. Other ASR systems are speaker-independent. These systems do not require any training. Speak-independent systems have the ability to recognize spoken words regardless of the speaker. Speaker-independent systems are practical solutions for business applications like interactive voice response (IVR).
  • 15. 15 From speech recognition’s mid-twentieth-century origins to its multi-industry applications today, the use cases for ASR technology are far-reaching. ASR made it out of the computer science laboratories and is now integrated into our everyday lives.  Voice Assistants : According to a 2020 survey conducted by NPR and Edison Research, 63% of respondents said they use a voice assistant. The ability to use voice commands to help complete tasks like opening mobile apps, sending a text message, or searching the web affords users a greater level of convenience.  Language learning: For people engaged in self-guided language study, apps using speech-recognition tools put them a step closer to having a comprehensive learning experience during independent study. Apps like Busuu and Babbel use ASR technology to help students practice their pronunciation and accents in their target languages. Using these apps, a student speaks into their phone or computer in their target language. The ASR software listens to that voice input, analyzes it, and if it matches what the system identifies as the correct pronunciation, it informs the learner. If the student’s voice input doesn’t match what the ASR knows to be correct, it will inform the student of their missed pronunciation as well.  Transcription Services : One of the first widespread use cases of ASR was for the simple transcription of speech. Speech-to- text services offer a level of convenience in many contexts and open the door to improved audio and video accessibility. Health care practitioners use dictation products like Dragon Naturally Speaking to help them take hands-free notes while attending to patients. ASR captioning also allows for real-time transcription of live video, which allows a broader audience to access the media.  Call Centers: ASR is crucial for the automation of processes for businesses with extensive customer support demands. With an influx of callers, companies need a way to efficiently handle a vast amount of customer communication. ASR technology is one of the main mechanisms involved in smart IVR — a system that automates routine inbound communications as well as
  • 16. 16 • Imprecision and false interpretations • Time and lack of efficiency • Accents and local differences • Background noise and loud environments • Privacy and data security
  • 17. Aravali College of Engineering And Management Jasana, Tigoan Road, Neharpar, Faridabad, Delhi NCR Toll Free Number : 91- 8527538785 Website : www.acem.edu.in 17