My Project

1,625 views

Published on

My University Project.

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,625
On SlideShare
0
From Embeds
0
Number of Embeds
4
Actions
Shares
0
Downloads
65
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • DEPARTMENT OF ECE ROBOTIC CONTROL THROUGH SPEECH
  • ROBOTIC CONTROL THROUGH SPEECH DEPARTMENT OF ECE
  • ROBOTIC CONTROL THROUGH SPEECH DEPARTMENT OF ECE
  • My Project

    1. 1. ROBOTIC CONTROL THROUGH SPEECH
    2. 2. INTRODUCTION <ul><li>This voice recognition project consists of two major components, a speech recognition module and a motorized robot. </li></ul><ul><li>Programmable module allows us to write the programming in Visual DSP++ (Programming applications for the ADSP 2181 Architecture). </li></ul><ul><li>The motorized robot will consist of two DC motors and will make the robot forward and backward directions. </li></ul>DEPARTMENT OF ECE
    3. 3. PROJECT DESCRIPTION <ul><li>The Speaker Recognition can be classified into two phases. </li></ul><ul><li>1 Training Phase. </li></ul><ul><li>2 Testing Phase. </li></ul>DEPARTMENT OF ECE
    4. 4. Training Phase. <ul><li>In Training Phase ,the frequency components of the given speech signal is extracted. </li></ul><ul><li>Each registered speaker has to provide samples of their speech (given words). </li></ul><ul><li>so that the system an build or train a reference model for that speaker. </li></ul>DEPARTMENT OF ECE
    5. 5. Testing phase <ul><li>In testing phase ,the input speech is matched with stored references models (s) </li></ul><ul><li>Recognition decision is made on the basis of Mel Frequency Cepstrum Coefficients (MFCC) </li></ul><ul><li>The command recognition is observed by the operation of stepper motor & DC motor and the control signals to the DC motor </li></ul>DEPARTMENT OF ECE
    6. 6. ARCHITECTURE OF ADSP 2181 DEPARTMENT OF ECE
    7. 7. FEATURES OF ADSP 2181 PROCESSOR <ul><li>25 ns Instruction Cycle Time from 20 MHz Crystal at 5.0 Volts </li></ul><ul><li>Single-Cycle Instruction Execution </li></ul><ul><li>Multifunction Instructions </li></ul><ul><li>Low Power Dissipation in Idle Mode </li></ul><ul><li>16K Words On-Chip Program Memory RAM </li></ul><ul><li>16K Words On-Chip Data Memory RAM </li></ul><ul><li>Independent ALU, Multiplier/Accumulator, and Barrel Shifter Units </li></ul><ul><li>3-Bus Architecture Allows Dual Operand Fetches in every Instruction Cycle </li></ul>DEPARTMENT OF ECE
    8. 8. ALU and MAC <ul><li>The ALU performs a standard set of arithmetic and logic operations in addition to division primitives. </li></ul><ul><li>  </li></ul><ul><li>The MAC performs single-cycle multiply, multiply/add and multiply/subtract operations. </li></ul>DEPARTMENT OF ECE
    9. 9. SHIFTER <ul><li>The shifter performs logical and arithmetic shifts, normalization, de-normalization, and derive exponent operations. </li></ul><ul><li>The shifter implements numeric format control including multiword floating-point representations. </li></ul>DEPARTMENT OF ECE
    10. 10. SPEECH <ul><li>The input speech is given in the form of nos. like1, 2,3.. </li></ul><ul><li>The frequency range of human voice is 4kHz hence sampling frequency is taken as 8kHz </li></ul><ul><li>In coding only 2000 samples are considered because only 0.25 sec will be taken for one character </li></ul>DEPARTMENT OF ECE
    11. 11. REPRESENTATION OF SPEECH SIGNAL DEPARTMENT OF ECE
    12. 12. Block Diagram <ul><li>Input speech </li></ul><ul><li>via mic ADSP 2181 </li></ul>DEPARTMENT OF ECE DC MOTOR MEL SPECTRUM WINDOWING FFT MEL FREQ WRAP MEL CEPSTRUM CODEC FRAMMING
    13. 13. FRAMING <ul><li>Speech signal is blocked into frames of N samples (n=256) </li></ul><ul><li>Adjacent Frames are separated by M samples (M=100) </li></ul><ul><li>Frame1= 0-256 </li></ul><ul><li>Frame2=100-356 </li></ul><ul><li>Such kind of 18 frames are required for 2000 samples/sec character. </li></ul>DEPARTMENT OF ECE
    14. 14. FRAMING DEPARTMENT OF ECE
    15. 15. Windowing <ul><li>Minimizes signal discontinuity in each frame </li></ul><ul><li>Reduced spectral distortion </li></ul><ul><li>Window signal is obtained by </li></ul><ul><li>Y 1 (n)=x 1 (n)*w(n) ; 0<=n<N-1 </li></ul><ul><li>Where w(n) is Hamming Window and is given by </li></ul><ul><li>w(n)=0.54-0.46Cos(2∏ n/N-1); 0<=n<N-1 </li></ul>DEPARTMENT OF ECE
    16. 16. Windowing DEPARTMENT OF ECE
    17. 17. Result of Windowing <ul><li>256 values are o/p of this process </li></ul><ul><li>These values are given as an </li></ul><ul><li>input for FFT. </li></ul><ul><li>Some values of windowing </li></ul><ul><li>for 1 kHz is shown </li></ul>0x0000 0x0826 0x0BE6 0x08B7 0x000F 0xF6C7 0xF26C 0xF5FC 0xFFE8 0x0AA9 0x0FC7 DEPARTMENT OF ECE
    18. 18. Fast Fourier Transform <ul><li>Converts time domain signal into frequency domain signal </li></ul><ul><li>Power spectrum is obtained with real and imaginary part of the frequency domain of the speech signal. </li></ul>DEPARTMENT OF ECE
    19. 19. Wrapping <ul><li>A subjective pitch for each frequency is computed using Mel Scale </li></ul><ul><li>Mel frequency scale is given by mel(f)=2595*log 10 (1+f/700) </li></ul>DEPARTMENT OF ECE
    20. 20. Mel Frequency Coefficients DEPARTMENT OF ECE
    21. 21. MFCC <ul><li>It is Mel Frequency Cepstrum Coefficient </li></ul><ul><li>It consists of various frequency coefficient components. </li></ul><ul><li>It contains: </li></ul><ul><li>Mel Spectrum (frequency domain) </li></ul><ul><li>Mel Cepstrum (time domain) </li></ul>DEPARTMENT OF ECE
    22. 22. SPECTRUM <ul><li>Samples are convoluted with mel filter bank to obtain mel frequency spectrum. </li></ul><ul><li>Mel frequency spectrum is given by </li></ul><ul><li>s(n)=y(n)*f(n) </li></ul><ul><li>s(n)------>mel frequency spectrum </li></ul><ul><li>y(n)------>samples </li></ul><ul><li>f(n)------->filter coefficients </li></ul>DEPARTMENT OF ECE
    23. 23. Inverse Discrete Cosine Transformation <ul><li>Mel frequency power spectrum is in frequency domain function </li></ul><ul><li>In order to obtain a time domain function the signal undergoes IDCT </li></ul><ul><li>Now mel frequency spectrum is converted into mel frequency cepstrum. </li></ul>DEPARTMENT OF ECE
    24. 24. CEPSTRUM <ul><li>MFCC real numbers and are convoluted to time domain using IDCT </li></ul><ul><li>The time domain coefficients are called mel frequency cepstrum coefficients.. </li></ul><ul><li>MFCC is given by </li></ul><ul><li>c(n)=sum of log (Sk * cos (n(k-.5)*pi/k) </li></ul>DEPARTMENT OF ECE
    25. 25. LEAST MEAN SQUARE ALGORITHM (LMS) <ul><li>This algorithm is used to find out the the minimum deviation between certain values. </li></ul><ul><li>During testing phase the input speech is compared with the stored 4 values. </li></ul><ul><li>The least deviated value is sent. </li></ul>DEPARTMENT OF ECE
    26. 26. INTERFACING PC WITH KIT <ul><ul><ul><ul><ul><li> RS-232 SERIAL CABLE </li></ul></ul></ul></ul></ul>DEPARTMENT OF ECE PC DSP PROCESSOR
    27. 27. DSP TO DC MOTOR DEPARTMENT OF ECE
    28. 28. CIRCUIT DIAGRAM DEPARTMENT OF ECE
    29. 29. HARDWARE DETAILS <ul><li>The latched output from the latch IC is given to the relays via resistor and transistor. </li></ul><ul><li>According to the predefined input, the coil gets energized and relay is switched to ON position. </li></ul><ul><li>Here we use SPDT relay </li></ul><ul><li>It causes a current flow in the DC Motor. </li></ul>DEPARTMENT OF ECE
    30. 30. Details of dc motor <ul><li>Speed of the motor - 300 rpm </li></ul><ul><li>Current – 750mA </li></ul><ul><li>Voltage – 7.5V </li></ul>DEPARTMENT OF ECE
    31. 31. Advantages <ul><li>It is SPEECH recognizable </li></ul><ul><li>Processing time is less </li></ul><ul><li>Easy and efficient </li></ul><ul><li>Useful for physically disable people </li></ul><ul><li>Less cost </li></ul><ul><li>Maintenance is easy </li></ul>DEPARTMENT OF ECE
    32. 32. Limitations <ul><li>Mismatching of frequency may affect the compatibility with the hardware. </li></ul><ul><li>Each and everyone voice should be trained before testing it. </li></ul>DEPARTMENT OF ECE
    33. 33. APPLICATIONS <ul><li>Physically and visually impaired friendly device where only the speech signals of the user is required. </li></ul><ul><li>In cases of acute problems like system crashes and all, this method can be utilized for emergency. </li></ul>DEPARTMENT OF ECE
    34. 34. CONCLUSION and FUTURE MODIFICATIONS <ul><li>Speech recognition is still an active research area. </li></ul><ul><li>Speech Recognition brings in the communication between human and machine. </li></ul><ul><li>This project recognizes the given speech signal and the word is displayed on the PC. </li></ul>DEPARTMENT OF ECE
    35. 35. <ul><li>THANK YOU </li></ul>DEPARTMENT OF ECE

    ×