A Speech Recognizer Application


Zina Saadi
Anila Chowdhury
College of Computer
Science Tufts University
   Introduction & Inspiration
   System Flow & Design
   Implementation
   Challenges
   Branding
   Usability Testi...
 A Speech recognizer application that can help callers to
  find a restaurant in New York City.
 Functionalities:
     ...
   Regular and non-regular diner people can use it
   Access it anytime by a phone call
   Select top Zagat rated resta...
System Flow

             Narrow
             Options




             Search
Main Menu                 Top List
         ...
User Scenario
• Scenario 1
   System: What is the name your restaurant?
   Caller: Heidelberg
   System: Heidelberg is a G...
Recording Prompts
• Voice Talents:
   –   Katarina (Italian)
   –   Maria (Spanish)
   –   Jennifer (US English)
   –   Ju...
Cuisines Distribution
                                  Italian

                                  American
              ...
Steakhouses     Mexican and Tex Mex Asian              Kosher       Noodle Shops Swiss
                                   ...
"The artist is nothing without
 the gift, but the gift is
 nothing without work."



                                 10
Data Extraction                 Flow-of-Control
(XML Parsing)                   Management
                               ...
State
            • Grouped them by functionality
 Tables
            Result: CGI Files
            • Mappings: Cuisines  ...
• Unicode Conversion
  Xml      • Used Python
        Result: Normalized .xml File

         • Extraction => Normalization...
Restaurants Names
• <PVAL>Cafe Español</PVAL>
      =>     <PVAL>Cafe Espanol</PVAL>
     string = re.sub(u"u00F1", u"n", ...
Description
• "ok, not olé." or "Provençal country home"
• Celebrating its 40th anniversary, André and Rita
  Jammet's Mid...
• <PVAL>14 Wall Street Restaurant, The</PVAL>
The 14 Wall Street Restaurant



• <PVAL>Cloister Cafe, The</PVAL>
=> The C...
Punctuation Removal
• <PVAL>Salute!</PVAL>
• <PVAL>Pao!</PVAL>

Number Conversion (using ‘lingusitics’ ruby gem)
• <PVAL>1...
Usability Testing
• Target Users
   – all kinds of web & phone users (laggard to advanced)
• What we tested
   – Basic fun...
Test Plan
•   Wizard of Oz
     – Subject pretended to be the caller
     – I pretended to be the System
•   Pilot Testing...
Usability Testing Results

•   This application has great potential for real life use -- I often find myself
    trying to...
Marketing & Future Work
• Suggest new restaurants
• Encourage people to use zagat.com and rate more
  restaurants
• Larger...
Teachers of the course:
    Blade Kotelly & Chris Roby




                                 22
teşekkürler
  дякую   ཐུགས་རྗེ་ཆེ་།
                köszi
                       grazie ‫شكرا بزاف‬
спасибо
          多謝  ...
Upcoming SlideShare
Loading in...5
×

Zagat Restaurant Finder

2,080

Published on

Zagat Restaurant Finder is a speech recognizer application that allows callers to find the right quality dining place in New York City regions. The application browses, parses, retrieves and lists restaurant information from “Zagat Survey”1 at http://www.zagat.com/, which is the world’s leading provider of consumer survey-based information on where to Eat, Drink, Stay and Play worldwide. Zagat Restaurant Finder contains restaurants from over 86 different cuisines, with Italian, and American as the lead cuisines. The figure below illustrates the cuisines distribution provided by the application.

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,080
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
18
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Zagat Restaurant Finder

  1. 1. A Speech Recognizer Application Zina Saadi Anila Chowdhury College of Computer Science Tufts University
  2. 2.  Introduction & Inspiration  System Flow & Design  Implementation  Challenges  Branding  Usability Testing  Marketing & Future Work  Questions/ Comments 2
  3. 3.  A Speech recognizer application that can help callers to find a restaurant in New York City.  Functionalities:  Find a restaurant by Name  Search for restaurants by Cuisine  Search for restaurants by Neighborhood  Top Zagat Rated List  Narrow down a list by  Food Ratings  Décor Ratings  Service Ratings  Price Ratings 3
  4. 4.  Regular and non-regular diner people can use it  Access it anytime by a phone call  Select top Zagat rated restaurants  Find a restaurant by name- get address, phone number  Find by Cuisine or Neighborhood  Helps you to narrow down your results based on rating  Get connected to the restaurant  Anila and Zina conducted a survey and about 87% people said “yes” to using a system to find restaurants. 4
  5. 5. System Flow Narrow Options Search Main Menu Top List Results Restaurant Information 5
  6. 6. User Scenario • Scenario 1 System: What is the name your restaurant? Caller: Heidelberg System: Heidelberg is a German restaurant located at East 80s neighborhood. • Scenario 2 Caller: French System: I have found 203 restaurants that matched your search criteria. Caller: Narrow Down (selected); Price Rating (selected) System: In the Price rating, I've got restaurants that have price range between 0 (on the low end) and 86 (on the high end.) What's the maximum Price you want to pay? Caller: 16 System: I have found 16 restaurants that matched your search criteria. 6
  7. 7. Recording Prompts • Voice Talents: – Katarina (Italian) – Maria (Spanish) – Jennifer (US English) – Julie (French) – Melissa (US English) – Anila (US English) • Music (selected by Zina) – Mozart 7
  8. 8. Cuisines Distribution Italian American 17% French Bistro 46% 16% Japanese French Chinese 6% 4% Eclectic and 3% 4% 4% International Others 8
  9. 9. Steakhouses Mexican and Tex Mex Asian Kosher Noodle Shops Swiss Coffeehouses Mediterranean Brasserie Dim Sum and Desserts Tapas Tunisian South Pizza Spanish Vietnamese American Bakeries Ukrainian Coffee Shops and Delis and Sandwich Indian Diners Shops Brazilian Caribbean Armenian Thai Greek Vegetarian Jewish Health Food Australian Middle Hamburgers Southern and Soul Continental Korean Eastern Burmese Bar B Q German Scandinavian Polish English Cambodian Cuban Argentinean Soups Portuguese Ethiopian Colombian Turkish Moroccan Californian Afghan Indonesian Dominican Nuevo Latino Belgian Hot Dogs Jamaican Israeli Eritrean Southwestern Russian Malaysian Austrian Persian Filipino Irish Peruvian Egyptian Puerto Rican Lebanese 9
  10. 10. "The artist is nothing without the gift, but the gift is nothing without work." 10
  11. 11. Data Extraction Flow-of-Control (XML Parsing) Management (cgi, vxml forms) Data Storage Merging Ruby Functions User Interface Data Extraction (Phone-Call <=> Interface Bevocal) Back-End Front-End 11
  12. 12. State • Grouped them by functionality Tables Result: CGI Files • Mappings: Cuisines Background sounds VoiceXML Language Voices Result: generated VXML via CGI • Updated/Added Functions Interface Result: <restaurantDataInterface.rb> 12
  13. 13. • Unicode Conversion Xml • Used Python Result: Normalized .xml File • Extraction => Normalization Data =>Segmentation Result: Created .pstore File • Data Encapsulation & Normalization Interface • Methods/Functions Definitions Result: <restaurantDataInterface.rb> 13
  14. 14. Restaurants Names • <PVAL>Cafe Español</PVAL> => <PVAL>Cafe Espanol</PVAL> string = re.sub(u"u00F1", u"n", string) • <PVAL>Mamá Mexico</PVAL> <PVAL>Mama Mexico</PVAL> re.sub(u"(u00E0|u00E1|u00E4)", u"a", string) 14
  15. 15. Description • "ok, not olé." or "Provençal country home" • Celebrating its 40th anniversary, André and Rita Jammet's Midtown French classic” • re.sub(u"(u00E8|u00E9|u00EA|u00EB)", u"e", string) • re.sub(u"u00E7", u"c", string) 15
  16. 16. • <PVAL>14 Wall Street Restaurant, The</PVAL> The 14 Wall Street Restaurant • <PVAL>Cloister Cafe, The</PVAL> => The Cloister Cafe 16
  17. 17. Punctuation Removal • <PVAL>Salute!</PVAL> • <PVAL>Pao!</PVAL> Number Conversion (using ‘lingusitics’ ruby gem) • <PVAL>101</PVAL> 17
  18. 18. Usability Testing • Target Users – all kinds of web & phone users (laggard to advanced) • What we tested – Basic functionalities – How well they understood the product – How easy was it to learn • Test Subjects – Friends and families • Challenges – Ask minimum numbers of questions and collect the right information – How clear and intuitive each tasks are. 18
  19. 19. Test Plan • Wizard of Oz – Subject pretended to be the caller – I pretended to be the System • Pilot Testing – Small prototype – Smaller set of Restaurant Data – Less functionalities • Final Testing – Full implementation of all proposed design – Full dataset – All functionalities included (narrow, top list) – Check values and boundaries for input and prompts – Recorded Prompts + Background music 19
  20. 20. Usability Testing Results • This application has great potential for real life use -- I often find myself trying to locate restaurants when I'm outside, and lack of access to a computer makes things difficult. – Farial Anam, Senior Associate, LECG, LLC • It's really neat (and it works great too). Since I am French, I focused on the 'French' cuisine, and it properly told me 10 restaurants with the proper pronunciation ("native quality" French)- I picked one ("Bistro les Amis") by saying the name (with my native, South-West, French accent) and it recognized it.... Very neat. – Eric Montagut, Senior Principal Engineer, Basis Tech 20
  21. 21. Marketing & Future Work • Suggest new restaurants • Encourage people to use zagat.com and rate more restaurants • Larger database • Add new cities & restaurants • Other languages • Add features (buffet, romantic, music etc) 21
  22. 22. Teachers of the course: Blade Kotelly & Chris Roby 22
  23. 23. teşekkürler дякую ཐུགས་རྗེ་ཆེ་། köszi grazie ‫شكرا بزاف‬ спасибо 多謝 ‫متشكرم‬ どうも Merci आभारी हँु ‫תודה‬ obrigadinho terima kasih Cảm ơn rất nhiều ‫تشكر‬ ‫متش ّرين‬ ‫ك‬ danke ‫مرسي‬ 非常感謝 धन्यवाद 唔該 ευχαριστώ πολυ Dzięki 23
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×