SlideShare a Scribd company logo
Testing Intelligent Personal Assistants Joe Buzzanga
Page 1
Intelligent Personal Assistants: Testing Part 1
This post is the first in a series that will evaluate the performance of the Intelligent Personal
Assistants (IPA) from Apple (Siri), Google (Google Now) and Microsoft (Cortana). All tests were
conducted on an iPhone 6s running iOS 9.2.1. Testing was done on January 23, 2016.
This initial test looks at how these systems handle a conversational task. We are looking for the
ability to go beyond answering a factual question to actually engaging in a simple two level
dialog with a user. The test consists of two questions spoken into the phone. First we ask: “Who
wrote For Whom the Bell Tolls”?. If the IPA answers “Ernest Hemingway” we follow up with
another question “When did he die”?
This is, of course, a simple Q/A dialog that any person would be able to handle, provided he or
she knew the answers. The tricky part is knowing that the pronoun “he” refers to “Ernest
Hemingway”. This is simple for a human but difficult for a computer. The ability to make this
cognitive linguistic connection is technically called “anaphora resolution” . Our particular
example is known as intersentential pronominal anaphora resolution. It involves connecting a
pronoun (“he”) to an antecedent (“Ernest Hemingway”) occurring in a different sentence.
We’ll refer to this exercise as a two level dialog.
Level 1: Who wrote “For Whom the Bell Tolls”
Answer: Ernest Hemingway
Level 2: When did he die?
Answer: July 22, 1961
Summary
Siri, Google Now, and Cortana all passed the Level 1 test easily. They “understood” the question
and answered correctly. Siri and Google Now responded with complete sentences. Cortana
simply responded with the name. Siri had the most “personality” in its response, while Google
Now and Cortana were devoid of any attempt to seem human.
Siri and Cortana both failed on the Level 2 test. They were completely unable to understand
how to handle a pronoun. In technical terms, they could not perform a successful anaphora
resolution.
Who wrote For Whom the Bell TollsErnest Hemingway When did he die
Testing Intelligent Personal Assistants Joe Buzzanga
Page 2
Google Now, on the other hand, was completely adept at answering not just the Level 2
question but a series of follow up questions, all referring to Hemingway via pronouns. On this
type of conversational task Google appears to be far ahead of Apple and Microsoft.
Test Results—Siri
Level 1—Who wrote “For Whom the Bell Tolls?”
Grade: Passed
Siri’s response was “Hmm let me have a look. It looks like the author of “For Whom the Bell
Tolls” was Ernest Hemingway”. The screen displayed a rich set of facts about Ernest Hemingway
(Figure 1)
Figure 1: Siri Level 1 “Who Wrote For Whom the Bell Tolls?”
The input interpretation is presented as well and shows Siri is quite accurate in identifying “For
Whom the Bell Tolls” as a book. Similarly, it recognizes that Ernest Hemingway is an author.
Level 2—“When did he die?”
Grade: Failed
Siri was utterly lost in trying to answer the follow up question. It responded with the
nonsensical statement “Here’s what I found on the web for When did For Whom the Bell Tolls
die” (Figure 2).
Testing Intelligent Personal Assistants Joe Buzzanga
Page 3
Figure 2: Siri Level 2 “When Did He Die?”
The answer here shows that Siri cannot connect “he” to “Ernest Hemingway”, instead resolving
it to the book title. Perhaps more disappointing is that Siri doesn’t recognize that death is a
property of humans and other living organisms and cannot logically apply to book titles.
Test Results—Google Now
Level 1—Who wrote “For Whom the Bell Tolls?”
Grade: Passed
Google Now responded directly: “Ernest Hemingway wrote For Whom the Bell Tolls”. Unlike
Siri, Google Now is notably lacking in playfulness or personality. But that is a matter of taste and
preference. Its answer was correct.
Testing Intelligent Personal Assistants Joe Buzzanga
Page 4
Figure 3: Google Now Level 1 "Who Wrote For Whom the Bell Tolls?"
Level 2—“When did he die?”
Grade: Passed
Google Now answered correctly: “He died on July 22, 1961”
Testing Intelligent Personal Assistants Joe Buzzanga
Page 5
Figure 4: Google Now Level 2 "When Did He Die"?
We posed follow up questions to see how deep Google could go. The answer is, surprisingly
deep. Here are our follow up questions:
Level 3—“How did he die”?
Google Now: “The cause of death of Ernest Hemingway was suicide”
We went even further and in each case, Google Now responded correctly. We omit the answers
here, but they were correct and were conveyed in complete English sentences.
Level 4—Where did he die?
Level 5—Where was he born?
Level 6—What was his first book?
It is striking to fire these questions at Google Now and receive correct spoken responses. It
almost feels like you are successfully interrogating a human. Once you pose the initial question
and name Ernest Hemingway it seems that you can follow up with an indefinite number of
questions just using a pronoun. Google Now’s deep learning technology “remembers” that the
pronouns continue to refer to Ernest Hemingway.
Testing Intelligent Personal Assistants Joe Buzzanga
Page 6
Test Results—Cortana
Level 1—Who wrote “For Whom the Bell Tolls?”
Grade: Passed
Cortana answered simply “Ernest Hemingway”. It did not respond with a complete sentence
and felt much more unpolished that both Siri and Google Now. Its screen display was notably
lacking in supplementary material.
Figure 5: Cortana Level1 "Who Wrote For Whom the Bell Tolls?"
Level 2—“When did he die?”
Grade: Failed
Cortana was unable to grasp the question and didn’t even attempt a verbal response. It
displayed a web page, apparently selected by literally matching the query phrase “did he die”
to a corresponding text snippet.
Testing Intelligent Personal Assistants Joe Buzzanga
Page 7
Figure 6: Cortana Level2 "When Did He Die?"

More Related Content

More from Joe Buzzanga

U.S. Consumer Search Preferences Q1 2017
U.S. Consumer Search Preferences Q1 2017U.S. Consumer Search Preferences Q1 2017
U.S. Consumer Search Preferences Q1 2017
Joe Buzzanga
 
Google decode q3 toc
Google decode q3 tocGoogle decode q3 toc
Google decode q3 toc
Joe Buzzanga
 
Is Google Evil 3.0
Is Google Evil 3.0Is Google Evil 3.0
Is Google Evil 3.0Joe Buzzanga
 
Technology Intelligence for R&D
Technology Intelligence for R&DTechnology Intelligence for R&D
Technology Intelligence for R&D
Joe Buzzanga
 
Building Network Elements Using Intel Network Processors and ATCA
Building Network Elements Using Intel Network Processors and ATCABuilding Network Elements Using Intel Network Processors and ATCA
Building Network Elements Using Intel Network Processors and ATCA
Joe Buzzanga
 
London Online 2008
London Online 2008London Online 2008
London Online 2008
Joe Buzzanga
 

More from Joe Buzzanga (6)

U.S. Consumer Search Preferences Q1 2017
U.S. Consumer Search Preferences Q1 2017U.S. Consumer Search Preferences Q1 2017
U.S. Consumer Search Preferences Q1 2017
 
Google decode q3 toc
Google decode q3 tocGoogle decode q3 toc
Google decode q3 toc
 
Is Google Evil 3.0
Is Google Evil 3.0Is Google Evil 3.0
Is Google Evil 3.0
 
Technology Intelligence for R&D
Technology Intelligence for R&DTechnology Intelligence for R&D
Technology Intelligence for R&D
 
Building Network Elements Using Intel Network Processors and ATCA
Building Network Elements Using Intel Network Processors and ATCABuilding Network Elements Using Intel Network Processors and ATCA
Building Network Elements Using Intel Network Processors and ATCA
 
London Online 2008
London Online 2008London Online 2008
London Online 2008
 

Recently uploaded

The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
laozhuseo02
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
GTProductions1
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
JeyaPerumal1
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
Javier Lasa
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
nirahealhty
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
natyesu
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
keoku
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
3ipehhoa
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
Gal Baras
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
VivekSinghShekhawat2
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
3ipehhoa
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
3ipehhoa
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
JungkooksNonexistent
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
laozhuseo02
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
Arif0071
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 

Recently uploaded (20)

The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 

Intelligent personal assistant testing 1

  • 1. Testing Intelligent Personal Assistants Joe Buzzanga Page 1 Intelligent Personal Assistants: Testing Part 1 This post is the first in a series that will evaluate the performance of the Intelligent Personal Assistants (IPA) from Apple (Siri), Google (Google Now) and Microsoft (Cortana). All tests were conducted on an iPhone 6s running iOS 9.2.1. Testing was done on January 23, 2016. This initial test looks at how these systems handle a conversational task. We are looking for the ability to go beyond answering a factual question to actually engaging in a simple two level dialog with a user. The test consists of two questions spoken into the phone. First we ask: “Who wrote For Whom the Bell Tolls”?. If the IPA answers “Ernest Hemingway” we follow up with another question “When did he die”? This is, of course, a simple Q/A dialog that any person would be able to handle, provided he or she knew the answers. The tricky part is knowing that the pronoun “he” refers to “Ernest Hemingway”. This is simple for a human but difficult for a computer. The ability to make this cognitive linguistic connection is technically called “anaphora resolution” . Our particular example is known as intersentential pronominal anaphora resolution. It involves connecting a pronoun (“he”) to an antecedent (“Ernest Hemingway”) occurring in a different sentence. We’ll refer to this exercise as a two level dialog. Level 1: Who wrote “For Whom the Bell Tolls” Answer: Ernest Hemingway Level 2: When did he die? Answer: July 22, 1961 Summary Siri, Google Now, and Cortana all passed the Level 1 test easily. They “understood” the question and answered correctly. Siri and Google Now responded with complete sentences. Cortana simply responded with the name. Siri had the most “personality” in its response, while Google Now and Cortana were devoid of any attempt to seem human. Siri and Cortana both failed on the Level 2 test. They were completely unable to understand how to handle a pronoun. In technical terms, they could not perform a successful anaphora resolution. Who wrote For Whom the Bell TollsErnest Hemingway When did he die
  • 2. Testing Intelligent Personal Assistants Joe Buzzanga Page 2 Google Now, on the other hand, was completely adept at answering not just the Level 2 question but a series of follow up questions, all referring to Hemingway via pronouns. On this type of conversational task Google appears to be far ahead of Apple and Microsoft. Test Results—Siri Level 1—Who wrote “For Whom the Bell Tolls?” Grade: Passed Siri’s response was “Hmm let me have a look. It looks like the author of “For Whom the Bell Tolls” was Ernest Hemingway”. The screen displayed a rich set of facts about Ernest Hemingway (Figure 1) Figure 1: Siri Level 1 “Who Wrote For Whom the Bell Tolls?” The input interpretation is presented as well and shows Siri is quite accurate in identifying “For Whom the Bell Tolls” as a book. Similarly, it recognizes that Ernest Hemingway is an author. Level 2—“When did he die?” Grade: Failed Siri was utterly lost in trying to answer the follow up question. It responded with the nonsensical statement “Here’s what I found on the web for When did For Whom the Bell Tolls die” (Figure 2).
  • 3. Testing Intelligent Personal Assistants Joe Buzzanga Page 3 Figure 2: Siri Level 2 “When Did He Die?” The answer here shows that Siri cannot connect “he” to “Ernest Hemingway”, instead resolving it to the book title. Perhaps more disappointing is that Siri doesn’t recognize that death is a property of humans and other living organisms and cannot logically apply to book titles. Test Results—Google Now Level 1—Who wrote “For Whom the Bell Tolls?” Grade: Passed Google Now responded directly: “Ernest Hemingway wrote For Whom the Bell Tolls”. Unlike Siri, Google Now is notably lacking in playfulness or personality. But that is a matter of taste and preference. Its answer was correct.
  • 4. Testing Intelligent Personal Assistants Joe Buzzanga Page 4 Figure 3: Google Now Level 1 "Who Wrote For Whom the Bell Tolls?" Level 2—“When did he die?” Grade: Passed Google Now answered correctly: “He died on July 22, 1961”
  • 5. Testing Intelligent Personal Assistants Joe Buzzanga Page 5 Figure 4: Google Now Level 2 "When Did He Die"? We posed follow up questions to see how deep Google could go. The answer is, surprisingly deep. Here are our follow up questions: Level 3—“How did he die”? Google Now: “The cause of death of Ernest Hemingway was suicide” We went even further and in each case, Google Now responded correctly. We omit the answers here, but they were correct and were conveyed in complete English sentences. Level 4—Where did he die? Level 5—Where was he born? Level 6—What was his first book? It is striking to fire these questions at Google Now and receive correct spoken responses. It almost feels like you are successfully interrogating a human. Once you pose the initial question and name Ernest Hemingway it seems that you can follow up with an indefinite number of questions just using a pronoun. Google Now’s deep learning technology “remembers” that the pronouns continue to refer to Ernest Hemingway.
  • 6. Testing Intelligent Personal Assistants Joe Buzzanga Page 6 Test Results—Cortana Level 1—Who wrote “For Whom the Bell Tolls?” Grade: Passed Cortana answered simply “Ernest Hemingway”. It did not respond with a complete sentence and felt much more unpolished that both Siri and Google Now. Its screen display was notably lacking in supplementary material. Figure 5: Cortana Level1 "Who Wrote For Whom the Bell Tolls?" Level 2—“When did he die?” Grade: Failed Cortana was unable to grasp the question and didn’t even attempt a verbal response. It displayed a web page, apparently selected by literally matching the query phrase “did he die” to a corresponding text snippet.
  • 7. Testing Intelligent Personal Assistants Joe Buzzanga Page 7 Figure 6: Cortana Level2 "When Did He Die?"