SlideShare a Scribd company logo
1 of 19
Jamie and
Debbie
Background
Jamie and Debbie got married. A couple weeks after getting married,
they left England and started travelling around the world. Jamie and
Debbie stayed in touch with their family and friends via email. However,
after about 2 years, the mother of Debbie started to worry about her.
She contacted the police and said:
Question: What do you think is the problem?
Their opinion
We think that the emails we receive from Debbie are not written by
Debbie. They though that the wording of the emails was not the type of
wording that their daughter would use.
Question:
Would you family notice if you did not write your own email?
Why do you think so?
Police investigation
Datasets
The detectives at Nottingham police station collected the emails that the
mother had received from Debbie’s email address.
The emails were divided into two datasets:
• Dataset 1: Emails received from Debbie before marriage
• Dataset 2: Emails received from Debbie after marriage
The detectives also felt that the style of writing was not the same in Dataset 1
and 2. They then collected emails sent by Jamie to his family. This is the third
dataset.
• Dataset 3: Emails received from Jamie
Dataset names
The police now have three datasets.
• Dataset 1: Emails received from Debbie before marriage
• Dataset 2: Emails received from Debbie after marriage
• Dataset 3: Emails received from Jamie
The investigators also used another dataset containing thousands of emails
written by different people.
• Dataset 4: Large collection of emails from many different senders
Question: Decide which datasets are questioned, known or reference.
Datasets
Questioned
(3,000
words)
Known 1:
Debbie
(28,000
words)
Known 2:
Jamie
(6,000
words)
Reference
(1 million
words)
Datasets
If the language features in two datasets are similar, the same person may have
written them.
If the language features in two datasets are NOT similar, the same person may
NOT have written them.
So, we need to discover if:
The features in the questioned dataset are similar to a known corpus
The features in the questioned dataset are different to a known corpus
Question: How can we do this?
Answers
Deep learning
• This will probably work, but it is difficult to explain in court
Statistical analysis
• This may also work, but it is also difficult to explain in court
Habits of language (idiosyncratic language - 口癖)
• This works and is the easy to explain.
Question: How can we systematically identify idiosyncratic language?
Is your language idiosyncratic?
THIN FAT DELICIOUS I
Thin
Slim
Slender
Skeletal
Skinny
Emaciated
Fat
Plump
Tubby
Chubby
Podgy
Overweight
Obese
Delicious
Tasty
Yummy
Flavorsome
Delectable
Scrumptious
俺
僕
私
儂
自分
うち
あたし
Datasets
Questioned
Known 1:
Debbie
Known 2:
Jamie
Reference
Afd
affadsA
Faasfsafs
afadsfa
Fsaafd
affsafs
Aafs
adsf
Afd
affadsA
Afd
affadsA
afadsfa
Fsaafd
affsafs
adsf
Afd
affadsA
Three common language features are used:
1. word frequency,
2. word frequency of words that are used more than expected, and
3. patterns following such words
Datasets
Questioned
Known 1:
Debbie
Known 2:
Jamie
Reference
Afd
affadsA
Faasfsafs
afadsfa
Fsaafd
affsafs
Aafs
adsf
Afd
affadsA
Afd
affadsA
afadsfa
Fsaafd
affsafs
adsf
Afd
affadsA
Question: How do we identify word frequency?
Datasets
Questioned
Known 1:
Debbie
Known 2:
Jamie
Reference
Afd
affadsA
Faasfsafs
afadsfa
Fsaafd
affsafs
Aafs
adsf
Afd
affadsA
Afd
affadsA
afadsfa
Fsaafd
affsafs
adsf
Afd
affadsA
Question: How do we identify frequency of words that are used more
than expected (keywords)?
Datasets
Questioned
Known 1:
Debbie
Known 2:
Jamie
Reference
Afd
affadsA
Faasfsafs
afadsfa
Fsaafd
affsafs
Aafs
adsf
Afd
affadsA
Afd
affadsA
afadsfa
Fsaafd
affsafs
adsf
Afd
affadsA
Question: How do we identify patterns following keywords?
Datasets
Questioned
Known 1:
Debbie
Known 2:
Jamie
Reference
Awhile 3
Buisiness 2
was sat 1
Awhile 2
Buisiness 4
was sat 1
Question: If these keywords were discovered, what would you conclude?
Guilty or Innocent?
Jamie Starbuck
Check the result online by searching for “Jamie Starbuck”
System Needed
Expert system
When preparing evidence for this case, the linguistic had to:
• Analyze keywords in the Questioned dataset
• Analyze keywords in each Known dataset separately
• Create a table in Excel to compare the keywords
• Identify keywords that occur in both Questioned and Known datasets
An expert system is needed to streamline this process, and remove the chance
of human error.

More Related Content

Similar to Starbuck.pptx

English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdf
English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdfEnglish9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdf
English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdfLeighCobaper
 
EF4PI Unit 1B- Present simple.pptx
EF4PI Unit 1B- Present simple.pptxEF4PI Unit 1B- Present simple.pptx
EF4PI Unit 1B- Present simple.pptxPremLearn
 
Modulo compactado octavo de ingles
Modulo compactado octavo de inglesModulo compactado octavo de ingles
Modulo compactado octavo de inglesEduardoVelez38
 
Croatian Uruguayan project
Croatian Uruguayan projectCroatian Uruguayan project
Croatian Uruguayan projectMarijana Smolcec
 
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptx
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptxFORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptx
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptxsyila halim
 
10 aa1 week 2, session 1
10 aa1 week 2, session 110 aa1 week 2, session 1
10 aa1 week 2, session 1Regsana1
 
Designing Online Email Based Storytelling Group for Older Adults
Designing Online Email Based Storytelling Group for Older AdultsDesigning Online Email Based Storytelling Group for Older Adults
Designing Online Email Based Storytelling Group for Older AdultsTed Li
 
MarriageFire_ TMC_tune up_nhlc
MarriageFire_ TMC_tune up_nhlcMarriageFire_ TMC_tune up_nhlc
MarriageFire_ TMC_tune up_nhlcMarriageFire77
 
Civility In The Workplace
Civility In The WorkplaceCivility In The Workplace
Civility In The WorkplaceSherry
 
love-and-relationships developing healthy relationship.pptx
love-and-relationships developing healthy relationship.pptxlove-and-relationships developing healthy relationship.pptx
love-and-relationships developing healthy relationship.pptxTorresDianaRose
 
Types of Language Registers
Types of Language RegistersTypes of Language Registers
Types of Language RegistersRyanBuer
 
4. modals, indefinite pronouns, inclusive language, verb analysis
4. modals, indefinite pronouns, inclusive language, verb analysis4. modals, indefinite pronouns, inclusive language, verb analysis
4. modals, indefinite pronouns, inclusive language, verb analysisIECP
 

Similar to Starbuck.pptx (20)

Taboo language
Taboo languageTaboo language
Taboo language
 
Active vs. passive
Active vs. passiveActive vs. passive
Active vs. passive
 
English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdf
English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdfEnglish9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdf
English9_Q2_M1_L1_MakingConnectionsText-to-Text-Text-to-Self_V2-1.pdf
 
LGBT assembly
LGBT assemblyLGBT assembly
LGBT assembly
 
EF4PI Unit 1B- Present simple.pptx
EF4PI Unit 1B- Present simple.pptxEF4PI Unit 1B- Present simple.pptx
EF4PI Unit 1B- Present simple.pptx
 
Modulo compactado octavo de ingles
Modulo compactado octavo de inglesModulo compactado octavo de ingles
Modulo compactado octavo de ingles
 
Identity
IdentityIdentity
Identity
 
Croatian Uruguayan project
Croatian Uruguayan projectCroatian Uruguayan project
Croatian Uruguayan project
 
INGLES_8_MODULO_1.pdf
INGLES_8_MODULO_1.pdfINGLES_8_MODULO_1.pdf
INGLES_8_MODULO_1.pdf
 
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptx
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptxFORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptx
FORM 3 UNIT 1 - MEET THE ANCESTORS (READING).pptx
 
Chapter 1. my friends
Chapter 1. my friendsChapter 1. my friends
Chapter 1. my friends
 
10 aa1 week 2, session 1
10 aa1 week 2, session 110 aa1 week 2, session 1
10 aa1 week 2, session 1
 
Designing Online Email Based Storytelling Group for Older Adults
Designing Online Email Based Storytelling Group for Older AdultsDesigning Online Email Based Storytelling Group for Older Adults
Designing Online Email Based Storytelling Group for Older Adults
 
FEJ Presents 1
FEJ Presents 1FEJ Presents 1
FEJ Presents 1
 
MarriageFire_ TMC_tune up_nhlc
MarriageFire_ TMC_tune up_nhlcMarriageFire_ TMC_tune up_nhlc
MarriageFire_ TMC_tune up_nhlc
 
Civility In The Workplace
Civility In The WorkplaceCivility In The Workplace
Civility In The Workplace
 
love-and-relationships developing healthy relationship.pptx
love-and-relationships developing healthy relationship.pptxlove-and-relationships developing healthy relationship.pptx
love-and-relationships developing healthy relationship.pptx
 
File 02 Speaking
File 02 SpeakingFile 02 Speaking
File 02 Speaking
 
Types of Language Registers
Types of Language RegistersTypes of Language Registers
Types of Language Registers
 
4. modals, indefinite pronouns, inclusive language, verb analysis
4. modals, indefinite pronouns, inclusive language, verb analysis4. modals, indefinite pronouns, inclusive language, verb analysis
4. modals, indefinite pronouns, inclusive language, verb analysis
 

More from john6938

Social Media Ethics.pptx
Social Media Ethics.pptxSocial Media Ethics.pptx
Social Media Ethics.pptxjohn6938
 
Future of Information Ethics.pptx
Future of Information Ethics.pptxFuture of Information Ethics.pptx
Future of Information Ethics.pptxjohn6938
 
Bioethics.pptx
Bioethics.pptxBioethics.pptx
Bioethics.pptxjohn6938
 
Surveillance and security.pptx
Surveillance and security.pptxSurveillance and security.pptx
Surveillance and security.pptxjohn6938
 
Introduction to Expert Systems.pptx
Introduction to Expert Systems.pptxIntroduction to Expert Systems.pptx
Introduction to Expert Systems.pptxjohn6938
 
Unit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptxUnit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptxjohn6938
 
Image_recognition.pptx
Image_recognition.pptxImage_recognition.pptx
Image_recognition.pptxjohn6938
 
Algorithms.pptx
Algorithms.pptxAlgorithms.pptx
Algorithms.pptxjohn6938
 
Artificial_intelligence.pptx
Artificial_intelligence.pptxArtificial_intelligence.pptx
Artificial_intelligence.pptxjohn6938
 
Image_generation.pptx
Image_generation.pptxImage_generation.pptx
Image_generation.pptxjohn6938
 
Computer_Graphics.pptx
Computer_Graphics.pptxComputer_Graphics.pptx
Computer_Graphics.pptxjohn6938
 
Security.pptx
Security.pptxSecurity.pptx
Security.pptxjohn6938
 
Gravitational_wave_detection.pptx
Gravitational_wave_detection.pptxGravitational_wave_detection.pptx
Gravitational_wave_detection.pptxjohn6938
 
Embedded_Systems.pptx
Embedded_Systems.pptxEmbedded_Systems.pptx
Embedded_Systems.pptxjohn6938
 
Software_engineering.pptx
Software_engineering.pptxSoftware_engineering.pptx
Software_engineering.pptxjohn6938
 
Quantum_computers.pptx
Quantum_computers.pptxQuantum_computers.pptx
Quantum_computers.pptxjohn6938
 
Sensors_SLAM.pptx
Sensors_SLAM.pptxSensors_SLAM.pptx
Sensors_SLAM.pptxjohn6938
 
Maths.pptx
Maths.pptxMaths.pptx
Maths.pptxjohn6938
 
Recommendation_systems.pptx
Recommendation_systems.pptxRecommendation_systems.pptx
Recommendation_systems.pptxjohn6938
 

More from john6938 (20)

Social Media Ethics.pptx
Social Media Ethics.pptxSocial Media Ethics.pptx
Social Media Ethics.pptx
 
Future of Information Ethics.pptx
Future of Information Ethics.pptxFuture of Information Ethics.pptx
Future of Information Ethics.pptx
 
Bioethics.pptx
Bioethics.pptxBioethics.pptx
Bioethics.pptx
 
Surveillance and security.pptx
Surveillance and security.pptxSurveillance and security.pptx
Surveillance and security.pptx
 
Introduction to Expert Systems.pptx
Introduction to Expert Systems.pptxIntroduction to Expert Systems.pptx
Introduction to Expert Systems.pptx
 
Unit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptxUnit 4 Problem breakdown.pptx
Unit 4 Problem breakdown.pptx
 
Image_recognition.pptx
Image_recognition.pptxImage_recognition.pptx
Image_recognition.pptx
 
Algorithms.pptx
Algorithms.pptxAlgorithms.pptx
Algorithms.pptx
 
Artificial_intelligence.pptx
Artificial_intelligence.pptxArtificial_intelligence.pptx
Artificial_intelligence.pptx
 
Image_generation.pptx
Image_generation.pptxImage_generation.pptx
Image_generation.pptx
 
Computer_Graphics.pptx
Computer_Graphics.pptxComputer_Graphics.pptx
Computer_Graphics.pptx
 
Security.pptx
Security.pptxSecurity.pptx
Security.pptx
 
Gravitational_wave_detection.pptx
Gravitational_wave_detection.pptxGravitational_wave_detection.pptx
Gravitational_wave_detection.pptx
 
Embedded_Systems.pptx
Embedded_Systems.pptxEmbedded_Systems.pptx
Embedded_Systems.pptx
 
Software_engineering.pptx
Software_engineering.pptxSoftware_engineering.pptx
Software_engineering.pptx
 
Quantum_computers.pptx
Quantum_computers.pptxQuantum_computers.pptx
Quantum_computers.pptx
 
NLP.pptx
NLP.pptxNLP.pptx
NLP.pptx
 
Sensors_SLAM.pptx
Sensors_SLAM.pptxSensors_SLAM.pptx
Sensors_SLAM.pptx
 
Maths.pptx
Maths.pptxMaths.pptx
Maths.pptx
 
Recommendation_systems.pptx
Recommendation_systems.pptxRecommendation_systems.pptx
Recommendation_systems.pptx
 

Recently uploaded

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 

Recently uploaded (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 

Starbuck.pptx