SlideShare a Scribd company logo
1 of 3
Download to read offline
Reading Kitty: Automating Prediction of Children’s Oral Reading Skill
Julie Medero, Alfredo Gomez, Alicia Ngo
Harvey Mudd College
Abstract
While affective outcomes are generally positive for the use of
eBooks and computer-based reading tutors in teaching chil-
dren to read, learning outcomes are often poorer (Korat and
Shamir 2004). We describe the first iteration of Reading Kitty,
an iOS application that uses NLP and speech processing to fo-
cus children’s time on close reading and prosody in oral read-
ing, while maintaining an emphasis on creativity and artifact
creation. We also share preliminary results demonstrating that
semitone range can be used to automatically predict readers’
skill level.
Introduction
Early elementary-aged children’s reading achievement is
aided by reading practice in general, especially when that
practice focuses on reading with appropriate prosody (Kim
and Petscher 2016). Because individualized instruction is
time-intensive for teachers, many eBook readers and com-
puterized reading programs are available for emerging read-
ers but, while those tools are popular, they result in poorer
reading interactions than paper books, which negatively im-
pacts learning outcomes.
By focusing students’ time on practicing reading with ap-
propriate prosody, with an end goal of an artifact creation,
the Reading with KIneTic TYpography (Reading Kitty) app
aims to to keep the positive affective elements of eBooks
without sacrificing quality of interaction with texts. In the re-
maining sections, we outline first steps toward development
of Reading Kitty. We also share preliminary analysis demon-
strating that the prosodic analysis used in Reading Kitty can
provide accurate and useful feedback to children.
The Reading Kitty app
Kinetic typography is a category of animations that uses
moving text and images to emphasize the meaning and struc-
ture of text. Figure 1 shows a frame from a Kinetic Typog-
raphy animation of a scene from the film Toy Story. In the
animation, words are emphasized through text size, weight,
and motion to reflect the way that they’re emphasized by the
character that delivers the line.
Copyright c 2019, Association for the Advancement of Artificial
Intelligence (www.aaai.org). All rights reserved.
Figure 1: Kinetic typography frame (RebelX-Productions
2016).
By interacting with Reading Kitty, a child reader is able
to create their own kinetic typography animations that reflect
their interactions with a story. First, Reading Kitty guides a
child reader through a close reading by presenting a series
of questions customized to the story a child is reading. Those
questions correspond to specific oral reading fluency bench-
marks in the Common Core Standards Initiative (National
Governors Association Center for Best Practices, Council of
Chief State School Officers 2010). The child uses the answer
to each close reading question as the basis for adjusting the
appearance of words in the story.
Next, Reading Kitty has the child read aloud. The
prosody of the child’s reading is used with the text charac-
teristics selected by the child to generate a kinetic typogra-
phy artifact, with better prosody resulting in more dynamic
videos. Finally, feedback about the child’s oral reading is
shared with the child’s teacher.
Pitch Analysis
Predicting high pitch
We want Reading Kitty to reward children who read with
appropriate prosody with more interesting, dynamic anima-
tions of their readings. With this goal in mind, we first exam-
ine the feasibility of predicting which words a skilled reader
would read with a high pitch. Classification models have
been used previously to predict the locations of prosodic
boundaries (Medero and Ostendorf 2013). We adopt this
idea by building a model to predict which words should be
read with a high pitch. We train the model on the Boston
Directions Corpus because this corpus is hand-labeled with
ToBI pitch tones (Silverman et al. 1992). We define a word
to be high-pitched when the word’s labeled tone has an H*
Figure 2: Average semitone range per utterance for skilled
and struggling readers.
in it. We use features that have proven successful for predict-
ing ToBI tones in Greek: part-of-speech (POS) tags, POS bi-
grams, syntactic chunk tags, the number of syllables in each
word, and each word’s depth in an automatically-generated
syntactic constituency tree (Zervas, Fakotakis, and Kokki-
nakis 2004). For our data set, though, resulting error rate is
too high to give reliable feedback to children. We look in-
stead at pitch range across an utterance.
Semitone range
We analyze the CMU Kids Corpus (Consortium. and Uni-
versity. 1997), which contains 5,180 utterances (spoken sen-
tences) from 76 children. The children range in age from
6 to 11 and are in first through third grade. These 76 chil-
dren are split into 2 groups, skilled readers (n = 44) and
struggling readers (n = 32) Each utterance in the corpus is
accompanied by a transcript, along with word- and phone-
level forced alignments.
For each utterance, we extract the F0 values in Hz every
.01s using PRAAT’s pitch tracker (Boersma 2001). We cal-
culate the F0 maximum and F0 minimum for each utterance
in the corpus, and then calculate the semitone range as de-
scribed by Aoyama et al. (Aoyama, Akbari, and Flege 2016).
We average the semitone ranges for each utterance across all
readers in the skilled group and all readers in the struggling
group.
Figure 2 compares the semitone range for the two groups.
Each point corresponds to an utterance. Points below the line
correspond to utterances that were read by struggling read-
ers with a smaller semitone range than by skilled readers.
Likewise, points above the line correspond to utterances that
were read by struggling readers with a larger semitone range
than by skilled readers.
Skilled readers show larger average semitone range than
struggling readers across utterances (Fig. 2), which is
promising for our goal of using semitone range as a metric
for feedback to young readers. Because the two datasets are
not matched in terms of which utterances are read by which
reader, however, we
Paired Comparison Finally, we perform a paired t-test.
We pair a skilled reader with a struggling reader who read
the same utterances. Since the participants in the two groups
do not align exactly, we use the Hungarian method to find
the pairings that maximize the number of shared utterances
(Kuhn 1955), resulting in 13 pairs with 41-91 shared utter-
ances. We ignore utterances that are not shared by each pair.
The skilled readers have an average semitone range of 17.03
(σ = 1.85) and the struggling readers have an average semi-
tone range of 14.83 (σ = 2.15), which is statistically signif-
icant with p < .05.
Conclusion
Reading Kitty represents a new way to bring evidence-based
reading instruction in an electronic form. Our initial anal-
ysis provides promising evidence that semitone-based fea-
tures can be used to give feedback on prosodic quality of
oral readings.
References
Aoyama, K.; Akbari, C.; and Flege, J. 2016. Prosodic char-
acteristics of American English in school-age children. In
Proc. Speech Prosody, 572–576.
Boersma, P. 2001. Praat, a system for doing phonetics by
computer. Glot International.
Consortium., L. D., and University., C.-M. 1997. The CMU
kids speech corpus TT -.
Kim, Y. S. G., and Petscher, Y. 2016. Prosodic Sensitiv-
ity and Reading: An Investigation of Pathways of Relations
Using a Latent Variable Approach. Journal of Educational
Psychology 108(5):630–645.
Korat, O., and Shamir, A. 2004. Do Hebrew electronic
books differ from Dutch electronic books? A replication of
a Dutch content analysis. Journal of Computer Assisted
Learning 20(4):257–268.
Kuhn, H. W. 1955. The Hungarian method for the assign-
ment problem. Naval Research Logistics Quarterly 83–97.
Medero, J., and Ostendorf, M. 2013. Atypical Prosodic
Structure as an Indicator of Reading Level and Text Diffi-
culty. In Proc. NAACL-HLT, 715–720.
National Governors Association Center for Best Practices,
Council of Chief State School Officers. 2010. Common
Core State Standards for English Language Arts. Techni-
cal report, National Governors Association Center for Best
Practices, Council of Chief State School Officers, Washing-
ton, D.C.
2016. Toy Story - ”You Are A Toy” Kinetic Typography.
https://youtu.be/_PU49-_qQoI.
Silverman, K.; Beckman, M.; Pitrelli, J.; Ostendorf,
M.; Wightman, C. C.; Price, P.; Pierrehumbert, J.; and
Hirschberg, J. 1992. TOBI: A Standard for Labeling En-
glish Prosody. In Second International Conference on Spo-
ken Language Processing.
Zervas, P.; Fakotakis, N.; and Kokkinakis, G. 2004. Pitch
accent prediction from ToBI annotated corpora based on
Bayesian learning. Lecture Notes in Artificial Intelli-
gence (Subseries of Lecture Notes in Computer Science)
3206:545–552.

More Related Content

What's hot

The literate environment
The literate environmentThe literate environment
The literate environmentkteacher11
 
Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop Liz Sims
 
Principles and methodologies questions and answers
Principles and methodologies questions and answersPrinciples and methodologies questions and answers
Principles and methodologies questions and answersMahima Zaman
 
Literate environment presentation
Literate environment presentationLiterate environment presentation
Literate environment presentationbrklyn09
 
Mentoring Reading 4.11
Mentoring Reading 4.11Mentoring Reading 4.11
Mentoring Reading 4.11englishonecfl
 
Supporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core ClassroomSupporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core ClassroomKristin Guest MS, CCC-SLP
 
3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdfAngelikaJoyAdvincula1
 
Reading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary StudentReading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary StudentB. J. Zagorac
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysiskd4rl4ever
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...TELKOMNIKA JOURNAL
 
SPED420ch8PP
SPED420ch8PPSPED420ch8PP
SPED420ch8PPfiegent
 
Philosophy Based Language Teaching
Philosophy Based Language TeachingPhilosophy Based Language Teaching
Philosophy Based Language TeachingSheng Nuesca
 

What's hot (16)

The literate environment
The literate environmentThe literate environment
The literate environment
 
Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop
 
Principles and methodologies questions and answers
Principles and methodologies questions and answersPrinciples and methodologies questions and answers
Principles and methodologies questions and answers
 
Reading In Elt
Reading In EltReading In Elt
Reading In Elt
 
Literate environment presentation
Literate environment presentationLiterate environment presentation
Literate environment presentation
 
Mentoring Reading 4.11
Mentoring Reading 4.11Mentoring Reading 4.11
Mentoring Reading 4.11
 
Supporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core ClassroomSupporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core Classroom
 
3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf
 
Reading Workshop
Reading WorkshopReading Workshop
Reading Workshop
 
Ehri word learning
Ehri word learningEhri word learning
Ehri word learning
 
Reading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary StudentReading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary Student
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
 
N01741100102
N01741100102N01741100102
N01741100102
 
SPED420ch8PP
SPED420ch8PPSPED420ch8PP
SPED420ch8PP
 
Philosophy Based Language Teaching
Philosophy Based Language TeachingPhilosophy Based Language Teaching
Philosophy Based Language Teaching
 

Similar to Reading Kitty Paper 2018

Literate environment assignment
Literate environment assignmentLiterate environment assignment
Literate environment assignmentlschipper
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysisrittere
 
Literate environment analysis presentation
Literate environment analysis presentationLiterate environment analysis presentation
Literate environment analysis presentationKim Sexton
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysisshancam08
 
Literate Environment Presentation
Literate Environment PresentationLiterate Environment Presentation
Literate Environment Presentationlarrycucumber
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentationavrilcogle
 
לימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידהלימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידהBoaz Bash March
 
Literacy learners digital story
Literacy learners digital storyLiteracy learners digital story
Literacy learners digital storyMaria Stamp
 
Literate environment analysis
Literate environment analysis Literate environment analysis
Literate environment analysis AngieBailey
 
Designing and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & LiteracyDesigning and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & LiteracyCenter for Literacy
 
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015William Kritsonis
 
Knifley Literate Environment Analysis
Knifley Literate Environment AnalysisKnifley Literate Environment Analysis
Knifley Literate Environment Analysislknifley
 
Literate Environment Analysis
Literate Environment AnalysisLiterate Environment Analysis
Literate Environment Analysislknifley
 
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...shannonleu
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis PresentationTaylorLynn
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysisIberia Randle
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentationsmythboys3
 
Read 6706 course project overview- literate environment analysis presentation
Read 6706  course project overview- literate environment analysis presentationRead 6706  course project overview- literate environment analysis presentation
Read 6706 course project overview- literate environment analysis presentationstokkecs
 
ELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewLinhVu946763
 

Similar to Reading Kitty Paper 2018 (20)

Literate environment assignment
Literate environment assignmentLiterate environment assignment
Literate environment assignment
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Literate environment analysis presentation
Literate environment analysis presentationLiterate environment analysis presentation
Literate environment analysis presentation
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Literate Environment Presentation
Literate Environment PresentationLiterate Environment Presentation
Literate Environment Presentation
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
לימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידהלימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידה
 
Literacy learners digital story
Literacy learners digital storyLiteracy learners digital story
Literacy learners digital story
 
Literate environment analysis
Literate environment analysis Literate environment analysis
Literate environment analysis
 
Designing and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & LiteracyDesigning and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & Literacy
 
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
 
Knifley Literate Environment Analysis
Knifley Literate Environment AnalysisKnifley Literate Environment Analysis
Knifley Literate Environment Analysis
 
Literate Environment Analysis
Literate Environment AnalysisLiterate Environment Analysis
Literate Environment Analysis
 
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Prof Prax 2007
Prof Prax 2007Prof Prax 2007
Prof Prax 2007
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
Read 6706 course project overview- literate environment analysis presentation
Read 6706  course project overview- literate environment analysis presentationRead 6706  course project overview- literate environment analysis presentation
Read 6706 course project overview- literate environment analysis presentation
 
ELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewELSA's Speech Recognition Overview
ELSA's Speech Recognition Overview
 

Recently uploaded

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Reading Kitty Paper 2018

  • 1. Reading Kitty: Automating Prediction of Children’s Oral Reading Skill Julie Medero, Alfredo Gomez, Alicia Ngo Harvey Mudd College Abstract While affective outcomes are generally positive for the use of eBooks and computer-based reading tutors in teaching chil- dren to read, learning outcomes are often poorer (Korat and Shamir 2004). We describe the first iteration of Reading Kitty, an iOS application that uses NLP and speech processing to fo- cus children’s time on close reading and prosody in oral read- ing, while maintaining an emphasis on creativity and artifact creation. We also share preliminary results demonstrating that semitone range can be used to automatically predict readers’ skill level. Introduction Early elementary-aged children’s reading achievement is aided by reading practice in general, especially when that practice focuses on reading with appropriate prosody (Kim and Petscher 2016). Because individualized instruction is time-intensive for teachers, many eBook readers and com- puterized reading programs are available for emerging read- ers but, while those tools are popular, they result in poorer reading interactions than paper books, which negatively im- pacts learning outcomes. By focusing students’ time on practicing reading with ap- propriate prosody, with an end goal of an artifact creation, the Reading with KIneTic TYpography (Reading Kitty) app aims to to keep the positive affective elements of eBooks without sacrificing quality of interaction with texts. In the re- maining sections, we outline first steps toward development of Reading Kitty. We also share preliminary analysis demon- strating that the prosodic analysis used in Reading Kitty can provide accurate and useful feedback to children. The Reading Kitty app Kinetic typography is a category of animations that uses moving text and images to emphasize the meaning and struc- ture of text. Figure 1 shows a frame from a Kinetic Typog- raphy animation of a scene from the film Toy Story. In the animation, words are emphasized through text size, weight, and motion to reflect the way that they’re emphasized by the character that delivers the line. Copyright c 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. Figure 1: Kinetic typography frame (RebelX-Productions 2016). By interacting with Reading Kitty, a child reader is able to create their own kinetic typography animations that reflect their interactions with a story. First, Reading Kitty guides a child reader through a close reading by presenting a series of questions customized to the story a child is reading. Those questions correspond to specific oral reading fluency bench- marks in the Common Core Standards Initiative (National Governors Association Center for Best Practices, Council of Chief State School Officers 2010). The child uses the answer to each close reading question as the basis for adjusting the appearance of words in the story. Next, Reading Kitty has the child read aloud. The prosody of the child’s reading is used with the text charac- teristics selected by the child to generate a kinetic typogra- phy artifact, with better prosody resulting in more dynamic videos. Finally, feedback about the child’s oral reading is shared with the child’s teacher. Pitch Analysis Predicting high pitch We want Reading Kitty to reward children who read with appropriate prosody with more interesting, dynamic anima- tions of their readings. With this goal in mind, we first exam- ine the feasibility of predicting which words a skilled reader would read with a high pitch. Classification models have been used previously to predict the locations of prosodic boundaries (Medero and Ostendorf 2013). We adopt this idea by building a model to predict which words should be read with a high pitch. We train the model on the Boston Directions Corpus because this corpus is hand-labeled with ToBI pitch tones (Silverman et al. 1992). We define a word to be high-pitched when the word’s labeled tone has an H*
  • 2. Figure 2: Average semitone range per utterance for skilled and struggling readers. in it. We use features that have proven successful for predict- ing ToBI tones in Greek: part-of-speech (POS) tags, POS bi- grams, syntactic chunk tags, the number of syllables in each word, and each word’s depth in an automatically-generated syntactic constituency tree (Zervas, Fakotakis, and Kokki- nakis 2004). For our data set, though, resulting error rate is too high to give reliable feedback to children. We look in- stead at pitch range across an utterance. Semitone range We analyze the CMU Kids Corpus (Consortium. and Uni- versity. 1997), which contains 5,180 utterances (spoken sen- tences) from 76 children. The children range in age from 6 to 11 and are in first through third grade. These 76 chil- dren are split into 2 groups, skilled readers (n = 44) and struggling readers (n = 32) Each utterance in the corpus is accompanied by a transcript, along with word- and phone- level forced alignments. For each utterance, we extract the F0 values in Hz every .01s using PRAAT’s pitch tracker (Boersma 2001). We cal- culate the F0 maximum and F0 minimum for each utterance in the corpus, and then calculate the semitone range as de- scribed by Aoyama et al. (Aoyama, Akbari, and Flege 2016). We average the semitone ranges for each utterance across all readers in the skilled group and all readers in the struggling group. Figure 2 compares the semitone range for the two groups. Each point corresponds to an utterance. Points below the line correspond to utterances that were read by struggling read- ers with a smaller semitone range than by skilled readers. Likewise, points above the line correspond to utterances that were read by struggling readers with a larger semitone range than by skilled readers. Skilled readers show larger average semitone range than struggling readers across utterances (Fig. 2), which is promising for our goal of using semitone range as a metric for feedback to young readers. Because the two datasets are not matched in terms of which utterances are read by which reader, however, we Paired Comparison Finally, we perform a paired t-test. We pair a skilled reader with a struggling reader who read the same utterances. Since the participants in the two groups do not align exactly, we use the Hungarian method to find the pairings that maximize the number of shared utterances (Kuhn 1955), resulting in 13 pairs with 41-91 shared utter- ances. We ignore utterances that are not shared by each pair. The skilled readers have an average semitone range of 17.03 (σ = 1.85) and the struggling readers have an average semi- tone range of 14.83 (σ = 2.15), which is statistically signif- icant with p < .05. Conclusion Reading Kitty represents a new way to bring evidence-based reading instruction in an electronic form. Our initial anal- ysis provides promising evidence that semitone-based fea- tures can be used to give feedback on prosodic quality of oral readings. References Aoyama, K.; Akbari, C.; and Flege, J. 2016. Prosodic char- acteristics of American English in school-age children. In Proc. Speech Prosody, 572–576. Boersma, P. 2001. Praat, a system for doing phonetics by computer. Glot International. Consortium., L. D., and University., C.-M. 1997. The CMU kids speech corpus TT -. Kim, Y. S. G., and Petscher, Y. 2016. Prosodic Sensitiv- ity and Reading: An Investigation of Pathways of Relations Using a Latent Variable Approach. Journal of Educational Psychology 108(5):630–645. Korat, O., and Shamir, A. 2004. Do Hebrew electronic books differ from Dutch electronic books? A replication of a Dutch content analysis. Journal of Computer Assisted Learning 20(4):257–268. Kuhn, H. W. 1955. The Hungarian method for the assign- ment problem. Naval Research Logistics Quarterly 83–97. Medero, J., and Ostendorf, M. 2013. Atypical Prosodic Structure as an Indicator of Reading Level and Text Diffi- culty. In Proc. NAACL-HLT, 715–720. National Governors Association Center for Best Practices, Council of Chief State School Officers. 2010. Common Core State Standards for English Language Arts. Techni- cal report, National Governors Association Center for Best Practices, Council of Chief State School Officers, Washing- ton, D.C. 2016. Toy Story - ”You Are A Toy” Kinetic Typography. https://youtu.be/_PU49-_qQoI. Silverman, K.; Beckman, M.; Pitrelli, J.; Ostendorf, M.; Wightman, C. C.; Price, P.; Pierrehumbert, J.; and Hirschberg, J. 1992. TOBI: A Standard for Labeling En- glish Prosody. In Second International Conference on Spo- ken Language Processing. Zervas, P.; Fakotakis, N.; and Kokkinakis, G. 2004. Pitch accent prediction from ToBI annotated corpora based on
  • 3. Bayesian learning. Lecture Notes in Artificial Intelli- gence (Subseries of Lecture Notes in Computer Science) 3206:545–552.