SlideShare a Scribd company logo
Reading Kitty: Automating Prediction of Children’s Oral Reading Skill
Julie Medero, Alfredo Gomez, Alicia Ngo
Harvey Mudd College
Abstract
While affective outcomes are generally positive for the use of
eBooks and computer-based reading tutors in teaching chil-
dren to read, learning outcomes are often poorer (Korat and
Shamir 2004). We describe the first iteration of Reading Kitty,
an iOS application that uses NLP and speech processing to fo-
cus children’s time on close reading and prosody in oral read-
ing, while maintaining an emphasis on creativity and artifact
creation. We also share preliminary results demonstrating that
semitone range can be used to automatically predict readers’
skill level.
Introduction
Early elementary-aged children’s reading achievement is
aided by reading practice in general, especially when that
practice focuses on reading with appropriate prosody (Kim
and Petscher 2016). Because individualized instruction is
time-intensive for teachers, many eBook readers and com-
puterized reading programs are available for emerging read-
ers but, while those tools are popular, they result in poorer
reading interactions than paper books, which negatively im-
pacts learning outcomes.
By focusing students’ time on practicing reading with ap-
propriate prosody, with an end goal of an artifact creation,
the Reading with KIneTic TYpography (Reading Kitty) app
aims to to keep the positive affective elements of eBooks
without sacrificing quality of interaction with texts. In the re-
maining sections, we outline first steps toward development
of Reading Kitty. We also share preliminary analysis demon-
strating that the prosodic analysis used in Reading Kitty can
provide accurate and useful feedback to children.
The Reading Kitty app
Kinetic typography is a category of animations that uses
moving text and images to emphasize the meaning and struc-
ture of text. Figure 1 shows a frame from a Kinetic Typog-
raphy animation of a scene from the film Toy Story. In the
animation, words are emphasized through text size, weight,
and motion to reflect the way that they’re emphasized by the
character that delivers the line.
Copyright c 2019, Association for the Advancement of Artificial
Intelligence (www.aaai.org). All rights reserved.
Figure 1: Kinetic typography frame (RebelX-Productions
2016).
By interacting with Reading Kitty, a child reader is able
to create their own kinetic typography animations that reflect
their interactions with a story. First, Reading Kitty guides a
child reader through a close reading by presenting a series
of questions customized to the story a child is reading. Those
questions correspond to specific oral reading fluency bench-
marks in the Common Core Standards Initiative (National
Governors Association Center for Best Practices, Council of
Chief State School Officers 2010). The child uses the answer
to each close reading question as the basis for adjusting the
appearance of words in the story.
Next, Reading Kitty has the child read aloud. The
prosody of the child’s reading is used with the text charac-
teristics selected by the child to generate a kinetic typogra-
phy artifact, with better prosody resulting in more dynamic
videos. Finally, feedback about the child’s oral reading is
shared with the child’s teacher.
Pitch Analysis
Predicting high pitch
We want Reading Kitty to reward children who read with
appropriate prosody with more interesting, dynamic anima-
tions of their readings. With this goal in mind, we first exam-
ine the feasibility of predicting which words a skilled reader
would read with a high pitch. Classification models have
been used previously to predict the locations of prosodic
boundaries (Medero and Ostendorf 2013). We adopt this
idea by building a model to predict which words should be
read with a high pitch. We train the model on the Boston
Directions Corpus because this corpus is hand-labeled with
ToBI pitch tones (Silverman et al. 1992). We define a word
to be high-pitched when the word’s labeled tone has an H*
Figure 2: Average semitone range per utterance for skilled
and struggling readers.
in it. We use features that have proven successful for predict-
ing ToBI tones in Greek: part-of-speech (POS) tags, POS bi-
grams, syntactic chunk tags, the number of syllables in each
word, and each word’s depth in an automatically-generated
syntactic constituency tree (Zervas, Fakotakis, and Kokki-
nakis 2004). For our data set, though, resulting error rate is
too high to give reliable feedback to children. We look in-
stead at pitch range across an utterance.
Semitone range
We analyze the CMU Kids Corpus (Consortium. and Uni-
versity. 1997), which contains 5,180 utterances (spoken sen-
tences) from 76 children. The children range in age from
6 to 11 and are in first through third grade. These 76 chil-
dren are split into 2 groups, skilled readers (n = 44) and
struggling readers (n = 32) Each utterance in the corpus is
accompanied by a transcript, along with word- and phone-
level forced alignments.
For each utterance, we extract the F0 values in Hz every
.01s using PRAAT’s pitch tracker (Boersma 2001). We cal-
culate the F0 maximum and F0 minimum for each utterance
in the corpus, and then calculate the semitone range as de-
scribed by Aoyama et al. (Aoyama, Akbari, and Flege 2016).
We average the semitone ranges for each utterance across all
readers in the skilled group and all readers in the struggling
group.
Figure 2 compares the semitone range for the two groups.
Each point corresponds to an utterance. Points below the line
correspond to utterances that were read by struggling read-
ers with a smaller semitone range than by skilled readers.
Likewise, points above the line correspond to utterances that
were read by struggling readers with a larger semitone range
than by skilled readers.
Skilled readers show larger average semitone range than
struggling readers across utterances (Fig. 2), which is
promising for our goal of using semitone range as a metric
for feedback to young readers. Because the two datasets are
not matched in terms of which utterances are read by which
reader, however, we
Paired Comparison Finally, we perform a paired t-test.
We pair a skilled reader with a struggling reader who read
the same utterances. Since the participants in the two groups
do not align exactly, we use the Hungarian method to find
the pairings that maximize the number of shared utterances
(Kuhn 1955), resulting in 13 pairs with 41-91 shared utter-
ances. We ignore utterances that are not shared by each pair.
The skilled readers have an average semitone range of 17.03
(σ = 1.85) and the struggling readers have an average semi-
tone range of 14.83 (σ = 2.15), which is statistically signif-
icant with p < .05.
Conclusion
Reading Kitty represents a new way to bring evidence-based
reading instruction in an electronic form. Our initial anal-
ysis provides promising evidence that semitone-based fea-
tures can be used to give feedback on prosodic quality of
oral readings.
References
Aoyama, K.; Akbari, C.; and Flege, J. 2016. Prosodic char-
acteristics of American English in school-age children. In
Proc. Speech Prosody, 572–576.
Boersma, P. 2001. Praat, a system for doing phonetics by
computer. Glot International.
Consortium., L. D., and University., C.-M. 1997. The CMU
kids speech corpus TT -.
Kim, Y. S. G., and Petscher, Y. 2016. Prosodic Sensitiv-
ity and Reading: An Investigation of Pathways of Relations
Using a Latent Variable Approach. Journal of Educational
Psychology 108(5):630–645.
Korat, O., and Shamir, A. 2004. Do Hebrew electronic
books differ from Dutch electronic books? A replication of
a Dutch content analysis. Journal of Computer Assisted
Learning 20(4):257–268.
Kuhn, H. W. 1955. The Hungarian method for the assign-
ment problem. Naval Research Logistics Quarterly 83–97.
Medero, J., and Ostendorf, M. 2013. Atypical Prosodic
Structure as an Indicator of Reading Level and Text Diffi-
culty. In Proc. NAACL-HLT, 715–720.
National Governors Association Center for Best Practices,
Council of Chief State School Officers. 2010. Common
Core State Standards for English Language Arts. Techni-
cal report, National Governors Association Center for Best
Practices, Council of Chief State School Officers, Washing-
ton, D.C.
2016. Toy Story - ”You Are A Toy” Kinetic Typography.
https://youtu.be/_PU49-_qQoI.
Silverman, K.; Beckman, M.; Pitrelli, J.; Ostendorf,
M.; Wightman, C. C.; Price, P.; Pierrehumbert, J.; and
Hirschberg, J. 1992. TOBI: A Standard for Labeling En-
glish Prosody. In Second International Conference on Spo-
ken Language Processing.
Zervas, P.; Fakotakis, N.; and Kokkinakis, G. 2004. Pitch
accent prediction from ToBI annotated corpora based on
Bayesian learning. Lecture Notes in Artificial Intelli-
gence (Subseries of Lecture Notes in Computer Science)
3206:545–552.

More Related Content

What's hot

The literate environment
The literate environmentThe literate environment
The literate environment
kteacher11
 
Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop
Liz Sims
 
Principles and methodologies questions and answers
Principles and methodologies questions and answersPrinciples and methodologies questions and answers
Principles and methodologies questions and answers
Mahima Zaman
 
Reading In Elt
Reading In EltReading In Elt
Reading In Elt
englishonecfl
 
Literate environment presentation
Literate environment presentationLiterate environment presentation
Literate environment presentation
brklyn09
 
Mentoring Reading 4.11
Mentoring Reading 4.11Mentoring Reading 4.11
Mentoring Reading 4.11
englishonecfl
 
Supporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core ClassroomSupporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core Classroom
Kristin Guest MS, CCC-SLP
 
3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf
AngelikaJoyAdvincula1
 
Reading Workshop
Reading WorkshopReading Workshop
Reading Workshop
englishonecfl
 
Ehri word learning
Ehri word learningEhri word learning
Ehri word learning
Saravuth kong
 
Reading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary StudentReading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary Student
B. J. Zagorac
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
kd4rl4ever
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
TELKOMNIKA JOURNAL
 
N01741100102
N01741100102N01741100102
N01741100102
IOSR Journals
 
SPED420ch8PP
SPED420ch8PPSPED420ch8PP
SPED420ch8PP
fiegent
 
Philosophy Based Language Teaching
Philosophy Based Language TeachingPhilosophy Based Language Teaching
Philosophy Based Language Teaching
Sheng Nuesca
 

What's hot (16)

The literate environment
The literate environmentThe literate environment
The literate environment
 
Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop Fluency & Comprehension Professional Development Workshop
Fluency & Comprehension Professional Development Workshop
 
Principles and methodologies questions and answers
Principles and methodologies questions and answersPrinciples and methodologies questions and answers
Principles and methodologies questions and answers
 
Reading In Elt
Reading In EltReading In Elt
Reading In Elt
 
Literate environment presentation
Literate environment presentationLiterate environment presentation
Literate environment presentation
 
Mentoring Reading 4.11
Mentoring Reading 4.11Mentoring Reading 4.11
Mentoring Reading 4.11
 
Supporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core ClassroomSupporting Low Level Readers in the Common Core Classroom
Supporting Low Level Readers in the Common Core Classroom
 
3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf3-Teaching-Literacy-in-Elementary.pdf
3-Teaching-Literacy-in-Elementary.pdf
 
Reading Workshop
Reading WorkshopReading Workshop
Reading Workshop
 
Ehri word learning
Ehri word learningEhri word learning
Ehri word learning
 
Reading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary StudentReading Diagnosis & Remediation for Elementary Student
Reading Diagnosis & Remediation for Elementary Student
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
Improving Sentiment Analysis of Short Informal Indonesian Product Reviews usi...
 
N01741100102
N01741100102N01741100102
N01741100102
 
SPED420ch8PP
SPED420ch8PPSPED420ch8PP
SPED420ch8PP
 
Philosophy Based Language Teaching
Philosophy Based Language TeachingPhilosophy Based Language Teaching
Philosophy Based Language Teaching
 

Similar to Reading Kitty Paper 2018

Literate environment assignment
Literate environment assignmentLiterate environment assignment
Literate environment assignment
lschipper
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
rittere
 
Literate environment analysis presentation
Literate environment analysis presentationLiterate environment analysis presentation
Literate environment analysis presentation
Kim Sexton
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
shancam08
 
Literate Environment Presentation
Literate Environment PresentationLiterate Environment Presentation
Literate Environment Presentation
larrycucumber
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
avrilcogle
 
לימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידהלימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידה
Boaz Bash March
 
Literacy learners digital story
Literacy learners digital storyLiteracy learners digital story
Literacy learners digital story
Maria Stamp
 
Literate environment analysis
Literate environment analysis Literate environment analysis
Literate environment analysis
AngieBailey
 
Designing and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & LiteracyDesigning and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & Literacy
Center for Literacy
 
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
William Kritsonis
 
Literate Environment Analysis
Literate Environment AnalysisLiterate Environment Analysis
Literate Environment Analysis
lknifley
 
Knifley Literate Environment Analysis
Knifley Literate Environment AnalysisKnifley Literate Environment Analysis
Knifley Literate Environment Analysis
lknifley
 
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
shannonleu
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
TaylorLynn
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
Iberia Randle
 
Prof Prax 2007
Prof Prax 2007Prof Prax 2007
Prof Prax 2007
Brock Dubbels
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
smythboys3
 
Read 6706 course project overview- literate environment analysis presentation
Read 6706  course project overview- literate environment analysis presentationRead 6706  course project overview- literate environment analysis presentation
Read 6706 course project overview- literate environment analysis presentation
stokkecs
 
ELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewELSA's Speech Recognition Overview
ELSA's Speech Recognition Overview
LinhVu946763
 

Similar to Reading Kitty Paper 2018 (20)

Literate environment assignment
Literate environment assignmentLiterate environment assignment
Literate environment assignment
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Literate environment analysis presentation
Literate environment analysis presentationLiterate environment analysis presentation
Literate environment analysis presentation
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Literate Environment Presentation
Literate Environment PresentationLiterate Environment Presentation
Literate Environment Presentation
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
לימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידהלימוד קריאה לתלמידים לקויי למידה
לימוד קריאה לתלמידים לקויי למידה
 
Literacy learners digital story
Literacy learners digital storyLiteracy learners digital story
Literacy learners digital story
 
Literate environment analysis
Literate environment analysis Literate environment analysis
Literate environment analysis
 
Designing and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & LiteracyDesigning and Researching E-books to Support Language & Literacy
Designing and Researching E-books to Support Language & Literacy
 
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
Webb, mary students’ engagement of reading beyond the word nftej v25 n3 2015
 
Literate Environment Analysis
Literate Environment AnalysisLiterate Environment Analysis
Literate Environment Analysis
 
Knifley Literate Environment Analysis
Knifley Literate Environment AnalysisKnifley Literate Environment Analysis
Knifley Literate Environment Analysis
 
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...Literate Environment Analysis Presentation The Beginning Reader, Pre K-3  (ED...
Literate Environment Analysis Presentation The Beginning Reader, Pre K-3 (ED...
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
Literate environment analysis
Literate environment analysisLiterate environment analysis
Literate environment analysis
 
Prof Prax 2007
Prof Prax 2007Prof Prax 2007
Prof Prax 2007
 
Literate Environment Analysis Presentation
Literate Environment Analysis PresentationLiterate Environment Analysis Presentation
Literate Environment Analysis Presentation
 
Read 6706 course project overview- literate environment analysis presentation
Read 6706  course project overview- literate environment analysis presentationRead 6706  course project overview- literate environment analysis presentation
Read 6706 course project overview- literate environment analysis presentation
 
ELSA's Speech Recognition Overview
ELSA's Speech Recognition OverviewELSA's Speech Recognition Overview
ELSA's Speech Recognition Overview
 

Recently uploaded

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 

Recently uploaded (20)

Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 

Reading Kitty Paper 2018

  • 1. Reading Kitty: Automating Prediction of Children’s Oral Reading Skill Julie Medero, Alfredo Gomez, Alicia Ngo Harvey Mudd College Abstract While affective outcomes are generally positive for the use of eBooks and computer-based reading tutors in teaching chil- dren to read, learning outcomes are often poorer (Korat and Shamir 2004). We describe the first iteration of Reading Kitty, an iOS application that uses NLP and speech processing to fo- cus children’s time on close reading and prosody in oral read- ing, while maintaining an emphasis on creativity and artifact creation. We also share preliminary results demonstrating that semitone range can be used to automatically predict readers’ skill level. Introduction Early elementary-aged children’s reading achievement is aided by reading practice in general, especially when that practice focuses on reading with appropriate prosody (Kim and Petscher 2016). Because individualized instruction is time-intensive for teachers, many eBook readers and com- puterized reading programs are available for emerging read- ers but, while those tools are popular, they result in poorer reading interactions than paper books, which negatively im- pacts learning outcomes. By focusing students’ time on practicing reading with ap- propriate prosody, with an end goal of an artifact creation, the Reading with KIneTic TYpography (Reading Kitty) app aims to to keep the positive affective elements of eBooks without sacrificing quality of interaction with texts. In the re- maining sections, we outline first steps toward development of Reading Kitty. We also share preliminary analysis demon- strating that the prosodic analysis used in Reading Kitty can provide accurate and useful feedback to children. The Reading Kitty app Kinetic typography is a category of animations that uses moving text and images to emphasize the meaning and struc- ture of text. Figure 1 shows a frame from a Kinetic Typog- raphy animation of a scene from the film Toy Story. In the animation, words are emphasized through text size, weight, and motion to reflect the way that they’re emphasized by the character that delivers the line. Copyright c 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. Figure 1: Kinetic typography frame (RebelX-Productions 2016). By interacting with Reading Kitty, a child reader is able to create their own kinetic typography animations that reflect their interactions with a story. First, Reading Kitty guides a child reader through a close reading by presenting a series of questions customized to the story a child is reading. Those questions correspond to specific oral reading fluency bench- marks in the Common Core Standards Initiative (National Governors Association Center for Best Practices, Council of Chief State School Officers 2010). The child uses the answer to each close reading question as the basis for adjusting the appearance of words in the story. Next, Reading Kitty has the child read aloud. The prosody of the child’s reading is used with the text charac- teristics selected by the child to generate a kinetic typogra- phy artifact, with better prosody resulting in more dynamic videos. Finally, feedback about the child’s oral reading is shared with the child’s teacher. Pitch Analysis Predicting high pitch We want Reading Kitty to reward children who read with appropriate prosody with more interesting, dynamic anima- tions of their readings. With this goal in mind, we first exam- ine the feasibility of predicting which words a skilled reader would read with a high pitch. Classification models have been used previously to predict the locations of prosodic boundaries (Medero and Ostendorf 2013). We adopt this idea by building a model to predict which words should be read with a high pitch. We train the model on the Boston Directions Corpus because this corpus is hand-labeled with ToBI pitch tones (Silverman et al. 1992). We define a word to be high-pitched when the word’s labeled tone has an H*
  • 2. Figure 2: Average semitone range per utterance for skilled and struggling readers. in it. We use features that have proven successful for predict- ing ToBI tones in Greek: part-of-speech (POS) tags, POS bi- grams, syntactic chunk tags, the number of syllables in each word, and each word’s depth in an automatically-generated syntactic constituency tree (Zervas, Fakotakis, and Kokki- nakis 2004). For our data set, though, resulting error rate is too high to give reliable feedback to children. We look in- stead at pitch range across an utterance. Semitone range We analyze the CMU Kids Corpus (Consortium. and Uni- versity. 1997), which contains 5,180 utterances (spoken sen- tences) from 76 children. The children range in age from 6 to 11 and are in first through third grade. These 76 chil- dren are split into 2 groups, skilled readers (n = 44) and struggling readers (n = 32) Each utterance in the corpus is accompanied by a transcript, along with word- and phone- level forced alignments. For each utterance, we extract the F0 values in Hz every .01s using PRAAT’s pitch tracker (Boersma 2001). We cal- culate the F0 maximum and F0 minimum for each utterance in the corpus, and then calculate the semitone range as de- scribed by Aoyama et al. (Aoyama, Akbari, and Flege 2016). We average the semitone ranges for each utterance across all readers in the skilled group and all readers in the struggling group. Figure 2 compares the semitone range for the two groups. Each point corresponds to an utterance. Points below the line correspond to utterances that were read by struggling read- ers with a smaller semitone range than by skilled readers. Likewise, points above the line correspond to utterances that were read by struggling readers with a larger semitone range than by skilled readers. Skilled readers show larger average semitone range than struggling readers across utterances (Fig. 2), which is promising for our goal of using semitone range as a metric for feedback to young readers. Because the two datasets are not matched in terms of which utterances are read by which reader, however, we Paired Comparison Finally, we perform a paired t-test. We pair a skilled reader with a struggling reader who read the same utterances. Since the participants in the two groups do not align exactly, we use the Hungarian method to find the pairings that maximize the number of shared utterances (Kuhn 1955), resulting in 13 pairs with 41-91 shared utter- ances. We ignore utterances that are not shared by each pair. The skilled readers have an average semitone range of 17.03 (σ = 1.85) and the struggling readers have an average semi- tone range of 14.83 (σ = 2.15), which is statistically signif- icant with p < .05. Conclusion Reading Kitty represents a new way to bring evidence-based reading instruction in an electronic form. Our initial anal- ysis provides promising evidence that semitone-based fea- tures can be used to give feedback on prosodic quality of oral readings. References Aoyama, K.; Akbari, C.; and Flege, J. 2016. Prosodic char- acteristics of American English in school-age children. In Proc. Speech Prosody, 572–576. Boersma, P. 2001. Praat, a system for doing phonetics by computer. Glot International. Consortium., L. D., and University., C.-M. 1997. The CMU kids speech corpus TT -. Kim, Y. S. G., and Petscher, Y. 2016. Prosodic Sensitiv- ity and Reading: An Investigation of Pathways of Relations Using a Latent Variable Approach. Journal of Educational Psychology 108(5):630–645. Korat, O., and Shamir, A. 2004. Do Hebrew electronic books differ from Dutch electronic books? A replication of a Dutch content analysis. Journal of Computer Assisted Learning 20(4):257–268. Kuhn, H. W. 1955. The Hungarian method for the assign- ment problem. Naval Research Logistics Quarterly 83–97. Medero, J., and Ostendorf, M. 2013. Atypical Prosodic Structure as an Indicator of Reading Level and Text Diffi- culty. In Proc. NAACL-HLT, 715–720. National Governors Association Center for Best Practices, Council of Chief State School Officers. 2010. Common Core State Standards for English Language Arts. Techni- cal report, National Governors Association Center for Best Practices, Council of Chief State School Officers, Washing- ton, D.C. 2016. Toy Story - ”You Are A Toy” Kinetic Typography. https://youtu.be/_PU49-_qQoI. Silverman, K.; Beckman, M.; Pitrelli, J.; Ostendorf, M.; Wightman, C. C.; Price, P.; Pierrehumbert, J.; and Hirschberg, J. 1992. TOBI: A Standard for Labeling En- glish Prosody. In Second International Conference on Spo- ken Language Processing. Zervas, P.; Fakotakis, N.; and Kokkinakis, G. 2004. Pitch accent prediction from ToBI annotated corpora based on
  • 3. Bayesian learning. Lecture Notes in Artificial Intelli- gence (Subseries of Lecture Notes in Computer Science) 3206:545–552.