SlideShare a Scribd company logo
1 of 3
Download to read offline
COMMENTARY
Artificial intelligence to support human instruction
Michael C. Mozera,b,c,1
, Melody Wiseheartd
, and Timothy P. Novikoffc
The popular media’s recent interest in artificial intelli-
gence (AI) has focused on autonomous systems that
might ultimately replace people in fields as diverse as
medicine, customer service, and transportation and
logistics. Often neglected is a subfield of AI that fo-
cuses on empowering people by improving how we
learn, remember, perceive, and make decisions. This
human-centered focus relies on interdisciplinary re-
search from cognitive neuroscience, psychology, and
theoretical computer science.
The synergy among these fields promises to
improve the microorganization of human instruction:
picking the next exercise for a student to attempt,
choosing what sort of hints and feedback to provide,
determining when material should be reviewed, and
selecting among teaching activities. These are deci-
sions which occur on a granularity that human instruc-
tors are typically unable to monitor and individualize,
and for which students have poor metacognitive
strategies (1). Such microinstruction complements the
strengths of human teachers and can yield significant
benefits. For example, in a semester-long experiment
integrated into a middle-school foreign language
course, setting aside roughly 30 min per week for AI-
guided personalized review of previously introduced
material led to a 16.5% improvement in overall course
retention on a cumulative examination administered a
month after the end of the semester, relative to a time-
matched control condition that reflects current educa-
tional practice (2).
In PNAS, Tabibian et al. (3) address the learning
and retention of factual material such as foreign lan-
guage vocabulary. They present an adaptive, data-
driven method with theoretical guarantees for sched-
uling retrieval practice. Their work contributes to a
growing body of results in algorithmic-education the-
ory that proves properties of idealized mathematical
models of educational scenarios (4–6).
Like every proposal for AI-based instruction, the
approach of Tabibian et al. (3) is formalized by two
models: student and teacher. The student model
quantifies an individual’s current knowledge state
and how it evolves based on instructional actions
and the passage of time. The teacher model spec-
ifies a policy—in the control-theoretic sense—for
1 7 14 21 35 70 105
Inter-Study Interval (Days)
0
20
40
60
80
100
%Recall
7 days
Retention
Interval
35 days
Retention
Interval
70 days
Retention
Interval
350 days
Retention
Interval
A
Time
0
20
40
60
80
100
%Recall
t
0
best inter-study interval
B
Fig. 1. (A) Recall accuracy at test for a given RI (7, 35, 70, or 350 d), after the
learner has studied material over two sessions separated by a given ISI (< to
105 d). Data points are from a human behavioral experiment (18). Solid lines are
predictions of a cognitive model fit with one free parameter to the data (10). (B)
Hypothetical memory strength curves consistent with experimental results in A.
Each curve shows a sequence of two study sessions, aligned such that the second
session of each sequence occurs at time t0 (dashed line). For various t > t0 (i.e., the
RI t − t0) the bars at the top of the graph indicate which ISI achieves the highest
recall accuracy. The optimal spacing depends on the RI.
a
Department of Computer Science, University of Colorado Boulder, Boulder, CO 80309; b
Institute of Cognitive Science, University of Colorado
Boulder, Boulder, CO 80309; c
Research and Machine Intelligence, Google, Inc., Mountain View, CA 94043; and d
Department of Psychology, York
University, Toronto, ON, Canada M3J 1P3
Author contributions: M.C.M., M.W., and T.P.N. wrote the paper.
The authors declare no conflict of interest.
Published under the PNAS license.
See companion article on page 3988.
1
To whom correspondence should be addressed. Email: mozer@colorado.edu.
Published online February 19, 2019.
www.pnas.org/cgi/doi/10.1073/pnas.1900370116 PNAS | March 5, 2019 | vol. 116 | no. 10 | 3953–3955
COMMENTARY
administering instructional actions conditioned on the student
model. Student models range from those motivated heavily by
psychological theory (7–10) to those motivated primarily by mathe-
matical elegance and tractability (4–6). Somewhere in the middle of
this continuum are methods that mine corpora of educational data
(11) to predict the influence of various teaching actions on the student’s
subsequent performance (2, 12). Teacher models also lie along a con-
tinuum, from hand-constructed heuristic or expert-based approaches
(2, 9, 13, 14) to those that provably optimize according to some
criterion, such as the student model’s score on a postinstruction
quiz (4, 15).
The ultimate aim in AI is to develop automated instruction
methods based on the combination of a psychologically valid
student model and a theoretically grounded teacher model.
However, this goal has yet to be achieved. Research concerned
with solving a tractable optimization problem has assumed a
simplified student model; nonetheless, even with a naive or
unrealistic student model, the formal nature of the work can
provide intuitions and a deeper understanding of the computational
issues. Conversely, the research most concerned with psychological
fidelity of the student model has relied on heuristic teacher models;
nonetheless, heuristic strategies often appear to be effective [e.g.,
Bjork’s (16) notion of desirable difficulty].
Tabibian et al. (3) identify a class of student models that admit
a formal analysis yet are informed by human memory data. They
characterize the student’s memory state in terms of stochastic
differential equations with jumps: When the student reviews a
particular item (e.g., a vocabulary word), the memory strength of
the item is bumped up and then decays over time. Reviewing
times for an item are specified via a temporal point process.
The stochastic optimal control problem is to determine the
point-process reviewing intensity, that is, the rate of retrieval prac-
tice, to maintain memory without excessive practice. Formally, the
penalty function they optimize is quadratic in both recall failure
rate and reviewing intensity. Their key result is a proof that the
optimal reviewing intensity is proportional to the student model’s
predicted probability of recall failure. Predicted recall probability
has long been used as a heuristic measure for prioritizing material
in flashcard-type schedulers, but Tabibian et al. (3) provide a com-
pelling theoretical justification for the use of this heuristic.
Tabibian et al.’s (3) claim that the benefit of review increases as
memory decays is consonant with the spacing effect (17), the be-
havioral finding that temporally distributed practice leads to more
robust and durable learning than massed practice. The spacing
effect is often cast as “don’t cram for an examination.” Cramming
can help students remember information on the examination;
however, cramming also leads to rapid forgetting. The rate of
forgetting is slowed by spaced practice. Fig. 1A presents results
from an empirical study in which participants learned trivia facts in
two sessions and were subsequently quizzed on the facts (18). In
the first session, each item is practiced to criterion; in the second
session, each item is practiced a fixed number of times. The lag
between the first and second study sessions (the interstudy inter-
val, ISI) is varied from minutes to 105 d, and the lag between the
second study session and the quiz (the retention interval, RI) is
varied from 7 to 350 d. The figure reveals an optimal ISI, and this
optimum increases with the RI. This interaction between optimal
ISI and RI is also found in big data corpora (19). The smooth curves
in the figure are predictions from a psychological model (10). Fig.
1B shows hypothetical forgetting curves for various ISIs, consistent
with the empirical results in Fig. 1A and psychological theories of
the spacing effect (e.g., refs. 10 and 17).
Whereas psychologists have focused on optimizing time of
study given a fixed number of practice opportunities (Fig. 1A), or
on prioritizing items for study given finite study time (2, 13, 14),
Tabibian et al. (3) frame the challenge as determining the optimal
(time-varying) reviewing rate per item. It may be unrealistic in an
educational context to forego hard constraints on practice trials or
time, but the cost trade-off formulation that Tabibian et al. (3)
propose—standard in machine learning—is a fresh perspective
for psychology. Another novelty of Tabibian et al. (3) concerns
the objective of spaced practice. Experimentalists typically con-
sider optimizing spacing for a particular RI, that is, the duration
following final study over which the information must be main-
tained (Fig. 1A). In contrast, Tabibian et al. (3) suggest a perhaps
more practical situation in which study and test phases are not
distinguished and knowledge must be periodically refreshed to
ensure its continuous availability. This practical, lifelong learning
perspective is relatively understudied in psychology (20).
Tabibian et al. identify a class of student models
that admit a formal analysis yet are informed
by human memory data.
Just as cognitive psychology may find new avenues of
investigation from algorithmic-instruction theory, algorithmic-
instruction theory may have more practical impact as it takes
into consideration the complexity of human behavioral data.
For example, consider the inverted-U shape of the curves in
Fig. 1A. Various psychological accounts have been proposed
to explain these curves (17), most positing dual opposing
mechanisms at play, one favoring short ISIs and one favoring
long ISIs, and the trade-off between these opposing mecha-
nisms gives rise to the inverted U.
Formal student models in Tabibian et al. (3) and in other the-
oretical work (15) presently do not capture this key dynamic of
spaced practice. As Tabibian et al. (3) acknowledge, under their
student model the forgetting rate is reduced at least as much by
massed practice as spaced practice, inconsistent with the behav-
ioral data. They suggest possible extensions but the theory does
not at present incorporate these extensions. One might therefore
expect an actual student following the schedule prescribed by
their algorithm to fail to achieve optimal performance. Remark-
ably, Tabibian et al. (3) present evidence ameliorating this con-
cern. They report on a natural experiment involving the reanalysis
of a big-data corpus from an online-learning platform. They find
that forgetting is relatively slowed following study sequences that
more closely adhere to their theory’s optimum spacing. Their
natural experiment is as well-conducted as one could hope for:
In comparing alternative schedules, they control for the lag be-
tween first and final study, the total number of study trials, and
the memory strength after the first study trial. Our explanation
for the theory’s success in practice is their choice of penalty
function, which imposes a cost for high review intensities (i.e.,
massed practice). This cost encourages long ISIs, and the inevi-
tability of forgetting encourages short ISIs, resulting in dual op-
posing factors that yield an intermediate spacing. Whether the
Tabibian et al. (3) theory can predict the precise peak of the
spacing function (Fig. 1A) may not be terribly critical, given that
for long retention periods the peak appears to be broad and flat.
This fact gives us some reason to be optimistic about the ease
with which the spacing effect can be leveraged to mitigate forget-
ting. Indeed, online platforms that perform adaptive scheduling
3954 | www.pnas.org/cgi/doi/10.1073/pnas.1900370116 Mozer et al.
using variants of the Leitner method (21) and SuperMemo (22)
have thrived despite their heuristic nature.
Although “rote memorization” is not considered a particularly
glamorous area of research in the education community, facts
underpin most types of learning, and one cannot engage in crit-
ical thinking without sufficient field-specific knowledge. The no-
tion of an automated instructor which refreshes our knowledge
base at appropriate times is quite appealing. Such an instructor
even has a role to play in the classroom that is complementary to
that of the teacher: In the study mentioned earlier using person-
alized review software (2), the teacher is freed from mechanical
duties to interact with smaller groups of students.
AI-guided microinstruction has other potential applications,
such as selecting and ordering examples to learn complex visual
concepts (e.g., bird species or skin-lesion diagnosis), determining
the hints a learner requires to solve a problem and when to
provide them, and recognizing when a student’s attention has
wandered while reading. In each of these cases, principled ap-
proaches require a psychologically plausible student model, a
quantitative measure of performance to be optimized, and a
teacher model with theoretical guarantees of achieving optimal
performance under the student model. The field of AI has made
advances through development of its theoretical foundations and
explicit formulations of system objectives. We expect this lesson to
apply to human-centered AI as well, but it will occur only through
collaborations between cognitive scientists who understand
the intricacy of human behavior and AI researchers who can bring
a new degree of clarity and elegance to cognitive models.
1 Cohen MS, Yan VX, Halamish V, Bjork RA (2013) Do students think that difficult or valuable materials should be restudied sooner rather than later? J Exp Psychol
Learn Mem Cogn 39:1682–1696.
2 Lindsey RV, Shroyer JD, Pashler H, Mozer MC (2014) Improving students’ long-term knowledge retention through personalized review. Psychol Sci 25:639–647.
3 Tabibian B, et al. (2019) Enhancing human learning via spaced repetition optimization. Proc Natl Acad Sci USA 116:3988–3993.
4 Novikoff TP, Kleinberg JM, Strogatz SH (2012) Education of a model student. Proc Natl Acad Sci USA 109:1868–1873.
5 Lewis JB, Li N (2014) Combinatorial aspects of flashcard games. Ann Combinatorics 18:459–472.
6 Nishimura J (2018) Critically slow learning in flashcard learning models. Chaos Interdiscip J Nonlinear Sci 28:083115.
7 Pavlik PI, Anderson JR (2005) Practice and forgetting effects on vocabulary memory: An activation-based model of the spacing effect. Cogn Sci 29:559–586.
8 Atkinson RC (1972) Optimizing the learning of a second-language vocabulary. J Exp Psychol 96:124–129.
9 Khajah MM, Lindsey RV, Mozer MC (2014) Maximizing students’ retention via spaced review: Practical guidance from computational models of memory. Top Cogn
Sci 6:157–169.
10 Mozer MC, Pashler H, Cepeda N, Lindsey RV, Vul E (2009) Predicting the optimal spacing of study: A multiscale context model of memory. Advances in Neural
Information Processing Systems, eds Bengio Y, Schuurmans D, Lafferty J, Williams C, Culotta A (Curran Associates, Red Hook, NY), Vol 22, pp 1321–1329.
11 Koedinger KR, et al. (2010) A data repository for the EDM community: The PSLC datashop. Handbook of Educational Data Mining, eds Romero C, Ventura S,
Pechenizkiy M, Baker RSJd (CRC, Boca Raton, FL).
12 Settles B, Meeder B (2016) A trainable spaced repetition model for language learning. Proceedings of the 54th Annual Meeting of the Association for
Computational Linguistics (Association for Computational Linguistics, Stroundsburg, PA), Vol. 1, pp 1848–1858.
13 Mettler E, Massey C, Kellman PJ (2011) Improving adaptive learning technology through the use of response times in Proceedings of the 33rd Annual Conference
of the Cognitive Science Society, eds. CarlsonL, Holscher C, Shipley T. (Cognitive Science Society, Austin, TX), pp. 2532–2537.
14 Pavlik PI, Anderson JR (2008) Using a model to compute the optimal schedule of practice. J Exp Psychol Appl 14:101–117.
15 Reddy S, Labutov I, Banerjee S, Joachims T (2016) Unbounded human learning: Optimal scheduling for spaced repetition. Proceedings of the 22nd ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining (ACM, New York), pp 1815–1824.
16 Bjork R (1994) Memory and metamemory considerations in the training of human beings. Metacognition: Knowing About Knowing, eds Metcalfe J, Shimamura A
(MIT Press, Cambridge, MA), pp 185–205.
17 Wiseheart M, et al. (2019) Enhancing the quality of student learning using distributed practice. Cambridge Handbook of Cognition and Education, eds
Dunlosky J, Rawson K (Cambridge Univ Press, New York), pp 550–584.
18 Cepeda NJ, Vul E, Rohrer D, Wixted JT, Pashler H (2008) Spacing effects in learning: A temporal ridgeline of optimal retention. Psychol Sci 19:1095–1102.
19 Kim A, Wong-Kee-You A, Wiseheart M, Rosenbaum RS (2019) The spacing effect stands up to big data. Behav Res Methods, 10.3758/s13428-018-1184-7.
20 Kang SHK, Lindsey RV, Mozer MC, Pashler H (2014) Retrieval practice over the long term: Expanding or equal-interval spacing? Psychol Bull Rev 21:1544–1550.
21 Leitner S (1972) So Lernt Man Lernen: Angewandte Lernpsychologie – ein Weg zum Erfolg (Weltbild, Augsburg, Germany).
22 Wozniak P, Gorzelanczyk E (1994) Optimization of repetition spacing in the practice of learning. Acta Neurobiologiae Experimentalis 54:59–62.
Mozer et al. PNAS | March 5, 2019 | vol. 116 | no. 10 | 3955

More Related Content

What's hot

ContentUsability_TechnologySupportedLearning
ContentUsability_TechnologySupportedLearningContentUsability_TechnologySupportedLearning
ContentUsability_TechnologySupportedLearning
Nivedita Singh
 
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
IJMIT JOURNAL
 
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
IJMIT JOURNAL
 
Cw3210561060
Cw3210561060Cw3210561060
Cw3210561060
IJMER
 
1 s2.0-s1877042812000092-main
1 s2.0-s1877042812000092-main1 s2.0-s1877042812000092-main
1 s2.0-s1877042812000092-main
Anne de Hulster
 
KatKennedy REU D.C. Poster
KatKennedy REU D.C. PosterKatKennedy REU D.C. Poster
KatKennedy REU D.C. Poster
Katlynn Kennedy
 
Evaluating instructional designs with mental workload assessments in universi...
Evaluating instructional designs with mental workload assessments in universi...Evaluating instructional designs with mental workload assessments in universi...
Evaluating instructional designs with mental workload assessments in universi...
Peter Manifold
 

What's hot (14)

ContentUsability_TechnologySupportedLearning
ContentUsability_TechnologySupportedLearningContentUsability_TechnologySupportedLearning
ContentUsability_TechnologySupportedLearning
 
Constructivist Approach Vs Expository Teaching: Exponential Functions
Constructivist Approach Vs Expository Teaching: Exponential FunctionsConstructivist Approach Vs Expository Teaching: Exponential Functions
Constructivist Approach Vs Expository Teaching: Exponential Functions
 
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
 
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
CLASSIFICATION OF QUESTIONS AND LEARNING OUTCOME STATEMENTS (LOS) INTO BLOOM’...
 
Ujer.2013.010304
Ujer.2013.010304Ujer.2013.010304
Ujer.2013.010304
 
Cw3210561060
Cw3210561060Cw3210561060
Cw3210561060
 
A Study on Learning Factor Analysis – An Educational Data Mining Technique fo...
A Study on Learning Factor Analysis – An Educational Data Mining Technique fo...A Study on Learning Factor Analysis – An Educational Data Mining Technique fo...
A Study on Learning Factor Analysis – An Educational Data Mining Technique fo...
 
1 s2.0-s1877042812000092-main
1 s2.0-s1877042812000092-main1 s2.0-s1877042812000092-main
1 s2.0-s1877042812000092-main
 
Whether simulation models that fall under the information systems category ad...
Whether simulation models that fall under the information systems category ad...Whether simulation models that fall under the information systems category ad...
Whether simulation models that fall under the information systems category ad...
 
KatKennedy REU D.C. Poster
KatKennedy REU D.C. PosterKatKennedy REU D.C. Poster
KatKennedy REU D.C. Poster
 
Data Analysis
Data Analysis Data Analysis
Data Analysis
 
583 h139-rufi'i
583 h139-rufi'i583 h139-rufi'i
583 h139-rufi'i
 
A Term Paper for the Course of Theories and Approaches in Language Teaching(...
A Term Paper for  the Course of Theories and Approaches in Language Teaching(...A Term Paper for  the Course of Theories and Approaches in Language Teaching(...
A Term Paper for the Course of Theories and Approaches in Language Teaching(...
 
Evaluating instructional designs with mental workload assessments in universi...
Evaluating instructional designs with mental workload assessments in universi...Evaluating instructional designs with mental workload assessments in universi...
Evaluating instructional designs with mental workload assessments in universi...
 

Similar to Artificial intelligence to support human instruction Michael C. Mozera,b,c,, Melody Wiseheartd and Timothy P. Novikoff.

kellman_et_al_2008
kellman_et_al_2008kellman_et_al_2008
kellman_et_al_2008
Joel Zucker
 
Jurnal ijssh rufi'i
Jurnal ijssh rufi'iJurnal ijssh rufi'i
Jurnal ijssh rufi'i
Rufi'i Rufii
 
5) You are performing an audit of purchases of desktop compute.docx
5) You are performing an audit of purchases of desktop compute.docx5) You are performing an audit of purchases of desktop compute.docx
5) You are performing an audit of purchases of desktop compute.docx
alinainglis
 
The World Testifies Of Data And Our Understanding Of It Essay
The World Testifies Of Data And Our Understanding Of It EssayThe World Testifies Of Data And Our Understanding Of It Essay
The World Testifies Of Data And Our Understanding Of It Essay
Sandy Harwell
 
Assessing Knowledge, Process, Understanding, Performance/Product
Assessing Knowledge, Process, Understanding, Performance/ProductAssessing Knowledge, Process, Understanding, Performance/Product
Assessing Knowledge, Process, Understanding, Performance/Product
Carlo Magno
 
IntroductionIt is the purpose of this manual to teach instruction.docx
IntroductionIt is the purpose of this manual to teach instruction.docxIntroductionIt is the purpose of this manual to teach instruction.docx
IntroductionIt is the purpose of this manual to teach instruction.docx
mariuse18nolet
 
Task Assessment of Fourth and Fifth Grade Teachers
Task Assessment of Fourth and Fifth Grade TeachersTask Assessment of Fourth and Fifth Grade Teachers
Task Assessment of Fourth and Fifth Grade Teachers
Christopher Peter Makris
 

Similar to Artificial intelligence to support human instruction Michael C. Mozera,b,c,, Melody Wiseheartd and Timothy P. Novikoff. (20)

kellman_et_al_2008
kellman_et_al_2008kellman_et_al_2008
kellman_et_al_2008
 
Improving Communication about Limit Concept in Mathematics through Inquisitio...
Improving Communication about Limit Concept in Mathematics through Inquisitio...Improving Communication about Limit Concept in Mathematics through Inquisitio...
Improving Communication about Limit Concept in Mathematics through Inquisitio...
 
B0610506
B0610506B0610506
B0610506
 
Online Assignment
Online AssignmentOnline Assignment
Online Assignment
 
Assessment Of The Effectiveness Of An Online Learning System In Improving Stu...
Assessment Of The Effectiveness Of An Online Learning System In Improving Stu...Assessment Of The Effectiveness Of An Online Learning System In Improving Stu...
Assessment Of The Effectiveness Of An Online Learning System In Improving Stu...
 
Intelligent tutoring systems
Intelligent  tutoring  systemsIntelligent  tutoring  systems
Intelligent tutoring systems
 
Cognitive process dimension in rbt explanatory notepages
Cognitive process dimension in rbt  explanatory notepagesCognitive process dimension in rbt  explanatory notepages
Cognitive process dimension in rbt explanatory notepages
 
K0176495101
K0176495101K0176495101
K0176495101
 
Jurnal ijssh rufi'i
Jurnal ijssh rufi'iJurnal ijssh rufi'i
Jurnal ijssh rufi'i
 
5) You are performing an audit of purchases of desktop compute.docx
5) You are performing an audit of purchases of desktop compute.docx5) You are performing an audit of purchases of desktop compute.docx
5) You are performing an audit of purchases of desktop compute.docx
 
The World Testifies Of Data And Our Understanding Of It Essay
The World Testifies Of Data And Our Understanding Of It EssayThe World Testifies Of Data And Our Understanding Of It Essay
The World Testifies Of Data And Our Understanding Of It Essay
 
Data Mining for Education. Ryan S.J.d. Baker, Carnegie Mellon University
Data Mining for Education.  Ryan S.J.d. Baker, Carnegie Mellon UniversityData Mining for Education.  Ryan S.J.d. Baker, Carnegie Mellon University
Data Mining for Education. Ryan S.J.d. Baker, Carnegie Mellon University
 
Assessing Knowledge, Process, Understanding, Performance/Product
Assessing Knowledge, Process, Understanding, Performance/ProductAssessing Knowledge, Process, Understanding, Performance/Product
Assessing Knowledge, Process, Understanding, Performance/Product
 
IntroductionIt is the purpose of this manual to teach instruction.docx
IntroductionIt is the purpose of this manual to teach instruction.docxIntroductionIt is the purpose of this manual to teach instruction.docx
IntroductionIt is the purpose of this manual to teach instruction.docx
 
Advice For Action With Automatic Feedback Systems
Advice For Action With Automatic Feedback SystemsAdvice For Action With Automatic Feedback Systems
Advice For Action With Automatic Feedback Systems
 
A Meta-Analysis Of The Effectiveness Of Intelligent Tutoring Systems On K 12 ...
A Meta-Analysis Of The Effectiveness Of Intelligent Tutoring Systems On K 12 ...A Meta-Analysis Of The Effectiveness Of Intelligent Tutoring Systems On K 12 ...
A Meta-Analysis Of The Effectiveness Of Intelligent Tutoring Systems On K 12 ...
 
1_Q2-PRACTICAL-RESEARCH.pptx
1_Q2-PRACTICAL-RESEARCH.pptx1_Q2-PRACTICAL-RESEARCH.pptx
1_Q2-PRACTICAL-RESEARCH.pptx
 
Assessment Of Teachers Beliefs About Classroom Use Of Critical-Thinking Acti...
Assessment Of Teachers  Beliefs About Classroom Use Of Critical-Thinking Acti...Assessment Of Teachers  Beliefs About Classroom Use Of Critical-Thinking Acti...
Assessment Of Teachers Beliefs About Classroom Use Of Critical-Thinking Acti...
 
Task Assessment of Fourth and Fifth Grade Teachers
Task Assessment of Fourth and Fifth Grade TeachersTask Assessment of Fourth and Fifth Grade Teachers
Task Assessment of Fourth and Fifth Grade Teachers
 
Sample Methodology Essay
Sample Methodology EssaySample Methodology Essay
Sample Methodology Essay
 

More from eraser Juan José Calderón

Ética y Revolución Digital . revista Diecisiete nº 4. 2021
Ética y Revolución Digital . revista Diecisiete nº 4. 2021Ética y Revolución Digital . revista Diecisiete nº 4. 2021
Ética y Revolución Digital . revista Diecisiete nº 4. 2021
eraser Juan José Calderón
 
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
eraser Juan José Calderón
 

More from eraser Juan José Calderón (20)

Evaluación de t-MOOC universitario sobre competencias digitales docentes medi...
Evaluación de t-MOOC universitario sobre competencias digitales docentes medi...Evaluación de t-MOOC universitario sobre competencias digitales docentes medi...
Evaluación de t-MOOC universitario sobre competencias digitales docentes medi...
 
Call for paper 71. Revista Comunicar
Call for paper 71. Revista ComunicarCall for paper 71. Revista Comunicar
Call for paper 71. Revista Comunicar
 
Editorial of the JBBA Vol 4, Issue 1, May 2021. Naseem Naqvi,
Editorial of the JBBA Vol 4, Issue 1, May 2021. Naseem Naqvi, Editorial of the JBBA Vol 4, Issue 1, May 2021. Naseem Naqvi,
Editorial of the JBBA Vol 4, Issue 1, May 2021. Naseem Naqvi,
 
REGULATION OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL LAYING DOWN HARMONIS...
REGULATION OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL LAYING DOWN HARMONIS...REGULATION OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL LAYING DOWN HARMONIS...
REGULATION OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL LAYING DOWN HARMONIS...
 
Predicting Big Data Adoption in Companies With an Explanatory and Predictive ...
Predicting Big Data Adoption in Companies With an Explanatory and Predictive ...Predicting Big Data Adoption in Companies With an Explanatory and Predictive ...
Predicting Big Data Adoption in Companies With an Explanatory and Predictive ...
 
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
 
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
Innovar con blockchain en las ciudades: Ideas para lograrlo, casos de uso y a...
 
Ética y Revolución Digital . revista Diecisiete nº 4. 2021
Ética y Revolución Digital . revista Diecisiete nº 4. 2021Ética y Revolución Digital . revista Diecisiete nº 4. 2021
Ética y Revolución Digital . revista Diecisiete nº 4. 2021
 
#StopBigTechGoverningBigTech . More than 170 Civil Society Groups Worldwide O...
#StopBigTechGoverningBigTech . More than 170 Civil Society Groups Worldwide O...#StopBigTechGoverningBigTech . More than 170 Civil Society Groups Worldwide O...
#StopBigTechGoverningBigTech . More than 170 Civil Society Groups Worldwide O...
 
PACTO POR LA CIENCIA Y LA INNOVACIÓN 8 de febrero de 2021
PACTO POR LA CIENCIA Y LA INNOVACIÓN 8 de febrero de 2021PACTO POR LA CIENCIA Y LA INNOVACIÓN 8 de febrero de 2021
PACTO POR LA CIENCIA Y LA INNOVACIÓN 8 de febrero de 2021
 
Expert Panel of the European Blockchain Observatory and Forum
Expert Panel of the European Blockchain Observatory and ForumExpert Panel of the European Blockchain Observatory and Forum
Expert Panel of the European Blockchain Observatory and Forum
 
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
Desigualdades educativas derivadas del COVID-19 desde una perspectiva feminis...
 
"Experiencias booktuber: Más allá del libro y de la pantalla"
"Experiencias booktuber: Más allá del libro y de la pantalla""Experiencias booktuber: Más allá del libro y de la pantalla"
"Experiencias booktuber: Más allá del libro y de la pantalla"
 
The impact of digital influencers on adolescent identity building.
The impact of digital influencers on adolescent identity building.The impact of digital influencers on adolescent identity building.
The impact of digital influencers on adolescent identity building.
 
Open educational resources (OER) in the Spanish universities
Open educational resources (OER) in the Spanish universitiesOpen educational resources (OER) in the Spanish universities
Open educational resources (OER) in the Spanish universities
 
El modelo flipped classroom: un reto para una enseñanza centrada en el alumno
El modelo flipped classroom: un reto para una enseñanza centrada en el alumnoEl modelo flipped classroom: un reto para una enseñanza centrada en el alumno
El modelo flipped classroom: un reto para una enseñanza centrada en el alumno
 
Pensamiento propio e integración transdisciplinaria en la epistémica social. ...
Pensamiento propio e integración transdisciplinaria en la epistémica social. ...Pensamiento propio e integración transdisciplinaria en la epistémica social. ...
Pensamiento propio e integración transdisciplinaria en la epistémica social. ...
 
Escuela de Robótica de Misiones. Un modelo de educación disruptiva.
Escuela de Robótica de Misiones. Un modelo de educación disruptiva.Escuela de Robótica de Misiones. Un modelo de educación disruptiva.
Escuela de Robótica de Misiones. Un modelo de educación disruptiva.
 
La Universidad española Frente a la pandemia. Actuaciones de Crue Universidad...
La Universidad española Frente a la pandemia. Actuaciones de Crue Universidad...La Universidad española Frente a la pandemia. Actuaciones de Crue Universidad...
La Universidad española Frente a la pandemia. Actuaciones de Crue Universidad...
 
Covid-19 and IoT: Some Perspectives on the Use of IoT Technologies in Prevent...
Covid-19 and IoT: Some Perspectives on the Use of IoT Technologies in Prevent...Covid-19 and IoT: Some Perspectives on the Use of IoT Technologies in Prevent...
Covid-19 and IoT: Some Perspectives on the Use of IoT Technologies in Prevent...
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Artificial intelligence to support human instruction Michael C. Mozera,b,c,, Melody Wiseheartd and Timothy P. Novikoff.

  • 1. COMMENTARY Artificial intelligence to support human instruction Michael C. Mozera,b,c,1 , Melody Wiseheartd , and Timothy P. Novikoffc The popular media’s recent interest in artificial intelli- gence (AI) has focused on autonomous systems that might ultimately replace people in fields as diverse as medicine, customer service, and transportation and logistics. Often neglected is a subfield of AI that fo- cuses on empowering people by improving how we learn, remember, perceive, and make decisions. This human-centered focus relies on interdisciplinary re- search from cognitive neuroscience, psychology, and theoretical computer science. The synergy among these fields promises to improve the microorganization of human instruction: picking the next exercise for a student to attempt, choosing what sort of hints and feedback to provide, determining when material should be reviewed, and selecting among teaching activities. These are deci- sions which occur on a granularity that human instruc- tors are typically unable to monitor and individualize, and for which students have poor metacognitive strategies (1). Such microinstruction complements the strengths of human teachers and can yield significant benefits. For example, in a semester-long experiment integrated into a middle-school foreign language course, setting aside roughly 30 min per week for AI- guided personalized review of previously introduced material led to a 16.5% improvement in overall course retention on a cumulative examination administered a month after the end of the semester, relative to a time- matched control condition that reflects current educa- tional practice (2). In PNAS, Tabibian et al. (3) address the learning and retention of factual material such as foreign lan- guage vocabulary. They present an adaptive, data- driven method with theoretical guarantees for sched- uling retrieval practice. Their work contributes to a growing body of results in algorithmic-education the- ory that proves properties of idealized mathematical models of educational scenarios (4–6). Like every proposal for AI-based instruction, the approach of Tabibian et al. (3) is formalized by two models: student and teacher. The student model quantifies an individual’s current knowledge state and how it evolves based on instructional actions and the passage of time. The teacher model spec- ifies a policy—in the control-theoretic sense—for 1 7 14 21 35 70 105 Inter-Study Interval (Days) 0 20 40 60 80 100 %Recall 7 days Retention Interval 35 days Retention Interval 70 days Retention Interval 350 days Retention Interval A Time 0 20 40 60 80 100 %Recall t 0 best inter-study interval B Fig. 1. (A) Recall accuracy at test for a given RI (7, 35, 70, or 350 d), after the learner has studied material over two sessions separated by a given ISI (< to 105 d). Data points are from a human behavioral experiment (18). Solid lines are predictions of a cognitive model fit with one free parameter to the data (10). (B) Hypothetical memory strength curves consistent with experimental results in A. Each curve shows a sequence of two study sessions, aligned such that the second session of each sequence occurs at time t0 (dashed line). For various t > t0 (i.e., the RI t − t0) the bars at the top of the graph indicate which ISI achieves the highest recall accuracy. The optimal spacing depends on the RI. a Department of Computer Science, University of Colorado Boulder, Boulder, CO 80309; b Institute of Cognitive Science, University of Colorado Boulder, Boulder, CO 80309; c Research and Machine Intelligence, Google, Inc., Mountain View, CA 94043; and d Department of Psychology, York University, Toronto, ON, Canada M3J 1P3 Author contributions: M.C.M., M.W., and T.P.N. wrote the paper. The authors declare no conflict of interest. Published under the PNAS license. See companion article on page 3988. 1 To whom correspondence should be addressed. Email: mozer@colorado.edu. Published online February 19, 2019. www.pnas.org/cgi/doi/10.1073/pnas.1900370116 PNAS | March 5, 2019 | vol. 116 | no. 10 | 3953–3955 COMMENTARY
  • 2. administering instructional actions conditioned on the student model. Student models range from those motivated heavily by psychological theory (7–10) to those motivated primarily by mathe- matical elegance and tractability (4–6). Somewhere in the middle of this continuum are methods that mine corpora of educational data (11) to predict the influence of various teaching actions on the student’s subsequent performance (2, 12). Teacher models also lie along a con- tinuum, from hand-constructed heuristic or expert-based approaches (2, 9, 13, 14) to those that provably optimize according to some criterion, such as the student model’s score on a postinstruction quiz (4, 15). The ultimate aim in AI is to develop automated instruction methods based on the combination of a psychologically valid student model and a theoretically grounded teacher model. However, this goal has yet to be achieved. Research concerned with solving a tractable optimization problem has assumed a simplified student model; nonetheless, even with a naive or unrealistic student model, the formal nature of the work can provide intuitions and a deeper understanding of the computational issues. Conversely, the research most concerned with psychological fidelity of the student model has relied on heuristic teacher models; nonetheless, heuristic strategies often appear to be effective [e.g., Bjork’s (16) notion of desirable difficulty]. Tabibian et al. (3) identify a class of student models that admit a formal analysis yet are informed by human memory data. They characterize the student’s memory state in terms of stochastic differential equations with jumps: When the student reviews a particular item (e.g., a vocabulary word), the memory strength of the item is bumped up and then decays over time. Reviewing times for an item are specified via a temporal point process. The stochastic optimal control problem is to determine the point-process reviewing intensity, that is, the rate of retrieval prac- tice, to maintain memory without excessive practice. Formally, the penalty function they optimize is quadratic in both recall failure rate and reviewing intensity. Their key result is a proof that the optimal reviewing intensity is proportional to the student model’s predicted probability of recall failure. Predicted recall probability has long been used as a heuristic measure for prioritizing material in flashcard-type schedulers, but Tabibian et al. (3) provide a com- pelling theoretical justification for the use of this heuristic. Tabibian et al.’s (3) claim that the benefit of review increases as memory decays is consonant with the spacing effect (17), the be- havioral finding that temporally distributed practice leads to more robust and durable learning than massed practice. The spacing effect is often cast as “don’t cram for an examination.” Cramming can help students remember information on the examination; however, cramming also leads to rapid forgetting. The rate of forgetting is slowed by spaced practice. Fig. 1A presents results from an empirical study in which participants learned trivia facts in two sessions and were subsequently quizzed on the facts (18). In the first session, each item is practiced to criterion; in the second session, each item is practiced a fixed number of times. The lag between the first and second study sessions (the interstudy inter- val, ISI) is varied from minutes to 105 d, and the lag between the second study session and the quiz (the retention interval, RI) is varied from 7 to 350 d. The figure reveals an optimal ISI, and this optimum increases with the RI. This interaction between optimal ISI and RI is also found in big data corpora (19). The smooth curves in the figure are predictions from a psychological model (10). Fig. 1B shows hypothetical forgetting curves for various ISIs, consistent with the empirical results in Fig. 1A and psychological theories of the spacing effect (e.g., refs. 10 and 17). Whereas psychologists have focused on optimizing time of study given a fixed number of practice opportunities (Fig. 1A), or on prioritizing items for study given finite study time (2, 13, 14), Tabibian et al. (3) frame the challenge as determining the optimal (time-varying) reviewing rate per item. It may be unrealistic in an educational context to forego hard constraints on practice trials or time, but the cost trade-off formulation that Tabibian et al. (3) propose—standard in machine learning—is a fresh perspective for psychology. Another novelty of Tabibian et al. (3) concerns the objective of spaced practice. Experimentalists typically con- sider optimizing spacing for a particular RI, that is, the duration following final study over which the information must be main- tained (Fig. 1A). In contrast, Tabibian et al. (3) suggest a perhaps more practical situation in which study and test phases are not distinguished and knowledge must be periodically refreshed to ensure its continuous availability. This practical, lifelong learning perspective is relatively understudied in psychology (20). Tabibian et al. identify a class of student models that admit a formal analysis yet are informed by human memory data. Just as cognitive psychology may find new avenues of investigation from algorithmic-instruction theory, algorithmic- instruction theory may have more practical impact as it takes into consideration the complexity of human behavioral data. For example, consider the inverted-U shape of the curves in Fig. 1A. Various psychological accounts have been proposed to explain these curves (17), most positing dual opposing mechanisms at play, one favoring short ISIs and one favoring long ISIs, and the trade-off between these opposing mecha- nisms gives rise to the inverted U. Formal student models in Tabibian et al. (3) and in other the- oretical work (15) presently do not capture this key dynamic of spaced practice. As Tabibian et al. (3) acknowledge, under their student model the forgetting rate is reduced at least as much by massed practice as spaced practice, inconsistent with the behav- ioral data. They suggest possible extensions but the theory does not at present incorporate these extensions. One might therefore expect an actual student following the schedule prescribed by their algorithm to fail to achieve optimal performance. Remark- ably, Tabibian et al. (3) present evidence ameliorating this con- cern. They report on a natural experiment involving the reanalysis of a big-data corpus from an online-learning platform. They find that forgetting is relatively slowed following study sequences that more closely adhere to their theory’s optimum spacing. Their natural experiment is as well-conducted as one could hope for: In comparing alternative schedules, they control for the lag be- tween first and final study, the total number of study trials, and the memory strength after the first study trial. Our explanation for the theory’s success in practice is their choice of penalty function, which imposes a cost for high review intensities (i.e., massed practice). This cost encourages long ISIs, and the inevi- tability of forgetting encourages short ISIs, resulting in dual op- posing factors that yield an intermediate spacing. Whether the Tabibian et al. (3) theory can predict the precise peak of the spacing function (Fig. 1A) may not be terribly critical, given that for long retention periods the peak appears to be broad and flat. This fact gives us some reason to be optimistic about the ease with which the spacing effect can be leveraged to mitigate forget- ting. Indeed, online platforms that perform adaptive scheduling 3954 | www.pnas.org/cgi/doi/10.1073/pnas.1900370116 Mozer et al.
  • 3. using variants of the Leitner method (21) and SuperMemo (22) have thrived despite their heuristic nature. Although “rote memorization” is not considered a particularly glamorous area of research in the education community, facts underpin most types of learning, and one cannot engage in crit- ical thinking without sufficient field-specific knowledge. The no- tion of an automated instructor which refreshes our knowledge base at appropriate times is quite appealing. Such an instructor even has a role to play in the classroom that is complementary to that of the teacher: In the study mentioned earlier using person- alized review software (2), the teacher is freed from mechanical duties to interact with smaller groups of students. AI-guided microinstruction has other potential applications, such as selecting and ordering examples to learn complex visual concepts (e.g., bird species or skin-lesion diagnosis), determining the hints a learner requires to solve a problem and when to provide them, and recognizing when a student’s attention has wandered while reading. In each of these cases, principled ap- proaches require a psychologically plausible student model, a quantitative measure of performance to be optimized, and a teacher model with theoretical guarantees of achieving optimal performance under the student model. The field of AI has made advances through development of its theoretical foundations and explicit formulations of system objectives. We expect this lesson to apply to human-centered AI as well, but it will occur only through collaborations between cognitive scientists who understand the intricacy of human behavior and AI researchers who can bring a new degree of clarity and elegance to cognitive models. 1 Cohen MS, Yan VX, Halamish V, Bjork RA (2013) Do students think that difficult or valuable materials should be restudied sooner rather than later? J Exp Psychol Learn Mem Cogn 39:1682–1696. 2 Lindsey RV, Shroyer JD, Pashler H, Mozer MC (2014) Improving students’ long-term knowledge retention through personalized review. Psychol Sci 25:639–647. 3 Tabibian B, et al. (2019) Enhancing human learning via spaced repetition optimization. Proc Natl Acad Sci USA 116:3988–3993. 4 Novikoff TP, Kleinberg JM, Strogatz SH (2012) Education of a model student. Proc Natl Acad Sci USA 109:1868–1873. 5 Lewis JB, Li N (2014) Combinatorial aspects of flashcard games. Ann Combinatorics 18:459–472. 6 Nishimura J (2018) Critically slow learning in flashcard learning models. Chaos Interdiscip J Nonlinear Sci 28:083115. 7 Pavlik PI, Anderson JR (2005) Practice and forgetting effects on vocabulary memory: An activation-based model of the spacing effect. Cogn Sci 29:559–586. 8 Atkinson RC (1972) Optimizing the learning of a second-language vocabulary. J Exp Psychol 96:124–129. 9 Khajah MM, Lindsey RV, Mozer MC (2014) Maximizing students’ retention via spaced review: Practical guidance from computational models of memory. Top Cogn Sci 6:157–169. 10 Mozer MC, Pashler H, Cepeda N, Lindsey RV, Vul E (2009) Predicting the optimal spacing of study: A multiscale context model of memory. Advances in Neural Information Processing Systems, eds Bengio Y, Schuurmans D, Lafferty J, Williams C, Culotta A (Curran Associates, Red Hook, NY), Vol 22, pp 1321–1329. 11 Koedinger KR, et al. (2010) A data repository for the EDM community: The PSLC datashop. Handbook of Educational Data Mining, eds Romero C, Ventura S, Pechenizkiy M, Baker RSJd (CRC, Boca Raton, FL). 12 Settles B, Meeder B (2016) A trainable spaced repetition model for language learning. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Association for Computational Linguistics, Stroundsburg, PA), Vol. 1, pp 1848–1858. 13 Mettler E, Massey C, Kellman PJ (2011) Improving adaptive learning technology through the use of response times in Proceedings of the 33rd Annual Conference of the Cognitive Science Society, eds. CarlsonL, Holscher C, Shipley T. (Cognitive Science Society, Austin, TX), pp. 2532–2537. 14 Pavlik PI, Anderson JR (2008) Using a model to compute the optimal schedule of practice. J Exp Psychol Appl 14:101–117. 15 Reddy S, Labutov I, Banerjee S, Joachims T (2016) Unbounded human learning: Optimal scheduling for spaced repetition. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM, New York), pp 1815–1824. 16 Bjork R (1994) Memory and metamemory considerations in the training of human beings. Metacognition: Knowing About Knowing, eds Metcalfe J, Shimamura A (MIT Press, Cambridge, MA), pp 185–205. 17 Wiseheart M, et al. (2019) Enhancing the quality of student learning using distributed practice. Cambridge Handbook of Cognition and Education, eds Dunlosky J, Rawson K (Cambridge Univ Press, New York), pp 550–584. 18 Cepeda NJ, Vul E, Rohrer D, Wixted JT, Pashler H (2008) Spacing effects in learning: A temporal ridgeline of optimal retention. Psychol Sci 19:1095–1102. 19 Kim A, Wong-Kee-You A, Wiseheart M, Rosenbaum RS (2019) The spacing effect stands up to big data. Behav Res Methods, 10.3758/s13428-018-1184-7. 20 Kang SHK, Lindsey RV, Mozer MC, Pashler H (2014) Retrieval practice over the long term: Expanding or equal-interval spacing? Psychol Bull Rev 21:1544–1550. 21 Leitner S (1972) So Lernt Man Lernen: Angewandte Lernpsychologie – ein Weg zum Erfolg (Weltbild, Augsburg, Germany). 22 Wozniak P, Gorzelanczyk E (1994) Optimization of repetition spacing in the practice of learning. Acta Neurobiologiae Experimentalis 54:59–62. Mozer et al. PNAS | March 5, 2019 | vol. 116 | no. 10 | 3955