RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)

•

1 like•88 views

This document proposes using reinforcement learning to automate keystroke-level modeling (RL-KLM). Keystroke-level modeling is traditionally used to predict task completion times by modeling user behavior as sequences of independent operators like pointing, clicking, etc. The authors represent the user interface as a Markov decision process that can be solved using reinforcement learning to find an optimal operator sequence. They demonstrate RL-KLM for cases like controlling a remote, selecting modalities on an alarm, and filling out a form. The approach could enable automated user interface evaluation and optimization by finding designs that are simple, fast and consistent.

Technology

RL-KLM: Automating
Keystroke-level Modeling
with Reinforcement Learning
Katri Leino, Antti Oulasvirta, Mikko Kurimo
Aalto University

Objectives Automate
task performance modeling
Groundwork
for adaptation and optimization

Related Work
Model-based evaluation
• GOMS, KLM models used as
evaluation functions
• E.g. CogTool, STEM
• Demonstrations required
[https://cogtool.wordpress.com]
[http://stem.lille.inria.fr]
!3
Reinforcement learning in
cognitive models
• Model learns a policy of to use a UI
• Case specific (e.g. text entry)
[Jokinen et al. 2017] [Chen et al. 2015]
Inverse Reinforcement Learning
• Learns reward functions from
observation
• Required data
[Brochu et al. 2010]

Keystroke Level Model
Predicts task completion time.
Behaviour as a sequence of independent operators.
!5
Mental operator 
1.35s
Pointing 
1.1s
Pressing 
1.5s
System response 
1.5s
[Card et al. 1980]
Traditionally, sequence is handcrafted

KLM as MDP
Markov decision process
provides a mathematical
framework for decision making.
MDP's policy, KLM's sequence,
can be solved with
Reinforcement Learning.
UI can be represented by a state-
action simulator.
!6
Agent
Interface
ActionReward
State

Reinforcement Learning
!7
Reward
• Finish the maze
• Penalty from
each used action
Learner
Learning policy from trial and error
by interacting with environment to
maximise the cumulative reward
• Policy defines which action agent
performs in the current state.
Environment

RL-KLM
Finds a KLM operator sequence which
minimises task completion time.
!8
Q-Learning
KLM operator
Duration of the operator
State of UI

Cases
Case 1:
Remote Controller
Case 2:
Multimodal Alarm
Case 3:
Form Filling

Case 1: Remote Controller
Task:
• Switch to a channel
• Select from two button types (blue
or green)
Proof of concept
• Time optimal policy
• Selected button depends on the
distance between channels
• If distance > 3 : blue
!11

Case 2: Multimodal Alarm
Problem
• Select modality to go to the goal state.
• Some modalities are inaccurate.
Policy accounts for the recognition
errors.
!12
Gestures are fastest to use,
Speech the second fastest, and
Tactile the slowest.

Case 3: GUI - Form filling
Task:
• Visit all states
Suited for spatial tasks: finds
the fastest path to visit the
items.
!13

Optimization
Objectives:
Simple, Fast, Consistent
• Trade-off between simple and fast
• Consistency: logical structure
UI modeled with
Finite State Machine
Design space and the tasks are
automatically generated.
!16
Simplest design
Balanced design
Fastest design

Conclusion
• KLM is a general model that can be automated with
Reinforcement Learning
• RL-KLM: Finds a policy that minimizes the task completion time.
• Initial results for simple cases.
• Possible applications: Evaluation, Optimization
Demo and codes for all experiments:
https://github.com/aalto-speech/rl-klm
!17

Thank you!
katri.k.leino@aalto.ﬁ
https://github.com/aalto-speech/rl-klm

References
- Jokinen, Jussi PP, et al. "Modelling learning of new keyboard layouts."
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems.
ACM, 2017.
- Chen, Xiuli, et al. "The emergence of interactive behavior: A model of rational
menu search." Proceedings of the 33rd Annual ACM Conference on Human Factors in
Computing Systems. ACM, 2015.
- Brochu, Eric, Vlad M. Cora, and Nando De Freitas. "A tutorial on Bayesian
optimization of expensive cost functions, with application to active user modeling
and hierarchical reinforcement learning." arXiv preprint arXiv:1012.2599 (2010).
- Card, Stuart K., Thomas P. Moran, and Allen Newell. "The keystroke-level model
for user performance time with interactive systems." Communications of the ACM
23.7 (1980): 396-410.
!19

Similar to RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)

EDUCON 2010: Adaptation in a PoEML-based E-learning PlatformRoberto Perez-Rodriguez

DITEC - Software EngineeringRasan Samarasinghe

Simulation and modeling introduction.pptxShamasRehman4

Operations research pptbheema raju

Generating test cases using UML Communication Diagram Praveen Penumathsa

Learning how to learnJoaquin Vanschoren

B2 2006 sizing_benchmarking (1)Steve Feldman

B2 2006 sizing_benchmarkingSteve Feldman

An Adjacent Analysis of the Parallel Programming Model Perspective: A SurveyIRJET Journal

Scalable and Cost-Effective Model-Based Software Verification and TestingLionel Briand

IRJET- Machine Learning Techniques for Code OptimizationIRJET Journal

Algorithm Analysis.pdfNayanChandak1

Review On In-Context Leaning.pptxwesleyshih4

Model-based GUI testing using UppaalUlrik Hørlyk Hjort

Software Testing Principles and Techniques suresh ramanujam

Performance evaluation of a multi-core system using Systems development meth...Yoshifumi Sakamoto

DEVELOPING A FRAMEWORK FOR ONLINE PRACTICE EXAMINATION AND AUTOMATED SCORE GE...ijcsit

DEVELOPING A FRAMEWORK FOR ONLINE PRACTICE EXAMINATION AND AUTOMATED SCORE GE...AIRCC Publishing Corporation

Ibm colloquium 070915_nybergdiannepatricia

In computer scienceiAwode Tolulope

Similar to RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019) (20)

EDUCON 2010: Adaptation in a PoEML-based E-learning Platform

DITEC - Software Engineering

Simulation and modeling introduction.pptx

Operations research ppt

Generating test cases using UML Communication Diagram

Learning how to learn

B2 2006 sizing_benchmarking (1)

B2 2006 sizing_benchmarking

An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey

Scalable and Cost-Effective Model-Based Software Verification and Testing

IRJET- Machine Learning Techniques for Code Optimization

Algorithm Analysis.pdf

Review On In-Context Leaning.pptx

Model-based GUI testing using Uppaal

Software Testing Principles and Techniques

Performance evaluation of a multi-core system using Systems development meth...

DEVELOPING A FRAMEWORK FOR ONLINE PRACTICE EXAMINATION AND AUTOMATED SCORE GE...

Ibm colloquium 070915_nyberg

In computer sciencei

Recently uploaded

Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin

ECS 2024 Teams Premium - Pretty SecureFemke de Vroome

Syngulon - Selection technology May 2024.pdfSyngulon

The Metaverse: Are We There Yet?Mark Billinghurst

Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance

Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...FIDO Alliance

Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin

THE BEST IPTV in GERMANY for 2024: IPTVreelreely ones

Demystifying gRPC in .Net by John StaveleyJohn Staveley

The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance

FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FIDO Alliance

Top 10 Symfony Development Companies 2024TopCSSGallery

Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin

Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55

Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde

WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim

Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance

Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfFIDO Alliance

TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275

PLAI - Acceleration Program for Generative A.I. StartupsStefano

Recently uploaded (20)

Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom

ECS 2024 Teams Premium - Pretty Secure

Syngulon - Selection technology May 2024.pdf

The Metaverse: Are We There Yet?

Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...

Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...

Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade

THE BEST IPTV in GERMANY for 2024: IPTVreel

Demystifying gRPC in .Net by John Staveley

The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf

FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...

Top 10 Symfony Development Companies 2024

Integrating Telephony Systems with Salesforce: Insights and Considerations, B...

Oauth 2.0 Introduction and Flows with MuleSoft

Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)

WSO2CONMay2024OpenSourceConferenceDebrief.pptx

Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf

Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf

TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...

PLAI - Acceleration Program for Generative A.I. Startups

RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)

1. RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning Katri Leino, Antti Oulasvirta, Mikko Kurimo Aalto University

2. Objectives Automate task performance modeling Groundwork for adaptation and optimization

3. Related Work Model-based evaluation • GOMS, KLM models used as evaluation functions • E.g. CogTool, STEM • Demonstrations required [https://cogtool.wordpress.com] [http://stem.lille.inria.fr] !3 Reinforcement learning in cognitive models • Model learns a policy of to use a UI • Case specific (e.g. text entry) [Jokinen et al. 2017] [Chen et al. 2015] Inverse Reinforcement Learning • Learns reward functions from observation • Required data [Brochu et al. 2010]

4. Approach

5. Keystroke Level Model Predicts task completion time. Behaviour as a sequence of independent operators. !5 Mental operator  1.35s Pointing  1.1s Pressing  1.5s System response  1.5s [Card et al. 1980] Traditionally, sequence is handcrafted

6. KLM as MDP Markov decision process provides a mathematical framework for decision making. MDP's policy, KLM's sequence, can be solved with Reinforcement Learning. UI can be represented by a state- action simulator. !6 Agent Interface ActionReward State

7. Reinforcement Learning !7 Reward • Finish the maze • Penalty from each used action Learner Learning policy from trial and error by interacting with environment to maximise the cumulative reward • Policy defines which action agent performs in the current state. Environment

8. RL-KLM Finds a KLM operator sequence which minimises task completion time. !8 Q-Learning KLM operator Duration of the operator State of UI

9. Example Cases

10. Cases Case 1: Remote Controller Case 2: Multimodal Alarm Case 3: Form Filling

11. Case 1: Remote Controller Task: • Switch to a channel • Select from two button types (blue or green) Proof of concept • Time optimal policy • Selected button depends on the distance between channels • If distance > 3 : blue !11

12. Case 2: Multimodal Alarm Problem • Select modality to go to the goal state. • Some modalities are inaccurate. Policy accounts for the recognition errors. !12 Gestures are fastest to use, Speech the second fastest, and Tactile the slowest.

13. Case 3: GUI - Form filling Task: • Visit all states Suited for spatial tasks: finds the fastest path to visit the items. !13

14. Applications

15. Design tool: demo !15

16. Optimization Objectives: Simple, Fast, Consistent • Trade-off between simple and fast • Consistency: logical structure UI modeled with Finite State Machine Design space and the tasks are automatically generated. !16 Simplest design Balanced design Fastest design

17. Conclusion • KLM is a general model that can be automated with Reinforcement Learning • RL-KLM: Finds a policy that minimizes the task completion time. • Initial results for simple cases. • Possible applications: Evaluation, Optimization Demo and codes for all experiments: https://github.com/aalto-speech/rl-klm !17

18. Thank you! katri.k.leino@aalto.ﬁ https://github.com/aalto-speech/rl-klm

19. References - Jokinen, Jussi PP, et al. "Modelling learning of new keyboard layouts." Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. ACM, 2017. - Chen, Xiuli, et al. "The emergence of interactive behavior: A model of rational menu search." Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 2015. - Brochu, Eric, Vlad M. Cora, and Nando De Freitas. "A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning." arXiv preprint arXiv:1012.2599 (2010). - Card, Stuart K., Thomas P. Moran, and Allen Newell. "The keystroke-level model for user performance time with interactive systems." Communications of the ACM 23.7 (1980): 396-410. !19

RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)

Recommended

Recommended

More Related Content

Similar to RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)

Similar to RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019) (20)

Recently uploaded

Recently uploaded (20)

RL-KLM: Automating Keystroke-level Modeling with Reinforcement Learning (IUI 2019)