Intelligent Verification/Validation for XR Based Systems

The future of XR: Current ecosystem and upcoming opportunities
Intelligent Verification/Validation for XR
Based Systems
Rui Prada
Instituto Superior Técnico, Universidade de Lisboa
INESC-ID
The future of XR: Current ecosystem and upcoming opportunities - May 27, 2021
https://iv4xr-project.eu
Project Ref. EU H2020-ICT-2018-3 - 856716

Testing XR Systems
Highly interactive
Rich and complex environments
Diverse multi-modal input and output
Testing demands high human labour (user
testing)

Use Artificial Intelligence (AI) to
support automation of XR testing
(Autonomous Testing Agents)

Team
University and Research
- INESC-ID, Portugal (Coordinator)
- Utrecht University, The Netherlands
- Fondazione Bruno Kessler, Italy
- Universitat Politecnica Valencia, Spain
- Umeå University, Sweden
Industry
- Gameware Europe, UK
- Good AI, Czech Republic
- Thales (SIX and AVS), France

Autonomous Testing Agents
Actively pursue testing goals
Intelligent coverage of the interaction
space
Identify potential interaction paths
Represent users with different profiles

Testing goals
 Testing Functional Properties
 The behaviour of the system
 Testing User Experience (UX)
 The impact on the user

iv4XR Agent-based Testing Framework

https://github.com/iv4xr-project
 aplib + iv4XR EDSL
 aplib:
 basic agents and runtime system
 action and tactic
 goal and goal-combinators
 iv4XR:
 test agents and testing related basic support
 extensions: pathplanning, world object model, emotion,
learning, ...
iv4XR
framework,
from a different
perspective

Interface-pattern (instead of interface) to
cater for diverging XR technologies

Two types of FTA
 Two types of FTA’s are being
developed:
 The first type of agent makes
deliberations to choose the
appropriate strategies that will
allow him to do goal-solving.
 The second type of agent is
intended to test the
functionality of the XR system
using exploration.

Goal Solving Functional Test Agent

Fully Automated Exploring/Testing
Space Engineers
 TESTAR as iv4XR agent
 While exploring:
 We test for robustness
 To infer a model that can be used
for Model-Based Testing (MBT)
 Find the best representation of
the World Object Model (WOM)

b_0
b_1
b_2 b_3
d_1
d_2
d_T
agent
"b_0 -- EXPLORE --> d_1_m",
"d_1_m -- EXPLORE --> b_1",
"b_1 -- EXPLORE --> b_0",
"b_0 -- EXPLORE --> d_T_m",
"d_T_m -- EXPLORE --> d_1_m",
"d_1_m -- EXPLORE --> b_0"

Different approaches
focusing on User eXperience

Automated assessment of UX
Agent
Cognitive
model
Emotional
model
Difficulty
Interaction
Policies
Personas

Emotional Prediction Through Machine
Learning
Predictive Model
Feed
Predicts
Expected
Emotional
Changes
Emotional
Changes
(Continuous Self
Reporting)
Training
SUT
(Observations)

Learning
 The Case Study:
 A simple top-down 2D
game
 Continuous self-
reporting after the fact
of the 3 dimensions of
the PAD emotional
model (Pleasure,
Arousal and Dominance)

Learning
 The emotional
reporting was cut into
slices, which were then
categorized as
decreasing, steady or
increasing.
 A machine learning model
based on Random Forests
was then trained to
predict these 3 classes.

Learning
Current Results (3 seconds slices):
 Arousal
 Accuracy:
 86.69 %
 Average Confusion Matrix:
 Pleasure
 Accuracy:
 90.95 %
 Dominance
 Accuracy:
 82.07 %
48.39 0.46 0.10
8.32 119.33 10.71
5.81 9.21 110.67
70.84 0.59 0.45
7.35 147.54 8.58
2.56 4.04 102.65
47.49 1. 0.56
12.71 153.24 13.69
6.2 5.42 89.96

Automated Assessment of Cognitive
Emotions (OCC Model)
Intelligent agents are deployed to generate tests based on UX test
specifications. This is achieved by deploying a Computational model of
emotion designed to provide affective processing in our intelligent agents

Three Pilot Studies
 We are making use of three pilots to test the framework:
 Space Engineers - a 3D game
 Players make use of several tools and fabrications (blocks) to obtain
resources and explore a solar system with player constructed vehicles and
machines.
 Maev - A simulation environment
 A defend/attack scenario in a nuclear powerplant. Defenders test security
measures to stop external intrusions.
 LiveSite - A monitoring system for infrastructure and construction sites
 Test the reading of sensors.
 Sensors at these sites can produce up to thousands of readings per
second. There are many virtual sensors.

How are we using the testing agents?
 Space Engineers requires ~20000 tests for each major
release.
 One arduous test that we are working on automating with
iv4XR is checking textures of blocks depending on their build
state.
 The Maev powerplant scenario currently requires two teams
of humans:
 Iv4XR will take control of the incursion forces, automatically
testing the efficacy of the proposed security measures.
 LiveSite buildings and critical infrastructures have a large
number of sensors that produce many readings per second.
 Iv4XR constructs tests based on their predefined relationships
and thresholds, but also investigates found errors to
determine if the readings are correct, or if there is some
anomaly (faulty/miscalibrated sensors).

Intelligent Verification/Validation for XR
Based Systems
@iv4xr
rui.prada@gaips.inesc-id.pt
Project Ref. EU H2020-ICT-2018-3 - 856716

Intelligent Verification/Validation for XR Based Systems

More Related Content

Similar to Intelligent Verification/Validation for XR Based Systems

More from Rui Prada

Recently uploaded

Intelligent Verification/Validation for XR Based Systems