SlideShare a Scribd company logo
1 of 25
© SSE, Prof. Dr. Andreas Metzger
Explainable Online Reinforcement Learning for
Adaptive Systems
Felix Feit, Andreas Metzger, Klaus Pohl
(with contributions from Jan Laufer)
SE 2023, Paderborn
F. Feit, A. Metzger, K. Pohl, “Explainable Online Reinforcement Learning for
Adaptive Systems”, in Software Engineering 2023, Fachtagung des GI-
Fachbereichs Softwaretechnik, 21.-24. Februar 2023, Paderborn, ser. LNI, G.
Engels, R. Hebig, M. Tichy, Eds., vol. P-332. Gesellschaft für Informatik e.V.,
2023, pp. 53–54.
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Agenda
1. Fundamentals
2. Problem
3. Solution
4. Discussion
SE2023 2
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Fundamentals
(Self-)adaptive Software Systems [Salehie & Tahvildari, 2009; Weyns, 2021]
• Observe changes in environment, requirements and themselves
• Modify their structure, parameters and behavior
Adaptive Software Life-Cycle Model
SE2023 3
DEV OPS
self-observe
ADAPT
self-modify
manual human-in-the-loop autonomous
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Fundamentals
Defining when to adapt
• Requires anticipating all relevant
environment states
• Infeasible in most cases due to incomplete
information at design time
Example:
• Concrete services dynamically bound during
execution not known at design time
Defining how to adapt
• Requires knowing concrete effect of adaptation
• Concrete effect typically not precisely known at
design time
Example:
• Deactivating recommendation engine has positive
impact on performance
• But: How much exactly?
SE2023 4
Engineering Challenge: „Design Time Uncertainty“ [Weyns et al. 2013; Weyns, 2020; Weyns et al., 2022]
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Online Reinforcement Learning
Fundamental RL Model
[Sutton & Barto, 2018]
• Reward function R defines learning goal
• RL agent aims to optimize cumulative rewards
Classical RL (e.g., Q-Learning)
• In each learning step:
either exploitation or exploration
• Exploitation: Reuse of existing knowledge
• Exploration: Collection of new knowledge
Deep RL [Palm et al., 2020]
• Knowledge represented as
Deep Artificial Neuronal Network
• Natural handling of non-stationarity due to
stochastic action selection (sampling)
• Handling of continuous states and actions
• Generalization over unseen, neighbouring states
SE2023 5
Action A
State S
Reward R
Action
Selection
Next state S’
RL Agent
Policy
Policy
Update
Environ-
ment
Textbook example “Cliff Walk“:
Actions =
{UP, DOWN,
LEFT, RIGHT}
Reward
[Sutton & Barto, 2018]
States:
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Online Reinforcement Learning for Adaptive Systems
6
MAPE-K Model Integrated Model
RL Model
Self-Adaptation Logic
Analyze
Monitor Execute
Plan
Knowledge
Online RL for SAS
Execute
Policy
(K)
Monitor
Action
Selection
(A + P)
Policy
Update
Action
A
State S
Reward R
Next state S’
Action A
State S
Reward R
Action
Selection
Next state S’
RL Agent
Policy
Policy
Update
Environ-
ment
Integrating MAPE-K and Reinforcement Learning [Metzger et al., 2022]
SEA4DQ@ESEC
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Agenda
1. Fundamentals
2. Problem
3. Solution
4. Discussion
SE2023 7
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Decision-making of RL agent not transparent
Reward function often too complex to deduce dynamic RL behavior
SE2023 8
Example: Webshop (SEAMS SWIM Exemplar)
• Goal: adapt system to balance different concerns
Example: Prescriptive Business Process Monitoring System
• Goal: find best time to raise alarm for adapting process
Alarm No Alarm
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Decision-making of RL agent not transparent
Practically impossible to deduce decision-making by only observing
“interface”, i.e., R, S, A
SE2023 9
R
S, A
Example:
Webshop
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Decision-making of RL agent not transparent
Knowledge in Deep RL not explicitly represented
SE2023 10
State
S
Action
A
Classical RL (Q-Learning)
Actions A = {UP, DOWN,
LEFT, RIGHT}
Reward
States S = {0, …, 47}
0 11
24 35
24 25 26 27 28 29 30 31 32 33 34 35
UP -13,36 -12,57 -11,73 -10,74 -9,95 -8,91 -7,99 -6,98 -5,95 -4,92 -3,93 -2,98
RIGHT -12,00 -11,00 -10,00 -9,00 -8,00 -7,00 -6,00 -5,00 -4,00 -3,00 -2,00 -1,93
LEFT -12,99 -13,00 -11,98 -10,95 -9,99 -8,89 -7,97 -6,98 -5,94 -4,81 -3,94 -2,98
DOWN -13,95 -112,18 -112,80 -111,49 -112,13 -112,68 -112,91 -112,42 -111,81 -110,62 -112,86 -1,00
Action
A
State S:
Deep RL
Falling into the cliff Reaching the Goal (G)
Example:
Cliff Walk
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Agenda
1. Fundamentals
2. Problem
3. Solution
4. Discussion
SE2023 11
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
Reward Decomposition
[Juozapaitis et al., 2019]
Decompose reward function to explain short-
term goal orientation of RL (train sub-RL agents)
Pro
• Helpful in the presence of multiple, “competing”
quality goals for learning
• Provides contrastive (counterfactual) explanations
Con
• No indication of explanation’s relevance
• Requires manually selecting relevant explanations
 cognitive overhead
Interestingness Elements
[Sequeira et al., 2020]
Identify relevant moments of interaction
between agent and environment at runtime
Pro
• Facilitates automatically selecting relevant
interactions to be explained
Con
• Does not explain whether RL behaves as expected
and for the right reasons
12
Provide Insights into Decision Making of Online RL
Decomposed Interestingness Elements (DINEs)
SE2023
+
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
Uncertain Action
Whether RL in given state is
uncertain (wide range of actions)
or certain
Reward Channel
Extremum
Points after local minimum /
maximum of state-value indicate
decisions in potentially
critical states
Reward Channel
Dominance
Influence that each sub-agent has
on each possible action
13
Three Types of DINEs
SE2023
Visualization in Dashboard
Certain Actions Uncertain Actions
Action 2 Action 2
Action 4
Action 4
Action 4
Action 2 Action 4
Action 1
Action 2
Action 1 Action 1
time step
Minimum Maximum
time step
Action 1 Action 2 Action 3 Action 4
Reward Channel 1 Reward Channel 2 Reward Channel 3
Action 1 Action 2 Action 3 Action 4
Reward Channel 1 Reward Channel 2 Reward Channel 3
Absolute Reward Channel Dominance Relative Reward Channel Dominance
Action 1 Action 2 Action 3 Action 4 Action 5 Action 1 Action 2 Action 3 Action 4 Action 5
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
Application to
SWIM Exemplar
SE2023 14
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
SE2023 15
User Study
RQ1: Task performance?
 8 different tasks carried out in the role of a software engineer
RQ2: Technology Acceptance?
 12 questions along Technology Acceptance Model – TAM (5-point scale)
Note, we decided against comparative assessment; would be unfair due to “black-box” nature of Deep RL
E.g., “why was adaptation X
chosen instead of adaptation Y in
state S?”
E.g., “using XRL-DINE in my job
would increase my job
productivity.”
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
SE2023 16
User Study – Initial Results
RQ1: Task Performance
 Effectiveness:  0.875
 Efficiency:  0.67
o cf. inspections:  0.07 [Petersen et al., 2008]
o cf. debugging:  0.03 [Ceccato et al., 2015]
17%
31%
31%
21%
Bachelor Student
Master Student
Academia
Industry
0.0 0.5 1.0 1.5
0.0 0.2 0.4 0.6 0.8 1.0
48 Participants
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
XRL-DINE
SE2023 17
User Study – Initial Results
RQ2: Technology Acceptance
 Useful for 61% of participants
 Easy to Use for 55% of participants
4% 4% 6% 2% 6% 4%
17% 19% 15%
10%
10%
6%
21% 23% 23%
27% 13% 21%
46%
48%
44% 48%
48% 48%
13% 6% 13% 13%
23% 21%
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Faster Better Increased
Productivity
Increased
Effectiveness
Easier to do
job
useful
Perceived Usefulness
Extremely Unlikely Quite Unlikely Neither Quite Likely Extremely Likely
4% 6% 2% 6% 4%
19% 15%
10%
10%
6%
23% 23%
27% 13% 21%
48%
44% 48%
48% 48%
6% 13% 13%
23% 21%
Better Increased
Productivity
Increased
Effectiveness
Easier to do
job
useful
Perceived Usefulness
ikely Quite Unlikely Neither Quite Likely Extremely Likely
% 6% 10% 6%
15%
6%
% 21% 17% 21%
13% 25%
% 19% 15%
27%
15%
17%
%
40%
33%
35%
48% 40%
% 15%
25%
10% 10% 13%
operate Easy to get the
dashboard to do
what I want
Interaction clear
and
understandable
Visualization
clear and
understandable
Easy to become
skillfil
Easy to Use
Perceived Ease of Use
tremely Unlikely Quite Unlikely Neither Quite Likely Extremely Likely
Useful Easy to Use
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Agenda
1. Fundamentals
2. Problem
3. Solution
4. Discussion
SE2023 18
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Discussion
A. Metzger, ACSOS 2022 19
Explanation Requirements from Social Sciences [Miller @ Artif. Intell. 2019]
Contrastive: “why P happened instead of Q?”
 “Reward Channel Dominance” DINE
Selective: “no need for complete course of events”
 “Reward Channel Extrema” DINE &
“Uncertain Actions” DINE
Causal: “most likely explanation not
necessarily the best”
 Future work: e.g., check whether agent relies on
spurious correlations (and not causality)
[Gajcin et al. @ AAAMAS Wkshp 2022]
Social: “transfer of knowledge as part of a
conversation”
 Future work: e.g., Chatbots for XAI (à la ChatGPT ;-)


?
? Prediction
Explanation
Train will be
delayed
Train passed
last light 5
min later
than typical
This also
happened
yesterday, but
why today a
problem?
Because of
attribute
“number of
trains behind
current one” > 5
What would
have to change
such that not
delay
(counterfactual)
“number of
trains behind
current one” < 2
Human
Chat4XAI
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Thanks!
SE2023 20
Research leading to these results has received funding from the EU’s
Horizon 2020 research and innovation programme under grant
agreement no. 871493 – www.dataports-project.eu
Track on
Data and AI Driven Engineering
(DAIDE)
As part of SEAA 2023 (49th Euromicro Conference
on Software Engineering and Advanced
Applications)
https://dsd-seaa2023.com/daide/
Deadline: April, 3
Special Issue on
Software Engineering and AI for Data
Quality
ACM Journal of Data and
Information Quality (JDIQ)
https://bit.ly/JDIQ-SI
Deadline: April, 30
Blatant advertising ;-)
Call for Papers
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
References
[Arabnejad et al., 2017] H. Arabnejad, C. Pahl, P. Jamshidi, and G. Estrada, “A comparison of reinforcement learning techniques for
fuzzy cloud autoscaling,” in 17th Intl Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017
[De Lemos et al. 2010] R. de Lemos et al., “Software Engineering for Self-Adaptive Systems: A Second Research Roadmap,” in Softw.
Eng. for Self-Adaptive Systems II, ser. LNCS. Springer, 2013, vol. 7475, pp. 1–32
[Di Francescomarino et al. 2018] Chiara Di Francescomarino, Chiara Ghidini, Fabrizio Maria Maggi, Fredrik Milani: Predictive Process
Monitoring Methods: Which One Suits Me Best? BPM 2018: 462-479
[Dulac-Arnold et al. 2015] Gabriel Dulac-Arnold, Richard Evans, Peter Sunehag, Ben Coppin: Reinforcement Learning in Large Discrete
Action Spaces. CoRR abs/1512.07679 (2015)
[Evermann et al. 2017] Evermann, J., Rehse, J., Fettke, P.: Predicting process behaviour using deep learning. Decision Support Systems
100, 2017
[Filho & Porter, 2017] Filho, R.V.R., Porter, B.: Defining emergent software using continuous self-assembly, perception, and learning.
TAAS 12(3), 16:1–16:25 (2017)
[Jamshidi et al., 2015] P. Jamshidi, A. Molzam Sharifloo, C. Pahl, A. Metzger, and G. Estrada, “Self-learning cloud controllers: Fuzzy Q-
learning for knowledge evolution (short paper),” in Int’l Conference on Cloud and Autonomic Computing (IC- CAC 2015) Cambridge,
USA, September 21-24, 2015,
[Kephart & Chess, 2003] J. O. Kephart and D. M. Chess, “The vision of autonomic computing,” IEEE Computer, vol. 36, no. 1, pp. 41–50,
2003.
[Klein et al. 2014] C. Klein, M. Maggio, K. Arzen, F. Hernandez-Rodriguez, “Brownout: building more robust cloud applications”. In:
36th Intl Conf. on Software Engineering (ICSE 2014), pp. 700–711. ACM, 2014
[Mann, 2016] Z. Mann, “Interplay of virtual machine selection and virtual machine placement”, in: 5th European Conf. on Service-
Oriented and Cloud Computing, ESOCC’16, LNCS vol. 9846, pp. 137–151 (2016)
[Metzger & Pohl, 2014] A. Metzger, K. Pohl, “Software product line engineering and variability management: Achievements and
challenges,” in ICSE Future of Software Engineering Track (FOSE 2014), ACM, 2014, pp. 70–84.
21
SE2023
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
References
[Metzger et al. 2019] A. Metzger, A. Neubauer, P. Bohn, and K. Pohl, “Proactive process adaptation using deep learning ensembles,” in
31st Int’l Conf. on Advanced Information Systems Engineering (CAiSE 2019), LNCS, vol. 11483. Springer, 2019, pp. 547–562
[Metzger et al. 2020] A. Metzger, C. Quinton, Z. Á. Mann, L. Baresi, K. Pohl, “Realizing Self-Adaptive Systems via Online Reinforcement
Learning and Feature-Model-guided Exploration”, Computing, Springer, March, 2022
[Metzger et al. 2020a] A. Metzger, C. Quinton, Z. Mann, L. Baresi, and K. Pohl, “Feature model-guided online reinforcement learning
for self-adaptive services,” in 18th Int’l Conf. on Service-Oriented Computing (ICSOC 2020), LNCS 12571, Springer, 2020
[Metzger et al. 2020b] A. Metzger, T. Kley, and A. Palm, “Triggering proactive business process adaptations via online reinforcement
learning,” in 18th Int’l Conf. on Business Process Management (BPM 2020), LNCS 12168. Springer, 2020e
[Palm et al. 2020] A. Palm, A. Metzger, and K. Pohl, “Online reinforcement learning for self-adaptive information systems,” in 32nd Int’l
Conf. on Advanced Information Systems Engineering (CAiSE 2020), LNCS 12127. Springer, 2020
[Porter et al., 2020] B. Porter, R. R. Filho, and P. Dean, “A survey of methodology in self-adaptive systems research,” in ACSOS. IEEE,
2020
[Salehie & Tahvildari, 2009] M. Salehie and L. Tahvildari, “Self-adaptive software: Landscape and research challenges,” TAAS, vol. 4, no.
2, 2009.
22
SE2023
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
References
[Siegmund et al. 2012] N. Siegmund, S. Kolesnikov, C. Kästner, S. Apel, D. Batory, M. Rosenmüller, G. Saake, G.: Predicting Performance
via Automated Feature-interaction Detection. In: 34th Intl Conf. on Software Engineering (ICSE 2012), pp. 167–177, ACM, 2012
[Sutton & Barto, 2018] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: MIT Press,
2018
[Taylor & Stone, 2009] M. Taylor, P. Stone: Transfer learning for reinforcement learning domains: A survey. J. Mach. Learn. Res. 10,
1633–1685 (2009)
[Wang et al., 2020] Hongbing Wang, Jiajie Li, Qi Yu, Tianjing Hong, Jia Yan, Wei Zhao: Integrating recurrent neural networks and
reinforcement learning for dynamic service composition. Future Gener. Comput. Syst. 107: 551-563 (2020)
[Weyns et al. 2013] Danny Weyns, Nelly Bencomo, Radu Calinescu, Javier Cámara, Carlo Ghezzi, Vincenzo Grassi, Lars Grunske, Paola
Inverardi, Jean-Marc Jézéquel, Sam Malek, Raffaela Mirandola, Marco Mori, Giordano Tamburrelli: Perpetual Assurances for Self-
Adaptive Systems. Software Engineering for Self-Adaptive Systems 2013: 31-63
[Weyns, 2021] Danny Weyns, Introduction to Self-Adaptive Systems: A Contemporary Software Engineering Perspective, Wiley, 2021.
[Weyns et al., 2022] D. Weyns, I. Gerostathopoulos, N. Abbas, J. Andersson, S. Biffl, P. Brada, T. Bures, A. D. Salle, P. Lago, A. Musil, J.
Musil, and P. Pelliccione, “Preliminary results of a survey on the use of self-adaptation in industry,” in 17th Intl Symp. on Software
Engineering for Adaptive and Self-Managing Systems, SEAMS@ICSE 2022, 2022.
[Xu et al., 2012] C. Xu, J. Rao, and X. Bu, “URL: A unified reinforcement learning approach for autonomic cloud management,” J.
Parallel Distrib. Comput., vol. 72, no. 2, pp. 95–105, 2012
23
SE2023
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Fundamentals
MAPE-K Reference Model [Kephart & Chess, 2003; Salehie & Tahvildari, 2009]
• Architecture as defining characteristic of adaptive system [Weyns, 2020]
SE2023 24
Example: Adaptive Web Shop
• Monitor: Sudden increase in number of concurrent users (workload peak)
• Analyze: Slow response of web shop if no adaptation
• Plan: Deactivating optional recommendation feature
• Execute: Replace dynamic recommendations with static banner
Self-Adaptation Logic
Analyze
Monitor Execute
Plan
Knowledge
Identify concrete
adaptations
Enact concrete
adaptation
Determine need for
adaptation
Collect and aggregate
observations
System Logic
Sensors Effectors
0
e
+
0
0
1
e
+
0
5
2
e
+
0
5
3
e
+
0
5
4
e
+
0
5
5
e
+
0
5
6
e
+
0
5
1
0
0
1
5
0
2
0
0
2
5
0
d
$
w
o
r
k
l
o
a
d
Workload
t
©
SSE,
Prof.
Dr.
Andreas
Metzger
SOFTWARE SYSTEMS ENGINEERING
Prof. Dr. K. Pohl
Online Reinforcement Learning
25
MAPE-K Model Integrated Model
RL Model
Self-Adaptation Logic
Analyze
Monitor Execute
Plan
Knowledge
Online RL for SAS
Execute
Policy
(K)
Monitor
Action
Selection
(A + P)
Policy
Update
Action
A
State S
Reward R
Next state S’
Action A
State S
Reward R
Action
Selection
Next state S’
RL Agent
Policy
Policy
Update
Environ-
ment
Integrating MAPE-K and Reinforcement Learning [Metzger et al., 2022]
SE2023

More Related Content

Similar to Explainable Online Reinforcement Learning for Adaptive Systems

Intro to Agile Methods for Execs, Leaders, and Managers
Intro to Agile Methods for Execs, Leaders, and ManagersIntro to Agile Methods for Execs, Leaders, and Managers
Intro to Agile Methods for Execs, Leaders, and ManagersDavid Rico
 
IA3_presentation.pptx
IA3_presentation.pptxIA3_presentation.pptx
IA3_presentation.pptxKtonNguyn2
 
Reducing_Learning_Curve_in_LB_GB_Sujith
Reducing_Learning_Curve_in_LB_GB_SujithReducing_Learning_Curve_in_LB_GB_Sujith
Reducing_Learning_Curve_in_LB_GB_SujithSujith Kolath
 
Attention Approximation: From the web to multi-screen television
Attention Approximation: From the web to multi-screen televisionAttention Approximation: From the web to multi-screen television
Attention Approximation: From the web to multi-screen televisionCaroline Jay
 
Learning Software Performance Models for Dynamic and Uncertain Environments
Learning Software Performance Models for Dynamic and Uncertain EnvironmentsLearning Software Performance Models for Dynamic and Uncertain Environments
Learning Software Performance Models for Dynamic and Uncertain EnvironmentsPooyan Jamshidi
 
ATI's Total Systems Engineering Development & Management technical training c...
ATI's Total Systems Engineering Development & Management technical training c...ATI's Total Systems Engineering Development & Management technical training c...
ATI's Total Systems Engineering Development & Management technical training c...Jim Jenkins
 
Total systems engineering_development_management_course_sampler
Total systems engineering_development_management_course_samplerTotal systems engineering_development_management_course_sampler
Total systems engineering_development_management_course_samplerJim Jenkins
 
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...NAVER Engineering
 
Semantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsSemantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsGiulio Carducci
 
QM-007-Design for 6 sigma
QM-007-Design for 6 sigmaQM-007-Design for 6 sigma
QM-007-Design for 6 sigmahandbook
 
Automated attendance system using Face recognition
Automated attendance system using Face recognitionAutomated attendance system using Face recognition
Automated attendance system using Face recognitionIRJET Journal
 
F-Secure Cloud Software icgse2013
F-Secure Cloud Software icgse2013F-Secure Cloud Software icgse2013
F-Secure Cloud Software icgse2013Janne Järvinen
 
Fuzzy Control meets Software Engineering
Fuzzy Control meets Software EngineeringFuzzy Control meets Software Engineering
Fuzzy Control meets Software EngineeringPooyan Jamshidi
 
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTING
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTINGANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTING
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTINGEditor IJMTER
 
Comments for Simulations Part II.pdf
Comments for Simulations Part II.pdfComments for Simulations Part II.pdf
Comments for Simulations Part II.pdfJeanMarshall8
 
Digicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfDigicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfitsmeankitkhan
 
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...Ali Ouni
 
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...Comit Projects Ltd
 
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...sugiuralab
 
A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...Rafael Ferreira da Silva
 

Similar to Explainable Online Reinforcement Learning for Adaptive Systems (20)

Intro to Agile Methods for Execs, Leaders, and Managers
Intro to Agile Methods for Execs, Leaders, and ManagersIntro to Agile Methods for Execs, Leaders, and Managers
Intro to Agile Methods for Execs, Leaders, and Managers
 
IA3_presentation.pptx
IA3_presentation.pptxIA3_presentation.pptx
IA3_presentation.pptx
 
Reducing_Learning_Curve_in_LB_GB_Sujith
Reducing_Learning_Curve_in_LB_GB_SujithReducing_Learning_Curve_in_LB_GB_Sujith
Reducing_Learning_Curve_in_LB_GB_Sujith
 
Attention Approximation: From the web to multi-screen television
Attention Approximation: From the web to multi-screen televisionAttention Approximation: From the web to multi-screen television
Attention Approximation: From the web to multi-screen television
 
Learning Software Performance Models for Dynamic and Uncertain Environments
Learning Software Performance Models for Dynamic and Uncertain EnvironmentsLearning Software Performance Models for Dynamic and Uncertain Environments
Learning Software Performance Models for Dynamic and Uncertain Environments
 
ATI's Total Systems Engineering Development & Management technical training c...
ATI's Total Systems Engineering Development & Management technical training c...ATI's Total Systems Engineering Development & Management technical training c...
ATI's Total Systems Engineering Development & Management technical training c...
 
Total systems engineering_development_management_course_sampler
Total systems engineering_development_management_course_samplerTotal systems engineering_development_management_course_sampler
Total systems engineering_development_management_course_sampler
 
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
 
Semantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsSemantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media Posts
 
QM-007-Design for 6 sigma
QM-007-Design for 6 sigmaQM-007-Design for 6 sigma
QM-007-Design for 6 sigma
 
Automated attendance system using Face recognition
Automated attendance system using Face recognitionAutomated attendance system using Face recognition
Automated attendance system using Face recognition
 
F-Secure Cloud Software icgse2013
F-Secure Cloud Software icgse2013F-Secure Cloud Software icgse2013
F-Secure Cloud Software icgse2013
 
Fuzzy Control meets Software Engineering
Fuzzy Control meets Software EngineeringFuzzy Control meets Software Engineering
Fuzzy Control meets Software Engineering
 
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTING
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTINGANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTING
ANALYSIS OF SOFTWARE SECURITY TESTING TECHNIQUES IN CLOUD COMPUTING
 
Comments for Simulations Part II.pdf
Comments for Simulations Part II.pdfComments for Simulations Part II.pdf
Comments for Simulations Part II.pdf
 
Digicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdfDigicrome Data Science & AI 11 Month Course PDF.pdf
Digicrome Data Science & AI 11 Month Course PDF.pdf
 
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...
A Multi-Objective Refactoring Approach to Introduce Design Patterns and Fix A...
 
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...
Sensors, threats, responses and challenges - Dr Emil Lupu (Imperial College L...
 
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...
A Study of Wearable Accelerometers Layout for Human Activity Recognition(Asia...
 
A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...A science-gateway for workflow executions: online and non-clairvoyant self-h...
A science-gateway for workflow executions: online and non-clairvoyant self-h...
 

More from Andreas Metzger

Antrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAntrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAndreas Metzger
 
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesFeature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesAndreas Metzger
 
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Andreas Metzger
 
Data-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsData-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsAndreas Metzger
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management Andreas Metzger
 
Big Data Technology Insights
Big Data Technology InsightsBig Data Technology Insights
Big Data Technology InsightsAndreas Metzger
 
Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Andreas Metzger
 
Data-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsData-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsAndreas Metzger
 
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Andreas Metzger
 
Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Andreas Metzger
 
Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Andreas Metzger
 
Predictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskPredictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskAndreas Metzger
 
Risk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationRisk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationAndreas Metzger
 
Predictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesPredictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesAndreas Metzger
 

More from Andreas Metzger (14)

Antrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAntrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptx
 
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesFeature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
 
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
 
Data-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsData-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software Systems
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management
 
Big Data Technology Insights
Big Data Technology InsightsBig Data Technology Insights
Big Data Technology Insights
 
Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles
 
Data-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsData-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information Systems
 
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
 
Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...
 
Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics
 
Predictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskPredictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and Risk
 
Risk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationRisk-based Proactive Process Adaptation
Risk-based Proactive Process Adaptation
 
Predictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesPredictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability Estimates
 

Recently uploaded

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 

Recently uploaded (20)

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 

Explainable Online Reinforcement Learning for Adaptive Systems

  • 1. © SSE, Prof. Dr. Andreas Metzger Explainable Online Reinforcement Learning for Adaptive Systems Felix Feit, Andreas Metzger, Klaus Pohl (with contributions from Jan Laufer) SE 2023, Paderborn F. Feit, A. Metzger, K. Pohl, “Explainable Online Reinforcement Learning for Adaptive Systems”, in Software Engineering 2023, Fachtagung des GI- Fachbereichs Softwaretechnik, 21.-24. Februar 2023, Paderborn, ser. LNI, G. Engels, R. Hebig, M. Tichy, Eds., vol. P-332. Gesellschaft für Informatik e.V., 2023, pp. 53–54.
  • 2. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Agenda 1. Fundamentals 2. Problem 3. Solution 4. Discussion SE2023 2
  • 3. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Fundamentals (Self-)adaptive Software Systems [Salehie & Tahvildari, 2009; Weyns, 2021] • Observe changes in environment, requirements and themselves • Modify their structure, parameters and behavior Adaptive Software Life-Cycle Model SE2023 3 DEV OPS self-observe ADAPT self-modify manual human-in-the-loop autonomous
  • 4. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Fundamentals Defining when to adapt • Requires anticipating all relevant environment states • Infeasible in most cases due to incomplete information at design time Example: • Concrete services dynamically bound during execution not known at design time Defining how to adapt • Requires knowing concrete effect of adaptation • Concrete effect typically not precisely known at design time Example: • Deactivating recommendation engine has positive impact on performance • But: How much exactly? SE2023 4 Engineering Challenge: „Design Time Uncertainty“ [Weyns et al. 2013; Weyns, 2020; Weyns et al., 2022]
  • 5. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Online Reinforcement Learning Fundamental RL Model [Sutton & Barto, 2018] • Reward function R defines learning goal • RL agent aims to optimize cumulative rewards Classical RL (e.g., Q-Learning) • In each learning step: either exploitation or exploration • Exploitation: Reuse of existing knowledge • Exploration: Collection of new knowledge Deep RL [Palm et al., 2020] • Knowledge represented as Deep Artificial Neuronal Network • Natural handling of non-stationarity due to stochastic action selection (sampling) • Handling of continuous states and actions • Generalization over unseen, neighbouring states SE2023 5 Action A State S Reward R Action Selection Next state S’ RL Agent Policy Policy Update Environ- ment Textbook example “Cliff Walk“: Actions = {UP, DOWN, LEFT, RIGHT} Reward [Sutton & Barto, 2018] States:
  • 6. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Online Reinforcement Learning for Adaptive Systems 6 MAPE-K Model Integrated Model RL Model Self-Adaptation Logic Analyze Monitor Execute Plan Knowledge Online RL for SAS Execute Policy (K) Monitor Action Selection (A + P) Policy Update Action A State S Reward R Next state S’ Action A State S Reward R Action Selection Next state S’ RL Agent Policy Policy Update Environ- ment Integrating MAPE-K and Reinforcement Learning [Metzger et al., 2022] SEA4DQ@ESEC
  • 7. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Agenda 1. Fundamentals 2. Problem 3. Solution 4. Discussion SE2023 7
  • 8. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Decision-making of RL agent not transparent Reward function often too complex to deduce dynamic RL behavior SE2023 8 Example: Webshop (SEAMS SWIM Exemplar) • Goal: adapt system to balance different concerns Example: Prescriptive Business Process Monitoring System • Goal: find best time to raise alarm for adapting process Alarm No Alarm
  • 9. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Decision-making of RL agent not transparent Practically impossible to deduce decision-making by only observing “interface”, i.e., R, S, A SE2023 9 R S, A Example: Webshop
  • 10. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Decision-making of RL agent not transparent Knowledge in Deep RL not explicitly represented SE2023 10 State S Action A Classical RL (Q-Learning) Actions A = {UP, DOWN, LEFT, RIGHT} Reward States S = {0, …, 47} 0 11 24 35 24 25 26 27 28 29 30 31 32 33 34 35 UP -13,36 -12,57 -11,73 -10,74 -9,95 -8,91 -7,99 -6,98 -5,95 -4,92 -3,93 -2,98 RIGHT -12,00 -11,00 -10,00 -9,00 -8,00 -7,00 -6,00 -5,00 -4,00 -3,00 -2,00 -1,93 LEFT -12,99 -13,00 -11,98 -10,95 -9,99 -8,89 -7,97 -6,98 -5,94 -4,81 -3,94 -2,98 DOWN -13,95 -112,18 -112,80 -111,49 -112,13 -112,68 -112,91 -112,42 -111,81 -110,62 -112,86 -1,00 Action A State S: Deep RL Falling into the cliff Reaching the Goal (G) Example: Cliff Walk
  • 11. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Agenda 1. Fundamentals 2. Problem 3. Solution 4. Discussion SE2023 11
  • 12. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE Reward Decomposition [Juozapaitis et al., 2019] Decompose reward function to explain short- term goal orientation of RL (train sub-RL agents) Pro • Helpful in the presence of multiple, “competing” quality goals for learning • Provides contrastive (counterfactual) explanations Con • No indication of explanation’s relevance • Requires manually selecting relevant explanations  cognitive overhead Interestingness Elements [Sequeira et al., 2020] Identify relevant moments of interaction between agent and environment at runtime Pro • Facilitates automatically selecting relevant interactions to be explained Con • Does not explain whether RL behaves as expected and for the right reasons 12 Provide Insights into Decision Making of Online RL Decomposed Interestingness Elements (DINEs) SE2023 +
  • 13. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE Uncertain Action Whether RL in given state is uncertain (wide range of actions) or certain Reward Channel Extremum Points after local minimum / maximum of state-value indicate decisions in potentially critical states Reward Channel Dominance Influence that each sub-agent has on each possible action 13 Three Types of DINEs SE2023 Visualization in Dashboard Certain Actions Uncertain Actions Action 2 Action 2 Action 4 Action 4 Action 4 Action 2 Action 4 Action 1 Action 2 Action 1 Action 1 time step Minimum Maximum time step Action 1 Action 2 Action 3 Action 4 Reward Channel 1 Reward Channel 2 Reward Channel 3 Action 1 Action 2 Action 3 Action 4 Reward Channel 1 Reward Channel 2 Reward Channel 3 Absolute Reward Channel Dominance Relative Reward Channel Dominance Action 1 Action 2 Action 3 Action 4 Action 5 Action 1 Action 2 Action 3 Action 4 Action 5
  • 14. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE Application to SWIM Exemplar SE2023 14
  • 15. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE SE2023 15 User Study RQ1: Task performance?  8 different tasks carried out in the role of a software engineer RQ2: Technology Acceptance?  12 questions along Technology Acceptance Model – TAM (5-point scale) Note, we decided against comparative assessment; would be unfair due to “black-box” nature of Deep RL E.g., “why was adaptation X chosen instead of adaptation Y in state S?” E.g., “using XRL-DINE in my job would increase my job productivity.”
  • 16. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE SE2023 16 User Study – Initial Results RQ1: Task Performance  Effectiveness:  0.875  Efficiency:  0.67 o cf. inspections:  0.07 [Petersen et al., 2008] o cf. debugging:  0.03 [Ceccato et al., 2015] 17% 31% 31% 21% Bachelor Student Master Student Academia Industry 0.0 0.5 1.0 1.5 0.0 0.2 0.4 0.6 0.8 1.0 48 Participants
  • 17. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl XRL-DINE SE2023 17 User Study – Initial Results RQ2: Technology Acceptance  Useful for 61% of participants  Easy to Use for 55% of participants 4% 4% 6% 2% 6% 4% 17% 19% 15% 10% 10% 6% 21% 23% 23% 27% 13% 21% 46% 48% 44% 48% 48% 48% 13% 6% 13% 13% 23% 21% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Faster Better Increased Productivity Increased Effectiveness Easier to do job useful Perceived Usefulness Extremely Unlikely Quite Unlikely Neither Quite Likely Extremely Likely 4% 6% 2% 6% 4% 19% 15% 10% 10% 6% 23% 23% 27% 13% 21% 48% 44% 48% 48% 48% 6% 13% 13% 23% 21% Better Increased Productivity Increased Effectiveness Easier to do job useful Perceived Usefulness ikely Quite Unlikely Neither Quite Likely Extremely Likely % 6% 10% 6% 15% 6% % 21% 17% 21% 13% 25% % 19% 15% 27% 15% 17% % 40% 33% 35% 48% 40% % 15% 25% 10% 10% 13% operate Easy to get the dashboard to do what I want Interaction clear and understandable Visualization clear and understandable Easy to become skillfil Easy to Use Perceived Ease of Use tremely Unlikely Quite Unlikely Neither Quite Likely Extremely Likely Useful Easy to Use
  • 18. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Agenda 1. Fundamentals 2. Problem 3. Solution 4. Discussion SE2023 18
  • 19. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Discussion A. Metzger, ACSOS 2022 19 Explanation Requirements from Social Sciences [Miller @ Artif. Intell. 2019] Contrastive: “why P happened instead of Q?”  “Reward Channel Dominance” DINE Selective: “no need for complete course of events”  “Reward Channel Extrema” DINE & “Uncertain Actions” DINE Causal: “most likely explanation not necessarily the best”  Future work: e.g., check whether agent relies on spurious correlations (and not causality) [Gajcin et al. @ AAAMAS Wkshp 2022] Social: “transfer of knowledge as part of a conversation”  Future work: e.g., Chatbots for XAI (à la ChatGPT ;-)   ? ? Prediction Explanation Train will be delayed Train passed last light 5 min later than typical This also happened yesterday, but why today a problem? Because of attribute “number of trains behind current one” > 5 What would have to change such that not delay (counterfactual) “number of trains behind current one” < 2 Human Chat4XAI
  • 20. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Thanks! SE2023 20 Research leading to these results has received funding from the EU’s Horizon 2020 research and innovation programme under grant agreement no. 871493 – www.dataports-project.eu Track on Data and AI Driven Engineering (DAIDE) As part of SEAA 2023 (49th Euromicro Conference on Software Engineering and Advanced Applications) https://dsd-seaa2023.com/daide/ Deadline: April, 3 Special Issue on Software Engineering and AI for Data Quality ACM Journal of Data and Information Quality (JDIQ) https://bit.ly/JDIQ-SI Deadline: April, 30 Blatant advertising ;-) Call for Papers
  • 21. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl References [Arabnejad et al., 2017] H. Arabnejad, C. Pahl, P. Jamshidi, and G. Estrada, “A comparison of reinforcement learning techniques for fuzzy cloud autoscaling,” in 17th Intl Symposium on Cluster, Cloud and Grid Computing, CCGRID 2017 [De Lemos et al. 2010] R. de Lemos et al., “Software Engineering for Self-Adaptive Systems: A Second Research Roadmap,” in Softw. Eng. for Self-Adaptive Systems II, ser. LNCS. Springer, 2013, vol. 7475, pp. 1–32 [Di Francescomarino et al. 2018] Chiara Di Francescomarino, Chiara Ghidini, Fabrizio Maria Maggi, Fredrik Milani: Predictive Process Monitoring Methods: Which One Suits Me Best? BPM 2018: 462-479 [Dulac-Arnold et al. 2015] Gabriel Dulac-Arnold, Richard Evans, Peter Sunehag, Ben Coppin: Reinforcement Learning in Large Discrete Action Spaces. CoRR abs/1512.07679 (2015) [Evermann et al. 2017] Evermann, J., Rehse, J., Fettke, P.: Predicting process behaviour using deep learning. Decision Support Systems 100, 2017 [Filho & Porter, 2017] Filho, R.V.R., Porter, B.: Defining emergent software using continuous self-assembly, perception, and learning. TAAS 12(3), 16:1–16:25 (2017) [Jamshidi et al., 2015] P. Jamshidi, A. Molzam Sharifloo, C. Pahl, A. Metzger, and G. Estrada, “Self-learning cloud controllers: Fuzzy Q- learning for knowledge evolution (short paper),” in Int’l Conference on Cloud and Autonomic Computing (IC- CAC 2015) Cambridge, USA, September 21-24, 2015, [Kephart & Chess, 2003] J. O. Kephart and D. M. Chess, “The vision of autonomic computing,” IEEE Computer, vol. 36, no. 1, pp. 41–50, 2003. [Klein et al. 2014] C. Klein, M. Maggio, K. Arzen, F. Hernandez-Rodriguez, “Brownout: building more robust cloud applications”. In: 36th Intl Conf. on Software Engineering (ICSE 2014), pp. 700–711. ACM, 2014 [Mann, 2016] Z. Mann, “Interplay of virtual machine selection and virtual machine placement”, in: 5th European Conf. on Service- Oriented and Cloud Computing, ESOCC’16, LNCS vol. 9846, pp. 137–151 (2016) [Metzger & Pohl, 2014] A. Metzger, K. Pohl, “Software product line engineering and variability management: Achievements and challenges,” in ICSE Future of Software Engineering Track (FOSE 2014), ACM, 2014, pp. 70–84. 21 SE2023
  • 22. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl References [Metzger et al. 2019] A. Metzger, A. Neubauer, P. Bohn, and K. Pohl, “Proactive process adaptation using deep learning ensembles,” in 31st Int’l Conf. on Advanced Information Systems Engineering (CAiSE 2019), LNCS, vol. 11483. Springer, 2019, pp. 547–562 [Metzger et al. 2020] A. Metzger, C. Quinton, Z. Á. Mann, L. Baresi, K. Pohl, “Realizing Self-Adaptive Systems via Online Reinforcement Learning and Feature-Model-guided Exploration”, Computing, Springer, March, 2022 [Metzger et al. 2020a] A. Metzger, C. Quinton, Z. Mann, L. Baresi, and K. Pohl, “Feature model-guided online reinforcement learning for self-adaptive services,” in 18th Int’l Conf. on Service-Oriented Computing (ICSOC 2020), LNCS 12571, Springer, 2020 [Metzger et al. 2020b] A. Metzger, T. Kley, and A. Palm, “Triggering proactive business process adaptations via online reinforcement learning,” in 18th Int’l Conf. on Business Process Management (BPM 2020), LNCS 12168. Springer, 2020e [Palm et al. 2020] A. Palm, A. Metzger, and K. Pohl, “Online reinforcement learning for self-adaptive information systems,” in 32nd Int’l Conf. on Advanced Information Systems Engineering (CAiSE 2020), LNCS 12127. Springer, 2020 [Porter et al., 2020] B. Porter, R. R. Filho, and P. Dean, “A survey of methodology in self-adaptive systems research,” in ACSOS. IEEE, 2020 [Salehie & Tahvildari, 2009] M. Salehie and L. Tahvildari, “Self-adaptive software: Landscape and research challenges,” TAAS, vol. 4, no. 2, 2009. 22 SE2023
  • 23. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl References [Siegmund et al. 2012] N. Siegmund, S. Kolesnikov, C. Kästner, S. Apel, D. Batory, M. Rosenmüller, G. Saake, G.: Predicting Performance via Automated Feature-interaction Detection. In: 34th Intl Conf. on Software Engineering (ICSE 2012), pp. 167–177, ACM, 2012 [Sutton & Barto, 2018] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: MIT Press, 2018 [Taylor & Stone, 2009] M. Taylor, P. Stone: Transfer learning for reinforcement learning domains: A survey. J. Mach. Learn. Res. 10, 1633–1685 (2009) [Wang et al., 2020] Hongbing Wang, Jiajie Li, Qi Yu, Tianjing Hong, Jia Yan, Wei Zhao: Integrating recurrent neural networks and reinforcement learning for dynamic service composition. Future Gener. Comput. Syst. 107: 551-563 (2020) [Weyns et al. 2013] Danny Weyns, Nelly Bencomo, Radu Calinescu, Javier Cámara, Carlo Ghezzi, Vincenzo Grassi, Lars Grunske, Paola Inverardi, Jean-Marc Jézéquel, Sam Malek, Raffaela Mirandola, Marco Mori, Giordano Tamburrelli: Perpetual Assurances for Self- Adaptive Systems. Software Engineering for Self-Adaptive Systems 2013: 31-63 [Weyns, 2021] Danny Weyns, Introduction to Self-Adaptive Systems: A Contemporary Software Engineering Perspective, Wiley, 2021. [Weyns et al., 2022] D. Weyns, I. Gerostathopoulos, N. Abbas, J. Andersson, S. Biffl, P. Brada, T. Bures, A. D. Salle, P. Lago, A. Musil, J. Musil, and P. Pelliccione, “Preliminary results of a survey on the use of self-adaptation in industry,” in 17th Intl Symp. on Software Engineering for Adaptive and Self-Managing Systems, SEAMS@ICSE 2022, 2022. [Xu et al., 2012] C. Xu, J. Rao, and X. Bu, “URL: A unified reinforcement learning approach for autonomic cloud management,” J. Parallel Distrib. Comput., vol. 72, no. 2, pp. 95–105, 2012 23 SE2023
  • 24. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Fundamentals MAPE-K Reference Model [Kephart & Chess, 2003; Salehie & Tahvildari, 2009] • Architecture as defining characteristic of adaptive system [Weyns, 2020] SE2023 24 Example: Adaptive Web Shop • Monitor: Sudden increase in number of concurrent users (workload peak) • Analyze: Slow response of web shop if no adaptation • Plan: Deactivating optional recommendation feature • Execute: Replace dynamic recommendations with static banner Self-Adaptation Logic Analyze Monitor Execute Plan Knowledge Identify concrete adaptations Enact concrete adaptation Determine need for adaptation Collect and aggregate observations System Logic Sensors Effectors 0 e + 0 0 1 e + 0 5 2 e + 0 5 3 e + 0 5 4 e + 0 5 5 e + 0 5 6 e + 0 5 1 0 0 1 5 0 2 0 0 2 5 0 d $ w o r k l o a d Workload t
  • 25. © SSE, Prof. Dr. Andreas Metzger SOFTWARE SYSTEMS ENGINEERING Prof. Dr. K. Pohl Online Reinforcement Learning 25 MAPE-K Model Integrated Model RL Model Self-Adaptation Logic Analyze Monitor Execute Plan Knowledge Online RL for SAS Execute Policy (K) Monitor Action Selection (A + P) Policy Update Action A State S Reward R Next state S’ Action A State S Reward R Action Selection Next state S’ RL Agent Policy Policy Update Environ- ment Integrating MAPE-K and Reinforcement Learning [Metzger et al., 2022] SE2023

Editor's Notes

  1. Beobachtung von Veränderungen in Umgebung, Anforderungen und sich selbst Anpassungen von Struktur, Parametern und Verhalten -- M. Papazoglou, K. Pohl, M. Parkin, and A. Metzger, Eds., Service Research Challenges and Solutions for the Future Internet: S-Cube – Towards Mechanisms and Methods for Engineering, Managing, and Adapting Service-Based Systems, ser. LNCS. Heidelberg, Germany: Springer, 2010, vol. 6500.
  2. Trotz dieser Möglichkeiten, zeigen sich beim Einsatz von ML für AS konkrete Probleme, von denen ich auf zwei im weiteren verlauf genauer eingehen werde… -- Bradley Schmerl, David Garlan, Christian Kästner - CMU Danny Weyns – U Leuven Pooyan Jamshidi – U South Carolina Javier Camara – U York Hongbing Wang – U Nanjing Sven Tomforde – U Kiel --- N. Esfahani, E. Kouroshfar, and S. Malek, “Taming Uncertainty in Self-adaptive Software,” in Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ser. ESEC/FSE ’11, 2011, pp. 234–244. -- A. J. Ramirez, A. C. Jensen, and B. H. C. Cheng, “A taxonomy of uncertainty for dynamically adaptive systems,” in 7th International Symposium on Software Engineering for Adaptive and Self-Managing Systems, SEAMS, 2012, pp. 99–108. --
  3. Ceccato: 0,03 / minute = 0,0005 Petersen: Efficiency: All Faults: ca. 4 faults/hour = 0,066 / minute = 0,001 / sekunde
  4. Cf. Mean efficiency of tool-supported debugging: 0.03; mean efficiency of requirement/design Inspections 0.06
  5. Beobachtung von Veränderungen in Umgebung, Anforderungen und sich selbst Anpassungen von Struktur, Parametern und Verhalten -- M. Papazoglou, K. Pohl, M. Parkin, and A. Metzger, Eds., Service Research Challenges and Solutions for the Future Internet: S-Cube – Towards Mechanisms and Methods for Engineering, Managing, and Adapting Service-Based Systems, ser. LNCS. Heidelberg, Germany: Springer, 2010, vol. 6500.