Fabrizio pastore TORACLE-2021 @ESEC/FSE 2021

Metamorphic Security Testing
for Web Systems
TORACLE @FSE (23/8/2021)
based on paper presented at ICST’20
(+ additional results)
Phu X. Mai*, Fabrizio Pastore*, Arda Goknil*, Lionel Briand*#
* SnT Centre, University of Luxembourg
#School of EECS, University of Ottawa

The Oracle Problem
in Web Systems Security
Testing
2
Security vulnerabilities are subtle
Discovered when testing with many inputs
Specifying expected results is infeasible

Metamorphic Testing
Alleviates the Oracle Problem
• Metamorphic Relations: necessary
properties of the program under test in
relation to multiple inputs and their
expected outputs
4
x1=(G,a,f) ∧ x2=(G,f,a) len(shortPath(x1))=len(shortPath(x2))
a
b
c
d
e
f
f
Source
input
Follow-up
input

Combined with
an automated strategy
to select source inputs,
metamorphic relations enable
automated testing
5

of Web Systems
• Source input: a sequence of valid interactions with the
system
• Follow-up input: generated by altering valid interactions
as an attacker would do
• Relations: capture properties that hold when the system
is not vulnerable
6
{login(Admin), RequestURL(settings_page)}
{login(User1), RequestURL(settings_page)}

of Web Systems
• Source input: a sequence of valid interactions with the
system
• Follow-up input: generated by altering valid interactions
as an attacker would do
• Relations: capture properties that hold when the system
is not vulnerable
7
if the follow-up user cannot access the URL from his GUI
then the output of the source and follow-up inputs should be
different

Automated Metamorphic
Security Testing Process
8
Catalog of
Metamorphic
Relations
Selected
Metamorphi
c
Relations
Translate
Metamorphic Relatio
ns to Java
2
Execute a
Web-crawler
3
Specify/Select
Metamorphic
Relations
1
Executable
Metamorphic
Relations in Ja
va
Source
Inputs
Data
Execute the
Metamorphic Testing
Algorithm
4
MR1: PASS
MR2: FAIL
MR3: PASS
MR4: PASS
Test
Result

Security Metamorphic
Relation Language (SMRL)
9

SMRL Basic Features
• Extension of Xbase:
• Java-like
• Automated generation of Java executables from
specifications
• SMRL provides seven boolean operators
• IMPLIES, AND, OR, TRUE, FALSE, and NOT
• EQUAL
• Evaluate the equality of two arguments
• Define follow-up inputs
• enable writing metamorphic relations in declarative form
10

Data Types
11
InputSequence
InputTypes
OutputSequence
OutputTypes
Action
1..* 0..1
Session User
0..1
HtmlOutput
1..*
Request URL
Anchor click
Button click
Text fill

12
Data Representation
Functions
Data function Description
Input(int n) Identifies the nth input sequence referred to in a MR.
User(int n) Identifies the nth user referred in a MR.
User() Identifies the 1st user referred in a MR.
Action(int n) Identifies the nth input action referred in a MR.
Session(int n) Identifies the nth Web session referred in a MR.
Output(Input n) Identifies the sequence of outputs generated by the nth
input.
Output(Input n, int i) Identifies the output generated by the ith action of the nth
input.
…

Web-specific Functions
• changeCredentials( Input i, User u)
• Modifies an input sequence so that the credentials of the specifi
ed user are used in the login action
changeCredentials( Input(1), User() )
• Input(1): {login(Admin), RequestURL(settings_page) }
• User(): User1
• Result: {login(User1), RequestURL(settings_page) }
• cannotReachTroughGUI( User u, String URL)
• Returns true if, based on the crawled data, a URL cannot be
reached by the given user by exploring the user interface of the
system
13

Metamorphic Relation
Example:
Bypass Authorization Schema
14

Example:
15
Data function
Boolean operat
or
Web specific functio
n

Example:
16
Data function
Boolean operat
or
n

Example:
17
Data function
Boolean operat
or
n

Example:
18
Data function
Boolean operat
or
n

Example:
19
Our metamorphic testing algorithm
executes each MR multiple times,
to ensure that every possible combination
of
Data function
Boolean operat
or
n

Source Inputs of Type InputSequence
{login(Admin), RequestURL(home_page)}
{login(User1), RequestURL(home_page)}
{login(User1), RequestURL(project1_page)}
{login(User2), RequestURL(project2_page)}
Source Inputs of Type User
User1
User2
Admin
Tested Combinations
Source Inputs Follow-up Inputs
+ User1
/
+ User2
/
+ User1
+ User2
+ User2
/
{login(User1), RequestURL(project_page)}
+User2
{login(User2), RequestURL(project_page)}
… …

Java Integration
• Metamorphic relations are automatically translated into
executable Java code
• They can be selected in Junit test cases
21

Deriving Source Inputs from
Data Crawled with Crawljax

Deriving Source Inputs from
Test Scripts
• We can derive source inputs from test scripts manually
implemented for functional testing
• we support Selenium-based scripts
• a test script represents an InputSequence
23

24
Executable
Metamorphic
Relation
in Java
Sourc
e
Inputs
data
Metamorphic
Testing
Algorithm
4
Follow-up
input
Test result
Executor
Verify relatio
n
Metamorphic Testing
Execution
Sourc
e
input
Action1
Action2
Action3
Action1
Action2’
Action3
Follow-up
Output
Sourc
e
Outpu
t
Output1
Output2
Output3
Output1
Output2’’
Output3
MR PASS/FAIL
Repeat

Metamorphic Relations
Catalog
• Catalog 1. Includes 23 Security Metamorphic Relations
derived from OWASP testing guidelines
25
Vulnerability type # Metamorphic
relations
Broken Authentication 6
Sensitive Data Exposure 5
Broken Access Control 8
Security Misconfiguration 3
Vulnerable Components 1
• Catalog 2. Ongoing. Shall cover 45% of vulnerability
types listed in the CWE Design Principles view.
Implemented 30%.

Broken Authentication
• Objective: verify that a session always changes after a
logout
• Source input: sequence of actions performed after
log-in
• Follow-up input: add a logout action after an action of
the source InputSequence
• Relation: session IDs before and after logout action
should be different
26

Sensitive Data Exposure
• Objective: verify that a user cannot retrieve sensitive data by
simply altering a URL parameter value
• Source input: sequence of actions including at least one action
containing a URL parameter (action X)
• Follow-up input: change the URL parameter value for the action
X (action X’) using a value observed only with other users
• Relation: the output of the action X’ either contains an error
message or contains data that is not sensitive
• e.g., it should contain data that has been already retrieved by
the same user in the recorded source inputs
27

Evaluation
• RQ1. (Complementarity) What types of security testing
activities not automated by other approaches can be
automated by MST?
• RQ2: (Completeness) What security vulnerability types
can be discovered by MST?
• RQ3. (Effectiveness) Is MST effective?
30

RQ1: What types of security testing
activities not automated by other
approaches can be automated by our
solution?
• Analyzed 90 security testing activities described in OWASP Web security
testing book to
• identify which existing approach can be adopted to automate them
• determine if MT can compensate for remaining testing activities
31

solution?
testing book to
32
Test automation strategy # OWASP activities
automated
Relying on implicit oracle 2
Catalog-based 6
Not testing (e.g., architecture analysis) 19
Vulnerability-specific 22
Not addressed yet 41
Metamorphic testing 16 of 41

solution?
testing book to
33
Test automation strategy # OWASP activities
automated
Relying on implicit oracle 2
Catalog-based 6
Not testing (e.g., architecture analysis) 19
Vulnerability-specific 22
Not addressed yet 41
Metamorphic testing 16 of 41

RQ2: What security
vulnerability types can be
discovered by MST?
• Methodology
• analyze each vulnerability type (weakness) reported in the
Design Principles view of the CWE database (CWE-DP)
• 223 weaknesses grouped by 12 design principles
• manually inspected the description, few concrete
vulnerabilities (CVE), and common attack patterns (CAPEC)
associated to it
• evaluated the possibility of writing a metamorphic relation to
discover the weakness
• report the number of weaknesses that can be automatically
tested for each of the categories listed in CWE-DP
34

CWE-DP
Weaknesses addressed by
MST
35
Security Design Principle #
Addressed by
MST
Audit 6 0 (0%)
Authenticate Actors 28 15 (54%)
Authorize Actors 60 32 (53%)
Cross Cutting 9 5 (56%)
Encrypt Data 38 5 (13%)
Identify Actors 12 9 (75%)
Limit Access 8 3 (38%)
Limit Exposure 6 0 (0%)
Lock Computer 1 0 (0%)
Manage User Sessions 6 4 (67%)
Validate Inputs 39 26 (67%)
Verify Message Integrity 10 2 (20%)
Total 223 101 (45%)

RQ3 – Is MST effective?
• Applied our aproach to software systems affected by known
vulnerabilities
• Considered all our 23 MRs
36
Vulnerability type # MRs
Targeted vulnerabilities
EDLAH
2
Jenkins
1.121.1
Joomla
3.8.7
Broken Authentication 6 2 2
Sensitive Data
Exposure
5 1
Broken Access Control 8 5 2
Security
Misconfiguration
3 1 1
Vulnerable 1

RQ3 – Data Collection
• Data collection:
• Executed Crawljax against the three systems for a
maximum of five hours or till exploration is exhausted
• Implemented test scripts that exercise features not
exercised by Crawljax
37

RQ3 - Measurements
• Sensitivity
• ratio of vulnerabilities being discovered
• Specificity
• ratio of follow-up inputs that do not trigger any vulnerability
and do not lead to any failure
• it measures the time saved from inspecting unwarranted
failures
• it is the complement of the false positive rate
38

RQ3 - Sensitivity
• We do not discover vulnerabilities that require specific environment settings
• Jenkins configuration modified during test execution
• Jenkins overloaded
Case
study
Total
Vulnerabiliti
es
Discovered vulnerabilities Sensitivity
Crawljax Crawljax &
Manual
Crawljax Crawljax
&
Manual
EDLAH2 4 3 4 75% 100%
Jenkins 8 4 6 50% 75%
Total 12 7 10 58.33% 83.33%
39
(ratio of vulnerabilities being
discovered)

RQ3 – Sensitivity
(Ratio of vulnerabilities being discovered)
40
Case
study
Total
Vulnerabiliti
es
Discovered
vulnerabilities
Sensitivity
Crawljax Crawljax &
Manual
Crawljax Crawljax &
Manual
EDLAH
2
4 3 4 75% 100%
Jenkins 8 4 6 50% 75%
Joomla 2 1 2 50 100
Total 14 8 12 57.14% 85.71%
• We do not discover vulnerabilities that require specific environment settings
• Jenkins configuration modified during test execution
• Jenkins overloaded

RQ3 – Sensitivity
(Ratio of vulnerabilities being discovered)
• Discovered a new vulnerability in Jenkins
• CVE-2020-2162: Stored XSS vulnerability in file
parameters
41
Case
study
Total
Vulnerabiliti
es
Discovered
vulnerabilities
Sensitivity
Crawljax Crawljax &
Manual
Crawljax Crawljax &
Manual
EDLAH
2
4 3 4 75% 100%
Jenkins 8 4 6 50% 75%
Joomla 2 1 2 50 100
Total 14 8 12 57.14% 85.71%

RQ3 – Specificity
(Ratio of follow-up inputs that do not trigger any vulnerability nor lead to
failures)
42
Case
study
Follow-up inputs
exercised by MST
Follow-up inputs
inspected
Specificity
Crawljax Crawljax
& Manual
Crawljax Crawljax
& Manual
Crawljax Crawljax &
Manual
EDLAH2 832 833 3 4 100% 100%
Jenkins 31303 33495 74 176 99.889% 99.893%
Joomla 22185 9395 39 43 99.525% 99.573%
Total 54320 43723 136 223 99.817% 99.826%
• The approach automatically tests the systems with many inputs and the
number of inputs to be erroneously inspected is minimal

AutomatedMetamorphicSecurity
TestingProcess
8
Catalogof
Metamorphic
Relations
Selected
Metamorphic
Relations
T
ransform
MetamorphicRelations
toJava
2
Execute a
W
eb-crawler
3
Specify/Select
Metamorphic
Relations
1
Executable
Metamorphic
RelationsinJava
Source
Inputs
Data
Execute the
MetamorphicT
esting
Algorithm
4
MR1:PASS
MR2:FAIL
MR3:PASS
MR4:PASS
T
est
Result
Experiments show
High sensitivity and specificity
Discovered one unknown vulnerability
in Jenkins (CVE-2020-2162)
https://sntsvv.github.io/SMRL/

Fabrizio pastore TORACLE-2021 @ESEC/FSE 2021

More Related Content

What's hot

Similar to Fabrizio pastore TORACLE-2021 @ESEC/FSE 2021

Recently uploaded

Fabrizio pastore TORACLE-2021 @ESEC/FSE 2021

Editor's Notes