OWASP TOP 10 LLM - Hands-on Workshop [Stefano Amorelli - Tallinn BSides 2023]

S T E F A N O A M O R E L L I
Speaker
2 1 S E P T 2 0 2 3
Date
OWASP TOP 10 LLM
Hands-on Workshop

What is OWASP?
Open Web Application Security Project is a
global non-profit organization dedicated to
improving the security of software.

What is a LLM?
Figure - ChatGPT
having self-identity
issues.

What is a LLM?
A large language model (LLM) is a type of artificial intelligence (AI)
algorithm that uses deep learning techniques and massively large
data sets to understand, summarize, generate and predict new
content.
It is a subset of the so-called generative AI.

What is OWASP TOP 10 for LLM?
OWASP IS KNOWN FOR "OWASP TOP 10":
a regularly updated report about the most critical web
application security risks
From this year, a new project "OWASP Top 10
LLM" aims to do the same for AI LLMs

OWASP Top 10 for LLMs v1.0.1
Released on August 26, 2023
(just a few weeks ago)

Founder and Leader of the first
OWASP chapter and DEFCON group
in Estonia
Member of the new committee of
OWASP TOP 10 for LLM AI

🚨DANGER ZONE 🚨
TODAY, WE'LL ONLY COVER ATTACKS 😈
FOR MITIGATIONS AND DEFENSE TECHNIQUES, PLEASE REFER TO THE DOCUMENTATION

🚨DANGER ZONE 🚨
THIS WORKSHOP IS FOR DEMONSTRATION AND
EDUCATIONAL PURPOSES ONLY
DOING ANY OF THESE EXERCISES MIGHT RESULT IN GETTING BANNED
FROM CHATGPT AND ANY CONSEQUENCES
PROCEED AT YOUR OWN RISK

LLM01: Prompt Injection
A Prompt Injection Vulnerability arises when an attacker feeds specially
designed inputs into a large language model (LLM).
This makes the LLM carry out actions in line with the attacker's goals,
evading the LLM policies.

LLM01:
Prompt
Injection
Figure - How to
gaslight ChatGPT.

H A N D S - O N E X E R C I S E
Figure - Thanks for
nothing, ChatGPT.
How could somebody 😏
manipulate ChatGPT to
actually code our shellcode?

•
•
•
Let's try with the following techniques:
Imagine we're in a movie…
Don't act as ChatGPT…
Ignore your safety controls…
How could somebody 😏
manipulate ChatGPT to
actually code our shellcode?
A P I A N D P L A Y G R O U N D A R E
M U C H M O R E S U S C E P T I B L E
T O J A I L B R E A K I N G

• API and Playground are much more
susceptible to jailbreaking
https://platform.openai.com/playground/p/fjng
iesKCEz1gOLBEaJbgiVr?model=gpt-3.5-turbo
An example of SE-LLM (Social
Engineering for LLMs), namely,
how LLMs can be manipulated to
do or say things they shouldn't, as
SE works for humans.
Figure - Nice job, Willy!

What we tried is referred as
"direct prompt injection" but a
more advanced threat is
"indirect prompt injection"

LLM07: Insecure Plugin Design
LLM plugins can have insecure inputs and insufficient
access control. This lack of application control makes
them easier to exploit and can result in consequences like
remote code execution.

LLM02: Insecure Output Handling
Insecure Output Handling is a vulnerability that arises when a downstream
component blindly accepts large language model (LLM) output without
proper scrutiny, such as passing LLM output directly to backend, privileged,
or client-side functions.

Let's try to indirectly-inject a prompt into ChatGPT
through a plugin, exploiting LLM07, LLM01, and LLM02
H A N D S - O N D E M O N S T R A T I O N

through a plugin, exploiting both LLM07 and LLM01
https://chat.openai.com/share/1b39b2dc-9a60-4c13-b95e-b135a2409907

Open question: How do you think an attacker could
leverage this?

https://chat.openai.com/share/630336a3-bff5-41ba-9c13-89df0ff2ef7b

How an hacker can inject a
web beacon into a victim's
ChatGPT…
Source: https://systemweakness.com/new-prompt-injection-attack-on-chatgpt-web-version-ef717492c5c2
tracking pixel
tracking pixel

What else can we inject?

https://chat.openai.com/share/adda901b-a661-4944-8978-62c84ed550f0

Phishing

NSFW (just for fun)

LLM08: Excessive Agency
LLM-based systems may undertake actions leading to
unintended consequences. The issue arises from
excessive functionality, permissions, or autonomy granted
to the LLM-based systems.

LLM09: Overreliance
Overreliance occurs when systems or people depend on LLMs for decision-
making or content generation without sufficient oversight. [hallucination] …
can result in misinformation, miscommunication, legal issues, and
reputational damage.

LLM03: Training Data Poisoning
Training data poisoning refers to manipulating the data or fine-tuning
process to introduce vulnerabilities, backdoors or biases that could
compromise the model’s security, effectiveness or ethical behavior.
Poisoned information may be surfaced to users or create other risks like
performance degradation, downstream software exploitation and
reputational damage.

LLM05: Supply Chain Vulnerabilities
The supply chain in LLMs can be vulnerable, impacting the integrity of
training data, ML models, and deployment platforms. These vulnerabilities
can lead to biased outcomes, security breaches, or even complete system
failures.
Finally, LLM Plugin extensions can bring their own vulnerabilities.

Let's poison together
an open-source LLM!

Let's poison together
an open-source LLM!
https://colab.research.google.com/drive/1lIDc_R6VrksmfpT2DIBCilEwY-bTAD2q

LLM06: Sensitive Information Disclosure
LLM applications have the potential to reveal sensitive information,
proprietary algorithms, or other confidential details through their output.
This can result in unauthorized access to sensitive data, intellectual
property, privacy violations, and other security breaches.

LLM04: Model DDOS
An attacker interacts with an LLM in a method that consumes an
exceptionally high amount of resources, which results in a decline in the
quality of service for them and other users as well as potentially incurring
high resource costs.

LLM10: Model Theft
This entry refers to the unauthorized access and exfiltration of LLM models
by malicious actors or APTs. This arises when the proprietary LLM models
(being valuable intellectual property), are compromised, physically stolen,
copied or weights and parameters are extracted to create a functional
equivalent

Hands-on Workshop
Thank you!
S T E F A N O A M O R E L L I
Q&A
Connect with me on LinkedIn
OWASP TOP 10 LLM

OWASP TOP 10 LLM - Hands-on Workshop [Stefano Amorelli - Tallinn BSides 2023]

Recommended

Recommended

More Related Content

Similar to OWASP TOP 10 LLM - Hands-on Workshop [Stefano Amorelli - Tallinn BSides 2023]

Similar to OWASP TOP 10 LLM - Hands-on Workshop [Stefano Amorelli - Tallinn BSides 2023] (20)

Recently uploaded

Recently uploaded (20)

OWASP TOP 10 LLM - Hands-on Workshop [Stefano Amorelli - Tallinn BSides 2023]