Exploring billion states of a program like a pro. How to cook your own fast and scalable DBI-based security tool. A case study.

© 2015 IBM Corporation
How to cook your own fast and scalable DBI-based security tool. A case study
Exploring billion states of a
program like a pro
2017

IBM Research - Haifa
About me
 PhD (Tomsk State University of Control Systems and Radioelectronics)
– Vulnerabilities detection in machine code (x86)
 Cyber Security Researcher at IBM Research Lab in Haifa, Israel
– R&D in technologies for highly-evasive malware analysis and detection
 A main contributor to DynamoRIO/DrMemory DBI frameworks
2017

Outline
 Dynamic binary instrumentation technique
– General idea and technique implementation
– DynamoRIO and Intel PIN frameworks comparison
– Possible application fields
 Example I. Heap-based bug detection using DBI
– General idea & motivation
– Implementation & issues
– Solutions for described issues
 Example II. Dynamic malware analysis
– General idea & motivation
– Implementation
– Tool demo
 Conclusion
2017

Dynamic binary instrumentation (DBI)
DBI is a technique of analyzing the behavior of a binary application at runtime through the
injection of instrumentation code
2017

How does it work ?
Application
launcher.exe
core.dll
DBI Engine
CreateProcess (suspended)
(1)
Inject core.dll
(2)
Application in memory
Windows kernel
Takebasicblock
(4)(3)
Hook entry point
core.dll + user dll/dlls
shared system dlls
ins1
ins2
ins3
insN
.
.
Code cache
basic block
transformation
(5)
ins1
inst_ins1
inst_ins2
ins2
inst_ins3
inst_ins4
ins3
inst_ins4
inst_ins5
insN
inst_insM
inst_insM+1
Execute &
calculate
addr of next
basic block
(6)
Takenextbasicblock
(7)
.
.

Frameworks Comparison
DynamoRIO Intel PIN
Redistribution
model
Open-source, BSD – license Proprietary, no-source
code available
Supported
architectures
x86, x86-64, ARM, AArch64 x86, x86-64
Supported
Platforms
Linux, Windows, MacOS,
Android
Linux, Windows, MacOS,
Android
Average
runtime overhead
108% (no tool)
139% (BBs counter)
130% (no tool)
162% (BBs counter)
Language C/C++ C/C++ (some Python
wrappers available)
Technology Binary code transformation callout/trampolines
2017

Instructions Counting. Example
DynamoRIOIntel PIN
2017

Instrumentation Granularity
Instruction level
(instrument all executed instructions)
Basic block level
(instrument all executed basic blocks)
Function call level
(instrument all executed calls)
Module level
(instrument all load/unload module events)
Events
• exceptions/signals
• syscalls
• thread/process creation/deletion
2017

Application
 Software security analysis & testing
– Support fuzzing (code coverage assessment, tainted input tracking)
– Bugs detection (overflows, use-after-free, uninitialized access and etc.)
– Symbolic execution
– Software bugs exploitability assessment
 Malware analysis
– Execution tracing
– Automatic unpacking
 Reverse-engineering
– Control-flow graph visualization
– Debugging
– Taint-tracking
 Other non-security fields
– Performance evaluation
– Memory leak detection
– Optimization
2017

Example I. Bugs detection
2017

WinHeap Explorer Tool
 WinHeap Explorer is a system for heap-based bug detection with the lowest runtime
overhead build on top of Intel PIN framework.
 Advantages:
– Light-weight instrumentation support (shown further)
– Lowest runtime-overhead
– Open-source (BSD license)
2017

Motivation Example
2017

Motivation Example
Heap memory block for a
Heap Layout
pMethod 1 pMethod 2 pMethod 3 pMethod 4
Virtual table for pFileOpen
2017

Motivation Example
Heap memory block for a pEvilMet 1 pMethod 2 pMethod 3 pMethod 4
Virtual table for pFileOpen
Heap Layout
2017

Detection of Heap-based Bugs. General Idea
Code Heap
instruction #1
instruction #2
instruction #3
instruction #4
instruction #5
0x0 – 0x3
0x4 – 0x7
0x8 – 0xB
0xC – 0xF
0x10 – 0x13
0x14 – 0x17
0x80 – 0x83
0x84 – 0x87
0x88 – 0x8B
0x8C – 0x8F
0x90 – 0x93
0x94 – 0x98
…
2017

Code Heap
instruction #1
instruction #2
instruction #3
instruction #4
instruction #5
0x0 – 0x3
0x4 – 0x7
0x8 – 0xB
0xC – 0xF
0x10 – 0x13
0x14 – 0x17
0x80 – 0x83
0x84 – 0x87
0x88 – 0x8B
0x8C – 0x8F
0x90 – 0x93
0x94 – 0x98
…
Allocated
memory
block #1
Freed
memory
block #2
2017

Code
Correct instruction #1
Incorrect instruction #3
[overflow]
[access block #1]
[access block #1]
[underflow]
[use after free]
Heap
0x0 – 0x3
0x4 – 0x7
0x8 – 0xB
0xC – 0xF
0x10 – 0x13
0x14 – 0x17
0x80 – 0x83
0x84 – 0x87
0x88 – 0x8B
0x8C – 0x8F
0x90 – 0x93
0x94 – 0x98
…
Allocated
memory
block #1
Freed
memory
block #2
2017

Code
[overflow]
[access block #1]
[access block #1]
[underflow]
[use after free]
Heap
0x0 – 0x3
0x4 – 0x7
0x8 – 0xB
0xC – 0xF
0x10 – 0x13
0x14 – 0x17
0x80 – 0x83
0x84 – 0x87
0x88 – 0x8B
0x8C – 0x8F
0x90 – 0x93
0x94 – 0x98
…
Allocated
memory
block #1
Freed
memory
block #2
redzone
redzone
marked as freed
2017

Code
[overflow]
[access block #1]
[access block #1]
[underflow]
[use after free]
Heap
0x0 – 0x3
0x4 – 0x7
0x8 – 0xB
0xC – 0xF
0x10 – 0x13
0x14 – 0x17
0x80 – 0x83
0x84 – 0x87
0x88 – 0x8B
0x8C – 0x8F
0x90 – 0x93
0x94 – 0x98
…
Allocated
memory
block #1
Freed
memory
block #2
redzone
redzone
marked as freed
𝑖𝑓 𝑝𝑜𝑖𝑛𝑡𝑒𝑟 ∈ 𝑟𝑒𝑑𝑧𝑜𝑛𝑒 𝑜𝑟 𝑝𝑜𝑖𝑛𝑡𝑒𝑟 ∈ 𝑓𝑟𝑒𝑒𝑑 𝑡ℎ𝑒𝑛 𝒃𝒖𝒈
2017

WinHeap Explorer. Implementation #1
Step 1. Instrument all heap management API calls (allocation/reallocation/freeing)
Step 2. Save redzones and heap block statuses (freed or in use) in a hashtable
Step 3. Instrument all executed instructions, dynamically check that an instruction doesn’t
access redzone or previously freed memory block
2017

access redzone or previously freed memory block.
Results:
 Runtime overhead: x310-x850
– Launching Mozilla Firefox = ~498M instructions (or 8 minutes to show first window)
– Launching Acrobat Reader = ~84M instructions (or 3 minutes to show first window)
– Launching PowerPoint = ~860M instructions (or 12 minutes to show first window)
 Memory overhead: x90-x120
2017

access redzone or previously freed memory block.
Results:
 Runtime overhead: x310-x850
– Launching Mozilla Firefox = ~498M instructions (or 8 minutes to show first window)
– Launching Acrobat Reader = ~84M instructions (or 3 minutes to show first window)
– Launching PowerPoint = ~860M instructions (or 12 minutes to show first window)
 Memory overhead: x90-x120
Absolutely unacceptable
2017

Shadow Memory Approach
2017

Windows Heap Management APIs Architecture
kernel32.dll
HeapAlloc
GlobalAlloc
LocalAlloc
Memory allocation
msvcr*.dll
malloc
calloc
new []
ole32.dll
CoTaskMemAlloc
kernel32.dll
HeapReAlloc
GlobalReAlloc
LocalReAlloc
Memory reallocation
msvcr*.dll
realloc
ole32.dll
CoTaskMemrealloc
kernel32.dll
HeapFree
GlobalFree
LocalFree
Memory freeing
msvcr*.dll
free
delete []
ole32.dll
CoTaskMemFree
ntdll.dll
RtlAllocateHeap RtlReAllocateHeap RtlFreeHeap
Windows Kernel
2017

Light-weight instrumentation. System DLLs instrumentation
2017

Light-weight instrumentation. Whole System Architecture
2017

WinHeap Explorer. Runtime overhead #2
2017

Example 2. Malware analysis
2017

Dynamic Malware Analysis. Motivation
2017

Dynamic Malware Analysis. Idea & Solution
 Goal – transparently and efficiently trace malware’s library calls
 Solution
– Instrument calls to exported library functions
– Print a trace of each executed function along with some arguments information
– Print a return address of each executed function (to be able to recognize calls from
unpacked code)
2017

DrLtrace Tool
 DrLtrace is a standalone application for transparent API calls tracing build on top of
DynamoRIO framework.
 Benefits:
– Transparent (no API-hooking, no debugging)
– High visibility (each API call, all arguments)
– Open-source (BSD license)
– Supported Windows, Linux, Android
– Easy to use, just specify:
drltrace.exe –logdir <log_name> -- <app_name>
2017

Output Examples
2017

DEMO
2017

Conclusion
 DBI is a powerful technique for transparent and efficient machine code introspection
 Numerous possible ways to use, especially for software security analysis, dynamic malware
introspection, reverse-engineering and etc.
 Traditional fast and scalable data structures may introduce significant overhead in case of
DBI
 Two open-source tools were introduced:
– WinHeap Explorer – a tool for heap-based bug detection in Windows applications
– Dr.Ltrace – a tool for transparent dynamic library calls tracing
2017

Links
 WinHeapExplorer – https://github.com/WinHeapExplorer/WinHeap-Explorer
 DrLtrace 1 - https://github.com/DynamoRIO/drmemory/tree/master/drltrace
 Whitepaper - https://github.com/WinHeapExplorer/WinHeap-Explorer/tree/master/PHD
1 – please build from source code to have the latest functionality
2017

Thank you for you attention!
PhD, Research Staff Member
(Cyber Security)
IBM Research Israel
Maksim Shudrak
maksims@il.ibm.com
mxmssh@gmail.com
2017

Exploring billion states of a program like a pro. How to cook your own fast and scalable DBI-based security tool. A case study.

Recommended

Recommended

More Related Content

Similar to Exploring billion states of a program like a pro. How to cook your own fast and scalable DBI-based security tool. A case study.

Similar to Exploring billion states of a program like a pro. How to cook your own fast and scalable DBI-based security tool. A case study. (20)

Recently uploaded

Recently uploaded (13)

Exploring billion states of a program like a pro. How to cook your own fast and scalable DBI-based security tool. A case study.

Editor's Notes