DomainTools Fingerprinting Threat Actors with Web Assets

Fingerprinting Threat Actors
Using Web Assets
MAY 2018

Often when seeing a new threat on our network, we want to do more
than just block one IP or type of traffic at our firewalls
Using an IP address or attack traffic signature to locate other attacker
infrastructure can prevent further incursions from that attacker…
Pivoting across pieces of information like:
•  Nameserver
•  Whois information
•  Email domains
•  Shared hosting
Exploratory Threat Intelligence

1.  When hunting down a specific adversary, particularly when
attribution is a goal, operational security is king!
2.  It is becoming more common for malicious actors to disguise
things like domain registrations, to use throwaway contact info, etc
3.  When web content is involved, it’s more difficult for an attacker to
create custom content for every site they use
4.  Even when content is different, coding style and shared resources
represent an investment in time that is difficult to re-work for each
malicious site
Exploratory Threat Intelligence

In general, when we refer to web assets, we mean files that are loaded
into the main HTML of a site via html tags.
Examples include:
•  Javascript files (via <script> tags)
•  CSS files (via <link> tags)
•  Images (via <img> tabs)
What do we mean by web assets?

•  Programmers are lazy… threat actors are no exception
•  Reusing css and js files is easier than writing new ones from scratch
•  The set of third party files loaded into an HTML document and the
order in which they are loaded is highly variable, and therefore a
good potential fingerprint
•  Added javascript and css in stock files and/or inline to the html are
easy ways for “lazy” programmers to get script code into pages,
and generally this code will not be unique across their infrastructure
How can we fingerprint infrastructure
using web assets?

What do these assets look like?
A website has a set of tags that
indicate resources to load. In
particular:
•  CSS
•  Javascript
These tags allow for
modularization so they make
great targets for infrastructure
linking.

Simple Connection Searching
THIS IS A TRIVIAL, BUT REAL EXAMPLE, USING A DOMAIN SURFACED FROM ALIENVAULT’S OTX

IF WE LOOK AT THIS DOMAIN IN IRIS, WE SEE SOME SIMILAR INFORMATION ABOUT ITS PROFILE

We saw some js and css
resources on this page already…
Let’s pick one that looks
interesting and see what google
might be able to find for it.
Let’s look at:
aurblue/components.css
Picking something to search on

There’s nothing specific in this CSS file
that looks interesting, but reuse is very
very common.
The hardest part of this process is
searching based on a component that is
common across a threat actor’s
infrastructure, but NOT generally
common.
Maybe we’ll get lucky!
Picking something to search on

Most of these sites are websites that
track metadata on other sites that reveal
connected domains.
These sites DO provide information
about the sites we are looking at, which
gives us some indications of where to
look next.
SEARCHING OUR UNIQUE FILENAME ON GOOGLE DOES, IN FACT,
YIELD INTERESTING RESULTS:

We got lucky this time, but often searching in this simplistic way, we
would find javascript and CSS resources that are common to many
sites.
Examples:
•  JQuery
•  Bootstrap
•  Foundation
•  Etc

Related Domains
IF WE LOOK AT SOME OF THE DOMAINS WE GUESSED WOULD BE
RELATED, WE FIND A PREVIOUSLY UNSEEN CORRELATION:

The two domains we investigated, though not directly related, both
have a related domain (surfaced by Iris) that shares a common email
address:
This shows us that even with a very straightforward approach, we can
begin to associate malicious actors infrastructure via web assets.
Related Domains

As we’ve said a few times, we got lucky on this case, but the same
technique can be used in a more general fashion to explore related
domains.
What about a less trivial case?
<link…
<script…
<image…
<meta…
File level hashing and analysis
Function level hashing and analysis
Coding style and comment analysis
Increasinglevelofsophistication

These techniques have been used in malware analysis for some time,
but we need to extend them to the web space.
•  Signatures
•  External network calls
•  Code reuse
•  Code style
•  Runtime analysis
Malware analysis, but for the web

Advertisers have been using similar techniques for tracking users across the
web for some time… why shouldn’t we use it to track attackers?
OR… reverse browser fingerprinting?
Browser attributes:
•  the User agent header
•  the Accept header
•  the Connection header
•  the Encoding header
•  the Language header
•  the list of plugins
•  the platform
•  the cookies preferences (allowed or not)
•  the Do Not Track preferences (yes, no or not
communicated)
•  the timezone
•  the screen resolution and its color depth
•  the use of local storage
•  the use of session storage
•  a picture rendered with the HTML Canvas
element
•  a picture rendered with WebGL
•  the presence of AdBlock
•  the list of fonts
Page attributes:
•  Meta tags
•  Link tags
•  Script tags
•  Inline scripts and styles
•  External object references
•  iframes
•  Web frameworks
•  What else?

DomainTools Fingerprinting Threat Actors with Web Assets

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to DomainTools Fingerprinting Threat Actors with Web Assets

Similar to DomainTools Fingerprinting Threat Actors with Web Assets (20)

Recently uploaded

Recently uploaded (20)

DomainTools Fingerprinting Threat Actors with Web Assets