Splunk Search Optimization

Copyright © 2015 Splunk Inc.
Search Optimization
Splunk Live! – New York

Agenda
● Splunk Architecture Overview
● How Are Events Stored?
● How Search Works
● Types of Searches
● Search Tips
2
● If we have time…
● Command Abuse
● If we have even more time…
● Bloom Filters

Am I in the right place?
Some familiarity with…
● Splunk roles
– Search Head, Indexer, Forwarder
● Splunk Search Interface
● Search Processing Language
(SPL)
3

Who’s This Dude?
4
Jeff Champagne
Client Architect
● Started with Splunk in Fall 2014
● Former Splunk customer in the Financial Services
Industry
● Lived previous lives as a Systems Administrator,
Engineer, and Architect

Splunk Enterprise Architecture
5
Send data from thousands of servers using any combination of Splunk forwarders
Auto load-balanced forwarding to Splunk Indexers
Offload search load to Splunk Search Heads

How Are Events Stored?
Buckets, Indexes, and Indexers
6
IndexersIndices
(Indexes)
BucketsEvents

Bucket Aging Process
7

What’s in a Bucket?
8
.tsidx
Sources.data
SourceTypes.data
Hosts.data
journal.gz
Bloom
filter

How Search Works
Where’s Waldo?
9
> index=world waldo

How Search Works
Where’s Waldo?
10
journal.gzBloom filter .tsidx
> index=world waldo
I have been trying to find Waldo looking
all over these books. I’m not sure I’ll
ever find him because my vision is terrible.
The individual you are looking for does not
exist in this dataset. We banished him. He
isn’t welcome.
Oh yeah, Waldo comes in this joint all the
time. The last time I saw him was probably
6 months ago. He was wearing a fur coat
from a bear that killed his brother.
find
Waldo
looking
The
individual
you
are
Yeah
Waldo
comes
in
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
a4704fd35f0308287f2937ba
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
1
Hash search terms
*The internal structure of Bloom filters, TSIDX, and Journal files has been simplified for illustrative purposes

How Search Works
Where’s Waldo?
11
> index=world waldo
isn’t welcome.
find
Waldo
looking
The
individual
you
are
Yeah
Waldo
comes
in
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
a4704fd35f0308287f2937ba
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
1
Hash search terms
2
Start searching buckets
on indexers by time

How Search Works
Where’s Waldo?
12
> index=world waldo
isn’t welcome.
find
Waldo
looking
The
individual
you
are
Yeah
Waldo
comes
in
Is Waldo in this
bucket?
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
a4704fd35f0308287f2937ba
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
1
Hash search terms
2
on indexers by time
3

How Search Works
Where’s Waldo?
13
> index=world waldo
isn’t welcome.
find
Waldo
looking
The
individual
you
are
Yeah
Waldo
comes
in
Is Waldo in this
bucket?
Where is Waldo
in the raw data?
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
a4704fd35f0308287f2937ba
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
1
Hash search terms
2
on indexers by time
3 4

How Search Works
Where’s Waldo?
14
> index=world waldo
isn’t welcome.
find
Waldo
looking
The
individual
you
are
Yeah
Waldo
comes
in
Is Waldo in this
bucket?
Where is Waldo
in the raw data?
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
a4704fd35f0308287f2937ba
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Bafc2467d6f7a6855d58279
61aa5b6c78fa4e363606934
2b80a20039f52112ba97370
Go Get Him!
Bafc2467d6f7a6855d58279
1
Hash search terms
2
on indexers by time
3 4 5

How Search Works
Types of Search Commands
15
● Streaming Command
● Applies a transformation to
search results as they travel
through the processing
pipeline
● Run on the indexers
(and Search Head if you have indexed data there)
● Examples: eval, rex, where,
rename, fields…
● Reporting/Transforming
Command
● Processes search results and
generates a reporting data
structure
● Run on the search head
● Examples: stats, top,
timechart…

How Search Works
Distributed Search
16
Search Head
Indexer Indexer

How Search Works
Distributed Search
17
1 Search Head parses search into
map (remote) and reduce parts

How Search Works
Distributed Search
18
2 Map parts of search are sent to indexers

How Search Works
Distributed Search
19
3 Indexers fetch events from disk

How Search Works
Distributed Search
20
4 Schema is applied to events (Field Extractions)

How Search Works
Distributed Search
21
5 Events are filtered based on KV pairs

How Search Works
Distributed Search
22
6 Streaming commands are applied

How Search Works
Distributed Search
23
7Search Head collects results and runs
reporting/transforming commands

How Search Works
Distributed Search
24
7Search Head collects results and runs
reporting/transforming commands
8Search Head summarizes and displays results

Types of Searches
26
• Dense
– Low cardinality (fewer unique values)
– Example: sourcetype=access method=GET
• Sparse
– High cardinality (lots of unique values)
– Example: sourcetype=access method=GET action=purchase
• Super Sparse (or Needle in a Haystack)
– Very high cardinality
– Example: sourcetype=cisco:asa action=denied src=10.2.3.11
• Rare
– Extremely high cardinality
– Benefit from Bloom Filters because events appear in very few buckets
Dense
Super
Sparse
Sparse
Rare

Dense Searches (>10% matching results)
(scanCount vs eventCount in Job Inspector)
27
Challenge:
• CPU bound
– Dominant cost is uncompressing *.gz raw data files
– Retrieval rate: 50K events per second per server
Solution:
• Divide and conquer
– Distribute search to an indexing cluster
– Ensure your events are well distributed across indexers
– Parallel compute and merge results
• Report/Data Model Acceleration or use of Summary Indexes
– Report on summarized data vs. raw data
> sourcetype=access_combined method=GET

Sparse Searches
28
Challenge:
• CPU bound
– Dominant cost is uncompressing *.gz raw data files
– Sometimes need to read far into a file to retrieve a few events
Solution:
• Avoid cherry picking
– Be selective about exclusions (avoid “NOT foo” or “field!=value”)
– Leverage indexed fields (source, host, soutcetype)
• Filter using whole terms
– Instead of > sourcetype=access_combined clientip=192.168.11.2
– Use > sourcetype=access_combined clientip=TERM(192.168.11.2)
> sourcetype=access_combined status=404

Super Sparse Searches
29
• “Needle in Haystack”
• Disk I/O Bound
– Must look through a lot of tsidx files to
find a small amount of data
• May take up to 2 Seconds to
search each bucket
> sourcetype=access_combined status 404 ip=10.2.1.3

Rare Term Searches
30
• Disk I/O Bound
• Bloom Filters Improve Performance
– Process up to 50 buckets per second
– I/Os reduced as buckets are excluded
– 20-100x faster than Super Sparse searches on conventional storage,
>1000x faster on SSD (Due to random reads)
> sourcetype=access_combined sessionID=1234

How can I determine if my search is Dense or Sparse?
Use Job Inspector…
31
Component Description
scanCount The number of events that are scanned or read off disk.
eventCount Number of events that are returned to base search
• For dense searches scanCount ~= eventCount.
• For sparse searches, scanCount >> eventCount.

Search Tips
32
Avoid Explanation Suggested Alternative
All Time • Events are stored in time-series order
• Reduce searched buckets by being
specific
• Use a specific time range
• Narrow the time range as much
as possible
index=* • Events are grouped into indexes
• Reduce searched buckets by specifying
an index
• Always specify an index in your
search
Wildcards • Wildcards are not compatible with
Bloom Filters
• Wildcard matching of terms in the
index takes time
• Varying levels of suck-itude
> myterm*  Not great
> *myterm  Bad
> *myterm*  Death
• Use the OR operator
i.e.: MyTerm1 OR MyTerm2

Search Tips
33
NOT
!=
• Bloom filters & indexes are designed
to quickly locate terms that exist
• Searching for terms that don’t exist
takes longer
• Use the OR/AND operators
(host=c OR host=d)
(host=f AND host=h)
vs.
(host!=a host!=b)
NOT host=a host=b
Verbose Search
Mode
• Verbose search mode causes full event
data to be sent to the search head,
even if it isn’t needed
• Use Smart Mode or Fast Mode
Real-time
Searches
• RT Searches put an increased load on
search head and indexers
• The same effect can typically be
accomplished with a 1 min. or 5 min.
scheduled search
• Use a scheduled search that
occurs more frequently

Search Tips
34
Joins/Sub-
searches
• Joins can be used to link events by a
common field value, but this is an
intensive search command
• Use the stats (preferred) or
transaction command to link
events
Search after
first |
• Filtering search results using a second
| search command in your query is
inefficient
• As much as possible, add all
filtering criteria before the
first |
i.e.: >index=main foo bar
vs.
>index=main foo | search bar

Search Tips
Indexed Extractions
• Key-value pair is stored in tsidx
file
• Allows for faster searching when
using KV pairs
• Use indexed extractions in your
search criteria as much as
possible
35
• Default Fields
• source, host, sourcetype
• Custom Extractions
• Defined in props.conf
• Storage considerations
• Cardinality of data
• Increased tisdx file size

Search Tips
Using TERM
• Forces Splunk to do an exact
match for an entire term
• Example: “10.0.0.6” vs. “10 and 0 and 0
and 6”
• Most useful when your term has
minor segmenters
• Default minor segmenters:
/ : = @ . - $ # % _
36
• Term MUST be bounded by major
segmenters
Example: Spaces, tabs, carriage returns
• Example:
Search: > ip=TERM(10.0.0.6)
Raw Data:
MATCH: 10.0.0.6 - admin
NO: ip=10.0.0.6 - admin

Command Abuse
Fields vs. Table
Goal: Remove fields I don’t need from results
● Table is a formatting command NOT a filtering command
– If used improperly, it will cause unnecessary data to be transferred to the search head from search peers
● Fields tells Splunk to explicitly drop or retain fields from your results
38
index=myIndex field1=value1 | fields field1, field2, field4 | head 10000
| table mySum, myTotal
index=myIndex field1=value1 | table field1, field2, field4 | head 10000
| table mySum, myTotal

Command Abuse
Fields vs. Table Example
39
Search Term Status Artifact Size # of Events Run Time
| table Running
(1%)
624.93MB 2,037,500 00:02:44
| fields Done 9.95MB 10,000 00:00:13

Command Abuse
Stats vs. Transaction
Goal: Group multiple events by a common field value
● If you’re not using any of the Transaction command parameters, the same
results can usually be accomplished using Stats
– startswith, endswith, maxspan, maxpause, etc…
40
index=mail from=joe@schmoe.com | stats latest(_time) as mTime values(to)
as to values(from) as from values(subject) as subject by message_id
index=mail from=joe@schmoe.com| transaction message_id | table _time, to,
from, subject, message_id

Command Abuse
Latest vs. Dedup
Goal: Return the latest login for each user
41
index=auth sourcetype=logon | stats latest(clientip) by username
index=auth sourcetype=logon | dedup username sortby - _time | table
username, clientip

Command Abuse
Joins & Sub-searches
Goal: Return the latest JSESSIONID across two sourcetypes
42
sourcetype=access_combined OR sourcetype=applogs | stats latest(*) as *
by JSESSIONID
sourcetype=access_combined | join type=inner JSESSIONID [search
sourcetype=applogs | dedup JSESSIONID | table JSESSIONID,
clienip, othervalue]

Bloom Filters
How do they work again?
● Created when buckets roll from hot to warm
● Deleted when buckets roll to frozen
● Stored with other bucket files by default, but can be moved
● Binary file
● Employs a constant number of I/O calls per query
– Speed does not decrease as the # or size of tsidx files grow
● Bit array
– Written to disk in consecutive chunks of 8 bits each
44

Bloom Filters
1. A bit array is created with a set number of positions
2. Keywords in the tsidx file are fed through a set of hash functions
3. The results of the functions are mapped to positions in the bit array,
setting the value to 1 (the positions may coincide)
4. The keywords in your search are fed through the same set of hash
functions
5. The bit array positions are compared and if any of the values are 0, the
keyword does not exist and the bucket is skipped
45

Bloom Filters
Interactive Demo
https://www.jasondavies.com/bloomfilter/
46

Resources
● Splunk Docs
– Write Better Searches
http://docs.splunk.com/Documentation/Splunk/latest/Search/Writebettersearches
– Wiki: How Distributed Search Works
http://wiki.splunk.com/Community:HowDistSearchWorks
– Splunk Search Types
http://docs.splunk.com/Documentation/Splunk/6.2.3/Capacity/HowsearchtypesaffectSplunkEnterpriseperformance
– Blog: When to use Transaction and when to use Stats
http://blogs.splunk.com/2012/11/29/book-excerpt-when-to-use-transaction-and-when-to-use-stats/
– Segmenters.conf Spec
http://docs.splunk.com/Documentation/Splunk/latest/Admin/Segmentersconf
– Splunk Book: Exploring Splunk
http://www.splunk.com/goto/book
47

Resources
Training
● eLearning
– What is Splunk (Intro to Splunk)
‣ http://www.splunk.com/view/SP-CAAAH9U
48
● Instructor Led Courses with Labs
– Using Splunk
‣ http://www.splunk.com/view/SP-CAAAH9A
– Searching & Reporting with Splunk
‣ http://www.splunk.com/view/SP-CAAAH9C
– Advanced Searching & Reporting
‣ http://www.splunk.com/view/SP-CAAAH9D

Splunk Search Optimization

More Related Content

What's hot

Similar to Splunk Search Optimization

More from Splunk

Recently uploaded

Splunk Search Optimization

Editor's Notes