The Time to Defer is Now

The Time to Defer is Now
Michael Diamant
Co-author, Scala High Performance Programming

Source: http://www.ew.com/sites/default/files/i/2015/04/15/wall-street-douglas.jpg

deferred
evaluation
scala.collection.
immutable.Queue
functional programming performance financial trading
JMH
design
tradeoffs
order
book
Focus areas
Topics we'll dig into
Your takeaways
strategies/tools for designing more robust and performant software

Getting to know the order book
Bid Offer
27.01 x 427.03 x 1
27.00 x 127.05 x 3
26.97 x 227.10 x 2
Bid Offer
27.01 x 427.03 x 1
27.00 x 127.05 x 3
26.97 x 227.10 x 2
Bid Offer
27.01 x 527.03 x 1
27.00 x 127.05 x 3
26.97 x 227.10 x 2
Buy @ 27.05
ID = 7389
Bid Offer
27.01 x 427.05 x 3
27.00 x 127.10 x 2
26.97 x 2
Crossing the book
Resting on the book
Bid Offer
27.01 x 427.03 x 1
27.00 x 127.05 x 3
26.97 x 227.10 x 2
Bid Offer
27.01 x 327.03 x 1
27.00 x 127.05 x 3
26.97 x 227.10 x 1
Canceling an order request
Buy @ 27.01
ID = 1932
Cancel
ID = 5502

Let's model the order book
class TreeMap[A, +B] private (tree: RB.Tree[A, B])
(implicit val ordering: Ordering[A])
object Price {
implicit val ordering: Ordering[Price] =
new Ordering[Price] {
def compare(x: Price, y: Price): Int =
Ordering.BigDecimal.compare(x.value, y.value)
}
}
case class QueueOrderBook(
bids: TreeMap[Price, Queue[BuyLimitOrder]],
offers: TreeMap[Price, Queue[SellLimitOrder]]) {
def bestBid: Option[BuyLimitOrder] = // highest price
bids.lastOption.flatMap(_._2.headOption)
def bestOffer: Option[SellLimitOrder] = // lowest price
offers.headOption.flatMap(_._2.headOption)
}

Better know a queue (1/3)
package scala.collection
package immutable
class Queue[+A] protected(???) {
def enqueue[B >: A](elem: B) = ...
def dequeue: (A, Queue[A]) = ...
}
O(1)
amortized O(1)

package scala.collection
package immutable
class Queue[+A] protected(
protected val in: List[A],
protected val out: List[A]) {
def enqueue[B >: A](elem: B) = new Queue(elem :: in, out)
def dequeue: (A, Queue[A]) = out match {
case Nil if !in.isEmpty => val rev = in.reverse ;
(rev.head, new Queue(Nil, rev.tail))
case x :: xs => (x, new Queue(in, xs))
case _ => throw new
NoSuchElementException("dequeue on empty queue")
}
}
Deferred evaluation
List.reverse: O(N)

Operation In Out
enqueue(1) List(1) Nil
enqueue(2) List(2, 1) Nil
enqueue(3) List(3, 2, 1) Nil
dequeue Nil List(2, 3)
dequeue Nil List(3)
enqueue(4) List(4) List(3)
dequeue List(4) Nil
dequeue Nil Nil

Understanding performance:
buy limit order arrives
Is there a resting sell order
priced <= the buy order?
TreeMap.headOption: O(Log)
Add resting buy order
TreeMap.get: O(Log)
Queue.enqueue: O(1)
TreeMap.+: O(Log)
Cross to execute offer
Queue.dequeue: amortized O(1)
TreeMap.+: O(Log)
LimitOrderAdded OrderExecuted
No Yes
Yields Yields

Understanding performance:
order cancel request arrives
Is there a bid
with matching ID?
TreeMap.find: O(N)
Queue.exists: O(N)
Is there an offer
with matching ID?
TreeMap.find: O(N)
Queue.exists: O(N)
OrderCancelRejected OrderCanceled
Remove order within
price level queue
Queue.filter: O(N)
YesNo
Yes
YieldsNo - Yields

Quiz!
Which operation is QueueOrderBook most
optimized for?
A. Adding a resting order
B. Crossing the book
C. Canceling a resting order
D. Rejecting an order cancel request

Answer!
Which operation is QueueOrderBook most
optimized for?
A. Adding a resting order
B. Crossing the book
C. Canceling a resting order
D. Rejecting an order cancel request

Quiz!
What is the distribution of operation
frequency seen in production?
Resting
Crossing
Canceling
Rejecting
Resting
Crossing
Canceling
Rejecting
Resting
Crossing
Canceling
Rejecting
Resting
Crossing
Canceling
Rejecting
A.
B.
C.
D.

Answer!
E. None of the above - I haven't given you
enough information
(Forgive me for providing a trick question)

Understanding our operating
environment
6 months of historical data show the
distribution below:
Resting
Crossing
Canceling
Rejecting
Does the QueueOrderBook implementation
strike an optimum performance balance?

Motivating Design Question #1
What operations in my system are most
performant?

Isolating the problem
When canceling an order, there are two
expensive operations:
1. Identifying the price level containing the
order-to-be-canceled
2. Traversing a Queue to remove the canceled
order
case class QueueOrderBook(
offers: TreeMap[Price, Queue[SellLimitOrder]])
#1 #2

Applying deferred evaluation
How can the order book defer the cost of
linear traversal to modify internal state?
Queue up your ideas!

Why is all this work being performed now?

How can I decompose the problem into
smaller discrete chunks?

New idea
case class LazyCancelOrderBook(
pendingCancelIds: Set[OrderId],
bids: TreeMap[Price,
Queue[BuyLimitOrder]],
offers: TreeMap[Price,
Queue[SellLimitOrder]])
Like Queue, let's add state to optimize
the slowest and most frequently seen
operation: canceling

A tale of two order cancel requests
Is there a bid
with matching ID?
TreeMap.find: O(N)
Queue.exists: O(N)
OrderCanceled
Remove order within
price level queue
Queue.filter: O(N)
Yes
Yields
Add order ID to
pending cancels
Set.+: effectively O(1)
OrderCanceled
Yields
def handleCancelOrder(
currentTime: () => EventInstant,
ob: LazyCancelOrderBook,
id: OrderId):
(LazyCancelOrderBook, Event) =
ob.copy(pendingCancelIds =
ob.pendingCancelIds + id) ->
OrderCanceled(currentTime(), id)
Similar to Queue.enqueue
QueueOrderBook LazyCancelOrderBook

Source: http://previews.123rf.com/images/kaarsten/kaarsten1102/kaarsten110200033/8723062-Stylized-red-stamp-showing-the-term-mission-
accomplished-All-on-white-background--Stock-Photo.jpg

Will this unit test pass?
"""Given empty book
|When cancel order arrives
|Then OrderCancelRejected
""".stripMargin ! Prop.forAll(
OrderId.genOrderId,
CommandInstant.genCommandInstant,
EventInstant.genEventInstant) { (id, ci, ei) =>
LazyCancelOrderBook.handle(
() => ei, LazyCancelOrderBook.empty,
CancelOrder(ci, id))._2 ====
OrderCancelRejected(ei, id)
}
Public API that supports all order book operations:
def handle(
c: Command): (LazyCancelOrderBook, Event)

Can I change any constraints to allow me to
model the problem differently?

Let's try again
case class LazyCancelOrderBook(
activeIds: Set[OrderId],
pendingCancelIds: Set[OrderId],
offers: TreeMap[Price, Queue[SellLimitOrder]])
Since order cancel reject support is a hard
requirement, the new implementation needs
additional state

Rejecting invalid cancel requests
Is there a bid
with matching ID?
TreeMap.find: O(N)
Queue.exists: O(N)
Is there an offer
with matching ID?
TreeMap.find: O(N)
Queue.exists: O(N)
OrderCancelRejected
No
No - Yields
QueueOrderBook LazyCancelOrderBook
Is there an active order
with matching ID?
Set.contains: effectively O(1)
No
OrderCancelRejected
def handleCancelOrder(
id: OrderId): (LazyCancelOrderBook, Event)
= ob.activeIds.contains(id) match {
case true => ob.copy(
activeIds = ob.activeIds – id,
pendingCancelIds = ob.pendingCancelIds
+ id) -> OrderCanceled(currentTime(), id)
case false => ob ->
OrderCancelRejected(currentTime(), id)
}

Resting buy order requests
Is there a resting sell order
priced <= the buy order?
Add resting buy order
TreeMap.get: O(Log)
Queue.enqueue: O(1)
Set.+: effectively O(1)
TreeMap.+: O(Log)
LimitOrderAdded
No
Yields
def handleAddLimitOrder(
lo: LimitOrder):
(LazyCancelOrderBook, Event) = lo match {
case b: BuyLimitOrder =>
ob.bestOffer.exists(_.price.value <= b.price.value) match {
case true => ??? // Omitted
case false =>
val orders = ob.bids.getOrElse(b.price, Queue.empty)
ob.copy(
bids = ob.bids + (b.price -> orders.enqueue(b)),
activeIds = ob.activeIds + b.id) ->
LimitOrderAdded(currentTime())
}
case s: SellLimitOrder =>
??? // Omitted
}
Continuing to defer
evaluation of canceled
order requests

Source: http://images.askmen.com/1080x540/2016/03/14-032037-how_to_correctly_roll_up_your_sleeves_basics.jpg
We can't defer anymore

def handleAddLimitOrder(
lo: LimitOrder): (LazyCancelOrderBook, Event) = lo match {
case b: BuyLimitOrder =>
ob.bestOffer.exists(_.price.value <= b.price.value) match {
case true => ob.offers.headOption.fold(restLimitOrder) {
case (p, q) => ??? // We need to fill this in
}
case false => restLimitOrder
Goals:
● Find active resting sell order to generate
OrderExecuted event
● Remove canceled resting orders found in
front of active resting sell
Canceled
ID = 1
Canceled
ID = 2
Canceled
ID = 3
Active
ID = 4
Canceled
ID = 5
Active
ID = 621.07 ->
Given:
We want the following final state:
Canceled
ID = 5
Active
ID = 621.07 ->
What are our goals?

Translating our goals to code (1/3)
@tailrec
def findActiveOrder(
q: Queue[SellLimitOrder],
idsToRemove: Set[OrderId]):
(Option[SellLimitOrder],
Option[Queue[SellLimitOrder]],
Set[OrderId]) = ???
Optionally, find an active order to
generate execution
Set of canceled order IDs to discard
Optionally, have a non-empty queue remaining after
removing matching active order and canceled orders

@tailrec
def findActiveOrder(
q: Queue[SellLimitOrder],
idsToRemove: Set[OrderId]):
(Option[SellLimitOrder], Option[Queue[SellLimitOrder]],
Set[OrderId]) =
q.dequeueOption match {
case Some((o, qq)) =>
ob.pendingCancelIds.contains(o.id) match {
case true => findActiveOrder(qq, idsToRemove + o.id)
case false => (
Some(o),
if (qq.nonEmpty) Some(qq) else None,
idsToRemove + o.id)
}
case None => (None, None, idsToRemove)
}
Found active order;
stop recursing
Queue emptied without finding active order;
stop recursing
Found canceled order;
Keep recursing

// Earlier segments omitted
findActiveOrder(q, Set.empty) match {
case (Some(o), Some(qq), rms) => (ob.copy(
offers = ob.offers + (o.price -> qq),
pendingCancelIds = ob.pendingCancelIds -- rms,
activeIds = ob.activeIds -- rms),
OrderExecuted(currentTime(),
Execution(b.id, o.price), Execution(o.id, o.price)))
case (Some(o), None, rms) => (ob.copy(
offers = ob.offers - o.price,
activeIds = ob.activeIds -- rms),
OrderExecuted(currentTime(),
Execution(b.id, o.price), Execution(o.id, o.price)))
case (None, _, rms) =>
val bs = ob.bids.getOrElse(b.price, Queue.empty).enqueue(b)
(ob.copy(bids = ob.bids + (b.price -> bs),
offers = ob.offers - p,
activeIds = ob.activeIds -- rms + b.id),
LimitOrderAdded(currentTime()))
}
Found an active order
and queue is non-empty
Since no active order
was found, the price
level must be empty
Found an active
order and queue is
empty

Source: http://www.sssupersports.com/wp-content/uploads/2014/11/12-rowen-f430-dashboard.jpg
But, how fast is it?

How to measure?
The 3 most important rules about
microbenchmarking:
1. Use JMH
2. ?
3. ?

How to measure?
microbenchmarking:
1. Use JMH
2. Use JMH
3. ?

How to measure?
microbenchmarking:
1. Use JMH
2. Use JMH
3. Use JMH

The shape of a JMH test
Test state Test configuration
Benchmarks

JMH: Test state (1 of 2)
@State(Scope.Benchmark)
class BookWithLargeQueue {
@Param(Array("1", "10"))
var enqueuedOrderCount: Int = 0
var eagerBook: QueueOrderBook = QueueOrderBook.empty
var lazyBook: LazyCancelOrderBook =
LazyCancelOrderBook.empty
var cancelLast: CancelOrder =
CancelOrder(CommandInstant.now(), OrderId(-1))
// More state to come
}
Which groups of threads share
the state defined below?
What test state do we want to
control when running the test?
Note var usage to
manage state

JMH: Test state (2 of 2)
class BookWithLargeQueue {
// Defined vars above
@Setup(Level.Trial)
def setup(): Unit = {
cancelLast = CancelOrder(
CommandInstant.now(), OrderId(enqueuedOrderCount))
eagerBook = {
(1 to enqueuedOrderCount).foldLeft(
QueueOrderBook.empty) { case (ob, i) =>
QueueOrderBook.handle(
() => EventInstant.now(),
ob,
AddLimitOrder(
CommandInstant.now(),
BuyLimitOrder(OrderId(i),
Price(BigDecimal(1.00)))))._1
}
}
lazyBook = ??? // Same as eagerBook
}
}
How often will the state
be re-initialized?
Mutable state
is initialized
here

JMH: Test configuration
@BenchmarkMode(Array(Throughput))
@OutputTimeUnit(TimeUnit.SECONDS)
@Warmup(iterations = 3, time = 5, timeUnit =
TimeUnit.SECONDS)
@Measurement(iterations = 30, time = 10, timeUnit =
TimeUnit.SECONDS)
@Fork(value = 1, warmups = 1, jvmArgs = Array("-Xms1G",
"-Xmx1G"))
class CancelBenchmarks { ... }

JMH: Benchmarks
class CancelBenchmarks {
import CancelBenchmarks._
@Benchmark
def eagerCancelLastOrderInLine(b: BookWithLargeQueue):
(QueueOrderBook, Event) =
QueueOrderBook.handle(systemEventTime, b.eagerBook,
b.cancelLast)
@Benchmark
def eagerCancelFirstOrderInLine(b: BookWithLargeQueue):
b.cancelFirst)
@Benchmark
def eagerCancelNonexistentOrder(b: BookWithLargeQueue):
b.cancelNonexistent)
// Same for LazyCancelOrderBook
}

Source: http://s.newsweek.com/sites/www.newsweek.com/files/2016/06/06/ai-google-red-button-artificial-intelligence.jpg
sbt 'project chapter4' 'jmh:run CancelBenchmarks -foe
true'

JMH results (1 of 2)
Benchmark Enqueued
Order Count
Throughput (ops per
second)
Error as Percentage of
Throughput
eagerCancelFirstOrderInLine 1 6,912,696.09 ± 0.44
lazyCancelFirstOrderInLine 1 25,676,031.50 ± 0.22
eagerCancelFirstOrderInLine 10 2,332,046.09 ± 0.96
lazyCancelFirstOrderInLine 10 12,656,750.43 ± 0.31
eagerCancelLastOrderInLine 1 5,641,784.63 ± 0.49
lazyCancelLastOrderInLine 1 25,619,665.34 ± 0.48
eagerCancelLastOrderInLine 10 1,788,885.62 ± 0.39
lazyCancelLastOrderInLine 10 13,269,215.32 ± 0.30

JMH results (2 of 2)
Benchmark Enqueued
Order Count
Throughput (ops per
second)
Error as Percentage of
Throughput
eagerCancelNonexistentOrder 1 9,351,630.96 ± 0.19
lazyCancelNonexistentOrder 1 31,742,147.67 ± 0.65
eagerCancelNonexistentOrder 10 6,897,164.11 ± 0.25
lazyCancelNonexistentOrder 10 24,102,925.78 ± 0.24

Poll
Would you release
LazyCancelOrderBook into production?
How about on a Friday?

Motivating design questions
Question Application to the order book example
What operations in my system are
most performant?
Executing an order and resting an order on the book
are the most performant operations. We leveraged
fast execution time to perform removals of canceled
orders from the book.
Why am I performing all of these
steps now?
Originally, order removal happened eagerly because
it was the most logical way to model the process.
How can I decompose the problem
into smaller discrete chunks?
The act of canceling was decomposed into
identifying the event sent to the requester and
removing the cancelled order from the book state.
Can I change any constraints to
allow me to model the problem
differently?
Ideally, we would have liked to remove the constraint
requiring rejection of non-existent orders.
Unfortunately, this was out of our control.

Thank you!
Book repo:
https://github.com/PacktPublishing/
Scala-High-Performance-
Programming

The Time to Defer is Now

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (9)

Similar to The Time to Defer is Now

Similar to The Time to Defer is Now (20)

Recently uploaded

Recently uploaded (20)

The Time to Defer is Now

Editor's Notes