Designing for Concurrency Approaches

•

1 like•960 views

This document discusses approaches for designing concurrent applications. It compares task-based and actor-based concurrency, traditional locking approaches versus software transactional memory (STM), and data replication versus decentralized data stores. The key points are that actor models may be better for event-driven systems, STM enables composable operations, and decentralized data can improve performance of complex queries over large datasets. It emphasizes testing approaches before assuming performance impacts and having use cases in mind when choosing patterns.

Technology

Designing for Concurrency

Susan Potter
Finsignia
August 2010

Concurrent Applications
market data
trading systems (front office)
risk management (middle office)
accounts/party service (back office)

Traditional Approaches
Task-based thread [pools]
(e.g. database connections, server sockets)
Hand coded locks
(to access shared memory "safely")
Data: sharding or replication
(to increase throughput on data access)

Less Traditional Approaches
Actor-based processes
(e.g. message passing like in Erlang)
Software Transactional Memory (STM)
(consistent and safe way to access shared state)
Data: decentralized datastores
(run map/reduce queries on many nodes at once)

Task-based vs Actor-based
task threads access shared mailboxes buffer incoming
state in objects messages
actors do not share state,
task threads compete for thus not competing for locks
locks on objects messages sent
synchronous operations asynchronously
within task thread actors react to messages sent
limited task scheduling (e. to them
g. wait, notify)

When might actors be better?
complexity of the task-based model becomes bottleneck
(debugging race conditions, deadlocks, livelocks,
starvation). Depends on your use case.
system is event-driven conceptually. Easier to translate
to high level abstraction in actor-based models.

Locks vs STM
Flexibility: fine vs coarse Analogous to database
grained choice transaction recording
Pessimistic locking each txn as log entry
Locking semantic need Optimistic reading
to be hand coded Atomic transaction
Composable operations Supports composable
are not well supported operations

Software Transactional Memory (STM)

Source: Armstrong on Software

When to use STM?
Using more cores/processors (STM=performance++) on
larger numbers of cores/processors (~>=4)
Hand coding and debugging locking semantics for
application becomes your bottleneck to prevent
deadlocks and livelocks
Priority inversion often hinders performance
BUT YOU CAN'T use STM when operation on shared
state cannot be undone. Must be undoable!

Replication vs Decentralized
Can improve throughput Improve throughput,
Some flexibility: replication performance of complex
strategies for a few use cases queries using map/reduce
Requires full replica(s) of data Flexibility to optimize two of
set on each node three: Consistency,
Availability, Partition tolerance
(CAP Theorem)
Does not require full replica(s)
of data set

When to use decentralized data?
Large data set you want distribute without
creating/managing your own sharding scheme
Want to optimize two of CAP
Run distributed map/reduce complex queries
BUT datastore should satisfy your other needs first.
Usually key-value/bucket lookup, not RDBMS!

Other Approaches...(not in production)

Compiler parallel optimizations
e.g. Haskell sparks
Persistent data structures
to aid concurrency throughput by better API design

General Tips
Use SLA metrics/measures to optimize relevant parts of
your concurrent system judiciously
Ensure your applications fit use case(s) for approach
Test your hypothesis by benchmarking
NEVER assume your changes have made the impact you expect.
There is no silver bullet: think, implement and test!

Questions

Twitter: @SusanPotter
GitHub: http://github.com/mbbx6spp
Email: susan@finsignia.com

Viewers also liked

Trading Day Logs Replay at TMPA-2014 (Trading Systems Testing)Iosif Itkin

Dynamo: Not Just For DatastoresSusan Potter

Writing Bullet-Proof Javascript: By Using CoffeeScriptSusan Potter

From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016Susan Potter

Distributed Developer Workflows using GitSusan Potter

Link Walking with RiakSusan Potter

From Zero to Application Delivery with NixOSSusan Potter

Ricon/West 2013: Adventures with Riak PipeSusan Potter

Functional Algebra: Monoids AppliedSusan Potter

How to build a trading systemFXstreet.com

Why HaskellSusan Potter

Running Free with the Monadskenbot

TradeZilla - Trading system DesignMarketcalls

Modern Algorithms and Data Structures - 1. Bloom Filters, Merkle TreesLorenzo Alberton

Scaling Teams, Processes and ArchitecturesLorenzo Alberton

Your data structures are made of maths!kenbot

Scalable Architectures - Taming the Twitter FirehoseLorenzo Alberton

Scalaz By Example (An IO Taster) -- PDXScala Meetup Jan 2014Susan Potter

Graphs in the Database: Rdbms In The Social Networks AgeLorenzo Alberton

The Art of Scalability - Managing growthLorenzo Alberton

Viewers also liked (20)

Trading Day Logs Replay at TMPA-2014 (Trading Systems Testing)

Dynamo: Not Just For Datastores

Writing Bullet-Proof Javascript: By Using CoffeeScript

From Zero To Production (NixOS, Erlang) @ Erlang Factory SF 2016

Distributed Developer Workflows using Git

Link Walking with Riak

From Zero to Application Delivery with NixOS

Ricon/West 2013: Adventures with Riak Pipe

Functional Algebra: Monoids Applied

How to build a trading system

Why Haskell

Running Free with the Monads

TradeZilla - Trading system Design

Modern Algorithms and Data Structures - 1. Bloom Filters, Merkle Trees

Scaling Teams, Processes and Architectures

Your data structures are made of maths!

Scalable Architectures - Taming the Twitter Firehose

Scalaz By Example (An IO Taster) -- PDXScala Meetup Jan 2014

Graphs in the Database: Rdbms In The Social Networks Age

The Art of Scalability - Managing growth

Similar to Designing for Concurrency Approaches

Azure and cloud design patternsVenkatesh Narayanan

Distributed Systems: scalability and high availabilityRenato Lucindo

Basics of Distributed Systems - Distributed StorageNilesh Salpe

Intro to SnappyData WebinarSnappyData

Application architecture for the rest of us - php xperts devcon 2012M N Islam Shihan

Waters Grid & HPC Coursejimliddle

Migration To Multi Core - Parallel Programming ModelsZvi Avraham

Cassandra internalsnarsiman

Handling Data in Mega Scale SystemsDirecti Group

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...Niraj Tolia

Performance and predictabilityRichardWarburton

ML on Big Data: Real-Time Analysis on Time SeriesSigmoid

AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...GeeksLab Odessa

Cassandra in Operationniallmilton

Cartographer, or Building A Next Generation Management Frameworkansmtug

Design Patterns for Distributed Non-Relational Databasesguestdfd1ec

Bhupeshbansal bigdata Bhupesh Bansal

Black Friday and Cyber Monday- Best Practices for Your E-Commerce DatabaseTim Vaillancourt

CassandraUpaang Saxena

Big Data Streams Architectures. Why? What? How?Anton Nazaruk

Similar to Designing for Concurrency Approaches (20)

Azure and cloud design patterns

Distributed Systems: scalability and high availability

Basics of Distributed Systems - Distributed Storage

Intro to SnappyData Webinar

Application architecture for the rest of us - php xperts devcon 2012

Waters Grid & HPC Course

Migration To Multi Core - Parallel Programming Models

Cassandra internals

Handling Data in Mega Scale Systems

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...

Performance and predictability

ML on Big Data: Real-Time Analysis on Time Series

AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...

Cassandra in Operation

Cartographer, or Building A Next Generation Management Framework

Design Patterns for Distributed Non-Relational Databases

Bhupeshbansal bigdata

Black Friday and Cyber Monday- Best Practices for Your E-Commerce Database

Cassandra

Big Data Streams Architectures. Why? What? How?

Recently uploaded

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Key Features Of Token Development (1).pptxLBM Solutions

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Pigging Solutions Piggable Sweeping Elbows

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

The transition to renewables in India.pdf

Unblocking The Main Thread Solving ANRs and Frozen Frames

Benefits Of Flutter Compared To Other Frameworks

08448380779 Call Girls In Friends Colony Women Seeking Men

SQL Database Design For Developers at php[tek] 2024

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Key Features Of Token Development (1).pptx

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Pigging Solutions in Pet Food Manufacturing

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Understanding the Laravel MVC Architecture

Azure Monitor & Application Insight to monitor Infrastructure & Application

Designing for Concurrency Approaches

1. Designing for Concurrency Susan Potter Finsignia August 2010

2. Types of Clients Hedge firms (e.g. Stark, CIG, CS) Investment banks (e.g. BofA) Trading technology (SaaS/ASP) firms

3. Concurrent Applications market data trading systems (front office) risk management (middle office) accounts/party service (back office)

4. Traditional Approaches Task-based thread [pools] (e.g. database connections, server sockets) Hand coded locks (to access shared memory "safely") Data: sharding or replication (to increase throughput on data access)

5. Less Traditional Approaches Actor-based processes (e.g. message passing like in Erlang) Software Transactional Memory (STM) (consistent and safe way to access shared state) Data: decentralized datastores (run map/reduce queries on many nodes at once)

6. Task-based vs Actor-based task threads access shared mailboxes buffer incoming state in objects messages actors do not share state, task threads compete for thus not competing for locks locks on objects messages sent synchronous operations asynchronously within task thread actors react to messages sent limited task scheduling (e. to them g. wait, notify)

7. Actor-based Processes

8. When might actors be better? complexity of the task-based model becomes bottleneck (debugging race conditions, deadlocks, livelocks, starvation). Depends on your use case. system is event-driven conceptually. Easier to translate to high level abstraction in actor-based models.

9. Locks vs STM Flexibility: fine vs coarse Analogous to database grained choice transaction recording Pessimistic locking each txn as log entry Locking semantic need Optimistic reading to be hand coded Atomic transaction Composable operations Supports composable are not well supported operations

10. Software Transactional Memory (STM) Source: Armstrong on Software

11. When to use STM? Using more cores/processors (STM=performance++) on larger numbers of cores/processors (~>=4) Hand coding and debugging locking semantics for application becomes your bottleneck to prevent deadlocks and livelocks Priority inversion often hinders performance BUT YOU CAN'T use STM when operation on shared state cannot be undone. Must be undoable!

12. Replication vs Decentralized Can improve throughput Improve throughput, Some flexibility: replication performance of complex strategies for a few use cases queries using map/reduce Requires full replica(s) of data Flexibility to optimize two of set on each node three: Consistency, Availability, Partition tolerance (CAP Theorem) Does not require full replica(s) of data set

13. When to use decentralized data? Large data set you want distribute without creating/managing your own sharding scheme Want to optimize two of CAP Run distributed map/reduce complex queries BUT datastore should satisfy your other needs first. Usually key-value/bucket lookup, not RDBMS!

14. Other Approaches...(not in production) Compiler parallel optimizations e.g. Haskell sparks Persistent data structures to aid concurrency throughput by better API design

15. General Tips Use SLA metrics/measures to optimize relevant parts of your concurrent system judiciously Ensure your applications fit use case(s) for approach Test your hypothesis by benchmarking NEVER assume your changes have made the impact you expect. There is no silver bullet: think, implement and test!

16. Questions Twitter: @SusanPotter GitHub: http://github.com/mbbx6spp Email: susan@finsignia.com

Designing for Concurrency Approaches

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Designing for Concurrency Approaches

Similar to Designing for Concurrency Approaches (20)

More from Susan Potter

More from Susan Potter (7)

Recently uploaded

Recently uploaded (20)

Designing for Concurrency Approaches