My talk about Tarantool and Lua at Percona Live 2016

•

1 like•771 views

In my talk I will focus on a practical use case: task queue application, using Tarantool as an application server and a database. The idea of the task queue is that producers put tasks (objects) into a queue, and consumers take tasks, perform them, mark as completed. The queue must guarantee certain properties: if a consumer failed, a task should return to the queue automatically, a task can't be taken by more than one consumer, priorities on tasks should be satisfied. With Tarantool, a task queue is a distributed networked application: there are multiple consumer/producer endpoints (hosts) through which a user can interact with the queue. The queue itself is a fault-tolerant distributed database: every task is stored in Tarantool database and replicated in multiple copies. If a machine goes down, the state of a task is tracked on a replica, and the user can continue working with the queue through a replica. Total power failure is also not an issue, since tasks are stored persistently on disk with transactional semantics. Performance of such an application is in hundreds of thousands of transactions per second. At the same time, the queue is highly customizable, since it's written entirely in Lua, is a Lua rock, but the code is running inside the database. This is the strength of Lua: one size doesn't have to fit all, and you don't have to sacrifice performance if you need customization. The second part of the talk will be about implementation details, performance numbers, a performance comparison with other queue products (beanstalkd, rabbitmq) in particular, and an overview of the implementation from language bindings point of view: how we make database API available in Lua, what are the challenges and performance hurdles of such binding.

Data & Analytics

Tarantool - a Lua based database
engine and in-memory execution
grid
http://try.tarantool.org kostja@tarantool.org

Spoiler
● Tarantool is an open source in-memory database
● Try it at http://try.tarantool.org
● read more at http://bit.ly/1ShfmZD and
http://bit.ly/1QiGvcf

Mail.Ru Group
5th
biggest email
in the world
6th
biggest web
portal in the world
149M+ Monthly
active users

Maintaining ACID: isolation
● Isolation — concurrent execution of transactions
results in a state that would be obtained if
transactions were executed serially
● A schedule — a possible history of transaction
execution, establishing the order in which data
change operations occurred
Let X, Y, Z be data items
E = r1[x] w1[x] w2[y] r2[z]

If t1 uses X ensure X doesn't change until t1 ends
● Concurrent transactions work with disjoint sets of data
● The order in which a data item is concurrently modified
is restricted by locking
Two-Phase Locking Theorem: If all transactions in an
execution are two-phase locked, then the execution is
serializable.
Isolation: a classic answer

Fallacy of caching
page header
modification log
page trailer
page directory
compressed
data
BLOB pointers
empty space
page header
page trailer
row offset array
row rowrow
Row
row
row
row rowrow
trx id
field 1
roll pointer
field pointers
field 2 field n

Parallel computing is difficult
Classical databases use threads and locking for
concurrency:
● limited scalability
Classical clients use syncrhonous network protocols
● but we need http/2.0 for databases

Solution
● make the database 100% RAM resident
● static transactions run serially in a dedicated
thread
● No need for locking, latching!
● 1024 cores-ready: begin sharding even on a single
host

Maintaining the write ahead log
● t1 wrote X and began commit I/O
● t2 starts, reads X and begins a commit
● t1 commit fails (I/O error)
→ we need to roll back t2 on roll back of t1
(cascading rollback)

With shared state:
● locking (hello deadlocks, hotspots, convoying, starvation,
priority inversion) ← not composable
● wait-free algorithms ← limited parallelism
Without shared state:
● hardware transactional memory ← still is not here
● functional programming ← not for databases
● actor model ← yes!
Approaching concurrency

● green threads
● CPU efficient memory management
● memory efficient data structures
● complex indexing (B-tree, R-tree)
● the actor runtime is available to developers
Actor model in Tarantool

● server side scripting in Lua, C
● rich standard library:
I/O, JSON, crypto, http,
crypto, ...
● fibers & channels
● triggers
→ freedom to ship code to data
Not-in-your-database features

● A copy-cat of beanstalkd API
● queue.put(), take(), ack(), delete(), release(),
bury(), kick(), peek()
● Important problems of queue management are taken care of:
– task priorities, timeouts, time to live
– poisoned tasks
– nested queues
Read the full docs at https://github.com/tarantool/queue
Application: queues

● in-memory database is its own species
● it takes numerous insights and years of R&D to create
● in the end we have a fair 10x performance speed up in
certain types of workloads
● all of the above is spiced up in Tarantool with rich
application development functions
● the result is available at http://download.tarantool.org
Summary

@kostja_osipov
Thank you!
fb.com/TarantoolDatabase
www.tarantool.orgkostja@tarantool.org

What's hot

protothread and its usage in contiki OSSalah Amean

Exploitation of counter overflows in the Linux kernelVitaly Nikolenko

Programming languagesDmitry Zinoviev

BKK16-503 Undefined Behavior and Compiler Optimizations – Why Your Program St...Linaro

Understanding eBPF in a Hurry!Ray Jenkins

Bpf performance tools chapter 4 bccViller Hsiao

CONFidence 2017: Escaping the (sand)box: The promises and pitfalls of modern ...PROIDEA

Glusterd_thread_synchronization_using_urcu_lca2016Atin Mukherjee

Preparation for mit ose lab4Benux Wei

introduction to linux kernel tcp/ip ptocotol stack monad bobo

FOSDEM2015: Live migration for containers is around the cornerAndrey Vagin

Make A Shoot ‘Em Up Game with Amethyst FrameworkYodalee

A tale of two(many) proxiesMohan Dutt

The TCP/IP Stack in the Linux KernelDivye Kapoor

Introduction to nand2 tetrisYodalee

Workshop@naha val3Shusaku Fukumine

Glusterfs session #5 inode t, fd-t lifecyclesPranith Karampuri

pkgsrc 2010 - the record of the past yearAkio OBATA

.NET Memory Primer (Martin Kulov)ITCamp

What's hot (19)

protothread and its usage in contiki OS

Exploitation of counter overflows in the Linux kernel

Programming languages

BKK16-503 Undefined Behavior and Compiler Optimizations – Why Your Program St...

Understanding eBPF in a Hurry!

Bpf performance tools chapter 4 bcc

CONFidence 2017: Escaping the (sand)box: The promises and pitfalls of modern ...

Glusterd_thread_synchronization_using_urcu_lca2016

Preparation for mit ose lab4

introduction to linux kernel tcp/ip ptocotol stack

FOSDEM2015: Live migration for containers is around the corner

Make A Shoot ‘Em Up Game with Amethyst Framework

A tale of two(many) proxies

The TCP/IP Stack in the Linux Kernel

Introduction to nand2 tetris

Workshop@naha val3

Glusterfs session #5 inode t, fd-t lifecycles

pkgsrc 2010 - the record of the past year

.NET Memory Primer (Martin Kulov)

Viewers also liked

Tarantool Modules, Tarantool Meetup 2016-08-25Roman Tsisyk

Tarantool 1.6 talk at SECR 2014 conferenceKostja Osipov

Практический опыт создания уменьшенной копии боевой инфраструктуры сайта для ...Андрей Шорин

JavaOne 2015 Java Mixed-Mode Flame GraphsBrendan Gregg

Four Things to Know About Reliable Spark Streaming with Typesafe and DatabricksLegacy Typesafe (now Lightbend)

Compression Options in Hadoop - A Tale of TradeoffsDataWorks Summit

Viewers also liked (6)

Tarantool Modules, Tarantool Meetup 2016-08-25

Tarantool 1.6 talk at SECR 2014 conference

Практический опыт создания уменьшенной копии боевой инфраструктуры сайта для ...

JavaOne 2015 Java Mixed-Mode Flame Graphs

Four Things to Know About Reliable Spark Streaming with Typesafe and Databricks

Compression Options in Hadoop - A Tale of Tradeoffs

Similar to My talk about Tarantool and Lua at Percona Live 2016

Sheepdog Status ReportLiu Yuan

Large Data Analyze With PyTablesInnfinision Cloud and BigData Solutions

Py tablesAli Hallaji

PyTablesAli Hallaji

Building a Database for the End of the Worldjhugg

PyTablesAli Hallaji

Running OpenStack in Production - Barcamp Saigon 2016Thang Man

Bsdtw17: george neville neil: realities of dtrace on free-bsdScott Tsai

Provenance for Data Munging EnvironmentsPaul Groth

Functional approach to packet processingNicola Bonelli

Impala presentation ahad ranaData Con LA

RealtimeMark Veltzer

Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, VectorizedHostedbyConfluent

Containers > VMsDavid Timothy Strauss

Handout: 'Open Source Tools & Resources'BDPA Education and Technology Foundation

It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, ShopifyHostedbyConfluent

epoll() - The I/O HeroMohsin Hijazee

Apache Storm ConceptsAndré Dias

Enduro/X MiddlewareMadars Vitolins

Linux Locking MechanismsKernel TLV

Similar to My talk about Tarantool and Lua at Percona Live 2016 (20)

Sheepdog Status Report

Large Data Analyze With PyTables

Py tables

PyTables

Building a Database for the End of the World

PyTables

Running OpenStack in Production - Barcamp Saigon 2016

Bsdtw17: george neville neil: realities of dtrace on free-bsd

Provenance for Data Munging Environments

Functional approach to packet processing

Impala presentation ahad rana

Realtime

Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized

Containers > VMs

Handout: 'Open Source Tools & Resources'

It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify

epoll() - The I/O Hero

Apache Storm Concepts

Enduro/X Middleware

Linux Locking Mechanisms

Recently uploaded

Invezz.com - Grow your wealth with trading signalsInvezz1

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Introduction-to-Machine-Learning (1).pptxfirstjob4

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765

Capstone Project on IBM Data Analytics ProgramMoniSankarHazra

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Recently uploaded (20)

Invezz.com - Grow your wealth with trading signals

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779

Schema on read is obsolete. Welcome metaprogramming..pdf

Introduction-to-Machine-Learning (1).pptx

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl

Capstone Project on IBM Data Analytics Program

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

ALSO dropshipping via API with DroFx.pptx

Smarteg dropshipping via API with DroFx.pptx

BigBuy dropshipping via API with DroFx.pptx

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure

CebaBaby dropshipping via API with DroFX.pptx

FESE Capital Markets Fact Sheet 2024 Q1.pdf

100-Concepts-of-AI by Anupama Kate .pptx

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Edukaciniai dropshipping via API with DroFx

My talk about Tarantool and Lua at Percona Live 2016

1. Tarantool - a Lua based database engine and in-memory execution grid http://try.tarantool.org kostja@tarantool.org

2. Spoiler ● Tarantool is an open source in-memory database ● Try it at http://try.tarantool.org ● read more at http://bit.ly/1ShfmZD and http://bit.ly/1QiGvcf

3. Mail.Ru Group 5th biggest email in the world 6th biggest web portal in the world 149M+ Monthly active users

5. Maintaining ACID: isolation ● Isolation — concurrent execution of transactions results in a state that would be obtained if transactions were executed serially ● A schedule — a possible history of transaction execution, establishing the order in which data change operations occurred Let X, Y, Z be data items E = r1[x] w1[x] w2[y] r2[z]

6. If t1 uses X ensure X doesn't change until t1 ends ● Concurrent transactions work with disjoint sets of data ● The order in which a data item is concurrently modified is restricted by locking Two-Phase Locking Theorem: If all transactions in an execution are two-phase locked, then the execution is serializable. Isolation: a classic answer

7. Fallacy of caching page header modification log page trailer page directory compressed data BLOB pointers empty space page header page trailer row offset array row rowrow Row row row row rowrow trx id field 1 roll pointer field pointers field 2 field n

8. Parallel computing is difficult Classical databases use threads and locking for concurrency: ● limited scalability Classical clients use syncrhonous network protocols ● but we need http/2.0 for databases

9. Solution ● make the database 100% RAM resident ● static transactions run serially in a dedicated thread ● No need for locking, latching! ● 1024 cores-ready: begin sharding even on a single host

10. Maintaining the write ahead log ● t1 wrote X and began commit I/O ● t2 starts, reads X and begins a commit ● t1 commit fails (I/O error) → we need to roll back t2 on roll back of t1 (cascading rollback)

11.

12. The engineering challenge

13. Latency vs. throughput

14. With shared state: ● locking (hello deadlocks, hotspots, convoying, starvation, priority inversion) ← not composable ● wait-free algorithms ← limited parallelism Without shared state: ● hardware transactional memory ← still is not here ● functional programming ← not for databases ● actor model ← yes! Approaching concurrency

15. Intel Xeon E5 architecture

16. ● green threads ● CPU efficient memory management ● memory efficient data structures ● complex indexing (B-tree, R-tree) ● the actor runtime is available to developers Actor model in Tarantool

17. ● server side scripting in Lua, C ● rich standard library: I/O, JSON, crypto, http, crypto, ... ● fibers & channels ● triggers → freedom to ship code to data Not-in-your-database features

18. ● A copy-cat of beanstalkd API ● queue.put(), take(), ack(), delete(), release(), bury(), kick(), peek() ● Important problems of queue management are taken care of: – task priorities, timeouts, time to live – poisoned tasks – nested queues Read the full docs at https://github.com/tarantool/queue Application: queues

19. The man da tory slide

20.

21.

22. ● in-memory database is its own species ● it takes numerous insights and years of R&D to create ● in the end we have a fair 10x performance speed up in certain types of workloads ● all of the above is spiced up in Tarantool with rich application development functions ● the result is available at http://download.tarantool.org Summary

23. @kostja_osipov Thank you! fb.com/TarantoolDatabase www.tarantool.orgkostja@tarantool.org

My talk about Tarantool and Lua at Percona Live 2016

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Viewers also liked

Viewers also liked (6)

Similar to My talk about Tarantool and Lua at Percona Live 2016

Similar to My talk about Tarantool and Lua at Percona Live 2016 (20)

Recently uploaded

Recently uploaded (20)

My talk about Tarantool and Lua at Percona Live 2016