Оцениваем решения NoSQL: какая база данных подходит для вашей системы

"Оцениваем решения NoSQL: какая база данных подходит для вашей системы”, Сергей Сверчков, менеджер-проектов в Altoros.

Sergey Sverchkov
Project Manager
sergey.sverchkov@altoros.com

© ALTOROS Systems | CONFIDENTIAL

ORDER
Order
ID: 1001
Order Date: 15.9.2012
Customer






Billing Address
Street: Somestreet 10
City: Somewhere
Postal Code: 55901



ADDRESS

Line Items
Quantity

Price

Ipod Touch

1

220.95

Monster Beat

2

190.00

Apple Mouse

1

69.90

Name



CUSTOMER

First Name: Peter
Last Name: Sample

ORDER_LINES






© ALTOROS Systems | CONFIDENTIAL

2

•
•
•
•
•
•
•
•
•

© ALTOROS Systems | CONFIDENTIAL

3

•
•

•
•
•
•

© ALTOROS Systems | CONFIDENTIAL

4

•



• Workload is defined by different distributions



•

Operations of the following types:





© ALTOROS Systems | CONFIDENTIAL

5

•





•





© ALTOROS Systems | CONFIDENTIAL

6

•
 Single availability zone eu-west-1b, Ireland region
 Single security group with all required port opened
 4 m1.xlarge 64bit instances for cluster nodes: 16GB RAM, 4 vCPU, 8 ECU, highperformance network
 1 c1.xlarge 64bit instance for YSCB client: 7GB RAM, 8 vCPU, 20 ECU, highperformance network
 2 additional c1.medium 64bit instances for mongo routers: 1.7GB RAM, 2 vCPU, 5
ECU, moderate network

•
 4 EBS volumes by 25 GB each in RAID0
 EBS optimized volumes, no Provisioned IOPS
© ALTOROS Systems | CONFIDENTIAL

9

•
 partitioner: org.apache.cassandra.dht.Murmur3Partitioner
 key_cache_size_in_mb: 1024
 row_cache_size_in_mb: 6096
 JVM heap size: 6GB
 Snappy compressor
 Replica factor 1

•
 2 c1.medium nodes with mongo router process - mongos
 Replica factor 1
 Sharding by internal key “_id”

© ALTOROS Systems | CONFIDENTIAL

10

•
 Replica factor 1
 Memory + disk mode

•
 JVM heap size 12GB
 Replica factor 1

 Snappy compressor

© ALTOROS Systems | CONFIDENTIAL

11

Performance of the systems was evaluated under different workloads:







© ALTOROS Systems | CONFIDENTIAL

12

Load phase, 100.000.000 records * 1 KB, [INSERT]
9

Average latency, ms

8
7
6
5

hbase

4

cassandra

3

couchbase
mongodb

2
1
0
0

10000

20000

30000

40000

Throughput, ops/sec

© ALTOROS Systems | CONFIDENTIAL

13

Workload A: Update (Update 50%, Read 50%)
120
100

cassandra

80

couchbase
hbase

60

mongodb
40
20
0
0

500

1000

1500

2000

© ALTOROS Systems | CONFIDENTIAL

2500

3000
14

Workload A: Read (Update 50%, Read 50%)

80
70
60

50

cassandra
couch

40

hbase
mongo

30
20
10
0
0

500

1000

1500

2000

© ALTOROS Systems | CONFIDENTIAL

2500

3000
15

Workload B: Update (update 5% , read 95%)
120
100
80
cassandra
60

couch
hbase

40

mongo

20
0
0

500

1000

1500

© ALTOROS Systems | CONFIDENTIAL

2000

2500

16

Workload B: Read (update 5% , read 95%)
90

80
70
60
cassandra

50

couch

40

hbase

30

mongo

20
10
0
0

500

1000

1500

© ALTOROS Systems | CONFIDENTIAL

2000

2500

17

Workload C: 100% Read
80
70
60
50

cassandra

40

couch
hbase

30

mongo
20
10
0
0

500

1000

1500

2000

© ALTOROS Systems | CONFIDENTIAL

2500

3000

18

Workload D: Insert (insert 5% , read 95%)
60
50
40
cassandra
30

couch
hbase

20

mongo

10
0
0

500

1000

1500

2000

© ALTOROS Systems | CONFIDENTIAL

2500

3000

19

Workload D: Read (insert 5% , read 95%)
90
80
70
60
cassandra

50

couch

40

hbase

30

mongo

20
10
0
0

500

1000

1500

2000

© ALTOROS Systems | CONFIDENTIAL

2500

3000

20

400

Workload E: Insert (Insert 5%, Scan 95%)

350
300
250
200

cassandra

150

hbase

100
50
0

0

50

100

150

© ALTOROS Systems | CONFIDENTIAL

200

250
21

Workload F: read (Read-Modify-Write 50%, Read 50%)
80
70

60
50

cassandra

40

couch
hbase

30

mongo
20
10
0
0

500

1000

1500

© ALTOROS Systems | CONFIDENTIAL

2000

2500

22

Workload F: Update (Read-Modify-Write 50%, Read 50%)
140
120

100
cassandra

80

couch
60

hbase
mongo

40
20
0
0

500

1000

1500

© ALTOROS Systems | CONFIDENTIAL

2000

2500

23

Workload F: Read-Modify-Write (Read-Modify-Write 50%, Read 50%)
200
180
160
140
120

cassandra

100

couch

80

hbase

60

mongo

40
20
0
0

500

1000

1500

© ALTOROS Systems | CONFIDENTIAL

2000

2500

24

Workload G: Insert (Insert 90%, Read 10%)
35

30
25
cassandra

20

couch
15

hbase
mongo

10
5
0
0

1000

2000

3000

4000

5000

© ALTOROS Systems | CONFIDENTIAL

6000

7000

25

Workload G: Read (Insert 90%, Read 10%)
60
50
40
cassandra
30

couch
hbase

20

mongo

10
0
0

1000

2000

3000

4000

5000

© ALTOROS Systems | CONFIDENTIAL

6000

7000

26

•
•

•
•
•
•
•
•

© ALTOROS Systems | CONFIDENTIAL

27

Presented at JAX London 2013 The need to operate terabyte-size databases becomes very common these days. Unless you have implemented architectures that use NoSQL databases and frameworks that support data-intensive distributed applications, then many technology options available are probably a slight enigma. This session focuses on real-world successful attempts to benchmark four of the most popular NoSQL databases side by side. The base tool selected for the purpose of this research is Yahoo Cloud Serving Benchmark and benchmarking is performed on Amazon Elastic Compute Cloud instances.

82599 sriov vm configuration notes

Ryan Aydelott

Wireless Management Solution - from i3 Network SystemsNguyen Trung Tuyen

Virtualization

Mahesh Bitla

SR-IOV, KVM and Emulex OneConnect 10Gbps cards on Debian/Stable

juet-y

Tugas presentase knkSarminipuren01

SR-IOV, KVM and Intel X520 10Gbps cards on Debian/Stable

juet-y

Fortinet Ansible Solution Part 2

Salim Haniff

NCompass Live - May 29, 2019 http://nlc.nebraska.gov/ncompasslive/ Interested in exploring virtual reality but have no idea where to begin? Is it a passing fad or something that would benefit your patrons? What challenges can you expect to encounter when bringing VR (or XR, AR, etc.) to your library? This presentation will outline some of the different platforms, including Oculus Go, Oculus Rift, HTC Vive, and Playstation VR, that we have considered for adoption and the rationale behind our choices. We will discuss such concerns as space requirements, costs, pedagogical strengths and limitations, as well as patron access policies and necessary infrastructure. Come get real with us! Presenters: Christine Fullerton and Nate Doherty, Chadron (NE) State College and Carl Spicher, Chadron Public Library.

Vps hostingAlbertSiddiqui

Сергей Сверчков - Оцениваем решения NoSQL: какая база данных подходит для ваш...

IT Share

The Real World - Plugging the Enterprise Into It (nodejs)

EC2 NoSQL Benchmarking

Altoros

This presentation contains information on the test environment, settings, major criteria for evaluation, and component diagrams that can help you to test a NoSQL data store for your project. It also provides a matrix that compares a number of NoSQL products based on our test results. We also list the issues we encountered and some approaches we used to overcome them. For more independent research into Hadoop, NoSQL, and other big data technologies, please visit www.altoros.com/research-papers or follow @altoros.

Being HAPI! Reverse Proxying on Purpose

eMagic-Data Center Management SystemSandesh Sonar

VSPEX Blue, une infrastructure hyper-convergée simple et sûre pour votre SDDC

RSD

Solarwinds NPM 10.5 webcast

Michal Hrncirik

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD

EMC

Commscope-Andrew AVA5-50FX

Mobility switch security architecture scott calzia madani adjaliAruba, a Hewlett Packard Enterprise company

Cisco Cloud Networking Workshop

Cisco Canada

Commscope-Andrew LDF12-50

3 Ways to Connect to the Oracle Cloud

Simon Haslam

One of the key challenges for all public cloud providers, not just Oracle, is how to securely and reliably connect cloud services to their customers’ existing systems. This presentation provides an impartial view of Oracle Network Cloud’s three offerings, with a more detailed drill down into the VPN available for shared compute cloud. First delivered by Simon Haslam on 6 December 2016 at the UKOUG Tech16 conference

huawei-s1730s-s24t4x-a-brochure-datasheet.pdf

Hi-Network.com

Commscope-Andrew LDF5-50A

Mediehuset Ingeniøren Live

Datacenter 2014: Commscope - Arne Keller

Infrastrukturen til fremtidens løsninger Kravene til din kablede infrastruktur øges hele tiden på grund af nye hastigheder og nye applikationer. Hastigheder på 10/40 Gbit Ethernet på kobber og 40/100/400 Gbit på fiber stiller krav til din installationen på samme måde som nye applikationer gør det. Det er bl.a. applikationer som Distribueret sensor netværk og LED belysning, indendørs mobildækning og nye PoE over WLAN standarder samt bygningsstyring.

What's hot

Building a Converged Infrastructure based on FCoE, Dell Blades and Force10 sw...juet-y

Introduction to AegisSAN Q500 Series

qsantechnology

GuideIT Storage Requirements Template

Vision Concepts Infrastructure Services Solution

Nexus 1000 v access guide

networkershome

NCompass Live: Let's Get Real About Virtual Reality

Nebraska Library Commission

Vps hostingAlbertSiddiqui

What's hot (6)

Building a Converged Infrastructure based on FCoE, Dell Blades and Force10 sw...

Introduction to AegisSAN Q500 Series

GuideIT Storage Requirements Template

Nexus 1000 v access guide

NCompass Live: Let's Get Real About Virtual Reality

Vps hosting

Similar to Оцениваем решения NoSQL: какая база данных подходит для вашей системы

Сергей Сверчков - Оцениваем решения NoSQL: какая база данных подходит для ваш...

IT Share

The Real World - Plugging the Enterprise Into It (nodejs)

EC2 NoSQL Benchmarking

Altoros

Being HAPI! Reverse Proxying on Purpose

eMagic-Data Center Management SystemSandesh Sonar

VSPEX Blue, une infrastructure hyper-convergée simple et sûre pour votre SDDC

RSD

Solarwinds NPM 10.5 webcast

Michal Hrncirik

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD

EMC

Commscope-Andrew AVA5-50FX

Mobility switch security architecture scott calzia madani adjaliAruba, a Hewlett Packard Enterprise company

Cisco Cloud Networking Workshop

Cisco Canada

Commscope-Andrew LDF12-50

3 Ways to Connect to the Oracle Cloud

Simon Haslam

huawei-s1730s-s24t4x-a-brochure-datasheet.pdf

Hi-Network.com

Commscope-Andrew LDF5-50A

Mediehuset Ingeniøren Live

Datacenter 2014: Commscope - Arne Keller

Cooking security sans@nightjtimberman

Emc vnx2 technical deep dive workshop

solarisyougood

2. Seamless Surveillance with Juniper networks.pdf

PawachMetharattanara

JomaSoft VDCF - Solaris Private Cloud

JomaSoft

Similar to Оцениваем решения NoSQL: какая база данных подходит для вашей системы (20)

Сергей Сверчков - Оцениваем решения NoSQL: какая база данных подходит для ваш...

The Real World - Plugging the Enterprise Into It (nodejs)

EC2 NoSQL Benchmarking

Being HAPI! Reverse Proxying on Purpose

eMagic-Data Center Management System

VSPEX Blue, une infrastructure hyper-convergée simple et sûre pour votre SDDC

Solarwinds NPM 10.5 webcast

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD

Commscope-Andrew AVA5-50FX

Mobility switch security architecture scott calzia madani adjali

Cisco Cloud Networking Workshop

Commscope-Andrew LDF12-50

3 Ways to Connect to the Oracle Cloud

huawei-s1730s-s24t4x-a-brochure-datasheet.pdf

Commscope-Andrew LDF5-50A

Datacenter 2014: Commscope - Arne Keller

Cooking security sans@night

Emc vnx2 technical deep dive workshop

2. Seamless Surveillance with Juniper networks.pdf

JomaSoft VDCF - Solaris Private Cloud

More from Olga Lavrentieva

15 10-22 altoros-fact_sheet_st_v4

Сергей Ковалёв (Altoros): Practical Steps to Improve Apache Hive Performance

Андрей Козлов (Altoros): Оптимизация производительности Cassandra

Владимир Иванов (Oracle): Java: прошлое и будущее

Владимир Иванов: Software Engineer / Principal Member of Technical Staff в Oracle; г.Санкт-Петербург Ведущий инженер Oracle, работает в группе разработки виртуальной Java-машиныHotSpot. Специализируется на JIT-компиляции и поддержке альтернативных языков на платформе Java. Доклад: «Java: прошлое и будущее».

Brug - Web push notification

Александр Ломов: "Reactjs + Haskell + Cloud Foundry = Love"

Максим Жилинский: "Контейнеры: под капотом"

Александр Протасеня: "PayPal. Различные способы интеграции"

Сергей Черничков: "Интеграция платежных систем в .Net приложения"

Сергей Черничков (.Net Developer в Altoros): "Интеграция платежных систем в .Net приложения" - Выбор платежной системы (Payment Gateway) - Обзор типовых решений интеграции платежных систем - Рекомендации по разработке, тестированию интеграции платежной системы.

Антон Шемерей «Single responsibility principle в руби или почему instanceclas...

Антон Шемерей (Senior Developer в Sphere Consulting, г.Минск) Доклад: «Single Responsibility Principle в Руби или почему instance/class variables это ОЧЕНЬ плохо» Всем приходится работать с унаследованным кодом и часами тратить время на поиск устранения ошибок, которых в большинстве случаев можно было бы легко избежать. Одним из краеугольных камней является нарушение принципа единственной ответственности. В докладе пойдет речь о том, как провести анализ кода, как его можно исправить и как избегать таких ошибок в будущем.

Егор Воробьёв: «Ruby internals»

Егор Воробьёв (Web Developer в Datarockets) Доклад: «Ruby internals» Юкихиро Мацумото и его команда потратили уйму времени, чтобы реализовать те вещи, которыми мы пользуемся каждый день. В своем докладе Егор расскажет, что скрывается за обычными строчками, которые каждый из нас использует, и объяснит, почему важно знать то, что находятся по ту сторону экрана.

Андрей Колешко «Что не так с Rails»

Андрей Колешко (Team Lead проекта Mezuka) Доклад: «Что не так с Rails?» Андрей расскажет, как и почему он и его команда решили отказаться от многих возможностей Rails и чем их заменили на своем проекте. В целом рассказ Андрея - это рассуждение о том, к чему приводит неправильное использование Rails, почему Rails не годится для всех Web-проектов в том виде, в котором представляет его сообщество разработчиков, авторы книг и best practices.

Дмитрий Савицкий «Ruby Anti Magic Shield»

Дмитрий Савицкий (Senior Software Engineer в Altoros) Доклад: «Ruby Anti-Magic Shield» Не упустите шанс попасть на сеанс практической магии с разоблачением от Дмитрия Савицкого. Способов помешать кому-то, кто пытается повлиять на ваш код со злым умыслом или по незнанию, не так уж и много. Дмитрий расскажет о тех немногочисленных возможностях, которые позволяют избежать запутанной и опасной "метамагии" в приложениях. Будет магически интересно.

Сергей Алексеев «Парное программирование. Удаленно»

Сергей Алексеев (Ruby Developer в Pinshape) Доклад: «Парное программирование. Удаленно» «Устали объяснять как это работает? Парное программирование – вместо тысячи слов. Потратили полдня на решение задачи и безрезультатно? Не тормозите – программируйте с напарником. Следуете трендам, следите за тенденциями – новое поколение выбирает парное программирование. Когда программировать одному уже не ice... Просто добавьте напарника. Несколько полезных инструментов и техник – мы отбираем только самое лучшее. Вы еще программируете в одиночку? Тогда мы идем к вам!»

«Почему Spark отнюдь не так хорош»

Алексей Дёмин (Java Developer в InData Labs) Доклад: «Почему Spark отнюдь не так хорош» О чём: Сейчас по всем каналам идёт обсуждение новой революционной технологии обработки данных Spark. Алексей предлагает взглянуть чуть глубже и узнать, действительно ли Spark так хорош, как нам рассказывает об этом маркетинг.

«Cassandra data modeling – моделирование данных для NoSQL СУБД Cassandra»

«Практика построения высокодоступного решения на базе Cloud Foundry Paas»

Сергей Сверчков (Solution Architect в Altoros) Доклад: «Практика построения высокодоступного решения на базе Cloud Foundry PaaS ». О чём: В докладе Сергей продемонстрирует архитектуру решения, базирующуюся на OpenStack, Cassandra и Cloud Foundry (PaaS), расскажет об интересных особенностях Cloud Foundry. Он также опишет опыт в области обработки данных с медицинских приборов, опыт разработки решения с высокими требованиями по доступности, безопасности в этой области. В своей презентации Сергей раскроет нюансы работы над различными уровнями решения и их интеграцией.

«Дизайн продвинутых нереляционных схем для Big Data»

Виктор Смирнов (Java Tech Lead в Klika Technologies) Доклад: «Дизайн продвинутых нереляционных схем для Big Data» О чём: Виктор познакомит всех с примерами продвинутых нереляционных схем данных и тем, как они могут использоваться для решения задач, связанных с хранением и обработкой больших данных.

«Обзор возможностей Open cv»Olga Lavrentieva

«Нужно больше шин! Eventbus based framework vertx.io»