SlideShare a Scribd company logo
1 of 35
Download to read offline
NoSQL Tel Aviv Meetup #1:
Polyglot Persistance
Arthur Gimpel
arthurgi@datazone.io
Wifi: zx
Password: n0tWireless
Welcome
`
Arthur Gimpell
‹ ›3
SaaS and more…
Arthur Gimpell
‹ ›4
SaaS and more…
Arthur Gimpell
‹ ›5
SaaS and more…
BaaS
Arthur Gimpell
‹ ›6
NoSQL Tel Aviv: Meetup Agenda
E
OBJECTIVE COMPARISONS
q
NETWORKING
p
KNOWLEGE SHARING
Arthur Gimpell
‹ ›7
About Me
•Working with databases for 8 years
•5 years, SQL Server & .NET
•3 years with NoSQL & Python & Node.js
•2015 - Founded DataZone
Arthur Gimpell
‹ ›8
DataZone | Data is our business! What’s yours?
•Consultancy & projects
•Private & public training
•Multi vendor, multi tier support with SLA
•Child unit of CloudZone, public cloud leaders
Use case
o
Arthur Gimpell
‹ ›10
uBar: Toolbar Company
•uBar’s toolbar provides a search engine and various utilities on the toolbar itself
•uBar’s revenue streams:
•Ads, provided on uBar’s search engine
•Bundled downloads with partners
•Selling user data & statistics, gathered by the toolbar user’s usage analysis
Arthur Gimpell
‹ ›11
uBar: Architecture
MSSQL
Sessions Toolbar Usage Analytics
•uBar’s solution is built on SOA:
•Sessions: Session & users mgmt. service
•Toolbar Usage: user statistics gathering
•Analytics: Near realtime BI
Arthur Gimpell
‹ ›12
uBar: Sessions Service - Features
MSSQL
Sessions
•Sessions are created when a client opens a
browser
•Sessions are ended when client closes browser,
or no activity is made during some specific time
•Users are mainly marketing, campaign
managers, media buyers and more. Those users
consume data from the Analytics service
Arthur Gimpell
‹ ›13
uBar: Sessions Service - Main Objects
MSSQL
Sessions
•Session: SessionId, ToolbarClientId,
UserId, UserAgent, StartTime
•User: UserId, UserPermissions,
Username, PasswordHash
•UserPermissions: UserId, PermissionId
•Permissions: PermissionId, Name
Arthur Gimpell
‹ ›14
uBar: Toolbar Usage Service - Features
MSSQL
•Every time an event occurs, like opening
a browser by a client, or browsing the
internet, the usage service saves data
about this event in the relevant table.
•ToolbarUsage writes ± 50M events per
day
Toolbar Usage
Arthur Gimpell
‹ ›15
uBar: Toolbar Usage Service - Main Objects
MSSQL
•ToolbarStart: ToolbarClientId, StartTime, [User data columns]
•NewTab: ToolbarClientId, NewTabUrl, [User data columns]
•ToolbarClicks: ToolbarClientId, ToolbarFeatureId, [User data
columns]
•WebsiteVisit: ToolbarClientId, WebsiteUrl, [User data
columns]
•ToolbarClients: ToolbarClientId, ToolbarVersion,
BundledVersion, BundleId
Toolbar Usage
Arthur Gimpell
‹ ›16
uBar: Analytics Service - Features
MSSQL
•Analytics service is providing Users with dashboards filled
with data.
•The data is pre aggregated every 1 hour in the database,
and saved to different tables
•The analytics service provides the most important KPI when
releasing campaigns to millions of users, and according to
its data operative decisions are made(stopping bad
campaigns, detecting bugs, ab testing etc..)
Analytics
Arthur Gimpell
‹ ›17
uBar: Challenges
•Velocity: 10k writes/sec on Usage service, 1k writes/
sec on Sessions service
•Volume: 1TB of operational data(1 month retention)
•New clients increase the velocity, and IO subsystem is
a bottleneck
•Campaign managers want more and more insights in
realtime, which require writing complex aggregation
jobs on the database and use CPU intensively.
RDBMS
Sessions Toolbar Usage Analytics
Arthur Gimpell
‹ ›18
Issues with Relational Database Management Systems in the IoT Age
•Everything is persisted, synchronously. Limited by IO
performance.
•All data is bound to a tabular schema, hard to make
changes in big databases.
•All data relies on a single data store, making it hard to
scale horizontally.
•Complex schema slows down aggregations and
queries drastically.
RDBMS
Sessions Toolbar Usage Analytics
Arthur Gimpell
‹ ›19
Polyglot Persistance: Overview
Key Value
Suitable for key value access patterns.
Main benefits are concurrency on key level
(Optimistic & Pessimistic), and extremely
easy scaling.
Document Store
Data which is more suitable
for OOP languages, storing
complex data (JSON) while
allowing scaling and
distribution.
Search / Index stores
Every data store serves a
different component of
the application,
according to its access
patterns and needs.
Consept
Suitable for cases where the main data
store cannot handle complex querying,
Allows scaling the querying layers
separately from operational data access
(CUD in CRUD).
Arthur Gimpell
‹ ›20
uBar: New Data Solution’s Targets
New Data
Solution
Handle the traffic, Velocity and Volume should not limit the product
Allow more realtime analytics, and more
complex slice & dice for the product
Use open source where possible,
Reduce costs.
Evaluation
E
Arthur Gimpell
‹ ›22
uBar: Analysing Sessions schema analysis & access patterns
Sessions
•Sessions are written with a UUID(SessionId),
and not sorted in any way in the table (Heap).
• Values:
•ToolbarClientId (Foreign key to ToolbarClient)
•UserId (Foreign key to User)
•UserAgent (Unstructured string)
•StartTime (DateTime)
?
Arthur Gimpell
‹ ›23
uBar: Analysing Sessions schema analysis & access patterns
Sessions
•Users and Permission tables are quite
simple and its own values with Many to
Many relation table (UsersPermissions) ?
Arthur Gimpell
‹ ›24
uBar: Analysing Sessions schema analysis & access patterns
Sessions
•Sessions writing Velocity is 1k/sec. IO is a
bottleneck.
•Sessions are written in Key Value pattern
•Users and Permissions are not
problematic, since those are cached in
the application and rarely change.
?
Arthur Gimpell
‹ ›25
uBar: Possible data stores for Sessions service
Sessions
•Candidate technologies with needed
throughput, complex data support, and
needed velocity: Redis, Couchbase,
Marklogic
?
Arthur Gimpell
‹ ›26
uBar: Analysing Toolbar Usage schema analysis & access patterns
•Toolbar Usage tables are not normalized in SQL
Server, and written as raw data.
•Usage write pattern is key value, where value is large
(30kb) and unstructured(User agent).
•Velocity in writes is 10k/sec,
•Toolbar Usage data is also time series data. The tables
have a clustered TimeStamp column(and partitioned
by it), for easier Analytics and aggregation.
?
Toolbar UsageSessions
Redis?
Couchbase?
Marklogic?
Arthur Gimpell
‹ ›27
uBar: Possible data stores for Toolbar Usage service
•Again, needed write pattern is Key Value.
•Data sizing, and needed throughput fits
Redis, Couchbase, Marklogic the same
way.
•Sessions and ToolbarUsage both can rely
(potentially) on the same data store.
Toolbar UsageSessions
Redis?
Couchbase?
Marklogic?
Redis?
Couchbase?
Marklogic?
Arthur Gimpell
‹ ›28
uBar: Analysing Toolbar Usage schema analysis & access patterns
•Analytics service’s schema is based on aggregated
data of ToolbarUsage & Sessions services.
•Development should be simple, in order to allow
maximal elasticity for product and analysts.
•Analysts should be able to query the data / ad hoc
•Data refresh should be less than 15 minutes
Toolbar UsageSessions
Redis?
Couchbase?
Marklogic?
Redis?
Couchbase?
Marklogic?
Analytics
?
Arthur Gimpell
‹ ›29
uBar: Possible data stores for Sessions service
•Possible services for analytics divide to
various groups:
•Classic BI solutions: Tableu, Qlikview,
Pantahoo
•Column Store DBMS: Redshift, Vertica..
•Pure search engine: Elasticsearch, Solr..
Toolbar UsageSessions
Redis?
Couchbase?
Marklogic?
Redis?
Couchbase?
Marklogic?
Analytics
BI Tools
ColumnStore
Search Engine
Arthur Gimpell
‹ ›30
uBar: Putting it all together - Operational Needs
Toolbar UsageSessions
Redis?
Couchbase?
Marklogic?
Redis?
Couchbase?
Marklogic?
AnalyticsVelocity Volume Price
Couchbase V V Low - Mid
Redis V V Low - Mid
Marklogic V V High
BI Tools
ColumnStore
Search Engine
Arthur Gimpell
‹ ›31
uBar: Putting it all together - Operational Needs
Toolbar UsageSessions
Redis?
Couchbase?
AnalyticsSupport Integration Final Notes
Couchbase
Vendor
Support - SLA
Elasticsearch -
XDCR
SQL Compatible -
JDBC ODBC
Rich
integrations,
High quality
Support
Redis
Redis Labs -
Managed
Plugin for Solr
Managed - no
maintenance
BI Tools
ColumnStore
Search Engine
Redis?
Couchbase?
Arthur Gimpell
‹ ›32
uBar: Putting it all together - Analytical Needs
Toolbar UsageSessions
Redis?
Couchbase?
AnalyticsPossibilities Pros Cons
BI Solutions
Tableu
Pentahoo
Qlikview
Simple for
business users,
Integrates with
Couchbase
Might get
expensive
Search Engines
Elasticsearch
Solr
Highly
customizable
Querying is
not straight
forward
BI Tools
Search Engine
Redis?
Couchbase?
Arthur Gimpell
‹ ›33
uBar: Final Architecture #1
Toolbar UsageSessions
Managed
Redis
Analytics
Elasticsearch
Managed
Redis
•Redis is managed. No maintenance at all
for operational and scalable cluster.
•Using Elasticsearch with Kibana is great
for time series data
•Data transformation will be made through
ETL.
Arthur Gimpell
‹ ›34
uBar: Final Architecture #2
Toolbar UsageSessions
Couchbase
Analytics
BI Tools +
Elasticsearch
Couchbase
•Couchbase is easy to use.
•With Couchbase’s SQL on JSONs (N1QL) It is 0
configuration to make it a data source for every
possible BI solution
•Couchbase’s Filtered replication to
Elasticsearch allows it to function only where
SQL is not enough.
So,
What do
you choose?

More Related Content

What's hot

Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 Migration and Coexistence between Relational and NoSQL Databases by Manuel H... Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...Big Data Spain
 
MongoDB for Time Series Data: Schema Design
MongoDB for Time Series Data: Schema DesignMongoDB for Time Series Data: Schema Design
MongoDB for Time Series Data: Schema DesignMongoDB
 
Real-time analytics with Druid at Appsflyer
Real-time analytics with Druid at AppsflyerReal-time analytics with Druid at Appsflyer
Real-time analytics with Druid at AppsflyerMichael Spector
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidCharles Allen
 
Implementing a canonical IoT backend in Azure with Azure Stream Analytics
Implementing a canonical IoT backend in Azure with Azure Stream AnalyticsImplementing a canonical IoT backend in Azure with Azure Stream Analytics
Implementing a canonical IoT backend in Azure with Azure Stream AnalyticsMarco Parenzan
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...DataStax Academy
 
A Zen Journey to Database Management
A Zen Journey to Database ManagementA Zen Journey to Database Management
A Zen Journey to Database ManagementBasho Technologies
 
Can My Inventory Survive Eventual Consistency?
Can My Inventory Survive Eventual Consistency?Can My Inventory Survive Eventual Consistency?
Can My Inventory Survive Eventual Consistency?DataStax
 
Sarine's Big Data Journey by Rostislav Aaronov
Sarine's Big Data Journey by Rostislav AaronovSarine's Big Data Journey by Rostislav Aaronov
Sarine's Big Data Journey by Rostislav AaronovIdan Tohami
 
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned Omid Vahdaty
 
James Corcoran, Head of Engineering EMEA, First Derivatives, "Simplifying Bi...
James Corcoran, Head of Engineering EMEA, First Derivatives,  "Simplifying Bi...James Corcoran, Head of Engineering EMEA, First Derivatives,  "Simplifying Bi...
James Corcoran, Head of Engineering EMEA, First Derivatives, "Simplifying Bi...Dataconomy Media
 
Elastic Stack roadmap deep dive
Elastic Stack roadmap deep diveElastic Stack roadmap deep dive
Elastic Stack roadmap deep diveElasticsearch
 
Azure Industrial Iot Edge
Azure Industrial Iot EdgeAzure Industrial Iot Edge
Azure Industrial Iot EdgeRiccardo Zamana
 
MongoDB and the Internet of Things
MongoDB and the Internet of ThingsMongoDB and the Internet of Things
MongoDB and the Internet of ThingsMongoDB
 
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0WSO2
 
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018Codemotion
 
Azure Big Data Story
Azure Big Data StoryAzure Big Data Story
Azure Big Data StoryLynn Langit
 

What's hot (20)

Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 Migration and Coexistence between Relational and NoSQL Databases by Manuel H... Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
 
MongoDB for Time Series Data: Schema Design
MongoDB for Time Series Data: Schema DesignMongoDB for Time Series Data: Schema Design
MongoDB for Time Series Data: Schema Design
 
Real-time analytics with Druid at Appsflyer
Real-time analytics with Druid at AppsflyerReal-time analytics with Druid at Appsflyer
Real-time analytics with Druid at Appsflyer
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & Druid
 
Implementing a canonical IoT backend in Azure with Azure Stream Analytics
Implementing a canonical IoT backend in Azure with Azure Stream AnalyticsImplementing a canonical IoT backend in Azure with Azure Stream Analytics
Implementing a canonical IoT backend in Azure with Azure Stream Analytics
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
 
hotdog a TD tool for DD
hotdog a TD tool for DDhotdog a TD tool for DD
hotdog a TD tool for DD
 
A Zen Journey to Database Management
A Zen Journey to Database ManagementA Zen Journey to Database Management
A Zen Journey to Database Management
 
Can My Inventory Survive Eventual Consistency?
Can My Inventory Survive Eventual Consistency?Can My Inventory Survive Eventual Consistency?
Can My Inventory Survive Eventual Consistency?
 
Sarine's Big Data Journey by Rostislav Aaronov
Sarine's Big Data Journey by Rostislav AaronovSarine's Big Data Journey by Rostislav Aaronov
Sarine's Big Data Journey by Rostislav Aaronov
 
Google Cloud Spanner Preview
Google Cloud Spanner PreviewGoogle Cloud Spanner Preview
Google Cloud Spanner Preview
 
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
AWS Big Data Demystified #1.2 | Big Data architecture lessons learned
 
James Corcoran, Head of Engineering EMEA, First Derivatives, "Simplifying Bi...
James Corcoran, Head of Engineering EMEA, First Derivatives,  "Simplifying Bi...James Corcoran, Head of Engineering EMEA, First Derivatives,  "Simplifying Bi...
James Corcoran, Head of Engineering EMEA, First Derivatives, "Simplifying Bi...
 
Elastic Stack roadmap deep dive
Elastic Stack roadmap deep diveElastic Stack roadmap deep dive
Elastic Stack roadmap deep dive
 
Big data in Azure
Big data in AzureBig data in Azure
Big data in Azure
 
Azure Industrial Iot Edge
Azure Industrial Iot EdgeAzure Industrial Iot Edge
Azure Industrial Iot Edge
 
MongoDB and the Internet of Things
MongoDB and the Internet of ThingsMongoDB and the Internet of Things
MongoDB and the Internet of Things
 
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
 
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
 
Azure Big Data Story
Azure Big Data StoryAzure Big Data Story
Azure Big Data Story
 

Similar to NoSQL Tel Aviv Meetup#1: Introduction to Polyglot Persistance

Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Clustrix
 
Azure Stream Analytics : Analyse Data in Motion
Azure Stream Analytics  : Analyse Data in MotionAzure Stream Analytics  : Analyse Data in Motion
Azure Stream Analytics : Analyse Data in MotionRuhani Arora
 
Getting to 1.5M Ads/sec: How DataXu manages Big Data
Getting to 1.5M Ads/sec: How DataXu manages Big DataGetting to 1.5M Ads/sec: How DataXu manages Big Data
Getting to 1.5M Ads/sec: How DataXu manages Big DataQubole
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at TwitterPrasad Wagle
 
Webinar: Best Practices for Getting Started with MongoDB
Webinar: Best Practices for Getting Started with MongoDBWebinar: Best Practices for Getting Started with MongoDB
Webinar: Best Practices for Getting Started with MongoDBMongoDB
 
MongoDB Best Practices
MongoDB Best PracticesMongoDB Best Practices
MongoDB Best PracticesLewis Lin 🦊
 
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ..."Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...Dataconomy Media
 
Using ClickHouse for Experimentation
Using ClickHouse for ExperimentationUsing ClickHouse for Experimentation
Using ClickHouse for ExperimentationGleb Kanterov
 
Building Scalable Aggregation Systems
Building Scalable Aggregation SystemsBuilding Scalable Aggregation Systems
Building Scalable Aggregation SystemsJared Winick
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc..."An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...Maya Lumbroso
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc..."An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...Dataconomy Media
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Lucidworks
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMark Kromer
 
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19marketingsyone
 
Deploy Microsoft Azure Data Solutions
Deploy Microsoft Azure Data SolutionsDeploy Microsoft Azure Data Solutions
Deploy Microsoft Azure Data SolutionsMarco Parenzan
 
Apache IOTDB: a Time Series Database for Industrial IoT
Apache IOTDB: a Time Series Database for Industrial IoTApache IOTDB: a Time Series Database for Industrial IoT
Apache IOTDB: a Time Series Database for Industrial IoTjixuan1989
 
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...Imply
 
PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)Stratebi
 
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?Crate.io
 

Similar to NoSQL Tel Aviv Meetup#1: Introduction to Polyglot Persistance (20)

Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
 
Azure Stream Analytics : Analyse Data in Motion
Azure Stream Analytics  : Analyse Data in MotionAzure Stream Analytics  : Analyse Data in Motion
Azure Stream Analytics : Analyse Data in Motion
 
Getting to 1.5M Ads/sec: How DataXu manages Big Data
Getting to 1.5M Ads/sec: How DataXu manages Big DataGetting to 1.5M Ads/sec: How DataXu manages Big Data
Getting to 1.5M Ads/sec: How DataXu manages Big Data
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at Twitter
 
Webinar: Best Practices for Getting Started with MongoDB
Webinar: Best Practices for Getting Started with MongoDBWebinar: Best Practices for Getting Started with MongoDB
Webinar: Best Practices for Getting Started with MongoDB
 
MongoDB Best Practices
MongoDB Best PracticesMongoDB Best Practices
MongoDB Best Practices
 
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ..."Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...
"Introduction to Kx Technology", James Corcoran, Head of Engineering EMEA at ...
 
Using ClickHouse for Experimentation
Using ClickHouse for ExperimentationUsing ClickHouse for Experimentation
Using ClickHouse for Experimentation
 
Building Scalable Aggregation Systems
Building Scalable Aggregation SystemsBuilding Scalable Aggregation Systems
Building Scalable Aggregation Systems
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc..."An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc..."An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data Analytics
 
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19
Tiago Fonseca & Rui Velho - Syone & Leroy Merlin - OSL19
 
Deploy Microsoft Azure Data Solutions
Deploy Microsoft Azure Data SolutionsDeploy Microsoft Azure Data Solutions
Deploy Microsoft Azure Data Solutions
 
Apache IOTDB: a Time Series Database for Industrial IoT
Apache IOTDB: a Time Series Database for Industrial IoTApache IOTDB: a Time Series Database for Industrial IoT
Apache IOTDB: a Time Series Database for Industrial IoT
 
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
 
PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)PCM18 (Big Data Analytics)
PCM18 (Big Data Analytics)
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?
 

Recently uploaded

哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查ydyuyu
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 
Research Assignment - NIST SP800 [172 A] - Presentation.pptx
Research Assignment - NIST SP800 [172 A] - Presentation.pptxResearch Assignment - NIST SP800 [172 A] - Presentation.pptx
Research Assignment - NIST SP800 [172 A] - Presentation.pptxi191686
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.krishnachandrapal52
 
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...Escortgram India
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoilmeghakumariji156
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdfMatthew Sinclair
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样ayvbos
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsMonica Sydney
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理F
 
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformonhackersuli
 
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...gragchanchal546
 
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...Sareena Khatun
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书F
 
Delivery in 20 Mins Call Girls Cuttack 9332606886 HOT & SEXY Models beautifu...
Delivery in 20 Mins Call Girls Cuttack  9332606886 HOT & SEXY Models beautifu...Delivery in 20 Mins Call Girls Cuttack  9332606886 HOT & SEXY Models beautifu...
Delivery in 20 Mins Call Girls Cuttack 9332606886 HOT & SEXY Models beautifu...Sareena Khatun
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdfMatthew Sinclair
 

Recently uploaded (20)

哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
哪里办理美国迈阿密大学毕业证(本硕)umiami在读证明存档可查
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Research Assignment - NIST SP800 [172 A] - Presentation.pptx
Research Assignment - NIST SP800 [172 A] - Presentation.pptxResearch Assignment - NIST SP800 [172 A] - Presentation.pptx
Research Assignment - NIST SP800 [172 A] - Presentation.pptx
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...
Independent Escorts & Call Girls In Aerocity Delhi - 9758998899 - Escortgram ...
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
 
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
一比一原版(Curtin毕业证书)科廷大学毕业证原件一模一样
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon
[Hackersuli] Élő szövet a fémvázon: Python és gépi tanulás a Zeek platformon
 
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
South Bopal [ (Call Girls) in Ahmedabad ₹7.5k Pick Up & Drop With Cash Paymen...
 
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...💚 Call Girls Bahraich   9332606886  High Profile Call Girls You Can Get The S...
💚 Call Girls Bahraich 9332606886 High Profile Call Girls You Can Get The S...
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书
 
Delivery in 20 Mins Call Girls Cuttack 9332606886 HOT & SEXY Models beautifu...
Delivery in 20 Mins Call Girls Cuttack  9332606886 HOT & SEXY Models beautifu...Delivery in 20 Mins Call Girls Cuttack  9332606886 HOT & SEXY Models beautifu...
Delivery in 20 Mins Call Girls Cuttack 9332606886 HOT & SEXY Models beautifu...
 
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
call girls in Anand Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 

NoSQL Tel Aviv Meetup#1: Introduction to Polyglot Persistance

  • 1. NoSQL Tel Aviv Meetup #1: Polyglot Persistance Arthur Gimpel arthurgi@datazone.io Wifi: zx Password: n0tWireless
  • 5. Arthur Gimpell ‹ ›5 SaaS and more… BaaS
  • 6. Arthur Gimpell ‹ ›6 NoSQL Tel Aviv: Meetup Agenda E OBJECTIVE COMPARISONS q NETWORKING p KNOWLEGE SHARING
  • 7. Arthur Gimpell ‹ ›7 About Me •Working with databases for 8 years •5 years, SQL Server & .NET •3 years with NoSQL & Python & Node.js •2015 - Founded DataZone
  • 8. Arthur Gimpell ‹ ›8 DataZone | Data is our business! What’s yours? •Consultancy & projects •Private & public training •Multi vendor, multi tier support with SLA •Child unit of CloudZone, public cloud leaders
  • 10. Arthur Gimpell ‹ ›10 uBar: Toolbar Company •uBar’s toolbar provides a search engine and various utilities on the toolbar itself •uBar’s revenue streams: •Ads, provided on uBar’s search engine •Bundled downloads with partners •Selling user data & statistics, gathered by the toolbar user’s usage analysis
  • 11. Arthur Gimpell ‹ ›11 uBar: Architecture MSSQL Sessions Toolbar Usage Analytics •uBar’s solution is built on SOA: •Sessions: Session & users mgmt. service •Toolbar Usage: user statistics gathering •Analytics: Near realtime BI
  • 12. Arthur Gimpell ‹ ›12 uBar: Sessions Service - Features MSSQL Sessions •Sessions are created when a client opens a browser •Sessions are ended when client closes browser, or no activity is made during some specific time •Users are mainly marketing, campaign managers, media buyers and more. Those users consume data from the Analytics service
  • 13. Arthur Gimpell ‹ ›13 uBar: Sessions Service - Main Objects MSSQL Sessions •Session: SessionId, ToolbarClientId, UserId, UserAgent, StartTime •User: UserId, UserPermissions, Username, PasswordHash •UserPermissions: UserId, PermissionId •Permissions: PermissionId, Name
  • 14. Arthur Gimpell ‹ ›14 uBar: Toolbar Usage Service - Features MSSQL •Every time an event occurs, like opening a browser by a client, or browsing the internet, the usage service saves data about this event in the relevant table. •ToolbarUsage writes ± 50M events per day Toolbar Usage
  • 15. Arthur Gimpell ‹ ›15 uBar: Toolbar Usage Service - Main Objects MSSQL •ToolbarStart: ToolbarClientId, StartTime, [User data columns] •NewTab: ToolbarClientId, NewTabUrl, [User data columns] •ToolbarClicks: ToolbarClientId, ToolbarFeatureId, [User data columns] •WebsiteVisit: ToolbarClientId, WebsiteUrl, [User data columns] •ToolbarClients: ToolbarClientId, ToolbarVersion, BundledVersion, BundleId Toolbar Usage
  • 16. Arthur Gimpell ‹ ›16 uBar: Analytics Service - Features MSSQL •Analytics service is providing Users with dashboards filled with data. •The data is pre aggregated every 1 hour in the database, and saved to different tables •The analytics service provides the most important KPI when releasing campaigns to millions of users, and according to its data operative decisions are made(stopping bad campaigns, detecting bugs, ab testing etc..) Analytics
  • 17. Arthur Gimpell ‹ ›17 uBar: Challenges •Velocity: 10k writes/sec on Usage service, 1k writes/ sec on Sessions service •Volume: 1TB of operational data(1 month retention) •New clients increase the velocity, and IO subsystem is a bottleneck •Campaign managers want more and more insights in realtime, which require writing complex aggregation jobs on the database and use CPU intensively. RDBMS Sessions Toolbar Usage Analytics
  • 18. Arthur Gimpell ‹ ›18 Issues with Relational Database Management Systems in the IoT Age •Everything is persisted, synchronously. Limited by IO performance. •All data is bound to a tabular schema, hard to make changes in big databases. •All data relies on a single data store, making it hard to scale horizontally. •Complex schema slows down aggregations and queries drastically. RDBMS Sessions Toolbar Usage Analytics
  • 19. Arthur Gimpell ‹ ›19 Polyglot Persistance: Overview Key Value Suitable for key value access patterns. Main benefits are concurrency on key level (Optimistic & Pessimistic), and extremely easy scaling. Document Store Data which is more suitable for OOP languages, storing complex data (JSON) while allowing scaling and distribution. Search / Index stores Every data store serves a different component of the application, according to its access patterns and needs. Consept Suitable for cases where the main data store cannot handle complex querying, Allows scaling the querying layers separately from operational data access (CUD in CRUD).
  • 20. Arthur Gimpell ‹ ›20 uBar: New Data Solution’s Targets New Data Solution Handle the traffic, Velocity and Volume should not limit the product Allow more realtime analytics, and more complex slice & dice for the product Use open source where possible, Reduce costs.
  • 22. Arthur Gimpell ‹ ›22 uBar: Analysing Sessions schema analysis & access patterns Sessions •Sessions are written with a UUID(SessionId), and not sorted in any way in the table (Heap). • Values: •ToolbarClientId (Foreign key to ToolbarClient) •UserId (Foreign key to User) •UserAgent (Unstructured string) •StartTime (DateTime) ?
  • 23. Arthur Gimpell ‹ ›23 uBar: Analysing Sessions schema analysis & access patterns Sessions •Users and Permission tables are quite simple and its own values with Many to Many relation table (UsersPermissions) ?
  • 24. Arthur Gimpell ‹ ›24 uBar: Analysing Sessions schema analysis & access patterns Sessions •Sessions writing Velocity is 1k/sec. IO is a bottleneck. •Sessions are written in Key Value pattern •Users and Permissions are not problematic, since those are cached in the application and rarely change. ?
  • 25. Arthur Gimpell ‹ ›25 uBar: Possible data stores for Sessions service Sessions •Candidate technologies with needed throughput, complex data support, and needed velocity: Redis, Couchbase, Marklogic ?
  • 26. Arthur Gimpell ‹ ›26 uBar: Analysing Toolbar Usage schema analysis & access patterns •Toolbar Usage tables are not normalized in SQL Server, and written as raw data. •Usage write pattern is key value, where value is large (30kb) and unstructured(User agent). •Velocity in writes is 10k/sec, •Toolbar Usage data is also time series data. The tables have a clustered TimeStamp column(and partitioned by it), for easier Analytics and aggregation. ? Toolbar UsageSessions Redis? Couchbase? Marklogic?
  • 27. Arthur Gimpell ‹ ›27 uBar: Possible data stores for Toolbar Usage service •Again, needed write pattern is Key Value. •Data sizing, and needed throughput fits Redis, Couchbase, Marklogic the same way. •Sessions and ToolbarUsage both can rely (potentially) on the same data store. Toolbar UsageSessions Redis? Couchbase? Marklogic? Redis? Couchbase? Marklogic?
  • 28. Arthur Gimpell ‹ ›28 uBar: Analysing Toolbar Usage schema analysis & access patterns •Analytics service’s schema is based on aggregated data of ToolbarUsage & Sessions services. •Development should be simple, in order to allow maximal elasticity for product and analysts. •Analysts should be able to query the data / ad hoc •Data refresh should be less than 15 minutes Toolbar UsageSessions Redis? Couchbase? Marklogic? Redis? Couchbase? Marklogic? Analytics ?
  • 29. Arthur Gimpell ‹ ›29 uBar: Possible data stores for Sessions service •Possible services for analytics divide to various groups: •Classic BI solutions: Tableu, Qlikview, Pantahoo •Column Store DBMS: Redshift, Vertica.. •Pure search engine: Elasticsearch, Solr.. Toolbar UsageSessions Redis? Couchbase? Marklogic? Redis? Couchbase? Marklogic? Analytics BI Tools ColumnStore Search Engine
  • 30. Arthur Gimpell ‹ ›30 uBar: Putting it all together - Operational Needs Toolbar UsageSessions Redis? Couchbase? Marklogic? Redis? Couchbase? Marklogic? AnalyticsVelocity Volume Price Couchbase V V Low - Mid Redis V V Low - Mid Marklogic V V High BI Tools ColumnStore Search Engine
  • 31. Arthur Gimpell ‹ ›31 uBar: Putting it all together - Operational Needs Toolbar UsageSessions Redis? Couchbase? AnalyticsSupport Integration Final Notes Couchbase Vendor Support - SLA Elasticsearch - XDCR SQL Compatible - JDBC ODBC Rich integrations, High quality Support Redis Redis Labs - Managed Plugin for Solr Managed - no maintenance BI Tools ColumnStore Search Engine Redis? Couchbase?
  • 32. Arthur Gimpell ‹ ›32 uBar: Putting it all together - Analytical Needs Toolbar UsageSessions Redis? Couchbase? AnalyticsPossibilities Pros Cons BI Solutions Tableu Pentahoo Qlikview Simple for business users, Integrates with Couchbase Might get expensive Search Engines Elasticsearch Solr Highly customizable Querying is not straight forward BI Tools Search Engine Redis? Couchbase?
  • 33. Arthur Gimpell ‹ ›33 uBar: Final Architecture #1 Toolbar UsageSessions Managed Redis Analytics Elasticsearch Managed Redis •Redis is managed. No maintenance at all for operational and scalable cluster. •Using Elasticsearch with Kibana is great for time series data •Data transformation will be made through ETL.
  • 34. Arthur Gimpell ‹ ›34 uBar: Final Architecture #2 Toolbar UsageSessions Couchbase Analytics BI Tools + Elasticsearch Couchbase •Couchbase is easy to use. •With Couchbase’s SQL on JSONs (N1QL) It is 0 configuration to make it a data source for every possible BI solution •Couchbase’s Filtered replication to Elasticsearch allows it to function only where SQL is not enough.