SlideShare a Scribd company logo
1 of 30
MySQL Visual Analysis
and Scale Out Options
February 26, 2015
2
Agenda
• MySQL visual analysis
• Design considerations
• Web scale challenges
• Characteristics of a
distributed database
• ScaleBase Analysis Genie
• Demo
• Q & A
– Please enter your questions on the GTW side panel
3
Vladi Vexler
Vice President,
Technology and Product Marketing
• Over 15 years experience in software development
and product management
• Experienced in cloud, web and enterprise
• Author of patents in field of databases innovation,
dynamic data caching and machine learning analytics
4
Who Are We?
Distributed Database Management System
Architected for the Cloud
Simple. Reliable. Powerful.
Scale Out Design Considerations
6
What Is Your Goal?
• Scale (mostly) reads
• Scale (mostly) writes
• Performance of reads
– Affected by joins and big tables scans of big tables
• Performance of writes
– Affected by IO r/wr, CPU and table indexes
(a growing overhead)
• Locks
• CPU/IO/ RAM issues
• Load peaks
• Data growth
• Geo-distribution, special data distribution needs
7
Database And Tables Metrics to Review
• Size
– Physical size on disk, Logical size (number of rows)
• Multiple/large indices
– Physical impacts (write time) and Logical impact (RAM)
• Reads vs. Writes
– Number of queries per table?
– % of total MySQL traffic
– % of table’s traffic
• Logical data relations – identify and analyze
– Joins – complexity of data distribution and data access
– Logical Data Chunks – related data in multiple tables
8
Example Visual Analysis: Tables
9
Scale Out Platform Considerations
DIY <> NewSQL <> NoSQL <> ScaleBase
• Short-term cost vs long-term cost
– Do-it-yourself - open source is not truly free
– Time to market
– Pareto principle – 20% of complications will take 80% of time
– High overhead cost in maintenance and future developments
• Reliability (ACID) vs. simplicity (BASE)
• Maturity and availability/reliability
• Features and limitations
• How to define a good data distribution policy?
– How to evaluate efficiency of a policy for data distribution and access?
– How to simulate different distribution policies and compare?
10
Scale Out
Methodologies
Comparison
Characteristics of a
Distributed Database
12
Distributed Table Types
• MASTER: Data on one shard only
– Example: general settings
• GLOBAL: Data copied to all shards
– Example: lookups
• DISTRIBUTED (root):
Data on a single shard, based on a key
– Example: Users table.
• CASCADED (distributed child table): Data on a single shard
however, distribution and access depend on the parent table
– Example: User_Photos, User_Photos_Likes – depend on Users
Note: Not all sharding platforms support Cascaded and Master table types
13
Distributed Queries Types
• ONE_DB - Single-shard execution. Global or Master tables, Distributed
& Cascaded tables, joins of a Distributed and Global tables
• ALL_DB – All-shards execution, one DB-node in a shard cluster:
– SELECT and Aggregate data from many shards – Parallel execution
(“map reduce” style) on all shards, Aggregate, Order, Group-By, Limit
– DDL statements
– DML on Global tables
• FULL_DB – Session statements (USE, SET) to be sent to all database
nodes in all shard clusters
• CROSS_DB – Sharding conflict resolution, such as cross-shard joins.
Note: Not all sharding platforms support ALL_DB, FULL_DB and CROSS_DB queries.
14
Importance of Logical Data Chunks
• Example: A Logical Data Chunk in a Facebook app:
– All rows in tables containing information related to George, from:
Users, Photos, Comments, Likes, Posts, Friends etc…
• Goals:
1. Optimal Data Distribution: Store maximum logical data chunks in
same shards
2. Maximize ONE_DB and ALL_DB queries
3. Handle all complex cases: related data is in multiple shards
– ALL_DB, CROSS_DB, FULL_DB queries
15
Data Relationships can be Extremely Complex
Usually, scale out is applied to growing-mature apps.
How do you define an optimal data distribution policy?
Analysis Genie:
MySQL Visual Analysis &
Optimal Distribution Policy Configuration
17
ScaleBase Analysis Genie
• A tool enabling MySQL visual analysis and building an optimal data
distribution policy; Designed for DBAs, Architects & Dev. Managers
• Two step-process:
– Analysis Assistant
– An agent captures app/DB information, including SQL traffic and
database metrics
– Obfuscates, summarizes and packages the App-DB data
– Analysis Genie
– a SaaS application, receives the AA package and presents the
visual analysis and details the policy configuration
Analysis Assistant Analysis Genie
18
ScaleBase Analysis Genie
• Advanced analytics
– Your schemas, data &
queries
• Identification of best
data distribution policy
– Customized for even the
most complex apps
• Complete policy control
• Quality assurance
– Review before production
• Policy simulation
– “What-if” analysis
https://www.scalebase.com/software/
19
MySQL Visual Analysis: Data and Data Access
20
Relationship Identification
Mapping includes:
• Schemas structures
• Tables & columns names
matching
• Queries parsing and
identification of joined
tables and columns
• Statistics on every object
size and access
21
Analyzing Relationships: From Chaos to Order
Understanding
and mapping
complex
relationships
22
Complete Control to Refine, Change and Simulate
23
Complete Control to Refine, Change and Simulate
Demo
25
ScaleBase Genie and ScaleBase Enterprise
Demo Environment
• Visual analysis
• Distribution policy identification and configuration
• Scale out load via data sharding (massive scale out)
ScaleBase
Enterprise
Analysis
Genie
Summary
27
Customer: Million+ User Online Gaming Company
Who:
• Mobile gaming company expanding globally
• Hosted on SoftLayer cloud in Hong Kong
Problem:
• Over a million downloads - peak period overload
• Needed scaling in place for expansion
Alternatives considered:
• Manually sharding/open source tools
• Other commercial solutions were too costly
Solution:
• Used visual analysis to determine optimized policy
• Up and running within a few weeks of initial download and now supports hundreds of
thousands of daily users
• Fully operational using data distribution and anticipating additional scale out within
next quarter
28
Scale out to unlimited users
Continuous availability
Dynamic workload optimization
Fast and simple deployment
Easily scale out a single
MySQL instance
Optimized for the Cloud
Reduces time-to-market
No changes needed to app or database
Database usage analytics
Intelligent load balancing
Centralized data management
ScaleBase
Distributed Database Management System
29
Get Instant Application/Database Insight!
Use visual analysis to plan your scale out strategy
Download the Analysis Genie here:
https://www.scalebase.com/software
Questions?
Contact Info:
Paul Campaniello
paul.campaniello@scalebase.com
Vladi Vexler
vladi.vexler@scalebase.com
Resources:
www.scalebase.com
www.scalebase.com/resources
www.scalebase.com/blog
info@scalebase.com
(617) 630.2800

More Related Content

What's hot

Platform for Data Scientists
Platform for Data ScientistsPlatform for Data Scientists
Platform for Data Scientistsdatamantra
 
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationBigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationExcelerate Systems
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectPrecisely
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionJames Serra
 
Machine Learning and AI
Machine Learning and AIMachine Learning and AI
Machine Learning and AIJames Serra
 
TechEvent Building a Data Lake
TechEvent Building a Data LakeTechEvent Building a Data Lake
TechEvent Building a Data LakeTrivadis
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...Cloudera, Inc.
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAmazon Web Services
 
The Rise of Microservices
The Rise of MicroservicesThe Rise of Microservices
The Rise of MicroservicesMongoDB
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsCloudera, Inc.
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseJames Serra
 
Microsoft Power BI: AI Powered Analytics
Microsoft Power BI: AI Powered AnalyticsMicrosoft Power BI: AI Powered Analytics
Microsoft Power BI: AI Powered AnalyticsJuan Alvarado
 
SQL Server 2008 Migration
SQL Server 2008 MigrationSQL Server 2008 Migration
SQL Server 2008 MigrationMark Ginnebaugh
 
Reducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleReducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleEDB
 
C*ollege Credit: Keep the DB, Lose the A
C*ollege Credit: Keep the DB, Lose the AC*ollege Credit: Keep the DB, Lose the A
C*ollege Credit: Keep the DB, Lose the ADataStax
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesRob Winters
 
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Data Integration through Data Virtualization (SQL Server Konferenz 2019)Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Data Integration through Data Virtualization (SQL Server Konferenz 2019)Cathrine Wilhelmsen
 

What's hot (20)

Platform for Data Scientists
Platform for Data ScientistsPlatform for Data Scientists
Platform for Data Scientists
 
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationBigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
 
Seamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with ConnectSeamless, Real-Time Data Integration with Connect
Seamless, Real-Time Data Integration with Connect
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
 
Machine Learning and AI
Machine Learning and AIMachine Learning and AI
Machine Learning and AI
 
TechEvent Building a Data Lake
TechEvent Building a Data LakeTechEvent Building a Data Lake
TechEvent Building a Data Lake
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Synapse for mere mortals
Synapse for mere mortalsSynapse for mere mortals
Synapse for mere mortals
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
 
The Rise of Microservices
The Rise of MicroservicesThe Rise of Microservices
The Rise of Microservices
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice Hotels
 
SQL vs NoSQL
SQL vs NoSQLSQL vs NoSQL
SQL vs NoSQL
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data Warehouse
 
Microsoft Power BI: AI Powered Analytics
Microsoft Power BI: AI Powered AnalyticsMicrosoft Power BI: AI Powered Analytics
Microsoft Power BI: AI Powered Analytics
 
SQL Server 2008 Migration
SQL Server 2008 MigrationSQL Server 2008 Migration
SQL Server 2008 Migration
 
Reducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off OracleReducing the Risks of Migrating Off Oracle
Reducing the Risks of Migrating Off Oracle
 
C*ollege Credit: Keep the DB, Lose the A
C*ollege Credit: Keep the DB, Lose the AC*ollege Credit: Keep the DB, Lose the A
C*ollege Credit: Keep the DB, Lose the A
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil Games
 
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Data Integration through Data Virtualization (SQL Server Konferenz 2019)Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
 

Viewers also liked

Handling Data in Mega Scale Web Systems
Handling Data in Mega Scale Web SystemsHandling Data in Mega Scale Web Systems
Handling Data in Mega Scale Web SystemsVineet Gupta
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins Edureka!
 
Introduction to Tokenization
Introduction to TokenizationIntroduction to Tokenization
Introduction to TokenizationNabeel Yoosuf
 
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data SetsEfficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data SetsPradeeban Kathiravelu, Ph.D.
 
What is Payment Tokenization?
What is Payment Tokenization?What is Payment Tokenization?
What is Payment Tokenization?Rambus Inc
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
 
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduceTuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreducedatasalt
 
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"Peter Vermeulen
 
9/11 Lore Nata Maria Ale
9/11 Lore Nata Maria Ale9/11 Lore Nata Maria Ale
9/11 Lore Nata Maria AleLorena Pga
 
365 days: Croatian Government's Decisions after the 1st year [Infographic]
365 days: Croatian Government's Decisions after the 1st year [Infographic]365 days: Croatian Government's Decisions after the 1st year [Infographic]
365 days: Croatian Government's Decisions after the 1st year [Infographic]Tomislav Korman
 
10 species of dinosaur from Romania
10 species of dinosaur from Romania10 species of dinosaur from Romania
10 species of dinosaur from Romaniabalada65
 
Scarlett Falling Down
Scarlett Falling DownScarlett Falling Down
Scarlett Falling DownLes Davy
 
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...Kouluterveyskysely
 
5 Steps to Successful Mobile Expense Management
5 Steps to Successful Mobile Expense Management5 Steps to Successful Mobile Expense Management
5 Steps to Successful Mobile Expense ManagementXigo
 
20 Famous Sopts
20 Famous Sopts20 Famous Sopts
20 Famous Soptshome based
 
Innovation & value creation in the document space
Innovation & value creation in the document spaceInnovation & value creation in the document space
Innovation & value creation in the document spaceDon Harbison
 

Viewers also liked (20)

Handling Data in Mega Scale Web Systems
Handling Data in Mega Scale Web SystemsHandling Data in Mega Scale Web Systems
Handling Data in Mega Scale Web Systems
 
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltpHpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
 
Reduce Side Joins
Reduce Side Joins Reduce Side Joins
Reduce Side Joins
 
Introduction to Tokenization
Introduction to TokenizationIntroduction to Tokenization
Introduction to Tokenization
 
Denormalization
DenormalizationDenormalization
Denormalization
 
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data SetsEfficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data Sets
 
What is Payment Tokenization?
What is Payment Tokenization?What is Payment Tokenization?
What is Payment Tokenization?
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduceTuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduce
 
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"
Nationale EuroCloud Monitor 2015 "Tussen Trotski en Troelstra"
 
9/11 Lore Nata Maria Ale
9/11 Lore Nata Maria Ale9/11 Lore Nata Maria Ale
9/11 Lore Nata Maria Ale
 
365 days: Croatian Government's Decisions after the 1st year [Infographic]
365 days: Croatian Government's Decisions after the 1st year [Infographic]365 days: Croatian Government's Decisions after the 1st year [Infographic]
365 days: Croatian Government's Decisions after the 1st year [Infographic]
 
10 species of dinosaur from Romania
10 species of dinosaur from Romania10 species of dinosaur from Romania
10 species of dinosaur from Romania
 
Developing Modular Systems using OSGi
Developing Modular Systems using OSGiDeveloping Modular Systems using OSGi
Developing Modular Systems using OSGi
 
Scarlett Falling Down
Scarlett Falling DownScarlett Falling Down
Scarlett Falling Down
 
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...
Kuronen: Oppilas- ja opiskelijahuolto osaksi lasten ja nuorten hyvinvointisuu...
 
Results
ResultsResults
Results
 
5 Steps to Successful Mobile Expense Management
5 Steps to Successful Mobile Expense Management5 Steps to Successful Mobile Expense Management
5 Steps to Successful Mobile Expense Management
 
20 Famous Sopts
20 Famous Sopts20 Famous Sopts
20 Famous Sopts
 
Innovation & value creation in the document space
Innovation & value creation in the document spaceInnovation & value creation in the document space
Innovation & value creation in the document space
 

Similar to MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck

Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Vladi Vexler
 
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...ScaleBase
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
Big SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on HadoopBig SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on HadoopWilfried Hoge
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the OrganizationSeeling Cheung
 
Evolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraEvolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraVishal Puri
 
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...ScaleBase
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse OptimizationCloudera, Inc.
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesDATAVERSITY
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
An overview of modern scalable web development
An overview of modern scalable web developmentAn overview of modern scalable web development
An overview of modern scalable web developmentTung Nguyen
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourceTerry Bunio
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresKangaroot
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databasesJames Serra
 

Similar to MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck (20)

Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
 
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...
Distributed RDBMS: Data Distribution Policy: Part 2 - Creating a Data Distrib...
 
NoSql Brownbag
NoSql BrownbagNoSql Brownbag
NoSql Brownbag
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Big SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on HadoopBig SQL 3.0 - Fast and easy SQL on Hadoop
Big SQL 3.0 - Fast and easy SQL on Hadoop
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Evolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraEvolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital era
 
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...
Distributed RDBMS: Data Distribution Policy: Part 1 - What is a Data Distribu...
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
SoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in UtahSoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in Utah
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
An overview of modern scalable web development
An overview of modern scalable web developmentAn overview of modern scalable web development
An overview of modern scalable web development
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open source
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT InfrastructuresOPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Couchbase 3.0.2 d1
Couchbase 3.0.2  d1Couchbase 3.0.2  d1
Couchbase 3.0.2 d1
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databases
 

More from Vladi Vexler

SafePeak - IT particle accelerator (2012)
SafePeak - IT particle accelerator (2012)SafePeak - IT particle accelerator (2012)
SafePeak - IT particle accelerator (2012)Vladi Vexler
 
SafePeak - In-Memory Dynamic Caching
SafePeak - In-Memory Dynamic CachingSafePeak - In-Memory Dynamic Caching
SafePeak - In-Memory Dynamic CachingVladi Vexler
 
SafePeak Configuration Guide
SafePeak Configuration GuideSafePeak Configuration Guide
SafePeak Configuration GuideVladi Vexler
 
SafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak ClusterSafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak ClusterVladi Vexler
 
Safe peak installation guide version 2.1
Safe peak installation guide version 2.1Safe peak installation guide version 2.1
Safe peak installation guide version 2.1Vladi Vexler
 
SafePeak - How to configure SQL Server agent in a safepeak deployment
SafePeak - How to configure SQL Server agent in a safepeak deploymentSafePeak - How to configure SQL Server agent in a safepeak deployment
SafePeak - How to configure SQL Server agent in a safepeak deploymentVladi Vexler
 
SafePeak cloud case study:EEDAR
SafePeak cloud case study:EEDAR SafePeak cloud case study:EEDAR
SafePeak cloud case study:EEDAR Vladi Vexler
 
SafePeak whitepaper for Cloud Apps
SafePeak whitepaper for Cloud AppsSafePeak whitepaper for Cloud Apps
SafePeak whitepaper for Cloud AppsVladi Vexler
 
SafePeak Installation guide
SafePeak Installation guideSafePeak Installation guide
SafePeak Installation guideVladi Vexler
 
SafePeak Globes testimonial
SafePeak Globes testimonialSafePeak Globes testimonial
SafePeak Globes testimonialVladi Vexler
 
SafePeak - Poria hospital case study
SafePeak - Poria hospital case studySafePeak - Poria hospital case study
SafePeak - Poria hospital case studyVladi Vexler
 
SafePeak @ large telco - Sharepoint benchmark
SafePeak @ large telco - Sharepoint benchmarkSafePeak @ large telco - Sharepoint benchmark
SafePeak @ large telco - Sharepoint benchmarkVladi Vexler
 
SafePeak datasheet 2010
SafePeak datasheet 2010SafePeak datasheet 2010
SafePeak datasheet 2010Vladi Vexler
 
SafePeak whitepaper
SafePeak whitepaperSafePeak whitepaper
SafePeak whitepaperVladi Vexler
 

More from Vladi Vexler (14)

SafePeak - IT particle accelerator (2012)
SafePeak - IT particle accelerator (2012)SafePeak - IT particle accelerator (2012)
SafePeak - IT particle accelerator (2012)
 
SafePeak - In-Memory Dynamic Caching
SafePeak - In-Memory Dynamic CachingSafePeak - In-Memory Dynamic Caching
SafePeak - In-Memory Dynamic Caching
 
SafePeak Configuration Guide
SafePeak Configuration GuideSafePeak Configuration Guide
SafePeak Configuration Guide
 
SafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak ClusterSafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak Cluster
 
Safe peak installation guide version 2.1
Safe peak installation guide version 2.1Safe peak installation guide version 2.1
Safe peak installation guide version 2.1
 
SafePeak - How to configure SQL Server agent in a safepeak deployment
SafePeak - How to configure SQL Server agent in a safepeak deploymentSafePeak - How to configure SQL Server agent in a safepeak deployment
SafePeak - How to configure SQL Server agent in a safepeak deployment
 
SafePeak cloud case study:EEDAR
SafePeak cloud case study:EEDAR SafePeak cloud case study:EEDAR
SafePeak cloud case study:EEDAR
 
SafePeak whitepaper for Cloud Apps
SafePeak whitepaper for Cloud AppsSafePeak whitepaper for Cloud Apps
SafePeak whitepaper for Cloud Apps
 
SafePeak Installation guide
SafePeak Installation guideSafePeak Installation guide
SafePeak Installation guide
 
SafePeak Globes testimonial
SafePeak Globes testimonialSafePeak Globes testimonial
SafePeak Globes testimonial
 
SafePeak - Poria hospital case study
SafePeak - Poria hospital case studySafePeak - Poria hospital case study
SafePeak - Poria hospital case study
 
SafePeak @ large telco - Sharepoint benchmark
SafePeak @ large telco - Sharepoint benchmarkSafePeak @ large telco - Sharepoint benchmark
SafePeak @ large telco - Sharepoint benchmark
 
SafePeak datasheet 2010
SafePeak datasheet 2010SafePeak datasheet 2010
SafePeak datasheet 2010
 
SafePeak whitepaper
SafePeak whitepaperSafePeak whitepaper
SafePeak whitepaper
 

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 

MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck

  • 1. MySQL Visual Analysis and Scale Out Options February 26, 2015
  • 2. 2 Agenda • MySQL visual analysis • Design considerations • Web scale challenges • Characteristics of a distributed database • ScaleBase Analysis Genie • Demo • Q & A – Please enter your questions on the GTW side panel
  • 3. 3 Vladi Vexler Vice President, Technology and Product Marketing • Over 15 years experience in software development and product management • Experienced in cloud, web and enterprise • Author of patents in field of databases innovation, dynamic data caching and machine learning analytics
  • 4. 4 Who Are We? Distributed Database Management System Architected for the Cloud Simple. Reliable. Powerful.
  • 5. Scale Out Design Considerations
  • 6. 6 What Is Your Goal? • Scale (mostly) reads • Scale (mostly) writes • Performance of reads – Affected by joins and big tables scans of big tables • Performance of writes – Affected by IO r/wr, CPU and table indexes (a growing overhead) • Locks • CPU/IO/ RAM issues • Load peaks • Data growth • Geo-distribution, special data distribution needs
  • 7. 7 Database And Tables Metrics to Review • Size – Physical size on disk, Logical size (number of rows) • Multiple/large indices – Physical impacts (write time) and Logical impact (RAM) • Reads vs. Writes – Number of queries per table? – % of total MySQL traffic – % of table’s traffic • Logical data relations – identify and analyze – Joins – complexity of data distribution and data access – Logical Data Chunks – related data in multiple tables
  • 9. 9 Scale Out Platform Considerations DIY <> NewSQL <> NoSQL <> ScaleBase • Short-term cost vs long-term cost – Do-it-yourself - open source is not truly free – Time to market – Pareto principle – 20% of complications will take 80% of time – High overhead cost in maintenance and future developments • Reliability (ACID) vs. simplicity (BASE) • Maturity and availability/reliability • Features and limitations • How to define a good data distribution policy? – How to evaluate efficiency of a policy for data distribution and access? – How to simulate different distribution policies and compare?
  • 12. 12 Distributed Table Types • MASTER: Data on one shard only – Example: general settings • GLOBAL: Data copied to all shards – Example: lookups • DISTRIBUTED (root): Data on a single shard, based on a key – Example: Users table. • CASCADED (distributed child table): Data on a single shard however, distribution and access depend on the parent table – Example: User_Photos, User_Photos_Likes – depend on Users Note: Not all sharding platforms support Cascaded and Master table types
  • 13. 13 Distributed Queries Types • ONE_DB - Single-shard execution. Global or Master tables, Distributed & Cascaded tables, joins of a Distributed and Global tables • ALL_DB – All-shards execution, one DB-node in a shard cluster: – SELECT and Aggregate data from many shards – Parallel execution (“map reduce” style) on all shards, Aggregate, Order, Group-By, Limit – DDL statements – DML on Global tables • FULL_DB – Session statements (USE, SET) to be sent to all database nodes in all shard clusters • CROSS_DB – Sharding conflict resolution, such as cross-shard joins. Note: Not all sharding platforms support ALL_DB, FULL_DB and CROSS_DB queries.
  • 14. 14 Importance of Logical Data Chunks • Example: A Logical Data Chunk in a Facebook app: – All rows in tables containing information related to George, from: Users, Photos, Comments, Likes, Posts, Friends etc… • Goals: 1. Optimal Data Distribution: Store maximum logical data chunks in same shards 2. Maximize ONE_DB and ALL_DB queries 3. Handle all complex cases: related data is in multiple shards – ALL_DB, CROSS_DB, FULL_DB queries
  • 15. 15 Data Relationships can be Extremely Complex Usually, scale out is applied to growing-mature apps. How do you define an optimal data distribution policy?
  • 16. Analysis Genie: MySQL Visual Analysis & Optimal Distribution Policy Configuration
  • 17. 17 ScaleBase Analysis Genie • A tool enabling MySQL visual analysis and building an optimal data distribution policy; Designed for DBAs, Architects & Dev. Managers • Two step-process: – Analysis Assistant – An agent captures app/DB information, including SQL traffic and database metrics – Obfuscates, summarizes and packages the App-DB data – Analysis Genie – a SaaS application, receives the AA package and presents the visual analysis and details the policy configuration Analysis Assistant Analysis Genie
  • 18. 18 ScaleBase Analysis Genie • Advanced analytics – Your schemas, data & queries • Identification of best data distribution policy – Customized for even the most complex apps • Complete policy control • Quality assurance – Review before production • Policy simulation – “What-if” analysis https://www.scalebase.com/software/
  • 19. 19 MySQL Visual Analysis: Data and Data Access
  • 20. 20 Relationship Identification Mapping includes: • Schemas structures • Tables & columns names matching • Queries parsing and identification of joined tables and columns • Statistics on every object size and access
  • 21. 21 Analyzing Relationships: From Chaos to Order Understanding and mapping complex relationships
  • 22. 22 Complete Control to Refine, Change and Simulate
  • 23. 23 Complete Control to Refine, Change and Simulate
  • 24. Demo
  • 25. 25 ScaleBase Genie and ScaleBase Enterprise Demo Environment • Visual analysis • Distribution policy identification and configuration • Scale out load via data sharding (massive scale out) ScaleBase Enterprise Analysis Genie
  • 27. 27 Customer: Million+ User Online Gaming Company Who: • Mobile gaming company expanding globally • Hosted on SoftLayer cloud in Hong Kong Problem: • Over a million downloads - peak period overload • Needed scaling in place for expansion Alternatives considered: • Manually sharding/open source tools • Other commercial solutions were too costly Solution: • Used visual analysis to determine optimized policy • Up and running within a few weeks of initial download and now supports hundreds of thousands of daily users • Fully operational using data distribution and anticipating additional scale out within next quarter
  • 28. 28 Scale out to unlimited users Continuous availability Dynamic workload optimization Fast and simple deployment Easily scale out a single MySQL instance Optimized for the Cloud Reduces time-to-market No changes needed to app or database Database usage analytics Intelligent load balancing Centralized data management ScaleBase Distributed Database Management System
  • 29. 29 Get Instant Application/Database Insight! Use visual analysis to plan your scale out strategy Download the Analysis Genie here: https://www.scalebase.com/software
  • 30. Questions? Contact Info: Paul Campaniello paul.campaniello@scalebase.com Vladi Vexler vladi.vexler@scalebase.com Resources: www.scalebase.com www.scalebase.com/resources www.scalebase.com/blog info@scalebase.com (617) 630.2800

Editor's Notes

  1. Next questions to discuss and consider are about the type of platform applicable for me. BASE = Basically Available, Soft state, Eventual consistency Basically Available: This constraint states that the system does guarantee the availability of the data as regards CAP Theorem; there will be a response to any request. But, that response could still be ‘failure’ to obtain the requested data or the data may be in an inconsistent or changing state, much like waiting for a check to clear in your bank account. Soft state: The state of the system could change over time, so even during times without input there may be changes going on due to ‘eventual consistency,’ thus the state of the system is always ‘soft.’ Eventual consistency: The system will eventually become consistent once it stops receiving input. The data will propagate to everywhere it should sooner or later, but the system will continue to receive input and is not checking the consistency of every transaction before it moves onto the next one.
  2. Here is a summary of different approaches. More detailed description can be found on our website, under Resources -> Competitive Comparison Explain the circles, We are the only one for example that provide Advanced Analytics, which is the foundation for defining optimal distribution policy. ScaleBase solution is the most simple to deploy, enabling shortest go-to-market and lowest maintenance
  3. One of first steps is to Visually Analyze complete summary about state of your MySQL tables: - Physical and Logical Sizes, Writes, Reads, Joins
  4. Determine optimal distribution policy for your specific application and database Analyze your existing schema and queries What is the current structure of your data How is your data accessed by the applications What is the size and rate of writes to individual tables
  5. Determine optimal distribution policy for your specific application and database Analyze your existing schema and queries What is the current structure of your data How is your data accessed by the applications What is the size and rate of writes to individual tables
  6. Determine optimal distribution policy for your specific application and database Analyze your existing schema and queries What is the current structure of your data How is your data accessed by the applications What is the size and rate of writes to individual tables
  7. Determine optimal distribution policy for your specific application and database Analyze your existing schema and queries What is the current structure of your data How is your data accessed by the applications What is the size and rate of writes to individual tables
  8. Risk Cost savings (ROI) Time to market Building solution takes years Open source is limited Not comprehensive Lack of technical support and services Custom built Inefficient and hard to maintain