Why use a columnar database for analytical workloads by Shane Johnson, Senior Director, MariaDB
In this session, we’re going to discuss how columnar databases the improve performance and efficiency of analytical workloads. We’ll begin by explaining why transactional queries (e.g., return every column in a single row) benefit from row-based storage whereas analytical queries (e.g., return the aggregate of a single column in every row) benefit from column-based storage. We will walk through the storage and query processing architecture of MariaDB AX, an open source columnar database, to show how columnar databases work. In addition, we will show how massively parallel processing, combined with column-based storage, not only improves the performance and efficiency of analytical workloads, but scales to support interactive, ad hoc analytical queries on terabytes of data and billions of rows in real time.
4. MariaDB AX and columnar database use cases
– agenda –
MariaDB AX
Introduction
Architecture
Use cases
Financial services
Healthcare
Telecommunications
Digital advertising
5. A database platform for
modern analytics and data
warehousing.
MariaDB AX and columnar database use cases
– introduction –
Distributed data
Columnar storage
Parallel processing
Data adapters
Connectors (Spark & Kafka)
Open source
Standard SQL
27. MariaDB AX and columnar database use cases
– financial services –
Drivers
Become customer-centric
Facilitate regulatory compliance
Create competitive advantages
Goals
Improve customer satisfaction
Mitigate financial risks
Predict market changes
Use cases
Fraud detection: identify patterns + detect anomalies in financial transactions
Compliance archiving: store financial trade history for long-term retention
Investment forecasting: analyze financial markets + securities to predict ROI
28. MariaDB AX and columnar database use cases
– OTC Markets Group –
Data
10TB of rolling data (5 years)
10,000 U.S. and global securities
100,000 trades
24 million quotes
Use cases
Subscribers analyze quote and trading
data
Regulatory agencies build compliance
reports on demand
29. MariaDB AX and columnar database use cases
– healthcare –
Drivers
Digital transformation
Electronic health records (EHRs)
Value-base care (VBC)
Goals
Improved population health
Better patient experiences
Reduced cost of care
Use cases
Population health mgt: analyze claims/surveys to recommend interventins
Evidence-based medicine: improve diagnostic accuracy by analyzing EHRs
Precision medicine: identify targeted treatments by analyzing genomes
30. MariaDB AX and columnar database use cases
– Institute for Health Metrics and Evaluation –
Data
30TB of data
100 billion data points
Multi-billion row tables
Use cases
Enable the public to analyze global
health population data via online data
visualization tools
31.
32. MariaDB AX and columnar database use cases
– telecommunications –
Drivers
Improve sales, marketing and
operational efficiency
Goals
High customer retention
Better network optimizations
New services and revenue
Use cases
Churn prevention: analyze customer plans/usage to create retention programs
Cross-selling: identify opportunities by analyzing call detail records (CDRs)
Network optimization: analyze traffic/cell tower data to optimize capacity
33. MariaDB AX and columnar database use cases
– Pinger, Inc. –
About
30 million texts
3 million phone calls
1.5 billions logs a day
24 months’ worth of data
Use cases
To support customer behavioural
analysis based on historical data and
usage
34. MariaDB AX and columnar database use cases
– digital advertising –
Drivers
Granularity of data
Demographic and behavioral
Social and location
Goals
Deliver the right ad to the right person at
the right time, in the right location and
through the right medium
Use cases
Audience segmentation: improve ad relevance via fine-grained visitor profiles
Ad placement: choose where to show ads based on click and conversion data
Real-time bidding: analyze big request/response history to optimize prices
35. MariaDB AX and columnar database use cases
– digital advertising vendor –
About
300 million impressions a month
70 million rows a day
60TB of uncompressed data
Use cases
Enable customers to create a custom
report on up to 30 columns on
demand
36. Why use a columnar database
for analytical workloads
Questions?