How to boost your datamanagement with Dremio ?

@VincentTerrasi
How to boost your Data Management
with

1. Before Dremio
Data is the business

Analytics on modern
data is incredibly hard
Unprecedented complexity

The demands for data
are growing rapidly
Increasing demands
Reporting
New products
Forecasting
Threat detection
BI
Machine
Learning
Segmenting
Fraud prevention

Your analysts are hungry for data
SQL
?

Data is a massive engineering project today
Data Staging
• Custom ETL
• Fragile transforms
• Slow moving
SQL

Data Staging
Data Warehouse
• High overhead
• DBA experts
SQL

Data Staging
Data Warehouse
Cubes, BI Extracts &
Aggregation Tables
• Data sprawl
• Governance issues
• Slow to update
+
+
+
+
+
+
+
+
+
SQL

BI Acceleration
The modern stack puts the burden on IT
Data Catalog
Data Prep
Data Virtualization
Ad-hoc Acceleration

2. After Dremio
There is a better way to do this

✓ Works with any data source
✓ Works with any BI tool
✓ No ETL, no data warehouse, no cubes
✓ Makes data self-service, collaborative
✓ Makes Big Data feel small
✓ Open source
There’s a better way,

A New Tier In Data Analytics: Data Fabric
Data Virtualization
RDBMS, MongoDB, Elasticsearch, Hadoop,, NAS,
Excel, JSON
Data Acceleration
OLAP and AdHoc queries at interactive speed,
without cubes or BI-extracts
Data Curation
Wrangle, prepare, enrich any source without
making copies of your data.
Data Catalog
Interactive Data Discovery, Enterprise and
Personal Data Assets
SQL

3. The technology
Apache Arrow …

Dremio optimizes your data and your queries automatically
for 10x-1000x acceleration
Native Push-Downs
Optimized query semantics for each data source:
relational, NoSQL HDFS and more.
Universal Relational Algebra
Query Planner automatically substitutes plans to
make optimal use of cache fragments.
Apache Arrow Execution
From 1 to 1000+ nodes, run on dedicated
infrastructure or in your Hadoop cluster, via YARN.
Dremio ReflectionsTM
Optimized physical data structures for row and
aggregation operations,.

Impersonation | Trusted Context* | Passthru*
Data Source Access Control
Dremio security architecture
LDAP
LDAP
Kerberos*
Virtual Dataset Access Control
ODBC | JDBC | REST
SSL / TLS*
SQL

Discover
Curate
Accelerate
Share
Discover
● Self-service access to all sources
● First class SQL support
● Extends your LDAP and Kerberos
Share
● Collaborate with your team
● Extends your permissions
Curate
● Rename columns, filter results
● Extract and transform values
● Join with other data sets
Accelerate
● Make queries 1000x faster
● Works with any data source
● Automatically adapts to you
Dremio powers analyst collaboration

Vincent Terrasi
@vincentterrasi
Get all our last discoveries and updates

How to boost your datamanagement with Dremio ?

More Related Content

What's hot

Similar to How to boost your datamanagement with Dremio ?

More from Vincent Terrasi

Recently uploaded

How to boost your datamanagement with Dremio ?

Editor's Notes