2. Disclaimer
I am the exclusive recipient of complaints
Email me at: stavros@tiledb.com
All the credit for our amazing work goes to our powerful team
Check it out at https://tiledb.com/about
3. Deep roots at the intersection of HPC, databases and data science
Traction with telecoms, pharmas, hospitals and other scientific organizations
40 members with expertise across all applications and domains
Who we are
TileDB got spun out from MIT and Intel Labs in 2017
WHERE IT ALL STARTED
Raised over $20M, we are very well capitalized
INVESTORS
5. Visit tiledb.com for a lot of resources
What you need to know
TileDB is a universal database
All data types (tables, images, video, genomics, LiDAR, etc)
Based on multi-dimensional arrays
TileDB offerings
TileDB Embedded (open-source storage engine)
TileDB Cloud (SaaS / on-prem database)
Numerous APIs and integrations
Numerous backends and cloud-optimized
6. TileDB Cloud
❏ Access control and logging
❏ Serverless SQL, UDFs, task graphs
❏ Jupyter notebooks and dashboards
Unified data management
and easy serverless compute
at global scale
The TileDB Universal Database
Pluggable Compute: Efficient APIs & Tool Integrations
TileDB Embedded
Open-source interoperable
storage with a universal
open-spec array format
❏ Parallel IO, rapid reads & writes
❏ Columnar, cloud-optimized
❏ Data versioning & time traveling
7. What is TileDB Embedded?
An embeddable C library that stores and accesses multi-dimensional arrays
Dense array Sparse array
It implements very fast array slicing across dimensions
8. Superior
performance
Built in C
Fully-parallelized
Columnar format
Multiple compressors
R-trees for sparse arrays
TileDB Embedded at a Glance
https://github.com/TileDBInc/TileDB
Open source:
Rapid updates
& data versioning
Immutable writes
Lock-free
Parallel reader / writer model
Time traveling
Schema evolution
9. TileDB Embedded at a Glance
https://github.com/TileDBInc/TileDB
Open source:
Extreme
interoperability
Numerous APIs
Numerous integrations
All backends
Optimized
for the cloud
Immutable writes
Parallel IO
Minimization of requests
11. Unified Data Management
Everything in TileDB Cloud is an array
All data, notebooks, UDFs, dashboards, ML models
A single platform for data management
Catalogs, descriptions, metadata and exploration
Access control
Logging
A single UI, everything accessible via REST
12. Notebooks
Embedded JupyterHub instances in the TileDB Cloud UI
Notebook management (similar to arrays)
Catalogs, descriptions, metadata and exploration
Access control
Logging
Super easy onboarding and testing
Launch different types
13. Sharing & Logging
Share your work, learn from others, promote science
A massive catalog of analysis-ready datasets
A massive catalog of runnable code
Collaboration and reproducibility
Organizations
Serverless, global-scale infrastructure
14. Serverless Scalable Compute
Serverless slicing and SQL
Serverless UDFs and task graphs
Geo-aware compute dispatch
Zero-infra data and code sharing
Automation, scalability, cost savings
15. Machine Learning
Store and version all ML models along with your data
Catalog, descriptions, metadata, versions, etc.
Sharing and logging
Scalable training and servicing of the models
ML is a data management problem
16. Dashboards
Diversify your visualization options
Create any dashboard via Python widgets, R shiny or other
Dashboards are notebooks, and notebooks are arrays
Launch a dashboard like a notebook in the TileDB UI
Share it, log it, monetize it
17. Monetization
A game-changer for marketplaces
A full marketplace, integrating with Stripe
Monetize everything (data and code)
Zero-infra requirement from the data/code vendor
No more wrangling data and deploying code
19. TileDB Cloud Value Proposition
A single solution for data storage and analysis
Unified data management
Security (authentication, access control, logging)
Better performance at a lower cost
Faster storage and access because of the array engine
Serverless, pay-as-you-go, geo-aware compute
Versatile, scalable compute
Zero-infra data/code sharing and monetization
Create and share any dataset
Unlimited creativity and collaboration
Build and share any code, notebook, ML model or dashboard