8. Apache Druid (incubating)
● Started 2018-02-28
● Migration logistics on dev list
● Join our new mailing lists!
● Still figuring out source repos, website, etc.
● Druid 0.12.0 is not an Apache release
● Maybe 0.13.0 will be?
8
14. What is NoSQL?
“There's no strong definition of the concept out there, no
trademarks, no standard group, not even a manifesto.”
14
Source: https://martinfowler.com/bliki/NosqlDefinition.html
15. What is NoSQL?
● Not using the relational model (nor the SQL language)
● Open source
● Designed to run on large clusters
● Based on the needs of 21st century web properties
● No schema, allowing fields to be added to any record without
controls
15
Source: https://martinfowler.com/bliki/NosqlDefinition.html
16. Druid and the Relational Model
Is avoiding the SQL language and
relational model really a good thing?
16
17. Druid and the Relational Model
● Datasources are like tables
○ Druid “lookups” apply to a common join use case
○ Big, flat tables are common in SQL databases anyway,
when analytical performance is critical
● Benefits of offering SQL
○ Developers and analysts know it
○ Integration with 3rd party apps
17
25. Future work
● Druid features not supported in Druid SQL (as of 0.12)
○ Multi-value dimensions
○ Spatial filters
○ Theta sketches (approx. set intersection, differences)
● JOIN related
○ Allow users to write lookups as a SQL JOIN
○ Allow JOINs between two Druid datasources
● Others: SQL window functions, SQL UNION, GROUPING SETS
25