7. We Lose: Joe Hellerstein (Berkeley) 2001
“Databases are commoditised and cornered
to slow-moving, evolving, structure intensive,
applications that require schema evolution.“ …
“The internet companies are lost and we will
remain in the doldrums of the enterprise
space.” …
“As databases are black boxes which require a
lot of coaxing to get maximum performance”
13. New Approach to Data Access
Simple
Pragmatic
Solved an insoluble problem
Unencumbered by tradition (good & bad)
14. With this came a Different Focus
Tradition No SQL
• Global consistency • Local consistency
• Schema driven • Schemaless
• Reliable Network • Unreliable Network
• Highly Structured • Semi-structured/
Unstructured
NoSQL / Big Data technologies really focus on load
and volume problems by avoiding the complexities
associated with traditional transactional storage
15. The ‘Relational Camp’ had
been busy too
Realisation that the traditional
architecture was insufficient for
various modern workloads
16. End of an Era Paper - 2007
“Because RDBMSs can be beaten by more
than an order of magnitude on the standard
OLTP benchmark, then there is no market
where they are competitive. As such, they
should be considered as legacy technology
more than a quarter of a century in age, for
which a complete redesign and re-architecting
is the appropriate next step.” – Michael
Stonebraker
18. There is a new and impressive breed
• Products ~ 5 years old
• Shared nothing (sharded)
• Designed for SSD’s & 10GE
• Large address spaces (256GB+)
• No indexes (column oriented)
• Dropping traditional tenets (referential integrity
etc)
• Surprisingly quick for big queries when
compared with incumbent technologies.
38. Combining structured & unstructured approaches in a
layered fashion makes the process more nimble
Structured Late Bound
Standardisation Schema
Layer
Raw Data
39. We take this kind of approach
• Grid of machines
• Late bound schema
• Sharded, immutable data
• Low latency (real time) and high throughput
(grid) use cases
• All data is observable (an event)
• Interfaces: Standardised (safe) or Raw
(uayor)
40. Both Raw & Standardised data is available
Operational Relational
(real time / MR) Analytics
Object/SQL
Standardisation
Raw Data
41. This helps to loosen the grip of the
single schema, whilst also
providing a more iterative
approach to standardisation
42. Support for both one standardised and many
bespoke models in the same technology
Raw Facts from
different systems
Standardised
Model
43. Next step: to centralise common
processing tasks
Standardised Risk
Model Calculation