Basic overview of how to evaluate and match workloads to the various database technologies available, from NoSQL to relational. Workloads have different characteristics. If you don’t understand them you can end up implementing the wrong solution for the problem you have.
The video from this presentation is available at https://bloorgroup.webex.com/bloorgroup/lsr.php?AT=pb&SP=EC&rID=4953842&rKey=d03b10ecd9163770
13. Types of workloads
Write‐biased: Read‐biased:
▪ OLTP ▪ Query
▪ OLTP, batch ▪ Query, simple retrieval
▪ OLTP, lite ▪ Query, complex
▪ Object persistence ▪ Query‐hierarchical /
▪ Data ingest, batch object / network
▪ Data ingest, real‐time ▪ Analytic
Mixed?
Inline analytic execution, operational BI
14. Matching to parameters, at assumption of data scale
Workload Write‐ Read‐ Updateable Eventual Un‐ Compute
parameters biased biased data consistency predictable intensive
ok query path
Standard
RDBMS
Parallel
RDBMS
NoSQL (kv,
dht, obj)
Hadoop*
Streaming
database
You see the problem: it’s an intersection of multiple parameters, and
this chart only includes the first tier of parameters. Plus, workload
factors can completely invert these general rules of thumb.
15. Matching to parameters, at assumption of data scale
Workload Complex Selective Low latency High High ingest
parameters queries queries queries concurrency rate
Standard
RDBMS
Parallel RDBMS
NoSQL (kv, dht,
obj)
Hadoop
Streaming
database
You have to look at the combination of workload factors: data scale,
concurrency, latency & response time, then chart the parameters.
18. About the Presenter
Mark Madsen is president of Third
Nature, a technology research and
consulting firm focused on business
intelligence, analytics and
information management. Mark is an
award-winning author, architect and
former CTO whose work has been
featured in numerous industry
publications. During his career Mark
received awards from the American
Productivity & Quality Center, TDWI,
Computerworld and the Smithsonian
Institute. He is an international
speaker, contributing editor at
Intelligent Enterprise, and manages
the open source channel at the
Business Intelligence Network. For
more information or to contact Mark,
visit http://ThirdNature.net.
19. About Third Nature
Third Nature is a research and consulting firm focused on new and
emerging technology and practices in analytics, business intelligence, and
performance management. If your question is related to data, analytics,
information strategy and technology infrastructure then you‘re at the right
place.
Our goal is to help companies take advantage of information-driven
management practices and applications. We offer education, consulting
and research services to support business and IT organizations as well as
technology vendors.
We fill the gap between what the industry analyst firms cover and what IT
needs. We specialize in product and technology analysis, so we look at
emerging technologies and markets, evaluating technology and hw it is
applied rather than vendor market positions.