Mind the gap: understanding today’s 
data platforms landscape 
Matthew Aslett, research director
451 Research is an information 
technology research & advisory company 
Founded in 2000 
210+ employees, including over 100 analysts 
1,000+ clients: Technology & Service providers, corporate 
advisory, finance, professional services, and IT decision makers 
10,000+ senior IT professionals in our research community 
Over 52 million data points each quarter 
4,500+ reports published each year covering 2,000+ 
innovative technology & service providers 
Headquartered in New York City with offices in London, 
Boston, San Francisco, and Washington D.C. 
451 Research and its sister company Uptime Institute 
comprise the two divisions of The 451 Group 
Research & Data 
Advisory Services 
Events 
2
1 
6 
AWS 
EMR 
Non-­‐rela%onal 
zone 
CitusDB 
IBM 
Big 
SQL 
Rela%onal 
zone 
MySQL 
114 
SQLStream 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
Hypertable 
HBase 
Compose 
CloudBird 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
Google 
App 
Engine 
Datastore 
LevelDB 
Accumulo 
Riak 
Couchbase 
Splice 
Machine 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
MariaDB 
SAP 
HANA 
Informix 
IBM 
DB2 
Google 
Cloud 
SQL 
HP 
Cloud 
RDB 
for 
MySQL 
ClearDB 
Database.com 
Rackspace 
Cloud 
Databases 
AWS 
RDS 
Azure 
SQL 
Database 
FathomDB 
StormDB 
HPCC 
Teradata 
Aster 
Cloudera 
MapR 
IBM 
Hortonworks 
BigInsights 
Google 
Compute 
Engine 
ZeWaset 
NGDATA 
Infochimps 
Metascale 
Mortar 
Data 
Rackspace 
Qubole 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
Teradata 
XtremeData 
Metamarkets 
Druid 
Ac/an 
Vector 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
Con/nuent 
vFabric 
Postgres 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
MemSQL 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
MemCachier 
Redis 
Labs 
Memcached 
Cloud 
FairCom 
BitYota 
IronCache 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
Oracle 
Exaly/cs 
Oracle 
Database 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Databricks/Spark 
AWS 
Elas/Cache 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
Al/scale 
Oracle 
Big 
Data 
Appliance 
RainStor 
OrientDB 
Sparksee 
Metamarkets 
Treasure 
Data 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
JethroData 
Pivotal 
HD/HAWQ 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Google 
Cloud 
Datastore 
Heroku 
Postgres 
GrapheneDB 
Cassandra.io 
BerkeleyDB 
Microso^ 
HDInsight 
HP 
Autonomy 
Oracle 
Exadata 
IBM 
PureData 
Impala 
Apache 
Drill 
Presto 
Microso^ 
SQL 
Server 
PDW 
Apache 
Tajo 
Apache 
Hive 
SPARQLBASE 
MammothDB 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Kogni/o 
Al/base 
XDB 
Savvis 
So^layer 
Verizon 
xPlenty 
Stardog 
MariaDB 
Enterprise 
Apache 
Storm 
Apache 
S4 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
DataTorrent 
AWS 
Kinesis 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
OpenStack 
Trove 
1010data 
Google 
BigQuery 
AWS 
Redshi^ 
TempoIQ 
InfluxDB 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
T-­‐Systems 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
SpaceCurve 
Postgres-­‐XL 
Google 
Cloud 
Dataflow 
Trafodion 
Hadapt 
Azure 
Search 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
Cloudant 
Iris 
Couch 
MongoLab 
RedisGreen 
Redis-­‐to-­‐go 
Redis 
Labs 
Redis 
Cloud 
ObjectRocket 
Azure 
DocumentDB 
ObjectRocket 
Redis 
AWS 
Elas/Cache 
with 
Redis 
MagnetoDB 
AWS 
SimpleDB 
1 
3 
Data 
Platforms 
Map 
October 2014 
CockroachDB 
AWS 
DynamoDB 
TokuMX 
VoltDB 
CortexDB 
CodeFutures 
Oracle 
Big 
Data 
Cloud 
3 
Stra/o
Data 
Platforms 
Map
The industry analyst’s insatiable desire to categorize 
5 
Darrenkw 
on 
Flickr: 
hWps://www.flickr.com/photos/darrenkw/3327185668
Segmenting the market 
§ NoSQL, NewSQL and Beyond 
• Assessing the drivers behind the development 
and adoption of NoSQL and NewSQL databases, 
as well as data grid/caching technologies 
• Published April 2011 
6
Rela/onal 
Non-­‐rela/onal 
Analy/c 
Opera/onal 
Infobright 
VectorWise 
Oracle 
IBM 
DB2 
SQL 
Server 
MySQL 
Ingres 
PostgreSQL 
SAP 
Sybase 
ASE 
Hadoop 
Netezza 
JustOne 
EMC 
Greenplum 
Aster 
Data 
ParAccel 
HP 
Ver/ca 
-­‐as-­‐a-­‐Service 
App 
Engine 
Datastore 
SimpleDB 
NewSQL 
Amazon 
RDS 
Xeround 
Calpont 
EnterpriseDB 
GenieDB 
ScalArc 
VoltDB 
InterSystems 
NoSQL 
Document 
Lotus 
Notes 
CouchDB 
MongoDB 
Graph 
Key 
value 
Big 
tables 
Objec/vity 
MarkLogic 
Versant 
Progress 
McObject 
Hypertable 
HBase 
Riak 
Redis 
Membrain 
Voldemort 
BerkeleyDB 
InfiniteGraph 
Neo4J 
GraphDB 
Data 
Grid/Cache 
Clustrix 
SQL 
Azure 
FathomDB 
Schooner 
MySQL 
Tokutek 
Akiban 
CodeFutures 
Con/nuent 
ScaleBase 
Transla[ce 
Database.com 
SAP 
Sybase 
IQ 
IBM 
InfoSphere 
NimbusDB 
HandlerSocket 
Cassandra 
Cloudant 
Memcached 
IBM 
eXtreme 
Scale 
TerracoWa 
GigaSpaces 
Oracle 
Coherence 
GridGain 
ScaleOut 
Vmware 
GemFire 
InfiniSpan 
CloudTran 
Couchbase 
RavenDB 
Drizzle 
Piccolo 
Dryad 
Hadapt 
Mapr 
Brisk 
MySQL 
Cluster 
Teradata 
7 
The 
“Drawn 
to 
Scale” 
problem: 
Company 
developing 
a 
Hadoop-­‐based 
NewSQL 
database.
8
Inspiration 
§ Harry Beck’s London Underground Tube Map 
hWp://briWon.disted.camosun.bc.ca/beck_map.jpg 
9
Inspiration 
Kai 
Chan 
Vong 
on 
Flickr: 
hWps://www.flickr.com/photos/kaichanvong/3723850455 
10 
§ Harry Beck’s London Underground Tube Map
Inspiration 
hWp://www.artrepublic.com/biographies/200-­‐simon-­‐paWerson.html 
11 
§ Simon Patterson’s “The Great Bear”
Inspiration 
12 
§ The Real Story Group’s Content Technology Vendor Map 
hWp://www.realstorygroup.com/vendormap/
Left to my own devices 
13
14
15
16
17
18
19
1 
6 
Non-­‐rela%onal 
zone 
CitusDB 
IBM 
Big 
SQL 
Rela%onal 
zone 
MySQL 
114 
SQLStream 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
Hypertable 
HBase 
Compose 
CloudBird 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
Google 
App 
Engine 
Datastore 
LevelDB 
Accumulo 
Riak 
Couchbase 
Splice 
Machine 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
MariaDB 
SAP 
HANA 
Informix 
IBM 
DB2 
Google 
Cloud 
SQL 
HP 
Cloud 
RDB 
for 
MySQL 
ClearDB 
Database.com 
Rackspace 
Cloud 
Databases 
AWS 
RDS 
Azure 
SQL 
Database 
FathomDB 
StormDB 
HPCC 
Teradata 
Aster 
Cloudera 
MapR 
IBM 
Hortonworks 
BigInsights 
ZeWaset 
NGDATA 
Infochimps 
Metascale 
Mortar 
Data 
Rackspace 
Qubole 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
Teradata 
XtremeData 
Metamarkets 
Druid 
Ac/an 
Vector 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
Con/nuent 
vFabric 
Postgres 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
MemSQL 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
MemCachier 
Redis 
Labs 
Memcached 
Cloud 
FairCom 
BitYota 
IronCache 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
Oracle 
Exaly/cs 
Oracle 
Database 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Databricks/Spark 
AWS 
Elas/Cache 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
Al/scale 
Oracle 
Big 
Data 
Appliance 
RainStor 
OrientDB 
Sparksee 
Metamarkets 
Treasure 
Data 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
JethroData 
Pivotal 
HD/HAWQ 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Google 
Cloud 
Datastore 
Heroku 
Postgres 
GrapheneDB 
Cassandra.io 
BerkeleyDB 
Microso^ 
HDInsight 
HP 
Autonomy 
Oracle 
Exadata 
IBM 
PureData 
Impala 
Apache 
Drill 
Presto 
Microso^ 
SQL 
Server 
PDW 
Apache 
Tajo 
Apache 
Hive 
SPARQLBASE 
MammothDB 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Kogni/o 
Al/base 
XDB 
Savvis 
So^layer 
Verizon 
xPlenty 
Stardog 
MariaDB 
Enterprise 
Apache 
Storm 
Apache 
S4 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
DataTorrent 
AWS 
Kinesis 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
OpenStack 
Trove 
1010data 
Google 
BigQuery 
AWS 
Redshi^ 
TempoIQ 
InfluxDB 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
T-­‐Systems 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
SpaceCurve 
Postgres-­‐XL 
Google 
Cloud 
Dataflow 
Trafodion 
Hadapt 
Azure 
Search 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
Cloudant 
Iris 
Couch 
MongoLab 
RedisGreen 
Redis-­‐to-­‐go 
Redis 
Labs 
Redis 
Cloud 
ObjectRocket 
Azure 
DocumentDB 
ObjectRocket 
Redis 
AWS 
Elas/Cache 
with 
Redis 
MagnetoDB 
AWS 
SimpleDB 
1 
3 
Data 
Platforms 
Map 
October 2014 
CockroachDB 
AWS 
DynamoDB 
TokuMX 
VoltDB 
CortexDB 
CodeFutures 
Oracle 
Big 
Data 
Cloud 
20 
AWS 
EMR 
Google 
Compute 
Engine 
Stra/o
Data 
Platforms 
Landscape
1 
6 
Non-­‐rela%onal 
zone 
Rela%onal 
zone 
114 
SQLStream 
Coral8 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
SQLite 
Ingres 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
Informix 
Teradata 
VectorWise 
Kx 
Systems 
ParAccel 
IBM 
InfoSphere 
Calpont 
Sybase 
IQ 
Ver/ca 
Greenplum 
Grid/cache 
zone 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Tangosol 
Coherence 
GigaSpaces 
GemStone 
Memcached 
TerracoWa 
Oracle 
Database 
Endeca 
A[vio 
Lucid 
Imagina/on 
Lucene/Solr 
Vivisimo 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
Progress 
ObjectStore 
MonetDB 
Firebird 
Oracle 
TimesTen 
IBM 
solidDB 
Adabas 
IBM 
IMS 
PostgreSQL 
TIBCO 
Ac/veSpaces 
Sybase 
SQL 
Anywhere 
Versant 
BerkeleyDB 
Autonomy 
IBM 
DB2 
Aster 
Data 
LogLogic 
Splunk 
Towards 
SIEM 
Pervasive 
PSQL 
Progress 
OpenEdge 
Kogni/o 
Progress 
Apama 
StreamBase 
TIBCO 
Aleri 
BEA 
1010data 
2 
A 
B 
C 
D 
E 
A 
B 
C 
D 
E 
2 
3 
4 
5 
Infobright 
1 
3 
4 
5 
6 
Data 
Platforms 
Map 
Circa 2008/9 
Netezza 
Key: 
General 
purpose 
Specialist 
analy/c 
Data 
caching 
Search 
In-­‐memory 
Stream 
processing 
22
1 
6 
Non-­‐rela%onal 
zone 
Rela%onal 
zone 
114 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
InterSystems 
Caché 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
SAP 
HANA 
Informix 
IBM 
DB2 
HPCC 
Teradata 
Aster 
IBM 
PureData 
for 
Analy/cs 
XtremeData 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
Grid/cache 
zone 
Memcached 
Ehcache 
vFabric 
Postgres 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
Oracle 
Database 
Oracle 
Endeca 
Server 
Elas/csearch 
A[vio 
So^ware 
AG 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
RainStor 
PostgreSQL 
TIBCO 
Ac/veSpaces 
SAP 
Sybase 
SQL 
Anywhere 
BigMemory 
Ac/an 
Versant 
HP 
Autonomy 
Oracle 
Exadata 
IBM 
PureData 
Microso^ 
SQL 
Server 
PDW 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
Ac/an 
PSQL 
Progress 
OpenEdge 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
2 
A 
B 
C 
D 
E 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
Postgres-­‐XL 
Red 
Hat 
JBoss 
Data 
Grid 
1 
3 
4 
5 
6 
Data 
Platforms 
Map 
Expansion 
VectorWise 
Kogni/o 
1010data 
SQLStream 
Infobright 
MySQL 
Lotus 
Notes 
Objec/vity 
BerkeleyDB 
MarkLogic 
Oracle 
Teradata 
Exaly/cs 
23
1 
6 
Non-­‐rela%onal 
zone 
Rela%onal 
zone 
114 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
HBase 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
RavenDB 
LevelDB 
Accumulo 
Riak 
Couchbase 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
SAP 
HANA 
Informix 
IBM 
DB2 
HPCC 
Teradata 
Aster 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
XtremeData 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
YarcData 
ParStream 
FlockDB 
Allegrograph 
HypergraphDB 
Grid/cache 
zone 
Memcached 
Ehcache 
vFabric 
Postgres 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
Oracle 
Database 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
RainStor 
OrientDB 
Sparksee 
PostgreSQL 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
FatDB 
Hypertable 
BerkeleyDB 
HP 
Autonomy 
Oracle 
Exadata 
IBM 
PureData 
Microso^ 
SQL 
Server 
PDW 
SPARQLBASE 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Stardog 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
2 
A 
B 
C 
D 
E 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
Postgres-­‐XL 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
1 
3 
Data 
Platforms 
Map 
NoSQL 
TokuMX 
VectorWise 
Kogni/o 
1010data 
SQLStream 
Infobright 
MySQL 
CortexDB 
Oracle 
Teradata 
Exaly/cs 
24
1 
6 
Non-­‐rela%onal 
zone 
Rela%onal 
zone 
MySQL 
114 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
HBase 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
LevelDB 
Accumulo 
Riak 
Couchbase 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
vFabric 
Postgres 
MariaDB 
SAP 
HANA 
Informix 
HPCC 
Teradata 
Aster 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
XtremeData 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
CodeFutures 
Con/nuent 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
RainStor 
OrientDB 
Sparksee 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Hypertable 
BerkeleyDB 
HP 
Autonomy 
IBM 
PureData 
Microso^ 
SQL 
Server 
PDW 
SPARQLBASE 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Al/base 
XDB 
Stardog 
MariaDB 
Enterprise 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
Postgres-­‐XL 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
1 
3 
Data 
Platforms 
Map 
NewSQL 
TokuMX 
VectorWise 
Kogni/o 
1010data 
SQLStream 
IBM 
DB2 
Oracle 
Database 
VoltDB 
FairCom 
MemSQL 
CockroachDB 
Oracle 
Exadata 
CortexDB 
Oracle 
Teradata 
Exaly/cs 
25
1 
6 
Non-­‐rela%onal 
zone 
CitusDB 
IBM 
Big 
SQL 
Rela%onal 
zone 
MySQL 
114 
SQLStream 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
HBase 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
LevelDB 
Accumulo 
Riak 
Couchbase 
Splice 
Machine 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
vFabric 
Postgres 
MariaDB 
SAP 
HANA 
Informix 
HPCC 
Teradata 
Aster 
Cloudera 
MapR 
IBM 
Hortonworks 
BigInsights 
NGDATA 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
XtremeData 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
CodeFutures 
Con/nuent 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
Oracle 
Big 
Data 
Appliance 
RainStor 
OrientDB 
Sparksee 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
JethroData 
Pivotal 
HD/HAWQ 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Hypertable 
BerkeleyDB 
HP 
Autonomy 
IBM 
PureData 
Impala 
Apache 
Drill 
Presto 
Microso^ 
SQL 
Server 
PDW 
Apache 
Tajo 
Apache 
Hive 
SPARQLBASE 
MammothDB 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Al/base 
XDB 
Stardog 
MariaDB 
Enterprise 
Apache 
Storm 
Apache 
S4 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
DataTorrent 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
Postgres-­‐XL 
Trafodion 
Hadapt 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
1 
3 
Data 
Platforms 
Map 
Hadoop 
TokuMX 
VectorWise 
Kogni/o 
1010data 
ZeWaset 
IBM 
DB2 
Oracle 
Database 
FairCom 
MemSQL 
CockroachDB 
Oracle 
Exadata 
VoltDB 
CortexDB 
Oracle 
Teradata 
Exaly/cs 
26
1 
6 
Non-­‐rela%onal 
zone 
CitusDB 
Rela%onal 
zone 
MySQL 
114 
SQLStream 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
Hypertable 
HBase 
Compose 
CloudBird 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
Google 
App 
Engine 
Datastore 
LevelDB 
Accumulo 
Riak 
Couchbase 
Splice 
Machine 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
SQL 
Server 
MySQL 
MariaDB 
SAP 
HANA 
Informix 
Google 
Cloud 
SQL 
HP 
Cloud 
RDB 
for 
MySQL 
ClearDB 
Database.com 
Rackspace 
Cloud 
Databases 
AWS 
RDS 
Azure 
SQL 
Database 
FathomDB 
StormDB 
HPCC 
Teradata 
Aster 
Cloudera 
MapR 
IBM 
Hortonworks 
BigInsights 
ZeWaset 
NGDATA 
Infochimps 
Metascale 
Mortar 
Data 
Rackspace 
Qubole 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
XtremeData 
Metamarkets 
Druid 
Ac/an 
Vector 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
Oracle 
Exadata 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
Con/nuent 
VoltDB 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
MemCachier 
Redis 
Labs 
Memcached 
Cloud 
BitYota 
IronCache 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Databricks/Spark 
AWS 
Elas/Cache 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
Al/scale 
Oracle 
Big 
Data 
Appliance 
RainStor 
OrientDB 
Sparksee 
Metamarkets 
Treasure 
Data 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
JethroData 
Pivotal 
HD/HAWQ 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Google 
Cloud 
Datastore 
Heroku 
Postgres 
GrapheneDB 
Cassandra.io 
BerkeleyDB 
Microso^ 
HDInsight 
HP 
Autonomy 
IBM 
Big 
SQL 
Impala 
Apache 
Drill 
Presto 
Microso^ 
SQL 
Server 
PDW 
Apache 
Tajo 
Apache 
Hive 
SPARQLBASE 
MammothDB 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Kogni/o 
Al/base 
XDB 
Savvis 
So^layer 
Verizon 
xPlenty 
Stardog 
MariaDB 
Enterprise 
Apache 
Storm 
Apache 
S4 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
DataTorrent 
AWS 
Kinesis 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
OpenStack 
Trove 
1010data 
Google 
BigQuery 
AWS 
Redshi^ 
TempoIQ 
InfluxDB 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
T-­‐Systems 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
SpaceCurve 
Postgres-­‐XL 
Google 
Cloud 
Dataflow 
Trafodion 
Hadapt 
Azure 
Search 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
Cloudant 
Iris 
Couch 
MongoLab 
RedisGreen 
Redis-­‐to-­‐go 
Redis 
Labs 
Redis 
Cloud 
ObjectRocket 
Azure 
DocumentDB 
ObjectRocket 
Redis 
AWS 
Elas/Cache 
with 
Redis 
MagnetoDB 
AWS 
SimpleDB 
1 
3 
Data 
Platforms 
Map 
DBaaS 
AWS 
DynamoDB 
TokuMX 
IBM 
DB2 
Oracle 
Database 
IBM 
PureData 
FairCom 
MemSQL 
CockroachDB 
CortexDB 
EnterpriseDB 
vFabric 
Postgres 
CodeFutures 
Oracle 
Big 
Data 
Cloud 
Oracle 
Teradata 
Exaly/cs 
27 
AWS 
EMR 
Google 
Compute 
Engine 
Stra/o
The big bang: drivers for change 
NoSQL 
NewSQL 
Hadoop 
DBaaS 
28 
Cloud 
Elas/c 
Distributed 
Architecture 
Virtual 
Scalable 
Flexible 
JSON 
Agile 
Schema-­‐on-­‐read 
Developers 
REST 
Schemaless 
Flexible 
Global 
Web 
Mobile 
Interac/ve 
Applica%ons 
Always-­‐on 
Social 
Local
Drivers for change influence each other 
Cloud 
Elas/c 
Distributed 
Schema-­‐on-­‐read 
§ Distributed 
architecture 
encourages 
new 
development 
approaches 
§ New 
applica/ons 
require 
distributed 
architecture 
JSON 
Global 
Web 
Mobile 
Interac/ve 
§ New 
development 
approaches 
demand 
new 
architecture 
§ Distributed 
architecture 
enables 
new 
applica/ons 
§ New 
app 
requirements 
demand 
new 
development 
approaches 
§ New 
dev 
approaches 
enable 
new 
lightweight 
apps 
Architecture 
Virtual 
Scalable 
Flexible 
Agile 
Developers 
REST 
Schemaless 
Flexible 
Applica%ons 
Always-­‐on 
Social 
Local 
29
Drivers for change: applications 
Social, 
mobile, 
global, 
local 
all 
have 
implica/ons 
for 
data 
connec/vity 
§ Social 
– 
increased 
interac/vity 
generates 
data 
§ Mobile 
– 
different 
form 
factors 
and 
access 
methods 
§ Global 
– 
applica/ons 
need 
to 
be 
immediately 
available 
everywhere 
§ Local 
– 
need 
to 
deliver 
localized 
content 
Global 
Web 
Mobile 
Interac/ve 
Applica%ons 
Always-­‐on 
Social 
Local 
30
Drivers for change: developers 
Developers 
increasingly 
drive 
data 
management 
and 
database 
selec/on 
§ Need 
to 
unite 
applica/on 
development 
and 
database 
management 
people/processes 
to 
achieve 
common 
goals 
§ DevOps 
movement 
growing 
apace 
§ Rapid 
development 
and 
con/nuous 
delivery 
is 
inconsistent 
with 
tradi/onal 
database 
management 
processes 
JSON 
Agile 
Schema-­‐on-­‐read 
Developers 
REST 
Schemaless 
Flexible 
31
Drivers for change: DevOps 
JSON 
Agile 
Schema-­‐on-­‐read 
Findings: 
• 51% 
cite 
business 
and 
strategy 
drivers: 
compe//ve, 
business 
produc/vity, 
revenue 
• 49% 
cite 
technology 
func/onality, 
new 
feature 
sets, 
reduced 
development 
costs 
• As 
DevOps 
matures 
as 
a 
discipline, 
aligning 
business 
and 
IT 
goals 
is 
ever 
more 
important. 
451 
Research 
survey 
of 
200 
qualified 
North 
American 
DevOps 
prac//oners 
or 
influencers 
Developers 
REST 
Schemaless 
Flexible 
32
Drivers for change: architecture 
Transi/oning 
from 
a 
tradi/onal 
database 
to 
a 
distributed 
database 
§ Interac/ve 
applica/ons 
means 
the 
pace 
of 
user 
growth 
and 
mul/plicity 
of 
data 
types 
is 
too 
great 
for 
tradi/onal 
rela/onal 
databases 
to 
efficiently 
absorb. 
§ Scalability 
§ Performance 
§ Relaxed 
consistency 
§ Agility 
§ Intricacy 
§ Necessity 
Cloud 
Elas/c 
Distributed 
Architecture 
Virtual 
Scalable 
Flexible 
33
Drivers for change: architecture 
Transi/oning 
from 
a 
tradi/onal 
database 
to 
a 
distributed 
database 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Cloud 
Elas/c 
Distributed 
Architecture 
Virtual 
Scalable 
Flexible 
§ Amazon’s top enterprise use cases are 
(in order of popularity starting with the most popular): 
• Development and test 
• New workloads 
Transi/oning 
from 
on-­‐premises 
• Supplement compuexisting /ng 
to 
the 
workloads cloud 
with cloud 
• Migration of existing workloads to the cloud 
• Datacenter migration 
• All-in cloud 
• Top three adoption drivers for public cloud are additive to the existing 
database landscape 
34
Drivers for change: shift to private cloud 
For 
each 
of 
the 
major 
digital 
infrastructure 
deployment 
methods, 
how 
are 
your 
IT 
services 
(% 
of 
applica/ons/workloads 
delivered) 
distributed? 
Internal, 
On-­‐premises, 
Non-­‐cloud 
Digital 
Infrastructure 
(Physical 
or 
Internal, 
On-­‐premises, 
Private 
Cloud-­‐based 
Off-­‐premises 
IT 
Outsourcing, 
Dedicated/Managed 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Cloud 
Cloud 
Elas/c 
Elas/c 
Distributed 
Distributed 
Virtual 
Scalable 
Flexible 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Architecture 
Virtual 
Scalable 
Flexible 
5% 
1% 
1% 
12% 
9% 
6% 
25% 
13% 
30% 
41% 
1% 
10% 
19% 
62% 
Hybrid 
Cloud 
Off-­‐premises 
Public 
Cloud 
Services 
(SaaS) 
Off-­‐premises 
Public 
Cloud 
Services 
(Other 
Than 
SaaS) 
Other 
Deployment 
Method 
2013 
2015 
The 
majority 
of 
workloads 
today 
are 
on 
internal, 
on-­‐premises, 
non-­‐cloud 
and 
are 
likely 
to 
remain 
on-­‐premises 
albeit 
with 
a 
move 
to 
private 
cloud 
35 
Source: 
Cloud 
Compu<ng 
– 
Wave 
6 
hAps://451research.com/report-­‐long?icid=3021
Drivers for change: public cloud not significant 
For 
each 
of 
the 
major 
digital 
infrastructure 
deployment 
methods, 
how 
are 
your 
IT 
services 
(% 
of 
applica/ons/workloads 
delivered) 
distributed? 
Internal, 
On-­‐premises, 
Non-­‐cloud 
Digital 
Infrastructure 
(Physical 
or 
Internal, 
On-­‐premises, 
Private 
Cloud-­‐based 
Off-­‐premises 
IT 
Outsourcing, 
Dedicated/Managed 
Transi/oning 
from 
on-­‐premises 
Pubic 
cloud 
compuand 
/ng 
SaaS 
to 
the 
are 
cloud 
growing, 
but 
not 
making 
a 
significant 
impact 
Cloud 
Cloud 
Elas/c 
Elas/c 
Distributed 
Distributed 
Virtual 
Scalable 
Flexible 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Architecture 
Virtual 
Scalable 
Flexible 
36 
5% 
1% 
1% 
1% 
12% 
13% 
10% 
9% 
6% 
30% 
25% 
41% 
19% 
62% 
Hybrid 
Cloud 
Off-­‐premises 
Public 
Cloud 
Services 
(SaaS) 
Off-­‐premises 
Public 
Cloud 
Services 
(Other 
Than 
SaaS) 
Other 
Deployment 
Method 
2013 
2015 
Source: 
Cloud 
Compu<ng 
– 
Wave 
6 
hAps://451research.com/report-­‐long?icid=3021
For 
each 
of 
the 
major 
digital 
infrastructure 
deployment 
methods, 
how 
are 
your 
IT 
services 
(% 
of 
applica/ons/workloads 
delivered) 
distributed? 
5% 
1% 
1% 
12% 
9% 
6% 
25% 
13% 
30% 
41% 
1% 
10% 
19% 
62% 
Internal, 
On-­‐premises, 
Non-­‐cloud 
Digital 
Infrastructure 
(Physical 
or 
Internal, 
On-­‐premises, 
Private 
Cloud-­‐based 
Off-­‐premises 
IT 
Outsourcing, 
Dedicated/Managed 
Hybrid 
Cloud 
Off-­‐premises 
Public 
Cloud 
Services 
(SaaS) 
Off-­‐premises 
Public 
Cloud 
Services 
(Other 
Than 
SaaS) 
Other 
Deployment 
Method 
2013 
2015 
Drivers for change: hybrid cloud is the future 
Cloud 
Cloud 
Elas/c 
Elas/c 
Distributed 
Distributed 
Virtual 
Scalable 
Flexible 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Architecture 
Virtual 
Scalable 
Flexible 
The 
future 
of 
the 
cloud 
is 
hybrid 
37 
Source: 
Cloud 
Compu<ng 
– 
Wave 
6 
hAps://451research.com/report-­‐long?icid=3021
Drivers for change: the route to hybrid is private 
For 
each 
of 
the 
following 
categories 
of 
workload/business 
func/ons, 
what 
is 
your 
primary 
deployment 
method 
likely 
to 
be 
in 
the 
next 
two 
years? 
Cloud 
Elas/c 
Distributed 
Architecture 
Virtual 
Scalable 
Flexible 
Test 
and 
Development 
of 
Applica/ons 
E-­‐business 
Hos/ng 
Customer-­‐facing 
Enterprise 
Applica/ons 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
Cloud 
Distributed 
Virtual 
Scalable 
Flexible 
Transi/oning 
from 
on-­‐premises 
compu/ng 
to 
the 
cloud 
53% 
68% 
11% 
35% 
53% 
18% 
42% 
18% 
14% 
23% 
17% 
26% 
43% 
27% 
31% 
29% 
18% 
66% 
46% 
21% 
39% 
Collabora/ve 
Applica/ons 
Cloud-­‐na/ve 
Applica/ons 
Batch 
Compu/ng 
Applica/ons 
Back-­‐office 
Enterprise 
Applica/ons 
Internal 
Private 
Cloud 
Hybrid 
Cloud 
External 
Public 
Cloud 
and 
SaaS 
38 
Source: 
Cloud 
Compu<ng 
– 
Wave 
6 
hAps://451research.com/report-­‐long?icid=3021
JSON 
Agile 
ApplicaSchema-­‐on-­‐/read 
ons 
Developers 
Architecture 
REST 
Schemaless 
Flexible 
Drivers for change 
Cloud 
Elas/c 
Distributed 
Architecture 
Virtual 
Scalable 
Flexible 
Global 
Web 
Mobile 
Interac/ve 
NoSQL 
NewSQL 
Hadoop 
DBaaS 
Applica%ons 
Always-­‐on 
Social 
Local 
39
New databases: differences 
40 
Non-­‐rela/onal 
data 
models. 
Trade-­‐off 
consistency 
for 
availability 
NoSQL 
Adds 
availability 
and 
flexibility 
to 
the 
familiar 
rela/onal 
data 
model 
NewSQL 
Any 
of 
the 
above, 
or 
tradi/onal 
RDBMS, 
delivered 
as 
a 
service 
DBaaS 
Batch 
(and 
now 
interac/ve) 
analy/c 
processing 
Hadoop 
of 
unstructured 
data
New databases: use-cases 
Approach 
Details 
Examples 
NoSQL 
MongoDB, 
Couchbase, 
Cassandra, 
Redis, 
Aerospike, 
Cloudant 
Non-­‐transac/onal 
opera/onal 
applica/ons, 
unstructured 
data, 
lightweight 
query 
NewSQL 
NuoDB, 
MemSQL, 
Transla[ce, 
VoltDB, 
Splice 
Machine 
Transac/onal 
opera/onal 
apps, 
structured 
data, 
complex 
query, 
opera/onal 
intelligence 
Hadoop 
Cloudera, 
MapR, 
Hortonworks, 
Pivotal, 
IBM, 
Teradata 
Non-­‐transac/onal 
analy/c 
applica/ons, 
mul/-­‐structured 
data, 
complex 
query 
DBaaS 
ObjectRocket, 
AWS 
DynamoDB, 
AWS 
RDS, 
Al/scale, 
Qubole 
Any 
of 
the 
above, 
or 
tradi/onal 
RDBMS, 
delivered 
as 
a 
service 
41
The big bang 
§ The database market expanded rapidly driven partly by polyglot persistence 
• Specialist databases for specialist use-cases 
42 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
NewSQL 
Hadoop 
DBaaS
The big crunch? 
§ Multi-model databases are a growing trend to support multiple use-cases and 
ease operational complexity – is the database market contracting? 
43 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
Mul/-­‐model 
databases 
Support 
a 
combina/on 
of 
the 
various 
individual 
NoSQL 
data 
models. 
NewSQL 
Hadoop 
DBaaS
§ Increasingly that also means SQL as NoSQL vendors add SQL support and 
SQL vendors add support for other models 
NewSQL 
The big crunch? 
44 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
Mul/-­‐model 
databases 
Support 
a 
combina/on 
of 
various 
individual 
data 
models. 
Hadoop 
DBaaS
§ While aaS is really just a deployment option 
NewSQL 
The big crunch? 
45 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
Mul/-­‐model 
databases 
Support 
a 
combina/on 
of 
various 
individual 
data 
models. 
XaaS 
Hadoop
§ And HDFS could potential become a common substrate for multiple data 
processing models 
NewSQL 
The big crunch? 
46 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
Mul/-­‐model 
databases 
Support 
a 
combina/on 
of 
various 
individual 
data 
models. 
Hadoop 
Distributed 
File 
System 
XaaS
Building blocks of the next-generation data platform? 
§ Distributed, elastic, flexible, new projects 
NewSQL 
47 
Wide-­‐ 
column 
stores 
Key 
Value 
stores 
Graph 
databases 
Document 
stores 
Mul/-­‐model 
databases 
Support 
a 
combina/on 
of 
various 
individual 
data 
models. 
Hadoop 
Distributed 
File 
System 
XaaS
The journey to a next-generation data platform 
? 
Managed 
strategic 
data 
plauorm 
Strategic 
consolida/on 
Tac/cal 
consolida/on 
Status 
quo 
48 
Mul/-­‐tenant, 
mul/-­‐model, 
mul/-­‐data 
center, 
hybrid, 
agile, 
elas/c, 
distributed, 
as-­‐a-­‐service, 
automated 
Strategic 
NewSQL, 
NoSQL, 
Hadoop, 
DBaaS 
produc/on 
deployments, 
strategic 
vendor 
rela/onships 
Managed 
polyglot 
data 
plauorms 
Strategic 
expansion 
Tac/cal 
NewSQL, 
NoSQL, 
Hadoop, 
DBaaS 
PoCs 
and 
departmental 
deployments, 
shadow 
IT 
Managed 
polyglot 
datab 
ases 
Centralized, 
scale-­‐up, 
SQL 
rela/onal 
databases 
Unmanaged 
polyglot 
databases 
Tac/cal 
expansion 
SQL-­‐on-­‐Hadoop, 
federated 
query, 
mul/-­‐model 
NoSQL 
and 
NewSQL, 
managed 
DBaaS, 
hybrid 
cloud
1 
6 
AWS 
EMR 
Non-­‐rela%onal 
zone 
CitusDB 
IBM 
Big 
SQL 
Rela%onal 
zone 
MySQL 
114 
SQLStream 
So^ware 
AG 
Sqrrl 
Enterprise 
RethinkDB 
Elas/csearch 
Cassandra 
Hypertable 
HBase 
Compose 
CloudBird 
Lotus 
Notes 
Ipedo 
XML 
Database 
Tamino 
XML 
Server 
UniData 
UniVerse 
McObject 
Objec/vity 
MarkLogic 
InterSystems 
Caché 
Starcounter 
ArangoDB 
Aerospike 
Founda/onDB 
Neo4J 
AffinityDB 
Trinity 
Giraph 
InfiniteGraph 
Oracle 
NoSQL 
CouchDB 
Redis 
Handlersocket 
RavenDB 
Google 
App 
Engine 
Datastore 
LevelDB 
Accumulo 
Riak 
Couchbase 
Splice 
Machine 
Ac/an 
Ingres 
SAP 
Sybase 
ASE 
EnterpriseDB 
SQL 
Server 
MySQL 
MariaDB 
SAP 
HANA 
Informix 
IBM 
DB2 
Google 
Cloud 
SQL 
HP 
Cloud 
RDB 
for 
MySQL 
ClearDB 
Database.com 
Rackspace 
Cloud 
Databases 
AWS 
RDS 
Azure 
SQL 
Database 
FathomDB 
StormDB 
HPCC 
Teradata 
Aster 
Cloudera 
MapR 
IBM 
Hortonworks 
BigInsights 
Google 
Compute 
Engine 
ZeWaset 
NGDATA 
Infochimps 
Metascale 
Mortar 
Data 
Rackspace 
Qubole 
Voldemort 
IBM 
PureData 
for 
Analy/cs 
Teradata 
XtremeData 
Metamarkets 
Druid 
Ac/an 
Vector 
Kx 
Systems 
IBM 
InfoSphere 
SAP 
Sybase 
IQ 
HP 
Ver/ca 
Pivotal 
Greenplum 
Exasol 
Ac/an 
Matrix 
ParStream 
TokuDB 
ScaleDB 
ScaleBase 
ScaleArc 
Infobright 
Con/nuent 
vFabric 
Postgres 
TransLa[ce 
NuoDB 
Drizzle 
JustOneDB 
Pivotal 
GemFire 
XD 
Zimory 
Scale 
Galera 
Clustrix 
Tesora 
MemSQL 
GenieDB 
YarcData 
Datomic 
FlockDB 
Allegrograph 
HypergraphDB 
MemCachier 
Redis 
Labs 
Memcached 
Cloud 
FairCom 
BitYota 
IronCache 
Grid/cache 
zone 
Memcached 
Ehcache 
ScaleOut 
So^ware 
IBM 
eXtreme 
Scale 
Oracle 
Coherence 
GridGain 
GigaSpaces 
XAP 
Pivotal 
GemFire 
CloudTran 
InfiniSpan 
Hazelcast 
Oracle 
Exaly/cs 
Oracle 
Database 
MySQL 
Cluster 
Oracle 
Endeca 
Server 
A[vio 
LucidWorks 
Big 
Data 
Lucene/Solr 
IBM 
InfoSphere 
Data 
Explorer 
Towards 
E-­‐discovery 
Towards 
enterprise 
search 
Documentum 
xDB 
ObjectStore 
LucidDB 
MonetDB 
Databricks/Spark 
AWS 
Elas/Cache 
Firebird 
SciDB 
SQLite 
Oracle 
TimesTen 
solidDB 
Adabas 
IBM 
IMS 
WakandaDB 
Al/scale 
Oracle 
Big 
Data 
Appliance 
RainStor 
OrientDB 
Sparksee 
Metamarkets 
Treasure 
Data 
PostgreSQL 
Percona 
HyperDex 
TIBCO 
Ac/veSpaces 
Titan 
SAP 
Sybase 
SQL 
Anywhere 
JethroData 
Pivotal 
HD/HAWQ 
BigMemory 
Ac/an 
Versant 
DataStax 
Enterprise 
DeepDB 
FatDB 
Google 
Cloud 
Datastore 
Heroku 
Postgres 
GrapheneDB 
Cassandra.io 
BerkeleyDB 
Microso^ 
HDInsight 
HP 
Autonomy 
Oracle 
Exadata 
IBM 
PureData 
Impala 
Apache 
Drill 
Presto 
Microso^ 
SQL 
Server 
PDW 
Apache 
Tajo 
Apache 
Hive 
SPARQLBASE 
MammothDB 
Al/base 
HDB 
LogicBlox 
SRCH2 
TIBCO 
LogLogic 
Splunk 
Towards 
SIEM 
Loggly 
Sumo 
Logentries 
Logic 
InfiniSQL 
JumboDB 
Ac/an 
PSQL 
Progress 
OpenEdge 
Kogni/o 
Al/base 
XDB 
Savvis 
So^layer 
Verizon 
xPlenty 
Stardog 
MariaDB 
Enterprise 
Apache 
Storm 
Apache 
S4 
IBM 
InfoSphere 
Streams 
TIBCO 
StreamBase 
DataTorrent 
AWS 
Kinesis 
Feedzai 
Guavus 
Lokad 
Key: 
General 
purpose 
Specialist 
analy/c 
-­‐as-­‐a-­‐Service 
BigTables 
Graph 
Document 
Key 
value 
stores 
Key 
value 
direct 
access 
Hadoop 
MySQL 
ecosystem 
Advanced 
clustering/sharding 
New 
SQL 
databases 
Data 
caching 
Data 
grid 
Search 
Appliances 
In-­‐memory 
Stream 
processing 
OpenStack 
Trove 
1010data 
Google 
BigQuery 
AWS 
Redshi^ 
TempoIQ 
InfluxDB 
WebScaleSQL 
Spider 
Fabric 
2 
A 
B 
C 
D 
E 
T-­‐Systems 
A 
B 
C 
D 
E 
2 
3 
4 
5 
SQream 
SpaceCurve 
Postgres-­‐XL 
Google 
Cloud 
Dataflow 
Trafodion 
Hadapt 
Azure 
Search 
Red 
Hat 
JBoss 
Data 
Grid 
4 
5 
6 
MongoDB 
Cloudant 
Iris 
Couch 
MongoLab 
RedisGreen 
Redis-­‐to-­‐go 
Redis 
Labs 
Redis 
Cloud 
ObjectRocket 
Azure 
DocumentDB 
ObjectRocket 
Redis 
AWS 
Elas/Cache 
with 
Redis 
MagnetoDB 
AWS 
SimpleDB 
1 
3 
Data 
Platforms 
Map 
October 2014 
CockroachDB 
AWS 
DynamoDB 
TokuMX 
VoltDB 
CortexDB 
CodeFutures 
Oracle 
Big 
Data 
Cloud 
49 
Stra/o
Thank You! 
matthew.aslett@451research.com 
@maslett 
www.451research.com

Sharing Experiences in Cloud Adoption: Burlington, MA

  • 1.
    Mind the gap:understanding today’s data platforms landscape Matthew Aslett, research director
  • 2.
    451 Research isan information technology research & advisory company Founded in 2000 210+ employees, including over 100 analysts 1,000+ clients: Technology & Service providers, corporate advisory, finance, professional services, and IT decision makers 10,000+ senior IT professionals in our research community Over 52 million data points each quarter 4,500+ reports published each year covering 2,000+ innovative technology & service providers Headquartered in New York City with offices in London, Boston, San Francisco, and Washington D.C. 451 Research and its sister company Uptime Institute comprise the two divisions of The 451 Group Research & Data Advisory Services Events 2
  • 3.
    1 6 AWS EMR Non-­‐rela%onal zone CitusDB IBM Big SQL Rela%onal zone MySQL 114 SQLStream So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra Hypertable HBase Compose CloudBird Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB Google App Engine Datastore LevelDB Accumulo Riak Couchbase Splice Machine Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server MySQL MariaDB SAP HANA Informix IBM DB2 Google Cloud SQL HP Cloud RDB for MySQL ClearDB Database.com Rackspace Cloud Databases AWS RDS Azure SQL Database FathomDB StormDB HPCC Teradata Aster Cloudera MapR IBM Hortonworks BigInsights Google Compute Engine ZeWaset NGDATA Infochimps Metascale Mortar Data Rackspace Qubole Voldemort IBM PureData for Analy/cs Teradata XtremeData Metamarkets Druid Ac/an Vector Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB ScaleDB ScaleBase ScaleArc Infobright Con/nuent vFabric Postgres TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora MemSQL GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB MemCachier Redis Labs Memcached Cloud FairCom BitYota IronCache Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast Oracle Exaly/cs Oracle Database MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Databricks/Spark AWS Elas/Cache Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB Al/scale Oracle Big Data Appliance RainStor OrientDB Sparksee Metamarkets Treasure Data PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere JethroData Pivotal HD/HAWQ BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Google Cloud Datastore Heroku Postgres GrapheneDB Cassandra.io BerkeleyDB Microso^ HDInsight HP Autonomy Oracle Exadata IBM PureData Impala Apache Drill Presto Microso^ SQL Server PDW Apache Tajo Apache Hive SPARQLBASE MammothDB Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Kogni/o Al/base XDB Savvis So^layer Verizon xPlenty Stardog MariaDB Enterprise Apache Storm Apache S4 IBM InfoSphere Streams TIBCO StreamBase DataTorrent AWS Kinesis Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing OpenStack Trove 1010data Google BigQuery AWS Redshi^ TempoIQ InfluxDB WebScaleSQL Spider Fabric 2 A B C D E T-­‐Systems A B C D E 2 3 4 5 SQream SpaceCurve Postgres-­‐XL Google Cloud Dataflow Trafodion Hadapt Azure Search Red Hat JBoss Data Grid 4 5 6 MongoDB Cloudant Iris Couch MongoLab RedisGreen Redis-­‐to-­‐go Redis Labs Redis Cloud ObjectRocket Azure DocumentDB ObjectRocket Redis AWS Elas/Cache with Redis MagnetoDB AWS SimpleDB 1 3 Data Platforms Map October 2014 CockroachDB AWS DynamoDB TokuMX VoltDB CortexDB CodeFutures Oracle Big Data Cloud 3 Stra/o
  • 4.
  • 5.
    The industry analyst’sinsatiable desire to categorize 5 Darrenkw on Flickr: hWps://www.flickr.com/photos/darrenkw/3327185668
  • 6.
    Segmenting the market § NoSQL, NewSQL and Beyond • Assessing the drivers behind the development and adoption of NoSQL and NewSQL databases, as well as data grid/caching technologies • Published April 2011 6
  • 7.
    Rela/onal Non-­‐rela/onal Analy/c Opera/onal Infobright VectorWise Oracle IBM DB2 SQL Server MySQL Ingres PostgreSQL SAP Sybase ASE Hadoop Netezza JustOne EMC Greenplum Aster Data ParAccel HP Ver/ca -­‐as-­‐a-­‐Service App Engine Datastore SimpleDB NewSQL Amazon RDS Xeround Calpont EnterpriseDB GenieDB ScalArc VoltDB InterSystems NoSQL Document Lotus Notes CouchDB MongoDB Graph Key value Big tables Objec/vity MarkLogic Versant Progress McObject Hypertable HBase Riak Redis Membrain Voldemort BerkeleyDB InfiniteGraph Neo4J GraphDB Data Grid/Cache Clustrix SQL Azure FathomDB Schooner MySQL Tokutek Akiban CodeFutures Con/nuent ScaleBase Transla[ce Database.com SAP Sybase IQ IBM InfoSphere NimbusDB HandlerSocket Cassandra Cloudant Memcached IBM eXtreme Scale TerracoWa GigaSpaces Oracle Coherence GridGain ScaleOut Vmware GemFire InfiniSpan CloudTran Couchbase RavenDB Drizzle Piccolo Dryad Hadapt Mapr Brisk MySQL Cluster Teradata 7 The “Drawn to Scale” problem: Company developing a Hadoop-­‐based NewSQL database.
  • 8.
  • 9.
    Inspiration § HarryBeck’s London Underground Tube Map hWp://briWon.disted.camosun.bc.ca/beck_map.jpg 9
  • 10.
    Inspiration Kai Chan Vong on Flickr: hWps://www.flickr.com/photos/kaichanvong/3723850455 10 § Harry Beck’s London Underground Tube Map
  • 11.
  • 12.
    Inspiration 12 §The Real Story Group’s Content Technology Vendor Map hWp://www.realstorygroup.com/vendormap/
  • 13.
    Left to myown devices 13
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
    1 6 Non-­‐rela%onal zone CitusDB IBM Big SQL Rela%onal zone MySQL 114 SQLStream So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra Hypertable HBase Compose CloudBird Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB Google App Engine Datastore LevelDB Accumulo Riak Couchbase Splice Machine Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server MySQL MariaDB SAP HANA Informix IBM DB2 Google Cloud SQL HP Cloud RDB for MySQL ClearDB Database.com Rackspace Cloud Databases AWS RDS Azure SQL Database FathomDB StormDB HPCC Teradata Aster Cloudera MapR IBM Hortonworks BigInsights ZeWaset NGDATA Infochimps Metascale Mortar Data Rackspace Qubole Voldemort IBM PureData for Analy/cs Teradata XtremeData Metamarkets Druid Ac/an Vector Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB ScaleDB ScaleBase ScaleArc Infobright Con/nuent vFabric Postgres TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora MemSQL GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB MemCachier Redis Labs Memcached Cloud FairCom BitYota IronCache Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast Oracle Exaly/cs Oracle Database MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Databricks/Spark AWS Elas/Cache Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB Al/scale Oracle Big Data Appliance RainStor OrientDB Sparksee Metamarkets Treasure Data PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere JethroData Pivotal HD/HAWQ BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Google Cloud Datastore Heroku Postgres GrapheneDB Cassandra.io BerkeleyDB Microso^ HDInsight HP Autonomy Oracle Exadata IBM PureData Impala Apache Drill Presto Microso^ SQL Server PDW Apache Tajo Apache Hive SPARQLBASE MammothDB Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Kogni/o Al/base XDB Savvis So^layer Verizon xPlenty Stardog MariaDB Enterprise Apache Storm Apache S4 IBM InfoSphere Streams TIBCO StreamBase DataTorrent AWS Kinesis Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing OpenStack Trove 1010data Google BigQuery AWS Redshi^ TempoIQ InfluxDB WebScaleSQL Spider Fabric 2 A B C D E T-­‐Systems A B C D E 2 3 4 5 SQream SpaceCurve Postgres-­‐XL Google Cloud Dataflow Trafodion Hadapt Azure Search Red Hat JBoss Data Grid 4 5 6 MongoDB Cloudant Iris Couch MongoLab RedisGreen Redis-­‐to-­‐go Redis Labs Redis Cloud ObjectRocket Azure DocumentDB ObjectRocket Redis AWS Elas/Cache with Redis MagnetoDB AWS SimpleDB 1 3 Data Platforms Map October 2014 CockroachDB AWS DynamoDB TokuMX VoltDB CortexDB CodeFutures Oracle Big Data Cloud 20 AWS EMR Google Compute Engine Stra/o
  • 21.
  • 22.
    1 6 Non-­‐rela%onal zone Rela%onal zone 114 SQLStream Coral8 Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché SQLite Ingres Sybase ASE EnterpriseDB SQL Server MySQL Informix Teradata VectorWise Kx Systems ParAccel IBM InfoSphere Calpont Sybase IQ Ver/ca Greenplum Grid/cache zone ScaleOut So^ware IBM eXtreme Scale Tangosol Coherence GigaSpaces GemStone Memcached TerracoWa Oracle Database Endeca A[vio Lucid Imagina/on Lucene/Solr Vivisimo Towards E-­‐discovery Towards enterprise search Documentum xDB Progress ObjectStore MonetDB Firebird Oracle TimesTen IBM solidDB Adabas IBM IMS PostgreSQL TIBCO Ac/veSpaces Sybase SQL Anywhere Versant BerkeleyDB Autonomy IBM DB2 Aster Data LogLogic Splunk Towards SIEM Pervasive PSQL Progress OpenEdge Kogni/o Progress Apama StreamBase TIBCO Aleri BEA 1010data 2 A B C D E A B C D E 2 3 4 5 Infobright 1 3 4 5 6 Data Platforms Map Circa 2008/9 Netezza Key: General purpose Specialist analy/c Data caching Search In-­‐memory Stream processing 22
  • 23.
    1 6 Non-­‐rela%onal zone Rela%onal zone 114 Ipedo XML Database Tamino XML Server UniData UniVerse McObject InterSystems Caché Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server SAP HANA Informix IBM DB2 HPCC Teradata Aster IBM PureData for Analy/cs XtremeData Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream Grid/cache zone Memcached Ehcache vFabric Postgres ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast Oracle Database Oracle Endeca Server Elas/csearch A[vio So^ware AG LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB RainStor PostgreSQL TIBCO Ac/veSpaces SAP Sybase SQL Anywhere BigMemory Ac/an Versant HP Autonomy Oracle Exadata IBM PureData Microso^ SQL Server PDW LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic Ac/an PSQL Progress OpenEdge IBM InfoSphere Streams TIBCO StreamBase Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing 2 A B C D E A B C D E 2 3 4 5 SQream Postgres-­‐XL Red Hat JBoss Data Grid 1 3 4 5 6 Data Platforms Map Expansion VectorWise Kogni/o 1010data SQLStream Infobright MySQL Lotus Notes Objec/vity BerkeleyDB MarkLogic Oracle Teradata Exaly/cs 23
  • 24.
    1 6 Non-­‐rela%onal zone Rela%onal zone 114 So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra HBase Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis RavenDB LevelDB Accumulo Riak Couchbase Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server SAP HANA Informix IBM DB2 HPCC Teradata Aster Voldemort IBM PureData for Analy/cs XtremeData Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix YarcData ParStream FlockDB Allegrograph HypergraphDB Grid/cache zone Memcached Ehcache vFabric Postgres ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast Oracle Database Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB RainStor OrientDB Sparksee PostgreSQL HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere BigMemory Ac/an Versant DataStax Enterprise FatDB Hypertable BerkeleyDB HP Autonomy Oracle Exadata IBM PureData Microso^ SQL Server PDW SPARQLBASE LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic JumboDB Ac/an PSQL Progress OpenEdge Stardog IBM InfoSphere Streams TIBCO StreamBase Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing 2 A B C D E A B C D E 2 3 4 5 SQream Postgres-­‐XL Red Hat JBoss Data Grid 4 5 6 MongoDB 1 3 Data Platforms Map NoSQL TokuMX VectorWise Kogni/o 1010data SQLStream Infobright MySQL CortexDB Oracle Teradata Exaly/cs 24
  • 25.
    1 6 Non-­‐rela%onal zone Rela%onal zone MySQL 114 So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra HBase Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB LevelDB Accumulo Riak Couchbase Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server MySQL vFabric Postgres MariaDB SAP HANA Informix HPCC Teradata Aster Voldemort IBM PureData for Analy/cs XtremeData Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB ScaleDB ScaleBase ScaleArc Infobright CodeFutures Con/nuent TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB RainStor OrientDB Sparksee PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Hypertable BerkeleyDB HP Autonomy IBM PureData Microso^ SQL Server PDW SPARQLBASE Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Al/base XDB Stardog MariaDB Enterprise IBM InfoSphere Streams TIBCO StreamBase Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing WebScaleSQL Spider Fabric 2 A B C D E A B C D E 2 3 4 5 SQream Postgres-­‐XL Red Hat JBoss Data Grid 4 5 6 MongoDB 1 3 Data Platforms Map NewSQL TokuMX VectorWise Kogni/o 1010data SQLStream IBM DB2 Oracle Database VoltDB FairCom MemSQL CockroachDB Oracle Exadata CortexDB Oracle Teradata Exaly/cs 25
  • 26.
    1 6 Non-­‐rela%onal zone CitusDB IBM Big SQL Rela%onal zone MySQL 114 SQLStream So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra HBase Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB LevelDB Accumulo Riak Couchbase Splice Machine Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server MySQL vFabric Postgres MariaDB SAP HANA Informix HPCC Teradata Aster Cloudera MapR IBM Hortonworks BigInsights NGDATA Voldemort IBM PureData for Analy/cs XtremeData Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB ScaleDB ScaleBase ScaleArc Infobright CodeFutures Con/nuent TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB Oracle Big Data Appliance RainStor OrientDB Sparksee PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere JethroData Pivotal HD/HAWQ BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Hypertable BerkeleyDB HP Autonomy IBM PureData Impala Apache Drill Presto Microso^ SQL Server PDW Apache Tajo Apache Hive SPARQLBASE MammothDB Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Al/base XDB Stardog MariaDB Enterprise Apache Storm Apache S4 IBM InfoSphere Streams TIBCO StreamBase DataTorrent Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing WebScaleSQL Spider Fabric 2 A B C D E A B C D E 2 3 4 5 SQream Postgres-­‐XL Trafodion Hadapt Red Hat JBoss Data Grid 4 5 6 MongoDB 1 3 Data Platforms Map Hadoop TokuMX VectorWise Kogni/o 1010data ZeWaset IBM DB2 Oracle Database FairCom MemSQL CockroachDB Oracle Exadata VoltDB CortexDB Oracle Teradata Exaly/cs 26
  • 27.
    1 6 Non-­‐rela%onal zone CitusDB Rela%onal zone MySQL 114 SQLStream So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra Hypertable HBase Compose CloudBird Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB Google App Engine Datastore LevelDB Accumulo Riak Couchbase Splice Machine Ac/an Ingres SAP Sybase ASE SQL Server MySQL MariaDB SAP HANA Informix Google Cloud SQL HP Cloud RDB for MySQL ClearDB Database.com Rackspace Cloud Databases AWS RDS Azure SQL Database FathomDB StormDB HPCC Teradata Aster Cloudera MapR IBM Hortonworks BigInsights ZeWaset NGDATA Infochimps Metascale Mortar Data Rackspace Qubole Voldemort IBM PureData for Analy/cs XtremeData Metamarkets Druid Ac/an Vector Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB Oracle Exadata ScaleDB ScaleBase ScaleArc Infobright Con/nuent VoltDB TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB MemCachier Redis Labs Memcached Cloud BitYota IronCache Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Databricks/Spark AWS Elas/Cache Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB Al/scale Oracle Big Data Appliance RainStor OrientDB Sparksee Metamarkets Treasure Data PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere JethroData Pivotal HD/HAWQ BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Google Cloud Datastore Heroku Postgres GrapheneDB Cassandra.io BerkeleyDB Microso^ HDInsight HP Autonomy IBM Big SQL Impala Apache Drill Presto Microso^ SQL Server PDW Apache Tajo Apache Hive SPARQLBASE MammothDB Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Kogni/o Al/base XDB Savvis So^layer Verizon xPlenty Stardog MariaDB Enterprise Apache Storm Apache S4 IBM InfoSphere Streams TIBCO StreamBase DataTorrent AWS Kinesis Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing OpenStack Trove 1010data Google BigQuery AWS Redshi^ TempoIQ InfluxDB WebScaleSQL Spider Fabric 2 A B C D E T-­‐Systems A B C D E 2 3 4 5 SQream SpaceCurve Postgres-­‐XL Google Cloud Dataflow Trafodion Hadapt Azure Search Red Hat JBoss Data Grid 4 5 6 MongoDB Cloudant Iris Couch MongoLab RedisGreen Redis-­‐to-­‐go Redis Labs Redis Cloud ObjectRocket Azure DocumentDB ObjectRocket Redis AWS Elas/Cache with Redis MagnetoDB AWS SimpleDB 1 3 Data Platforms Map DBaaS AWS DynamoDB TokuMX IBM DB2 Oracle Database IBM PureData FairCom MemSQL CockroachDB CortexDB EnterpriseDB vFabric Postgres CodeFutures Oracle Big Data Cloud Oracle Teradata Exaly/cs 27 AWS EMR Google Compute Engine Stra/o
  • 28.
    The big bang:drivers for change NoSQL NewSQL Hadoop DBaaS 28 Cloud Elas/c Distributed Architecture Virtual Scalable Flexible JSON Agile Schema-­‐on-­‐read Developers REST Schemaless Flexible Global Web Mobile Interac/ve Applica%ons Always-­‐on Social Local
  • 29.
    Drivers for changeinfluence each other Cloud Elas/c Distributed Schema-­‐on-­‐read § Distributed architecture encourages new development approaches § New applica/ons require distributed architecture JSON Global Web Mobile Interac/ve § New development approaches demand new architecture § Distributed architecture enables new applica/ons § New app requirements demand new development approaches § New dev approaches enable new lightweight apps Architecture Virtual Scalable Flexible Agile Developers REST Schemaless Flexible Applica%ons Always-­‐on Social Local 29
  • 30.
    Drivers for change:applications Social, mobile, global, local all have implica/ons for data connec/vity § Social – increased interac/vity generates data § Mobile – different form factors and access methods § Global – applica/ons need to be immediately available everywhere § Local – need to deliver localized content Global Web Mobile Interac/ve Applica%ons Always-­‐on Social Local 30
  • 31.
    Drivers for change:developers Developers increasingly drive data management and database selec/on § Need to unite applica/on development and database management people/processes to achieve common goals § DevOps movement growing apace § Rapid development and con/nuous delivery is inconsistent with tradi/onal database management processes JSON Agile Schema-­‐on-­‐read Developers REST Schemaless Flexible 31
  • 32.
    Drivers for change:DevOps JSON Agile Schema-­‐on-­‐read Findings: • 51% cite business and strategy drivers: compe//ve, business produc/vity, revenue • 49% cite technology func/onality, new feature sets, reduced development costs • As DevOps matures as a discipline, aligning business and IT goals is ever more important. 451 Research survey of 200 qualified North American DevOps prac//oners or influencers Developers REST Schemaless Flexible 32
  • 33.
    Drivers for change:architecture Transi/oning from a tradi/onal database to a distributed database § Interac/ve applica/ons means the pace of user growth and mul/plicity of data types is too great for tradi/onal rela/onal databases to efficiently absorb. § Scalability § Performance § Relaxed consistency § Agility § Intricacy § Necessity Cloud Elas/c Distributed Architecture Virtual Scalable Flexible 33
  • 34.
    Drivers for change:architecture Transi/oning from a tradi/onal database to a distributed database Transi/oning from on-­‐premises compu/ng to the cloud Cloud Elas/c Distributed Architecture Virtual Scalable Flexible § Amazon’s top enterprise use cases are (in order of popularity starting with the most popular): • Development and test • New workloads Transi/oning from on-­‐premises • Supplement compuexisting /ng to the workloads cloud with cloud • Migration of existing workloads to the cloud • Datacenter migration • All-in cloud • Top three adoption drivers for public cloud are additive to the existing database landscape 34
  • 35.
    Drivers for change:shift to private cloud For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed? Internal, On-­‐premises, Non-­‐cloud Digital Infrastructure (Physical or Internal, On-­‐premises, Private Cloud-­‐based Off-­‐premises IT Outsourcing, Dedicated/Managed Transi/oning from on-­‐premises compu/ng to the cloud Cloud Cloud Elas/c Elas/c Distributed Distributed Virtual Scalable Flexible Transi/oning from on-­‐premises compu/ng to the cloud Architecture Virtual Scalable Flexible 5% 1% 1% 12% 9% 6% 25% 13% 30% 41% 1% 10% 19% 62% Hybrid Cloud Off-­‐premises Public Cloud Services (SaaS) Off-­‐premises Public Cloud Services (Other Than SaaS) Other Deployment Method 2013 2015 The majority of workloads today are on internal, on-­‐premises, non-­‐cloud and are likely to remain on-­‐premises albeit with a move to private cloud 35 Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-­‐long?icid=3021
  • 36.
    Drivers for change:public cloud not significant For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed? Internal, On-­‐premises, Non-­‐cloud Digital Infrastructure (Physical or Internal, On-­‐premises, Private Cloud-­‐based Off-­‐premises IT Outsourcing, Dedicated/Managed Transi/oning from on-­‐premises Pubic cloud compuand /ng SaaS to the are cloud growing, but not making a significant impact Cloud Cloud Elas/c Elas/c Distributed Distributed Virtual Scalable Flexible Transi/oning from on-­‐premises compu/ng to the cloud Architecture Virtual Scalable Flexible 36 5% 1% 1% 1% 12% 13% 10% 9% 6% 30% 25% 41% 19% 62% Hybrid Cloud Off-­‐premises Public Cloud Services (SaaS) Off-­‐premises Public Cloud Services (Other Than SaaS) Other Deployment Method 2013 2015 Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-­‐long?icid=3021
  • 37.
    For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed? 5% 1% 1% 12% 9% 6% 25% 13% 30% 41% 1% 10% 19% 62% Internal, On-­‐premises, Non-­‐cloud Digital Infrastructure (Physical or Internal, On-­‐premises, Private Cloud-­‐based Off-­‐premises IT Outsourcing, Dedicated/Managed Hybrid Cloud Off-­‐premises Public Cloud Services (SaaS) Off-­‐premises Public Cloud Services (Other Than SaaS) Other Deployment Method 2013 2015 Drivers for change: hybrid cloud is the future Cloud Cloud Elas/c Elas/c Distributed Distributed Virtual Scalable Flexible Transi/oning from on-­‐premises compu/ng to the cloud Architecture Virtual Scalable Flexible The future of the cloud is hybrid 37 Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-­‐long?icid=3021
  • 38.
    Drivers for change:the route to hybrid is private For each of the following categories of workload/business func/ons, what is your primary deployment method likely to be in the next two years? Cloud Elas/c Distributed Architecture Virtual Scalable Flexible Test and Development of Applica/ons E-­‐business Hos/ng Customer-­‐facing Enterprise Applica/ons Transi/oning from on-­‐premises compu/ng to the cloud Cloud Distributed Virtual Scalable Flexible Transi/oning from on-­‐premises compu/ng to the cloud 53% 68% 11% 35% 53% 18% 42% 18% 14% 23% 17% 26% 43% 27% 31% 29% 18% 66% 46% 21% 39% Collabora/ve Applica/ons Cloud-­‐na/ve Applica/ons Batch Compu/ng Applica/ons Back-­‐office Enterprise Applica/ons Internal Private Cloud Hybrid Cloud External Public Cloud and SaaS 38 Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-­‐long?icid=3021
  • 39.
    JSON Agile ApplicaSchema-­‐on-­‐/read ons Developers Architecture REST Schemaless Flexible Drivers for change Cloud Elas/c Distributed Architecture Virtual Scalable Flexible Global Web Mobile Interac/ve NoSQL NewSQL Hadoop DBaaS Applica%ons Always-­‐on Social Local 39
  • 40.
    New databases: differences 40 Non-­‐rela/onal data models. Trade-­‐off consistency for availability NoSQL Adds availability and flexibility to the familiar rela/onal data model NewSQL Any of the above, or tradi/onal RDBMS, delivered as a service DBaaS Batch (and now interac/ve) analy/c processing Hadoop of unstructured data
  • 41.
    New databases: use-cases Approach Details Examples NoSQL MongoDB, Couchbase, Cassandra, Redis, Aerospike, Cloudant Non-­‐transac/onal opera/onal applica/ons, unstructured data, lightweight query NewSQL NuoDB, MemSQL, Transla[ce, VoltDB, Splice Machine Transac/onal opera/onal apps, structured data, complex query, opera/onal intelligence Hadoop Cloudera, MapR, Hortonworks, Pivotal, IBM, Teradata Non-­‐transac/onal analy/c applica/ons, mul/-­‐structured data, complex query DBaaS ObjectRocket, AWS DynamoDB, AWS RDS, Al/scale, Qubole Any of the above, or tradi/onal RDBMS, delivered as a service 41
  • 42.
    The big bang § The database market expanded rapidly driven partly by polyglot persistence • Specialist databases for specialist use-cases 42 Wide-­‐ column stores Key Value stores Graph databases Document stores NewSQL Hadoop DBaaS
  • 43.
    The big crunch? § Multi-model databases are a growing trend to support multiple use-cases and ease operational complexity – is the database market contracting? 43 Wide-­‐ column stores Key Value stores Graph databases Document stores Mul/-­‐model databases Support a combina/on of the various individual NoSQL data models. NewSQL Hadoop DBaaS
  • 44.
    § Increasingly thatalso means SQL as NoSQL vendors add SQL support and SQL vendors add support for other models NewSQL The big crunch? 44 Wide-­‐ column stores Key Value stores Graph databases Document stores Mul/-­‐model databases Support a combina/on of various individual data models. Hadoop DBaaS
  • 45.
    § While aaSis really just a deployment option NewSQL The big crunch? 45 Wide-­‐ column stores Key Value stores Graph databases Document stores Mul/-­‐model databases Support a combina/on of various individual data models. XaaS Hadoop
  • 46.
    § And HDFScould potential become a common substrate for multiple data processing models NewSQL The big crunch? 46 Wide-­‐ column stores Key Value stores Graph databases Document stores Mul/-­‐model databases Support a combina/on of various individual data models. Hadoop Distributed File System XaaS
  • 47.
    Building blocks ofthe next-generation data platform? § Distributed, elastic, flexible, new projects NewSQL 47 Wide-­‐ column stores Key Value stores Graph databases Document stores Mul/-­‐model databases Support a combina/on of various individual data models. Hadoop Distributed File System XaaS
  • 48.
    The journey toa next-generation data platform ? Managed strategic data plauorm Strategic consolida/on Tac/cal consolida/on Status quo 48 Mul/-­‐tenant, mul/-­‐model, mul/-­‐data center, hybrid, agile, elas/c, distributed, as-­‐a-­‐service, automated Strategic NewSQL, NoSQL, Hadoop, DBaaS produc/on deployments, strategic vendor rela/onships Managed polyglot data plauorms Strategic expansion Tac/cal NewSQL, NoSQL, Hadoop, DBaaS PoCs and departmental deployments, shadow IT Managed polyglot datab ases Centralized, scale-­‐up, SQL rela/onal databases Unmanaged polyglot databases Tac/cal expansion SQL-­‐on-­‐Hadoop, federated query, mul/-­‐model NoSQL and NewSQL, managed DBaaS, hybrid cloud
  • 49.
    1 6 AWS EMR Non-­‐rela%onal zone CitusDB IBM Big SQL Rela%onal zone MySQL 114 SQLStream So^ware AG Sqrrl Enterprise RethinkDB Elas/csearch Cassandra Hypertable HBase Compose CloudBird Lotus Notes Ipedo XML Database Tamino XML Server UniData UniVerse McObject Objec/vity MarkLogic InterSystems Caché Starcounter ArangoDB Aerospike Founda/onDB Neo4J AffinityDB Trinity Giraph InfiniteGraph Oracle NoSQL CouchDB Redis Handlersocket RavenDB Google App Engine Datastore LevelDB Accumulo Riak Couchbase Splice Machine Ac/an Ingres SAP Sybase ASE EnterpriseDB SQL Server MySQL MariaDB SAP HANA Informix IBM DB2 Google Cloud SQL HP Cloud RDB for MySQL ClearDB Database.com Rackspace Cloud Databases AWS RDS Azure SQL Database FathomDB StormDB HPCC Teradata Aster Cloudera MapR IBM Hortonworks BigInsights Google Compute Engine ZeWaset NGDATA Infochimps Metascale Mortar Data Rackspace Qubole Voldemort IBM PureData for Analy/cs Teradata XtremeData Metamarkets Druid Ac/an Vector Kx Systems IBM InfoSphere SAP Sybase IQ HP Ver/ca Pivotal Greenplum Exasol Ac/an Matrix ParStream TokuDB ScaleDB ScaleBase ScaleArc Infobright Con/nuent vFabric Postgres TransLa[ce NuoDB Drizzle JustOneDB Pivotal GemFire XD Zimory Scale Galera Clustrix Tesora MemSQL GenieDB YarcData Datomic FlockDB Allegrograph HypergraphDB MemCachier Redis Labs Memcached Cloud FairCom BitYota IronCache Grid/cache zone Memcached Ehcache ScaleOut So^ware IBM eXtreme Scale Oracle Coherence GridGain GigaSpaces XAP Pivotal GemFire CloudTran InfiniSpan Hazelcast Oracle Exaly/cs Oracle Database MySQL Cluster Oracle Endeca Server A[vio LucidWorks Big Data Lucene/Solr IBM InfoSphere Data Explorer Towards E-­‐discovery Towards enterprise search Documentum xDB ObjectStore LucidDB MonetDB Databricks/Spark AWS Elas/Cache Firebird SciDB SQLite Oracle TimesTen solidDB Adabas IBM IMS WakandaDB Al/scale Oracle Big Data Appliance RainStor OrientDB Sparksee Metamarkets Treasure Data PostgreSQL Percona HyperDex TIBCO Ac/veSpaces Titan SAP Sybase SQL Anywhere JethroData Pivotal HD/HAWQ BigMemory Ac/an Versant DataStax Enterprise DeepDB FatDB Google Cloud Datastore Heroku Postgres GrapheneDB Cassandra.io BerkeleyDB Microso^ HDInsight HP Autonomy Oracle Exadata IBM PureData Impala Apache Drill Presto Microso^ SQL Server PDW Apache Tajo Apache Hive SPARQLBASE MammothDB Al/base HDB LogicBlox SRCH2 TIBCO LogLogic Splunk Towards SIEM Loggly Sumo Logentries Logic InfiniSQL JumboDB Ac/an PSQL Progress OpenEdge Kogni/o Al/base XDB Savvis So^layer Verizon xPlenty Stardog MariaDB Enterprise Apache Storm Apache S4 IBM InfoSphere Streams TIBCO StreamBase DataTorrent AWS Kinesis Feedzai Guavus Lokad Key: General purpose Specialist analy/c -­‐as-­‐a-­‐Service BigTables Graph Document Key value stores Key value direct access Hadoop MySQL ecosystem Advanced clustering/sharding New SQL databases Data caching Data grid Search Appliances In-­‐memory Stream processing OpenStack Trove 1010data Google BigQuery AWS Redshi^ TempoIQ InfluxDB WebScaleSQL Spider Fabric 2 A B C D E T-­‐Systems A B C D E 2 3 4 5 SQream SpaceCurve Postgres-­‐XL Google Cloud Dataflow Trafodion Hadapt Azure Search Red Hat JBoss Data Grid 4 5 6 MongoDB Cloudant Iris Couch MongoLab RedisGreen Redis-­‐to-­‐go Redis Labs Redis Cloud ObjectRocket Azure DocumentDB ObjectRocket Redis AWS Elas/Cache with Redis MagnetoDB AWS SimpleDB 1 3 Data Platforms Map October 2014 CockroachDB AWS DynamoDB TokuMX VoltDB CortexDB CodeFutures Oracle Big Data Cloud 49 Stra/o
  • 50.
    Thank You! matthew.aslett@451research.com @maslett www.451research.com