Blueflood: Open Source Metrics Processing at CassandraEU 2013

G
Blueflood
Simple Metrics Processing

Gary Dusbabek • Cassandra EU 2013
Motivation
Building Blocks
Future
Future Stuff
Motivation
Get	
  	
  
the Data 	
  
In
Each check
generates 2-20
metrics
Multiply by data
centers
Blueflood: Open Source Metrics Processing at CassandraEU 2013
Currently handling
120 million metrics
per hour
40 million aggregate
Cassandra write
operations per hour
Get	
  the	
  Data	
  Out	
  
Fast	
  Graphs!	
  
Think:	
  Dashboards	
  
SLA	
  is	
  important	
  
Get	
  the	
  Data	
  Out	
  

Get	
  	
  
the Data 	
  
Out

Fast	
  Graphs!	
  
Think:	
  Dashboards	
  
SLA	
  is	
  important	
  
Get	
  the	
  Data	
  Out	
  
Fast	
  Graphs!	
  
Think:	
  Dashboards	
  
SLA	
  is	
  important	
  

Fast
Graphs
Multitenant
Different
SLAs
expectations
Hard
Tenants imply
Metadata
Hampers
generic computing
Systems
Lipstick system
Nice to Have
Not
Mission Critical
Don’t
Break the Bank
Blueflood: Open Source Metrics Processing at CassandraEU 2013
Avoid
Hadoop
HATE
Hadoop
HATE
Hadoop
We Ended Up With This
Ingestion API

Ingestion
Transform

Query API

Metadata + Cache

Rollup Scheduler

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
We Ended Up With This
Ingestion API

Ingestion
Transform

Query API

Metadata + Cache

Rollup Scheduler

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
Cassandra

Database
(Cassandra)
Cassandra
1.0, 1.1, 1.2 Compatible

No 2.0 yet
	
  
Cassandra
Experimented with CQL very early on

CQL 1.0 time frame	
  
Cassandra
Experimented with CQL very early on

CQL 1.0 time frame	
  
Cassandra
Astyanax now
Mostly happy with it
Connection pool implementation is
very sensitive to network bumps
Cassandra



Experimented with various compaction
strategies
No real winner
Leveldb bugs in 1.0 made it almost a
non-starter
Cassandra
CASSANDRA-5685


Per-CF TTLs
Doesn’t help us
Might help you
Cassandra
CASSANDRA-3974


TTL histogram used to give input on
which sstables are good candidates
for compaction (size-tiered only)
Cassandra
CASSANDRA-5228

Track max TTL per sstable to expire the
whole thing.
We could use this by using bucketed CFs
Anatomy of a Metric
One dimensional signal
Has an ID
We call this a locator
Mostly opaque
Tuple of (tenantId [,other things,…]
Anatomy of a Metric
Example:



6335,web01,ping,bytes
Anatomy of a Metric
Stuff whatever you want in there
Just don’t change it
It becomes a key
Anatomy of a Metric
Has a type associated with it:
long, double, string, boolean

Type determines on-disk
serialization
!
{!
"timestamp": 1319222001982,!
"monitoring_zone_id": "mzXXXXXXXX",!
"available": true,!
"status": "code=200,rt=0.257s,bytes=0",!
"metrics": {!
"bytes": {!
"type": "i",!
"data": "0"!
},!
"tt_firstbyte": {!
"type": "I",!
"data": "257"!
},!
"tt_connect": {!
"type": "I",!
"data": "128"!
},!
"code": {!
"type": "s",!
"data": "200"!
},!
"duration": {!
"type": "I",!
"data": "257"!
}!
}!

Example

}!
Anatomy of a Metric
Sometimes has units

Example: seconds, bytes, light
years



We guess on this
Column Families
Metrics
Full resolution
One per granularity (5m, 20m, 60m, 240m,
1440m)
One row per metric
Locator is the key
Column Families
Metrics
No Bucketing
Will be required for high frequency metrics
Solution is easy
Just complicates Locator resolution
Column Families
Metadata


One row per metric

Rollup State


Nasty map for tracking shard state

Active Metrics

Shard to list of locators
Column Families
STRING & BOOLEAN


Speshul

Only updated when values change

Plumbing keeps old values in
memory
Libraries

Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
Ingestion LibrarY
insert_metrics(list<metric>)!
Ingestion LibrarY
update_state(shard, granularity, slot)!

SLOT == Bucket of time
	
  
Rollup LibrarY
get_active_locators(shard)!
get_state(shard, granularity, slot)!

get_metrics(from, to, locator, granularity)!

write_rollups(list<rollup>)!

update_state(shard, granularity, slot)!

	
  
Rollup LibrarY
Supports bulk operations outside
of the service

Enables tools to be written
	
  
Rollup LibrarY
Rollups contain

count, min, max, mean, variance

Serialization is versioned
	
  
Query LibrarY
get_data(form, to, granularity)!
get_data(from, to, num_points)!
	
  
Metadata & Cache
Metadata + Cache

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
Metadata & Cache
Integrated into services (ingestion
& rollup)



Backed by Cassandra
Supports different eviction
strategies based on needs
Metadata & Cache


Example 1:


TTLs are linked to tenants and are
not known when metrics are
ingested
A separate API must be consulted
Metadata & Cache


Example 2:


Units are valuable only at query
time, but are not included with
metrics
Heuristically guess and store these
Rollup Schedule Service
Metadata + Cache

Rollup Scheduler

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
Rollup Schedule Service
Problem:
Divide time into buckets without
scratching at infinity

Identify them using a finite set of keys
Rollup Schedule Service
Solution:

Order preserving consistent hashing
for timestamps
Rollup Schedule Service


Imagine a two week period divided
into slots the size of each granularity
Rollup Schedule Service
4032 5m slots
1008 20m slots
336 60m slots
84 240m slots
14 1440m slots
Rollup Schedule Service
Gives us a way of consistently
addressing and bucketing time ranges

As time increases, so does the slot it
hashes to (until it wraps to zero)
Rollup Schedule Service

When do we roll up?
Whenever an active slot
a) has not been updated in N seconds
b) is M seconds old
Rollup Schedule Service

What about late data?

Late data can be ingested
for 24 hours
Ingestion	
  Processors
Ingestion
Transform

Metadata + Cache

Rollup Scheduler

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
Ingestion	
  Processors
Every metric is not built the same way
They come from different places
Processors allow you to make them consistent
Can be synchronous or asynchronous
API Endpoints
Ingestion API

Ingestion
Transform

Query API

Metadata + Cache

Rollup Scheduler

State Management
Java Ingestion
Library

Java Rollup
Library
Database
(Cassandra)

Java Query
Library
API Endpoints
Why not ship it with API endpoints?


External forces
API Endpoints
Decided to make them

Modular
API Endpoints
We do ship reference API endpoints

UDP Ingestion
HTTP Ingestion
HTTP Query
API Endpoints

Downside?
More work for you
API Endpoints

Upside?
We ♥ Pull Requests
How Does It Scale?
Ingestion scales linearly 

Add ingestion nodes until
Cassandra is the bottleneck
How Does It Scale?
Two ingestors per DC
Only one per DC is active
Double ingest
How Does It Scale?
Rollups scale [almost] linearly by
spreading out shard ownership


Shards are currently pegged at
128



Ok to have multiple nodes own a
shard
Zookeeper is a soft-dependency
Future Stuff
Local ingestion durability
Future Stuff



Richer metadata API
Example: tag metrics and then use
those tags as a query facet
Will require an index
Experimenting with ElasticSearch
Home-rolled bitmap indexes
Future Stuff
Pre-aggregated Metrics
Histograms (partially
implemented)
Counters, Timers, Gauges, Sets
Future Stuff
Deep statsd and graphite
integration (active work)

Statsd is hard because counts get
reset after a flush
Future Stuff
Graphite is just involved
(new rollup types)
Whisper DB interface
Then hack carbon to support it
Already pluggable, just needs
integration
Thanks!
http://blueflood.io
blueflood-discuss@googlegroups.com
Freenode: #blueflood
gitub.com/rackerlabs/blueflood
Twitter: @gdusbabek
Image Credits



All images for this presentation come from the Flickr commons collection
http://www.flickr.com/commons/
flood
guide
motivation
cows
jet
apartments
groups
lipstick
elephant
containers
anatomy
columns
library
cache money
railyard
processors
terminal
fish
future
thanks

http://www.flickr.com/photos/keenepubliclibrary/2593172720/sizes/z/
http://www.flickr.com/photos/field_museum_library/3796303860/
http://www.flickr.com/photos/statelibraryofnsw/4944459226/sizes/l/in/photolist-8wVDt1/
http://www.flickr.com/photos/nationalarchives/7457004362/sizes/l/
http://www.flickr.com/photos/sdasmarchives/4564334397/sizes/o/
http://www.flickr.com/photos/nypl/3110619126/sizes/o/
http://www.flickr.com/photos/fylkesarkiv/4545544268/sizes/l/ 
http://www.flickr.com/photos/library_of_congress/2179918784/sizes/o/
http://www.flickr.com/photos/statelibraryofnsw/2963006536/sizes/o/
http://www.flickr.com/photos/smu_cul_digitalcollections/9526924556/sizes/l/
http://www.flickr.com/photos/usnationalarchives/5573758997/sizes/l/
http://www.flickr.com/photos/cornelluniversitylibrary/3485933761/sizes/l/
http://www.flickr.com/photos/statelibraryofnsw/4414971043/sizes/l/
http://www.flickr.com/photos/smu_cul_digitalcollections/8519861690/sizes/l/
http://www.flickr.com/photos/nlireland/8443250313/sizes/h/
http://www.flickr.com/photos/national_library_of_australia_commons/6174084474/sizes/l/
http://www.flickr.com/photos/nypl/3110609190/sizes/o/
http://www.flickr.com/photos/hartlepool_museum/4398630456/sizes/o/
http://www.flickr.com/photos/usnationalarchives/7158774350/sizes/l/
http://www.flickr.com/photos/nlireland/9490851253/sizes/l/
1 of 82

Recommended

Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014 by
Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014
Blueflood and Beyond: The Future of Metrics - Berlin Buzzwords 2014gdusbabek
1.3K views73 slides
Airflow @ Agari by
Airflow @ Agari Airflow @ Agari
Airflow @ Agari Sid Anand
2.6K views58 slides
Flink at netflix paypal speaker series by
Flink at netflix   paypal speaker seriesFlink at netflix   paypal speaker series
Flink at netflix paypal speaker seriesMonal Daxini
3.7K views90 slides
Running Presto and Spark on the Netflix Big Data Platform by
Running Presto and Spark on the Netflix Big Data PlatformRunning Presto and Spark on the Netflix Big Data Platform
Running Presto and Spark on the Netflix Big Data PlatformEva Tse
1.8K views68 slides
(BDT403) Netflix's Next Generation Big Data Platform | AWS re:Invent 2014 by
(BDT403) Netflix's Next Generation Big Data Platform | AWS re:Invent 2014(BDT403) Netflix's Next Generation Big Data Platform | AWS re:Invent 2014
(BDT403) Netflix's Next Generation Big Data Platform | AWS re:Invent 2014Amazon Web Services
3.4K views60 slides
Lambda at Weather Scale by Robbie Strickland by
Lambda at Weather Scale by Robbie StricklandLambda at Weather Scale by Robbie Strickland
Lambda at Weather Scale by Robbie StricklandSpark Summit
2K views121 slides

More Related Content

What's hot

Next Generation Big Data Platform at Netflix 2014 by
Next Generation Big Data Platform at Netflix 2014Next Generation Big Data Platform at Netflix 2014
Next Generation Big Data Platform at Netflix 2014Eva Tse
3.8K views60 slides
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012 by
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012Amazon Web Services
3.1K views38 slides
Resilient Predictive Data Pipelines (QCon London 2016) by
Resilient Predictive Data Pipelines (QCon London 2016)Resilient Predictive Data Pipelines (QCon London 2016)
Resilient Predictive Data Pipelines (QCon London 2016)Sid Anand
2K views60 slides
Cloud Native Predictive Data Pipelines (micro talk) by
Cloud Native Predictive Data Pipelines (micro talk)Cloud Native Predictive Data Pipelines (micro talk)
Cloud Native Predictive Data Pipelines (micro talk)Sid Anand
454 views24 slides
xPatterns on Spark, Shark, Mesos, Tachyon by
xPatterns on Spark, Shark, Mesos, TachyonxPatterns on Spark, Shark, Mesos, Tachyon
xPatterns on Spark, Shark, Mesos, TachyonClaudiu Barbura
8.8K views26 slides
Resilient Predictive Data Pipelines (GOTO Chicago 2016) by
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Sid Anand
1.5K views104 slides

What's hot(20)

Next Generation Big Data Platform at Netflix 2014 by Eva Tse
Next Generation Big Data Platform at Netflix 2014Next Generation Big Data Platform at Netflix 2014
Next Generation Big Data Platform at Netflix 2014
Eva Tse3.8K views
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012 by Amazon Web Services
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012
BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012
Amazon Web Services3.1K views
Resilient Predictive Data Pipelines (QCon London 2016) by Sid Anand
Resilient Predictive Data Pipelines (QCon London 2016)Resilient Predictive Data Pipelines (QCon London 2016)
Resilient Predictive Data Pipelines (QCon London 2016)
Sid Anand2K views
Cloud Native Predictive Data Pipelines (micro talk) by Sid Anand
Cloud Native Predictive Data Pipelines (micro talk)Cloud Native Predictive Data Pipelines (micro talk)
Cloud Native Predictive Data Pipelines (micro talk)
Sid Anand454 views
xPatterns on Spark, Shark, Mesos, Tachyon by Claudiu Barbura
xPatterns on Spark, Shark, Mesos, TachyonxPatterns on Spark, Shark, Mesos, Tachyon
xPatterns on Spark, Shark, Mesos, Tachyon
Claudiu Barbura8.8K views
Resilient Predictive Data Pipelines (GOTO Chicago 2016) by Sid Anand
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Sid Anand1.5K views
Monitoring, Hold the Infrastructure - Getting the Most out of AWS Lambda – Da... by Amazon Web Services
Monitoring, Hold the Infrastructure - Getting the Most out of AWS Lambda – Da...Monitoring, Hold the Infrastructure - Getting the Most out of AWS Lambda – Da...
Monitoring, Hold the Infrastructure - Getting the Most out of AWS Lambda – Da...
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka by confluent
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache KafkaKafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
confluent3.4K views
Homologous Apache Spark Clusters Using Nomad with Alex Dadgar by Databricks
Homologous Apache Spark Clusters Using Nomad with Alex DadgarHomologous Apache Spark Clusters Using Nomad with Alex Dadgar
Homologous Apache Spark Clusters Using Nomad with Alex Dadgar
Databricks1.1K views
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Streaming Analytics with Spark, Kafka, Cassandra and AkkaStreaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Helena Edelson52.1K views
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S... by Helena Edelson
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...
Helena Edelson86.2K views
From a kafkaesque story to The Promised Land by Ran Silberman
From a kafkaesque story to The Promised LandFrom a kafkaesque story to The Promised Land
From a kafkaesque story to The Promised Land
Ran Silberman4.7K views
Cloud Native Data Pipelines (DataEngConf SF 2017) by Sid Anand
Cloud Native Data Pipelines (DataEngConf SF 2017)Cloud Native Data Pipelines (DataEngConf SF 2017)
Cloud Native Data Pipelines (DataEngConf SF 2017)
Sid Anand866 views
Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar... by Spark Summit
Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar...Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar...
Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar...
Spark Summit4.2K views
QCon London 2016 - Patterns of reliable in-stream processing @ Scale by Alexey Kharlamov
QCon London 2016 - Patterns of reliable in-stream processing @ ScaleQCon London 2016 - Patterns of reliable in-stream processing @ Scale
QCon London 2016 - Patterns of reliable in-stream processing @ Scale
Alexey Kharlamov750 views
Real Time Data Processing Using Spark Streaming by Hari Shreedharan
Real Time Data Processing Using Spark StreamingReal Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark Streaming
Hari Shreedharan2.6K views
Real Time Data Streaming using Kafka & Storm by Ran Silberman
Real Time Data Streaming using Kafka & StormReal Time Data Streaming using Kafka & Storm
Real Time Data Streaming using Kafka & Storm
Ran Silberman16.9K views
Cassandra and SparkSQL: You Don't Need Functional Programming for Fun with Ru... by Databricks
Cassandra and SparkSQL: You Don't Need Functional Programming for Fun with Ru...Cassandra and SparkSQL: You Don't Need Functional Programming for Fun with Ru...
Cassandra and SparkSQL: You Don't Need Functional Programming for Fun with Ru...
Databricks6.2K views
Scalable complex event processing on samza @UBER by Shuyi Chen
Scalable complex event processing on samza @UBERScalable complex event processing on samza @UBER
Scalable complex event processing on samza @UBER
Shuyi Chen4.8K views

Viewers also liked

Flapjack: rethinking monitoring for the cloud by
Flapjack: rethinking monitoring for the cloudFlapjack: rethinking monitoring for the cloud
Flapjack: rethinking monitoring for the cloudLindsay Holmwood
2.7K views142 slides
Introduction to CQL and Data Modeling with Apache Cassandra by
Introduction to CQL and Data Modeling with Apache CassandraIntroduction to CQL and Data Modeling with Apache Cassandra
Introduction to CQL and Data Modeling with Apache CassandraJohnny Miller
7.5K views90 slides
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast" by
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"DataStax Academy
34.9K views27 slides
Cassandra & puppet, scaling data at $15 per month by
Cassandra & puppet, scaling data at $15 per monthCassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per monthdaveconnors
37.3K views31 slides
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ... by
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...DataStax Academy
29.3K views14 slides
Data Modeling with Cassandra and Time Series Data by
Data Modeling with Cassandra and Time Series DataData Modeling with Cassandra and Time Series Data
Data Modeling with Cassandra and Time Series DataDani Traphagen
5.1K views41 slides

Viewers also liked(10)

Flapjack: rethinking monitoring for the cloud by Lindsay Holmwood
Flapjack: rethinking monitoring for the cloudFlapjack: rethinking monitoring for the cloud
Flapjack: rethinking monitoring for the cloud
Lindsay Holmwood2.7K views
Introduction to CQL and Data Modeling with Apache Cassandra by Johnny Miller
Introduction to CQL and Data Modeling with Apache CassandraIntroduction to CQL and Data Modeling with Apache Cassandra
Introduction to CQL and Data Modeling with Apache Cassandra
Johnny Miller7.5K views
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast" by DataStax Academy
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"
NYC* 2013 — "Using Cassandra for DVR Scheduling at Comcast"
DataStax Academy34.9K views
Cassandra & puppet, scaling data at $15 per month by daveconnors
Cassandra & puppet, scaling data at $15 per monthCassandra & puppet, scaling data at $15 per month
Cassandra & puppet, scaling data at $15 per month
daveconnors37.3K views
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ... by DataStax Academy
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...
C* Summit 2013: Time for a New Relationship - Intuit's Journey from RDBMS to ...
DataStax Academy29.3K views
Data Modeling with Cassandra and Time Series Data by Dani Traphagen
Data Modeling with Cassandra and Time Series DataData Modeling with Cassandra and Time Series Data
Data Modeling with Cassandra and Time Series Data
Dani Traphagen5.1K views
Time series with Apache Cassandra - Long version by Patrick McFadin
Time series with Apache Cassandra - Long versionTime series with Apache Cassandra - Long version
Time series with Apache Cassandra - Long version
Patrick McFadin19.8K views
Cassandra at eBay - Cassandra Summit 2012 by Jay Patel
Cassandra at eBay - Cassandra Summit 2012Cassandra at eBay - Cassandra Summit 2012
Cassandra at eBay - Cassandra Summit 2012
Jay Patel85.8K views
Migrating Netflix from Datacenter Oracle to Global Cassandra by Adrian Cockcroft
Migrating Netflix from Datacenter Oracle to Global CassandraMigrating Netflix from Datacenter Oracle to Global Cassandra
Migrating Netflix from Datacenter Oracle to Global Cassandra
Adrian Cockcroft126.5K views
Cassandra 2.0 and timeseries by Patrick McFadin
Cassandra 2.0 and timeseriesCassandra 2.0 and timeseries
Cassandra 2.0 and timeseries
Patrick McFadin13.7K views

Similar to Blueflood: Open Source Metrics Processing at CassandraEU 2013

透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹 by
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹Amazon Web Services
870 views44 slides
SnappyData, the Spark Database. A unified cluster for streaming, transactions... by
SnappyData, the Spark Database. A unified cluster for streaming, transactions...SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...SnappyData
2.5K views42 slides
SnappyData at Spark Summit 2017 by
SnappyData at Spark Summit 2017SnappyData at Spark Summit 2017
SnappyData at Spark Summit 2017Jags Ramnarayan
120 views42 slides
Data & Analytics - Session 2 - Introducing Amazon Redshift by
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftAmazon Web Services
6.2K views47 slides
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2 by
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Amazon Web Services
397 views64 slides
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan... by
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...Jürgen Ambrosi
538 views33 slides

Similar to Blueflood: Open Source Metrics Processing at CassandraEU 2013(20)

透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹 by Amazon Web Services
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
SnappyData, the Spark Database. A unified cluster for streaming, transactions... by SnappyData
SnappyData, the Spark Database. A unified cluster for streaming, transactions...SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
SnappyData2.5K views
Data & Analytics - Session 2 - Introducing Amazon Redshift by Amazon Web Services
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon Redshift
Amazon Web Services6.2K views
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2 by Amazon Web Services
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan... by Jürgen Ambrosi
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...
6° Sessione - Ambiti applicativi nella ricerca di tecnologie statistiche avan...
Jürgen Ambrosi538 views
Simplify and Scale Data Engineering Pipelines with Delta Lake by Databricks
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta Lake
Databricks2.3K views
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A... by Windows Developer
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
Build 2017 - P4010 - A lap around Azure HDInsight and Cosmos DB Open Source A...
Windows Developer696 views
AWS Webcast - Managing Big Data in the AWS Cloud_20140924 by Amazon Web Services
AWS Webcast - Managing Big Data in the AWS Cloud_20140924AWS Webcast - Managing Big Data in the AWS Cloud_20140924
AWS Webcast - Managing Big Data in the AWS Cloud_20140924
Amazon Web Services4.9K views
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services by Amazon Web Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
Amazon Web Services2.8K views
IBM Cloud Native Day April 2021: Serverless Data Lake by Torsten Steinbach
IBM Cloud Native Day April 2021: Serverless Data LakeIBM Cloud Native Day April 2021: Serverless Data Lake
IBM Cloud Native Day April 2021: Serverless Data Lake
Torsten Steinbach105 views
Next Generation Data Platforms - Deon Thomas by Thoughtworks
Next Generation Data Platforms - Deon ThomasNext Generation Data Platforms - Deon Thomas
Next Generation Data Platforms - Deon Thomas
Thoughtworks3.2K views
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ... by Fwdays
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Fwdays515 views
Module 2 - Datalake by Lam Le
Module 2 - DatalakeModule 2 - Datalake
Module 2 - Datalake
Lam Le275 views
Databases in the Cloud - DevDay Austin 2017 Day 2 by Amazon Web Services
Databases in the Cloud - DevDay Austin 2017 Day 2Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic... by Amazon Web Services
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Amazon Web Services3.9K views
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ... by Precisely
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Precisely855 views

More from gdusbabek

My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015 by
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015gdusbabek
1.1K views55 slides
How To (Not) Open Source - Javazone, Oslo 2014 by
How To (Not) Open Source - Javazone, Oslo 2014How To (Not) Open Source - Javazone, Oslo 2014
How To (Not) Open Source - Javazone, Oslo 2014gdusbabek
745 views180 slides
Measure All the Things! - Austin Data Day 2014 by
Measure All the Things! - Austin Data Day 2014Measure All the Things! - Austin Data Day 2014
Measure All the Things! - Austin Data Day 2014gdusbabek
1.3K views106 slides
Introduction to Blueflood at Berlin Buzzwords 2013 by
Introduction to Blueflood at Berlin Buzzwords 2013Introduction to Blueflood at Berlin Buzzwords 2013
Introduction to Blueflood at Berlin Buzzwords 2013gdusbabek
1.6K views164 slides
Rackspace Cloud Monitoring - Strata NYC by
Rackspace Cloud Monitoring - Strata NYCRackspace Cloud Monitoring - Strata NYC
Rackspace Cloud Monitoring - Strata NYCgdusbabek
1.2K views43 slides
Austin cassandra meetup by
Austin cassandra meetupAustin cassandra meetup
Austin cassandra meetupgdusbabek
768 views33 slides

More from gdusbabek(14)

My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015 by gdusbabek
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015
My Futuristic Vision of the Future of Cassandra's Future - NGCC 2015
gdusbabek1.1K views
How To (Not) Open Source - Javazone, Oslo 2014 by gdusbabek
How To (Not) Open Source - Javazone, Oslo 2014How To (Not) Open Source - Javazone, Oslo 2014
How To (Not) Open Source - Javazone, Oslo 2014
gdusbabek745 views
Measure All the Things! - Austin Data Day 2014 by gdusbabek
Measure All the Things! - Austin Data Day 2014Measure All the Things! - Austin Data Day 2014
Measure All the Things! - Austin Data Day 2014
gdusbabek1.3K views
Introduction to Blueflood at Berlin Buzzwords 2013 by gdusbabek
Introduction to Blueflood at Berlin Buzzwords 2013Introduction to Blueflood at Berlin Buzzwords 2013
Introduction to Blueflood at Berlin Buzzwords 2013
gdusbabek1.6K views
Rackspace Cloud Monitoring - Strata NYC by gdusbabek
Rackspace Cloud Monitoring - Strata NYCRackspace Cloud Monitoring - Strata NYC
Rackspace Cloud Monitoring - Strata NYC
gdusbabek1.2K views
Austin cassandra meetup by gdusbabek
Austin cassandra meetupAustin cassandra meetup
Austin cassandra meetup
gdusbabek768 views
How Rackspace Cloud Monitoring uses Cassandra by gdusbabek
How Rackspace Cloud Monitoring uses CassandraHow Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses Cassandra
gdusbabek1.5K views
Breaking the Relational Headlock: A Survey of NoSQL Datastores by gdusbabek
Breaking the Relational Headlock: A Survey of NoSQL DatastoresBreaking the Relational Headlock: A Survey of NoSQL Datastores
Breaking the Relational Headlock: A Survey of NoSQL Datastores
gdusbabek2K views
Building Rackspace Cloud Monitoring by gdusbabek
Building Rackspace Cloud MonitoringBuilding Rackspace Cloud Monitoring
Building Rackspace Cloud Monitoring
gdusbabek1.4K views
Cassandra Codebase 2011 by gdusbabek
Cassandra Codebase 2011Cassandra Codebase 2011
Cassandra Codebase 2011
gdusbabek2.2K views
Data Modeling with Cassandra Column Families by gdusbabek
Data Modeling with Cassandra Column FamiliesData Modeling with Cassandra Column Families
Data Modeling with Cassandra Column Families
gdusbabek6.1K views
Getting to Know the Cassandra Codebase by gdusbabek
Getting to Know the Cassandra CodebaseGetting to Know the Cassandra Codebase
Getting to Know the Cassandra Codebase
gdusbabek3.3K views
Introduction to Cassandra (June 2010) by gdusbabek
Introduction to Cassandra (June 2010)Introduction to Cassandra (June 2010)
Introduction to Cassandra (June 2010)
gdusbabek8.1K views
Cassandra Presentation for San Antonio JUG by gdusbabek
Cassandra Presentation for San Antonio JUGCassandra Presentation for San Antonio JUG
Cassandra Presentation for San Antonio JUG
gdusbabek4.5K views

Recently uploaded

This talk was not generated with ChatGPT: how AI is changing science by
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceElena Simperl
34 views13 slides
The Power of Generative AI in Accelerating No Code Adoption.pdf by
The Power of Generative AI in Accelerating No Code Adoption.pdfThe Power of Generative AI in Accelerating No Code Adoption.pdf
The Power of Generative AI in Accelerating No Code Adoption.pdfSaeed Al Dhaheri
44 views18 slides
AI + Memoori = AIM by
AI + Memoori = AIMAI + Memoori = AIM
AI + Memoori = AIMMemoori
15 views9 slides
Discover Aura Workshop (12.5.23).pdf by
Discover Aura Workshop (12.5.23).pdfDiscover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdfNeo4j
20 views55 slides
The Coming AI Tsunami.pptx by
The Coming AI Tsunami.pptxThe Coming AI Tsunami.pptx
The Coming AI Tsunami.pptxjohnhandby
14 views12 slides
Future of AR - Facebook Presentation by
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook PresentationRob McCarty
66 views27 slides

Recently uploaded(20)

This talk was not generated with ChatGPT: how AI is changing science by Elena Simperl
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing science
Elena Simperl34 views
The Power of Generative AI in Accelerating No Code Adoption.pdf by Saeed Al Dhaheri
The Power of Generative AI in Accelerating No Code Adoption.pdfThe Power of Generative AI in Accelerating No Code Adoption.pdf
The Power of Generative AI in Accelerating No Code Adoption.pdf
Saeed Al Dhaheri44 views
AI + Memoori = AIM by Memoori
AI + Memoori = AIMAI + Memoori = AIM
AI + Memoori = AIM
Memoori15 views
Discover Aura Workshop (12.5.23).pdf by Neo4j
Discover Aura Workshop (12.5.23).pdfDiscover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdf
Neo4j20 views
The Coming AI Tsunami.pptx by johnhandby
The Coming AI Tsunami.pptxThe Coming AI Tsunami.pptx
The Coming AI Tsunami.pptx
johnhandby14 views
Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty66 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash171 views
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De... by Moses Kemibaro
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...
Don’t Make A Human Do A Robot’s Job! : 6 Reasons Why AI Will Save Us & Not De...
Moses Kemibaro38 views
"Node.js Development in 2024: trends and tools", Nikita Galkin by Fwdays
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin
Fwdays37 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue209 views
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by BookNet Canada
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
BookNet Canada46 views
Measurecamp Brussels - Synthetic data.pdf by Human37
Measurecamp Brussels - Synthetic data.pdfMeasurecamp Brussels - Synthetic data.pdf
Measurecamp Brussels - Synthetic data.pdf
Human37 27 views
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And... by ShapeBlue
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
Enabling DPU Hardware Accelerators in XCP-ng Cloud Platform Environment - And...
ShapeBlue120 views
Cocktail of Environments. How to Mix Test and Development Environments and St... by Aleksandr Tarasov
Cocktail of Environments. How to Mix Test and Development Environments and St...Cocktail of Environments. How to Mix Test and Development Environments and St...
Cocktail of Environments. How to Mix Test and Development Environments and St...
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell by Fwdays
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell
"Node.js vs workers — A comparison of two JavaScript runtimes", James M Snell
Fwdays14 views
"Package management in monorepos", Zoltan Kochan by Fwdays
"Package management in monorepos", Zoltan Kochan"Package management in monorepos", Zoltan Kochan
"Package management in monorepos", Zoltan Kochan
Fwdays37 views

Blueflood: Open Source Metrics Processing at CassandraEU 2013