C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner

•

5 likes•2,017 views

This document discusses using Cassandra to store time-series metrics data. It describes how the schema was matched to storage by using a measurement column family with rows organized by metric ID and time. It also covers optimizing data expiration through techniques like TTL expiration, synchronized compactions, and leveraging immutable sstable modification times. Effective monitoring is emphasized as well, including dashboards to track the ring and using Cassandra log volumes to identify issues.

Technology Business

#CASSANDRA13
Time-Series Metrics with Cassandra
Mike Heffner

#CASSANDRA13
October 2011
l  Decision: All measurements in Cassandra
l  Single EC2 Ring: 6 * m1.large
l  Cassandra 0.8.x
l  How does this work?

#CASSANDRA13
Today
l  Multiple sharded rings
l  ~250,000 writes / second
l  EC2: m1.xlarge and m2.4xlarge
l  Cassandra 1.1.x
l  Read load: < 1%

#CASSANDRA13
Talk Highlights
l  Matching Schema to Storage
l  Optimally Expiring Data
l  Monitor Everything

#CASSANDRA13
What is a Measurement?
( Metric ID, Source )
(X, Y) => (Time stamp, Value)

#CASSANDRA13
Measurement CF
Example: Select measurements between times [T1, T2]:

#CASSANDRA13
Locating Rows
Let us calculate the maximum row size:
l  1 minute records
l  1 week TTL
l  7 days * 24 hours * 60 minutes => 10,080
l  3 Longs * 8 bytes * 10k => ~240KB (not bad)

#CASSANDRA13
Examining CF SSTables
Metrics/metric_id_epochs_60 histograms
Offset SSTables
1 28821
2 58859
3 201198
4 178326
5 223016
6 154952
7 83289
8 21552
10 81104
1 2 3 4 5 6 7 8 10
nodetool cfhistograms Metrics metric_id_epochs_60

$#CASSANDRA13 Splitting the Rows mget(Rows: [12, EBase_30], [12, EBase_40], Columns: {31->45}) Retrieve Time Bases for Times 31->45 for metric ID 12:$

#CASSANDRA13
Examining CF SSTables
Metrics/metric_id_epochs_60
Offset SSTables
1 28821
2 58859
3 201198
4 178326
5 223016
6 154952
7 83289
8 21552
10 81104
1 2 3 4 5 6 7 8 10
nodetool cfhistograms Metrics metric_id_epochs_60
Metrics/metric_id_epochs_60
Offset SSTables
1 3491820
2 5389762
3 4095760
4 1310741
5 9976 1 2 3 4 5 6 7 8 9 10
Before
After

#CASSANDRA13
TTL Expiration
l  Churn of about 750GB / day
l  12 TB total
l  6% of data set
l  gc_grace = 0
l  STC

#CASSANDRA13
* http://hight3ch.com/garbage-truck-crushing-a-car/

#CASSANDRA13
Cleanup
l  Not just for topology changes
l  Tombstoned rows (not referenced)
l  Rotated row keys decrease references
l  Cons: Must process every sstable.

#CASSANDRA13
Leverage SStable Mod Time
l  If now – mtime > TTL => all data is expired
l  We can quickly eliminate entire sstables:
find -mtime +<TTL> -name *.db | xargs rm
l  Fast and low overhead
l  Cons: Rolling restart
26G 2013-05-17 09:44 Metrics-metrics_60-hf-7209-Data.db

#CASSANDRA13
nodetool setcompactionthreshold

#CASSANDRA13
Increasing minor compactions
l  By default, STC requires a minimum of 4 ssts
l  Leads to large non-compacted sstables
l  Dropping to 2 can flatten the storage growth
nodetool setcompactionthreshold <ks> <cf> 2
l  Cons: CPU/IO increase

#CASSANDRA13
Disk Errors => Throw Away
l  If you ever see this, replace!
end_request: I/O error, dev xvdb, sector 467940617
end_request: I/O error, dev xvdb, sector 467940617
l  Mark node down, bootstrap new
l  No metric for this?

#CASSANDRA13
Cassandra Log Volume
l  Count log lines seen every 10 minutes
l  Track over time
l  Can identify:
-  Unbalanced workloads
-  Schema disagreements
-  Phantom gossip nodes
-  GC activity
l  grep -v '.java' => exceptions

#CASSANDRA13
Q & A
Mike Heffner
/mheffner
/mheffner

What's hot

InfluxDb and Grafana fighting with dataIvan Vaskevych

FWI without tears: a forward modeling-free gradientMarcelo Guarido de Andrade

Redis Day TLV 2018 - Redis as a Time-Series DBRedis Labs

Garbage Collection In MicorosoftSmithaNatarajamurthy

nebulaconfPedro Dias

Cassandra at talkbitsMax Alexejev

MongoDB - Warehouse and Aggregator of EventsMaxim Ligus

ACDKOCHI19 - Become Thanos of the Lambda Land: Wield all the Infinity StonesAWS User Group Kochi

LOFAR - finding transients in the radio spectrumGijs Molenaar

Cassandra 2.1 boot camp, CompactionJoshua McKenzie

10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...DevOpsDays Tel Aviv

Cassandra&map reducevlaskinvlad

Kubernetes London - Musings on Kubernetes OperatorsStuart Leeks

Debugging and Profiling Rails ApplicationDavid Paluy

Ndb cluster 80_tpc_hmikaelronstrom

SqliteToRealmPluu love

Solr Power FTW: Powering NoSQL the World OverAlex Pinkin

Lab 01 03_16Hao Wu

C* Summit EU 2013: Analytics On Top of Cassandra and HadoopDataStax Academy

CrPhani madhuri

What's hot (20)

InfluxDb and Grafana fighting with data

FWI without tears: a forward modeling-free gradient

Redis Day TLV 2018 - Redis as a Time-Series DB

Garbage Collection In Micorosoft

nebulaconf

Cassandra at talkbits

MongoDB - Warehouse and Aggregator of Events

ACDKOCHI19 - Become Thanos of the Lambda Land: Wield all the Infinity Stones

LOFAR - finding transients in the radio spectrum

Cassandra 2.1 boot camp, Compaction

10 Devops-Friendly Database Must-Haves - Dor Laor, ScyllaDB - DevOpsDays Tel ...

Cassandra&map reduce

Kubernetes London - Musings on Kubernetes Operators

Debugging and Profiling Rails Application

Ndb cluster 80_tpc_h

SqliteToRealm

Solr Power FTW: Powering NoSQL the World Over

Lab 01 03_16

C* Summit EU 2013: Analytics On Top of Cassandra and Hadoop

Similar to C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner

C* Summit 2013: No Whistling Required: Cabs, Cassandra, and Hailo by Dave Gar...DataStax Academy

Data Science Lab Meetup: Cassandra and SparkChristopher Batey

C* Summit 2013: Cassandra at Instagram by Rick BransonDataStax Academy

Cassandra 3.x et la future 4.0Alexander DEJANOVSKI

How Opera Syncs Tens of Millions of Browsers and Sleeps Well at NightScyllaDB

Cassandra at Instagram (August 2013)Rick Branson

Use ScyllaDB Alternator to Use Amazon DynamoDB API, Everywhere, Better, More ...ScyllaDB

Big data analytics with Spark & Cassandra Matthias Niehoff

Cassandra Deep Diver & Data ModelingBrian Enochson

Tweaking perfomance on high-load projects_Думанский ДмитрийGeeksLab Odessa

Cassandra To Infinity And BeyondRomain Hardouin

Tweaking performance on high-load projectsDmitriy Dumanskiy

4 use cases for C* to Scylla◄ ★ Jack Pavlov ★ ►

Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...DataStax

C* Summit EU 2013: Practice Makes Perfect: Extreme Cassandra OptimizationDataStax Academy

Mixing Batch and Real-time: Cassandra with Shark (Cassandra Europe 2013)Richard Low

C* Summit EU 2013: Mixing Batch and Real-Time: Cassandra with Shark DataStax Academy

ScyllaDB: NoSQL at Ludicrous SpeedJ On The Beach

TiDB vs Aurora.pdfssuser3fb50b

Cloud Computing in the Cloud (Hadoop.tw Meetup @ 2015/11/23)Jeff Hung

Similar to C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner (20)

C* Summit 2013: No Whistling Required: Cabs, Cassandra, and Hailo by Dave Gar...

Data Science Lab Meetup: Cassandra and Spark

C* Summit 2013: Cassandra at Instagram by Rick Branson

Cassandra 3.x et la future 4.0

How Opera Syncs Tens of Millions of Browsers and Sleeps Well at Night

Cassandra at Instagram (August 2013)

Use ScyllaDB Alternator to Use Amazon DynamoDB API, Everywhere, Better, More ...

Big data analytics with Spark & Cassandra

Cassandra Deep Diver & Data Modeling

Tweaking perfomance on high-load projects_Думанский Дмитрий

Cassandra To Infinity And Beyond

Tweaking performance on high-load projects

4 use cases for C* to Scylla

Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...

C* Summit EU 2013: Practice Makes Perfect: Extreme Cassandra Optimization

Mixing Batch and Real-time: Cassandra with Shark (Cassandra Europe 2013)

C* Summit EU 2013: Mixing Batch and Real-Time: Cassandra with Shark

ScyllaDB: NoSQL at Ludicrous Speed

TiDB vs Aurora.pdf

Cloud Computing in the Cloud (Hadoop.tw Meetup @ 2015/11/23)

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

AI as an Interface for Commercial BuildingsMemoori

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Slack Application Development 101 Slidespraypatel2

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Install Stable Diffusion in windows machinePadma Pradeep

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

AI as an Interface for Commercial Buildings

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Benefits Of Flutter Compared To Other Frameworks

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Presentation on how to chat with PDF using ChatGPT code interpreter

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Slack Application Development 101 Slides

Azure Monitor & Application Insight to monitor Infrastructure & Application

GenCyber Cyber Security Day Presentation

Human Factors of XR: Using Human Factors to Design XR Systems

Install Stable Diffusion in windows machine

08448380779 Call Girls In Civil Lines Women Seeking Men

Scaling API-first – The story of a global engineering organization

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Understanding the Laravel MVC Architecture

C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner

1. #CASSANDRA13 Time-Series Metrics with Cassandra Mike Heffner

2. #CASSANDRA13 What we do.

3. #CASSANDRA13 October 2011 l  Decision: All measurements in Cassandra l  Single EC2 Ring: 6 * m1.large l  Cassandra 0.8.x l  How does this work?

4. #CASSANDRA13 Today l  Multiple sharded rings l  ~250,000 writes / second l  EC2: m1.xlarge and m2.4xlarge l  Cassandra 1.1.x l  Read load: < 1%

5. #CASSANDRA13 Talk Highlights l  Matching Schema to Storage l  Optimally Expiring Data l  Monitor Everything

6. #CASSANDRA13 Matching Schema to Storage

7. #CASSANDRA13 What is a Measurement? ( Metric ID, Source ) (X, Y) => (Time stamp, Value)

8. #CASSANDRA13 Measurement CF Example: Select measurements between times [T1, T2]:

9. #CASSANDRA13 Locating Rows Let us calculate the maximum row size: l  1 minute records l  1 week TTL l  7 days * 24 hours * 60 minutes => 10,080 l  3 Longs * 8 bytes * 10k => ~240KB (not bad)

10. #CASSANDRA13 Row Storage Over Time

11. #CASSANDRA13 Row Storage Over Time

12. #CASSANDRA13 Seek All The SStables

13. #CASSANDRA13 Examining CF SSTables Metrics/metric_id_epochs_60 histograms Offset SSTables 1 28821 2 58859 3 201198 4 178326 5 223016 6 154952 7 83289 8 21552 10 81104 1 2 3 4 5 6 7 8 10 nodetool cfhistograms Metrics metric_id_epochs_60

14. #CASSANDRA13 Splitting the Rows mget(Rows: [12, EBase_30], [12, EBase_40], Columns: {31->45}) Retrieve Time Bases for Times 31->45 for metric ID 12:

15. #CASSANDRA13 Examining CF SSTables Metrics/metric_id_epochs_60 Offset SSTables 1 28821 2 58859 3 201198 4 178326 5 223016 6 154952 7 83289 8 21552 10 81104 1 2 3 4 5 6 7 8 10 nodetool cfhistograms Metrics metric_id_epochs_60 Metrics/metric_id_epochs_60 Offset SSTables 1 3491820 2 5389762 3 4095760 4 1310741 5 9976 1 2 3 4 5 6 7 8 9 10 Before After

16. #CASSANDRA13 /graph me

17. #CASSANDRA13 Optimally Expiring Data

18. #CASSANDRA13 TTL Expiration l  Churn of about 750GB / day l  12 TB total l  6% of data set l  gc_grace = 0 l  STC

19. #CASSANDRA13 Synchronized Compactions

20. #CASSANDRA13

21. #CASSANDRA13 nodetool compact

22. #CASSANDRA13 * http://hight3ch.com/garbage-truck-crushing-a-car/

23. #CASSANDRA13 nodetool cleanup

24. #CASSANDRA13 Cleanup l  Not just for topology changes l  Tombstoned rows (not referenced) l  Rotated row keys decrease references l  Cons: Must process every sstable.

25. #CASSANDRA13 Immutable SStables

26. #CASSANDRA13 Leverage SStable Mod Time l  If now – mtime > TTL => all data is expired l  We can quickly eliminate entire sstables: find -mtime +<TTL> -name *.db | xargs rm l  Fast and low overhead l  Cons: Rolling restart 26G 2013-05-17 09:44 Metrics-metrics_60-hf-7209-Data.db

27. #CASSANDRA13 nodetool setcompactionthreshold

28. #CASSANDRA13 Increasing minor compactions l  By default, STC requires a minimum of 4 ssts l  Leads to large non-compacted sstables l  Dropping to 2 can flatten the storage growth nodetool setcompactionthreshold <ks> <cf> 2 l  Cons: CPU/IO increase

29. #CASSANDRA13 Result

30. #CASSANDRA13 Effective Monitoring

31. #CASSANDRA13 Ring Dashboards

32. #CASSANDRA13 Disk Errors => Throw Away l  If you ever see this, replace! end_request: I/O error, dev xvdb, sector 467940617 end_request: I/O error, dev xvdb, sector 467940617 l  Mark node down, bootstrap new l  No metric for this?

33. #CASSANDRA13 Cassandra Log Volume l  Count log lines seen every 10 minutes l  Track over time l  Can identify: -  Unbalanced workloads -  Schema disagreements -  Phantom gossip nodes -  GC activity l  grep -v '.java' => exceptions

34. #CASSANDRA13 Q & A Mike Heffner /mheffner /mheffner

C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner

Similar to C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner (20)

More from DataStax Academy

More from DataStax Academy (20)

Recently uploaded

Recently uploaded (20)

C* Summit 2013: Time-Series Metrics with Cassandra by Mike Heffner