Scaling Twitter with Cassandra

•Download as KEY, PDF•

115 likes•20,601 views

Ryan King

Technology

Scaling Twitter
with Cassandra
Ryan King
Storage Team

bit.ly/chirpcassandra

ryan@twitter.com

@rk

Legacy
• vertically & horiztonally partitioned mysql

• memcached (rows, indexes and fragments)

• application managed

Legacy Drawbacks
• many single-points-of-failure

• hardware-intensive

• manpower-intensive

• tight coupling

Apache Cassandra
• Apache top level project

• originally developed at Facebook

• Rackspace, Digg, SimpleGeo, Twitter, etc.

Why Cassandra?
• highly available

• consistent, eventually

• decentralized

• fault tolerant

• elastic

• flexible schema

• high write throughput

What is Cassandra?
• distributed database

• Google's BigTable's data model

• Amazon's Dynamo's infrastructure

Cassandra Data Model
• keyspaces

• column families

• columns

• super columns

Cassandra Infrastructure
• partitioners

• storage

• querying

Partitioners
• order-preserving

• random

• custom

Storage
• commit log

• memtables

• sstables

• compaction

• bloom filters

• indexes

• key cache

• row cache

Querying
• get

• multiget

• range

• slice

Consistency
• N, R, W

• N = number of replicas

Consistency
• N, R, W

• N = number of replicas

• R = read replicas

Consistency
• N, R, W

• N = number of replicas

• R = read replicas

• W = write replicas

Consistency
• N, R, W

• N = number of replicas

• R = read replicas

• W = write replicas

• send request, wait for specified number

Consistency
• N, R, W

• N = number of replicas

• R = read replicas

• W = write replicas

• send request, wait for specified number

• wait for others in background and perform read-
repair

Consistency Levels
• ZERO

• ONE

• QUORUM

• ALL

Strong Consistency
• If W + R > N, you will have consistency

• W=1, R=N

• W=N, R=1

• W=Q, R=Q where Q = N / 2 + 1

Eventuality
• Hinted Handoff

• Read Repair

• Proactive Repair (Merkle trees)

Potential Consistency
• causes

• write-through caching

• master-slave replication failures

Read Repair
• send read to all replicas

• if they differ, resolve conflicts and update (in
background)

Hinted Handoff
• A wants to write to B

• B is down

• A tells C, "when B is back, send them this
update"

Proactive Repair
• use Merkle trees to find inconsistencies

• resolve conflicts

• send repaired data

• triggered manually

How we’re moving?
• parallel deployments

• incremental traffic shifting

Parallel Deployment
1. build new implementation
2. integrate it alongside existing
3. ...with switches for dynamically move/mirror traffic

4. turn up traffic
5. break something
6. Fix it

7. GOTO 4

What's hot

Delivering Meaning In Near-Real Time At High Velocity In Massive Scale with A...

Helena Edelson

Scala Days, Amsterdam, 2015: Lambda Architecture - Batch and Streaming with Spark, Cassandra, Kafka, Akka and Scala; Fault Tolerance, Data Pipelines, Data Flows, Data Locality, Akka Actors, Spark, Spark Cassandra Connector, Big Data, Asynchronous data flows. Time series data, KillrWeather, Scalable Infrastructure, Partition For Scale, Replicate For Resiliency, Parallelism Isolation, Data Locality, Location Transparency

Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala

Helena Edelson

Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17

spark-project

Spark on Mesos-A Deep Dive-(Dean Wampler and Tim Chen, Typesafe and Mesosphere)

Spark Summit

Spark Streaming with Cassandra

Jacek Lewandowski

Meet Up - Spark Stream Processing + Kafka

Knoldus Inc.

Recipes for Running Spark Streaming Applications in Production-(Tathagata Das...

Spark Summit

Analytics with Cassandra & Spark

Matthias Niehoff

Developing a Real-time Engine with Akka, Cassandra, and Spray

Jacob Park

Regardless of the meaning we are searching for over our vast amounts of data, whether we are in science, finance, technology, energy, health care…, we all share the same problems that must be solved: How do we achieve that? What technologies best support the requirements? This talk is about how to leverage fast access to historical data with real time streaming data for predictive modeling for lambda architecture with Spark Streaming, Kafka, Cassandra, Akka and Scala. Efficient Stream Computation, Composable Data Pipelines, Data Locality, Cassandra data model and low latency, Kafka producers and HTTP endpoints as akka actors...

Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...

Helena Edelson

Apache Spark has emerged over the past year as the imminent successor to Hadoop MapReduce. Spark can process data in memory at very high speed, while still be able to spill to disk if required. Spark’s powerful, yet flexible API allows users to write complex applications very easily without worrying about the internal workings and how the data gets processed on the cluster. Spark comes with an extremely powerful Streaming API to process data as it is ingested. Spark Streaming integrates with popular data ingest systems like Apache Flume, Apache Kafka, Amazon Kinesis etc. allowing users to process data as it comes in. In this talk, Hari will discuss the basics of Spark Streaming, its API and its integration with Flume, Kafka and Kinesis. Hari will also discuss a real-world example of a Spark Streaming application, and how code can be shared between a Spark application and a Spark Streaming application. Each stage of the application execution will be presented, which can help understand practices while writing such an application. Hari will finally discuss how to write a custom application and a custom receiver to receive data from other systems.

Real Time Data Processing Using Spark Streaming

Hari Shreedharan

How do you rapidly derive complex insights on top of really big data sets in Cassandra? This session draws upon Evan's experience building a distributed, interactive, columnar query engine on top of Cassandra and Spark. We will start by surveying the existing query landscape of Cassandra and discuss ways to integrate Cassandra and Spark. We will dive into the design and architecture of a fast, column-oriented query architecture for Spark, and why columnar stores are so advantageous for OLAP workloads. I will present a schema for Parquet-like storage of analytical datasets onCassandra. Find out why Cassandra and Spark are the perfect match for enabling fast, scalable, complex querying and storage of big analytical data.

OLAP with Cassandra and Spark

Evan Chan

Exactly-Once Streaming from Kafka-(Cody Koeninger, Kixer)

Spark Summit

Muvr is a real-time personal trainer system. It must be highly available, resilient and responsive, and so it relies on heavily on Spark, Mesos, Akka, Cassandra, and Kafka—the quintuple also known as the SMACK stack. In this talk, we are going to explore the architecture of the entire muvr system, exploring, in particular, the challenges of ingesting very large volume of data, applying trained models on the data to provide real-time advice to our users, and training & evaluating new models using the collected data. We will specifically emphasize on how we have used Cassandra for consuming lots of fast incoming biometric data from devices and sensors, and how to securely access the big data sets from Cassandra in Spark to compute the models. We will finish by showing the mechanics of deploying such a distributed application. You will get a clear understanding of how Mesos, Marathon, in conjunction with Docker, is used to build an immutable infrastructure that allows us to provide reliable service to our users and a great environment for our engineers.

Real-time personal trainer on the SMACK stack

Anirvan Chakraborty

Stream Processing using Apache Spark and Apache Kafka

Abhinav Singh

You have collected a lot of time series data so now what? It's not going to be useful unless you can analyze what you have. Apache Spark has become the heir apparent to Map Reduce but did you know you don't need Hadoop? Apache Cassandra is a great data source for Spark jobs! Let me show you how it works, how to get useful information and the best part, storing analyzed data back into Cassandra. That's right. Kiss your ETL jobs goodbye and let's get to analyzing. This is going to be an action packed hour of theory, code and examples so caffeine up and let's go.

Analyzing Time Series Data with Apache Spark and Cassandra

Patrick McFadin

spark-timeseries is a Scala / Java / Python library for interacting with time series data on Apache Spark. Time-series are an important part of data science applications, but are notoriously difficult in the context of distributed systems, due to their sequential nature. Getting this right is therefore a challenging but important element of progress in the universe of distributed systems applied to data science. This talk will cover the current overall design of spark-timeseries, the current functionalities, and will provide some usage examples. Because the project is still at an early stage, the talk will also cover the current weaknesses and future improvements that are in the spark-timeseries project roadmap.

Time Series Analytics with Spark: Spark Summit East talk by Simon Ouellette

Spark Summit

Are you tired of struggling with your existing data analytic applications? When MapReduce first emerged it was a great boon to the big data world, but modern big data processing demands have outgrown this framework. That’s where Apache Spark steps in, boasting speeds 10-100x faster than Hadoop and setting the world record in large scale sorting. Spark’s general abstraction means it can expand beyond simple batch processing, making it capable of such things as blazing-fast, iterative algorithms and exactly once streaming semantics. This combined with it’s interactive shell make it a powerful tool useful for everybody, from data tinkerers to data scientists to data developers.

The How and Why of Fast Data Analytics with Apache Spark

Legacy Typesafe (now Lightbend)

Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar...

Spark Summit

Developing Application with Big Data is really challenging work, scaling, fault tolerance and responsiveness some are the biggest challenge. Realtime bigdata application that have self healing feature is a dream these days. Apache Spark is a fast in-memory data processing system that gives a good backend for realtime application.In this talk I will show how to use reactive platform, Actor model and Apache Spark stack to develop a system that have responsiveness, resiliency, fault tolerance and message driven feature.

Reactive app using actor model & apache spark

Rahul Kumar

What's hot (20)

Delivering Meaning In Near-Real Time At High Velocity In Massive Scale with A...

Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala

Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17

Spark on Mesos-A Deep Dive-(Dean Wampler and Tim Chen, Typesafe and Mesosphere)

Spark Streaming with Cassandra

Meet Up - Spark Stream Processing + Kafka

Recipes for Running Spark Streaming Applications in Production-(Tathagata Das...

Analytics with Cassandra & Spark

Developing a Real-time Engine with Akka, Cassandra, and Spray

Lambda Architecture with Spark, Spark Streaming, Kafka, Cassandra, Akka and S...

Real Time Data Processing Using Spark Streaming

OLAP with Cassandra and Spark

Exactly-Once Streaming from Kafka-(Cody Koeninger, Kixer)

Real-time personal trainer on the SMACK stack

Stream Processing using Apache Spark and Apache Kafka

Analyzing Time Series Data with Apache Spark and Cassandra

Time Series Analytics with Spark: Spark Summit East talk by Simon Ouellette

The How and Why of Fast Data Analytics with Apache Spark

Spark Streaming: Pushing the throughput limits by Francois Garillot and Gerar...

Reactive app using actor model & apache spark

Viewers also liked

NoSQL at Twitter (NoSQL EU 2010)

Kevin Weil

IT performance management isn’t about monitoring CPU, memory or disk space any more. One of the toughest application performance challenges for any IT administrator is when a user says "my application is slow". You have to be able to quickly determine what the real cause of the problem is - is it in the network, the database, the application, storage? The fact that applications are using multi-tier architectures and being deployed in cloud and virtualized infrastructures only adds to the challenge. View these slides from our webinar where Frank Ohlhorst, Enterprise IT Analyst & Consultant and Srinivas Ramanathan, CEO of eG Innovations, discussed the best practices for troubleshooting and prevention so even before a user complains their application is slow, you can pinpoint exactly where the cause of a problem lies – ensuring quick resolution and a positive user experience.

My Application is Slow | Best Practices for Troubleshooting and Prevention

eG Innovations

Chirp 2010: Scaling Twitter

John Adams

Biometric Databases and Hadoop__HadoopSummit2010

Yahoo Developer Network

Docker国内外本番環境サービス事例のご紹介

ThinkIT_impress

"In this session, Twitter engineer Alex Payne will explore how the popular social messaging service builds scalable, distributed systems in the Scala programming language. Since 2008, Twitter has moved the development of its most critical systems to Scala, which blends object-oriented and functional programming with the power, robust tooling, and vast library support of the Java Virtual Machine. Find out how to use the Scala components that Twitter has open sourced, and learn the patterns they employ for developing core infrastructure components in this exciting and increasingly popular language."

Building Distributed Systems in Scala

Alex Payne

Machine Data 101 Hands-on

Splunk

Aggregates

Huzaifa Shafiq

NoSQL databases, the CAP theorem, and the theory of relativity

Lars Marius Garshol

Scaling Twitter

Blaine

BUILDING MATERIALS - SAND

Ravindra Patnayaka

Facebook architecture presentation: scalability challenge

Cristina Munoz

Facebook Architecture - Breaking it Open

HARMAN Services

Twitter - Architecture and Scalability lessons

Aditya Rao

facebook architecture for 600M users

Jongyoon Choi

Big Data in Real-Time at Twitter

nkallen

Viewers also liked (16)

NoSQL at Twitter (NoSQL EU 2010)

My Application is Slow | Best Practices for Troubleshooting and Prevention

Chirp 2010: Scaling Twitter

Biometric Databases and Hadoop__HadoopSummit2010

Docker国内外本番環境サービス事例のご紹介

Building Distributed Systems in Scala

Machine Data 101 Hands-on

Aggregates

NoSQL databases, the CAP theorem, and the theory of relativity

Scaling Twitter

BUILDING MATERIALS - SAND

Facebook architecture presentation: scalability challenge

Facebook Architecture - Breaking it Open

Twitter - Architecture and Scalability lessons

facebook architecture for 600M users

Big Data in Real-Time at Twitter

Similar to Scaling Twitter with Cassandra

cassandra_presentation_final

SergioBruno21

Vitalii Bondarenko - “Azure real-time analytics and kappa architecture with K...

Lviv Startup Club

Exploring the Fundamentals of YugabyteDB - Mydbops MyWebinar Edition 25 Join us for an enlightening journey into the world of YugabyteDB, a distributed SQL database revolutionizing data management. In this webinar presentation, we delve into the challenges faced by traditional databases, explore the architecture and unique features of YugabyteDB, and showcase its seamless scalability and fault tolerance. Watch the full recording: https://youtu.be/QtvK-apLBwQ Visit Mydbops Blogs: https://www.mydbops.com/blog/

Exploring the Fundamentals of YugabyteDB - Mydbops

Mydbops

No sql solutions - 공개용

Byeongweon Moon

Spring one2gx2010 spring-nonrelational_data

Roger Xia

Apache Cassandra @Geneva JUG 2013.02.26

Benoit Perroud

Clojure has been heralded as a pioneer in data oriented functional programming. In this talk, Huahai will explore the use of Clojure data diffing/patching library as a tool to simplify software architecture and solve complex engineering problems. After briefly describing EditScript, a Clojure data diffing/patching library, he will detail several usage patterns by drawing from code examples in our production system. Huahai will discuss how diffing improves system modularization by reducing namespace dependencies; how it drastically simplifies client-server communication to drive much faster UI iterations; how it enables massive scaling by turning stateful applications into stateless ones; and how it powers collaborative editing of online documents. This talk is for everyone who are interested in expanding their data oriented functional programming tool box.

Data Diffing Based Software Architecture Patterns

Huahai Yang

Thoughts on Transaction and Consistency Models

iammutex

Cassandra from the trenches: migrating Netflix (update)

Jason Brown

Cassandra

exsuns

MyCassandra (Full English Version)

Shun Nakamura

Cassandra integrations

T Jake Luciani

NoSQL overview #phptostart turin 11.07.2011

David Funaro

The Return of the Living Datalog

Mike Fogus

Real-time Cassandra

Acunu

Life as a software engineer is so exciting! Computing power continue to rise exponentially, software demands continue to rise exponentially as well, so far so good. The bad news are that in the last decade the computing power of single threaded application remains almost flat. If you decide to continue ignoring concurrency and multi-threading the gap between the problems you are able to solve and your hardware capabilities will continue to rise. In this session we will discuss different approaches for taming the concurrency beast, such as shared mutability,shared immutability and isolated mutability actors, STM, etc we will discuss the shortcomings and the dangers of each approach and we will compare different programming languages and how they choose to tackle/ignore concurrency.

Concurrency and Multithreading Demistified - Reversim Summit 2014

Haim Yadid

NoSQL_Night

Clarence J M Tauro

04-Introduction-to-CassandraDB-.pdf

hothyfa

SDEC2011 NoSQL Data modelling

Korea Sdec

Intro to Graphs for Fedict

Rik Van Bruggen

Similar to Scaling Twitter with Cassandra (20)

cassandra_presentation_final

Vitalii Bondarenko - “Azure real-time analytics and kappa architecture with K...

Exploring the Fundamentals of YugabyteDB - Mydbops

No sql solutions - 공개용

Spring one2gx2010 spring-nonrelational_data

Apache Cassandra @Geneva JUG 2013.02.26

Data Diffing Based Software Architecture Patterns

Thoughts on Transaction and Consistency Models

Cassandra from the trenches: migrating Netflix (update)

Cassandra

MyCassandra (Full English Version)

Cassandra integrations

NoSQL overview #phptostart turin 11.07.2011

The Return of the Living Datalog

Real-time Cassandra

Concurrency and Multithreading Demistified - Reversim Summit 2014

NoSQL_Night

04-Introduction-to-CassandraDB-.pdf

SDEC2011 NoSQL Data modelling

Intro to Graphs for Fedict

Recently uploaded

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

apidays

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Zilliz

Manulife - Insurer Transformation Award 2024

The Digital Insurer

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Webinar Recording: https://www.panagenda.com/webinars/why-teams-call-analytics-is-critical-to-your-entire-business Nothing is as frustrating and noticeable as being in an important call and being unable to see or hear the other person. Not surprising then, that issues with Teams calls are among the most common problems users call their helpdesk for. Having in depth insight into everything relevant going on at the user’s device, local network, ISP and Microsoft itself during the call is crucial for good Microsoft Teams Call quality support. To ensure a quick and adequate solution and to ensure your users get the most out of their Microsoft 365. But did you know that ‘bad calls’ are also an excellent indicator of other problems arising? Precisely because it is so noticeable!? Like the canary in the mine, bad calls can be early indicators of problems. Problems that might otherwise not have been noticed for a while but can have a big impact on productivity and satisfaction. Join this session by Christoph Adler to learn how true Microsoft Teams call quality analytics helped other organizations troubleshoot bad calls and identify and fix problems that impacted Teams calls or the use of Microsoft365 in general. See what it can do to keep your users happy and productive! In this session we will cover - Why CQD data alone is not enough to troubleshoot call problems - The importance of attributing call problems to the right call participant - What call quality analytics can do to help you quickly find, fix-, and prevent problems - Why having retrospective detailed insights matters - Real life examples of how others have used Microsoft Teams call quality monitoring to problem shoot problems with their ISP, network, device health and more.

Why Teams call analytics are critical to your entire business

panagenda

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

Scalable LLM APIs for AI and Generative AI Application Development Ettikan Karuppiah, Director/Technologist - NVIDIA Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

apidays

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

In the thrilling conclusion to 2023, ransomware groups had a banner year, really outdoing themselves in the "make everyone's life miserable" department. LockBit 3.0 took gold in the hacking olympics, followed by the plucky upstarts Clop and ALPHV/BlackCat. Apparently, 48% of organizations were feeling left out and decided to get in on the cyber attack action. Business services won the "most likely to get digitally mugged" award, with education and retail nipping at their heels. Hackers expanded their repertoire beyond boring old encryption to the much more exciting world of extortion. The US, UK and Canada took top honors in the "countries most likely to pay up" category. Bitcoins were the currency of choice for discerning hackers, because who doesn't love untraceable money?

Ransomware_Q4_2023. The report. [EN].pdf

Overkill Security

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

MS Copilot expands with MS Graph connectors

Nanddeep Nachan

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Zilliz

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Scaling API-first – The story of a global engineering organization Ian Reasor, Senior Computer Scientist - Adobe Radu Cotescu, Senior Computer Scientist - Adobe Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

apidays

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

presentation ICT roal in 21st century education

jfdjdjcjdnsjd

Architecting Cloud Native Applications

WSO2

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MadyBayot

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Manulife - Insurer Transformation Award 2024

Powerful Google developer tools for immediate impact! (2023-24 C)

Why Teams call analytics are critical to your entire business

Boost Fertility New Invention Ups Success Rates.pdf

GenAI Risks & Security Meetup 01052024.pdf

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Ransomware_Q4_2023. The report. [EN].pdf

AWS Community Day CPH - Three problems of Terraform

MS Copilot expands with MS Graph connectors

Apidays New York 2024 - The value of a flexible API Management solution for O...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Automating Google Workspace (GWS) & more with Apps Script

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

presentation ICT roal in 21st century education

Architecting Cloud Native Applications

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Scaling Twitter with Cassandra

2. Scaling Twitter with Cassandra Ryan King Storage Team

3. bit.ly/chirpcassandra ryan@twitter.com @rk

4. Legacy • vertically & horiztonally partitioned mysql • memcached (rows, indexes and fragments) • application managed

5. Legacy Drawbacks • many single-points-of-failure • hardware-intensive • manpower-intensive • tight coupling

6. Apache Cassandra • Apache top level project • originally developed at Facebook • Rackspace, Digg, SimpleGeo, Twitter, etc.

7. Why Cassandra? • highly available • consistent, eventually • decentralized • fault tolerant • elastic • flexible schema • high write throughput

8. What is Cassandra? • distributed database • Google's BigTable's data model • Amazon's Dynamo's infrastructure

9. Cassandra Data Model • keyspaces • column families • columns • super columns

10. Cassandra Infrastructure • partitioners • storage • querying

11. Partitioners • order-preserving • random • custom

12. Storage • commit log • memtables • sstables • compaction • bloom filters • indexes • key cache • row cache

13. Querying • get • multiget • range • slice

14. Consistency

15. Consistency • N, R, W

16. Consistency • N, R, W • N = number of replicas

17. Consistency • N, R, W • N = number of replicas • R = read replicas

18. Consistency • N, R, W • N = number of replicas • R = read replicas • W = write replicas

19. Consistency • N, R, W • N = number of replicas • R = read replicas • W = write replicas • send request, wait for specified number

20. Consistency • N, R, W • N = number of replicas • R = read replicas • W = write replicas • send request, wait for specified number • wait for others in background and perform read- repair

21. Consistency Levels • ZERO • ONE • QUORUM • ALL

22. Strong Consistency • If W + R > N, you will have consistency • W=1, R=N • W=N, R=1 • W=Q, R=Q where Q = N / 2 + 1

23. Eventuality • Hinted Handoff • Read Repair • Proactive Repair (Merkle trees)

24. Potential Consistency

25. Potential Consistency • causes • write-through caching • master-slave replication failures

26. Example

27. Read Repair • send read to all replicas • if they differ, resolve conflicts and update (in background)

28. Hinted Handoff • A wants to write to B • B is down • A tells C, "when B is back, send them this update"

29. Proactive Repair • use Merkle trees to find inconsistencies • resolve conflicts • send repaired data • triggered manually

30. Parallel Deployment

31. How we’re moving? • parallel deployments • incremental traffic shifting

32. Parallel Deployment 1. build new implementation 2. integrate it alongside existing 3. ...with switches for dynamically move/mirror traffic 4. turn up traffic 5. break something 6. Fix it 7. GOTO 4

33. ?

Editor's Notes

* storage team * personal background
* began working on this problem last june * complexity had grown unmanageable * multiple internal customers * error domain grows as data size and complexity grow
* every master db is a SPOF (failover is hard to pull off without strong coordination) * SPOFs lead to expensive hardware * app-managed hosts is tight coupling
* our application is already tolerant of eventual consistency (actually more tolerant...) * in addition to scale, we want more flexibility than relational data models give us
keyspace: database CF: table column: attribute SC: collection of attributes
[insert diagrams of ring + tokens] nodes are arranged on a ring keys are mapped to the ring and written to the next N machines partitioners map keys to the ring
[flow chart of how updates happen]
if OPP, rows are ordered columns are ordered [diagram of range and slice]
insert to mysql insert into memcache replicate to slave update mysql insert into memcache fails replication to slave fails
Launching is shifting from roll back to roll forward

Scaling Twitter with Cassandra

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (16)

Similar to Scaling Twitter with Cassandra

Similar to Scaling Twitter with Cassandra (20)

Recently uploaded

Recently uploaded (20)

Scaling Twitter with Cassandra

Editor's Notes