SlideShare a Scribd company logo
1 of 35
Download to read offline
Better Together: Apache
Cassandra and Apache Kafka
1
2 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Your Presenters
3 © DataStax, All Rights Reserved.
4 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Apache Cassandra Overview
• First developed by Facebook
• Top-level Apache project since 2010
• Partitioned row store
• Distributed, decentralized
• Elastic scalability / high performance
• High availability / fault tolerant
• Tuneable consistency
• Cassandra Query Language (CQL)
© DataStax, All Rights Reserved.5 Apache Cassandra ® Apache Software Foundation
Apache Kafka Overview
• First developed by LinkedIn
• Top-level Apache Project since 2012
• Distributed streaming platform
• Used for real-time data pipelines and
streaming applications
• Horizontal scalability / high performance
• High availability / Fault tolerance
• Stream persistence and querying
(KSQL)
• Connect framework
6 © DataStax, All Rights Reserved. Apache Kafka ® Apache Software Foundation
Kafka Concepts
• Topics
– Collection of key/value pairs
– Append-only
– Can be partitioned
• Producers
• Consumers
– Separate offsets
7 © DataStax, All Rights Reserved.
Kafka Concepts
• Streams applications
– Combined Producer/Consumer
• KSQL
– Query language used by stream
applications
8 © DataStax, All Rights Reserved.
Kafka Concepts
• Brokers
• Clusters
• Connect Framework
– Sources
– Sinks
9 © DataStax, All Rights Reserved.
Cassandra + Kafka – Similarities and Distinctives
• Concepts in common
– Distributed Systems
– Partitioning / Hashing
– Replication
• Slight differences in implementation
– Multi-DC
– Log-structured
– TTL / retention
• Cassandra excels at…
– High volume, write intensive data storage
workloads at scale
– Suitable as a system of record
– High performance searching via DSE
• Kafka excels at…
– Streaming data to/from services and legacy
data sources
– Acting upon changes in data from multiple
sources (aka pipelines)
10 © DataStax, All Rights Reserved.
+
Better Together – using the best of both
12 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Pattern 1: Cassandra + Kafka in Microservices
13 © DataStax, All Rights Reserved.
Some
Producer
My
microservice
DataStax Enterprise
• Consume
topic(s)
Other
consumers
• Read /
write data
• Publish to
topic(s)
KillrVideo Services Suggested
Videos
Service
DataStax Enterprise
DSE Graph
• UserCreated
• YouTubeVideoAdded
• UserRatedVideo • Populate graph
• Graph recommender
traversal
• Read and
write data
User Management, Video
Catalog, Ratings
Cassandra + Kafka – KillrVideo Example
Confidential© DataStax, All Rights Reserved.
Pattern 2: Kafka into Cassandra
15
16 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Why a Kafka Connector ?
Account &
Product Usage
Geolocation
ClickStream
Mobile Device
System of records
Event Sources
…
Topic: stocks-ticks
Why a Kafka Connector?
• Spark Streaming = PULL
➢ Enable advanced transformations and
computations
➢ Mode Pull with a dedicated runtime (poll)
18
• Kakfa Connector = PUSH
➢ No extra runtime
What is the Kafka Connector ?
Account &
Product Usage
Geolocation
ClickStream
Mobile Device
System of records
Event Sources
…
Topic: stocks-ticks
Kafka Connect
Sources Sinks
HERE
What is the Kafka Connector?
• Automatically ingest from Kafka to DSE
– Simple, Fast, Flexible, Secure
• Deployed in the Kafka Connect framework
– Managed through the built-in REST API
• Visibility into running connectors and tasks
• Endpoints for operator tasks
– Automatic rebalancing
• Useful for availability and scaling
20
Kafka Connect
Sources Sinks
HERE
What is the Kafka Connector?
• Built by DataStax drivers team
– Best practices for writing to DSE
– Resiliency of DS drivers
21
Kafka Connect
Sources Sinks
HERE
What is the Kafka Connector?
22
Kafka Connect
Data Sinks
DataStax Connector
workers
<standalone-worker>.properties
<distributed-worker>.properties
or
reads
Connectors list
<connector>.properties
or
<connector>.json
read
Config (mapping)
instantiate
start
23 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Academy.datastax.com/downloads
24
What versions does this work with?
25
DSE 5.0+
Supported Versions
- DSE 5.0+
- Confluent 3.2+
- Apache Kafka 0.10.2+
Supported offerings
- DS Enterprise
- DS Basic
- DDAC
Confluent Apache Kafka
3.2.x+ 0.10.2.x+
3.3.x+ 0.11.0.x+
4.0.x+ 1.0.x+
4.1.x+ 1.1.x+
5.0.x+ 2.0.x+
Docker and Datastax
26 Confidential
• WHERE
– https://hub.docker.com/u/datastax/
– https://github.com/datastax/docker-
images/tree/master/datastax-docker-image-
examples
• We provide
– Dockers images for DSE, studio, Opscenter
– Docker-compose configuration files
– Sample Deployments
• We support
– Installation on dev before 6.7
– Installation on prod from 6.7 (December 2018)
https://github.com/clun/kafka-dse/tree/driver2
27 © DataStax, All Rights Reserved. Confidential
Demonstration Overview
© DataStax, All Rights Reserved.
Kafka-dse-producer
TickGenerator
StockTickProducer
timer
Alpha Vantage
2181
9092
KAFKA-CONNECT
8083
Sources Sinks
DSE
Kafka-dse-webui
Java-dse-driver
Spring Web Flux
29 © DataStax, All Rights Reserved.
Confidential
Agenda
1 Apache Cassandra and Apache Kafka
2 Better Together – Common Patterns
3 DataStax Kafka Connector
4 Demonstration
5 Resources
Documentation and Examples
• Blog
– https://www.datastax.com/2018/12/introducing-the-datastax-apache-kafka-connector
• Download
– https://academy.datastax.com/downloads#connectors
• Docs
– https://docs.datastax.com/en/kafka/doc/index.html
• Demonstration
– https://github.com/clun/kafka-dse/tree/driver2
• Examples
– https://github.com/datastax/kafka-examples
30 © DataStax, All Rights Reserved.
DataStax Academy
• Free self-paced courses
• DS201: Apache Cassandra™
• DS210: Operations
• DS220: Data Modeling
• DS310: Search
• DS320: Analytics
• DS330: Graph
• Kafka Connector Getting Started
31 © DataStax, All Rights Reserved.
https://academy.datastax.com
Live Coding on Twitch
• Live coding sessions with advocates and
guests
• Working through the challenges of
building distributed systems
• Join the conversation and ask questions
• Twitch Rewind: Kafka Connector
– https://www.youtube.com/watch?v=2_BidD
K5zGE
https://www.twitch.tv/datastaxacademy
32 © DataStax, All Rights Reserved.
Join us at Accelerate!
33 © DataStax, All Rights Reserved.
http://www.datastax.com/accelerate
Discount Code: ADVOCATE20
Cassandra + Kafka at Accelerate!
34 © DataStax, All Rights Reserved.
…and additional sessions
Thank you
35 © DataStax, All Rights Reserved.
Confidential

More Related Content

What's hot

Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
Kai Wähner
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?
Kai Wähner
 
Introducing Kafka's Streams API
Introducing Kafka's Streams APIIntroducing Kafka's Streams API
Introducing Kafka's Streams API
confluent
 

What's hot (20)

Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
Apache Kafka + Apache Mesos + Kafka Streams - Highly Scalable Streaming Micro...
 
Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?Can Apache Kafka Replace a Database?
Can Apache Kafka Replace a Database?
 
Apache Kafka - Scalable Message-Processing and more !
Apache Kafka - Scalable Message-Processing and more !Apache Kafka - Scalable Message-Processing and more !
Apache Kafka - Scalable Message-Processing and more !
 
APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of APAC Kafka Summit - Best Of
APAC Kafka Summit - Best Of
 
Introduction to Apache Kafka and Confluent... and why they matter
Introduction to Apache Kafka and Confluent... and why they matterIntroduction to Apache Kafka and Confluent... and why they matter
Introduction to Apache Kafka and Confluent... and why they matter
 
Evolving from Messaging to Event Streaming
Evolving from Messaging to Event StreamingEvolving from Messaging to Event Streaming
Evolving from Messaging to Event Streaming
 
Secure Kafka at scale in true multi-tenant environment ( Vishnu Balusu & Asho...
Secure Kafka at scale in true multi-tenant environment ( Vishnu Balusu & Asho...Secure Kafka at scale in true multi-tenant environment ( Vishnu Balusu & Asho...
Secure Kafka at scale in true multi-tenant environment ( Vishnu Balusu & Asho...
 
Partner Development Guide for Kafka Connect
Partner Development Guide for Kafka ConnectPartner Development Guide for Kafka Connect
Partner Development Guide for Kafka Connect
 
Introducing Kafka's Streams API
Introducing Kafka's Streams APIIntroducing Kafka's Streams API
Introducing Kafka's Streams API
 
Hello, kafka! (an introduction to apache kafka)
Hello, kafka! (an introduction to apache kafka)Hello, kafka! (an introduction to apache kafka)
Hello, kafka! (an introduction to apache kafka)
 
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
 
Confluent Kafka and KSQL: Streaming Data Pipelines Made Easy
Confluent Kafka and KSQL: Streaming Data Pipelines Made EasyConfluent Kafka and KSQL: Streaming Data Pipelines Made Easy
Confluent Kafka and KSQL: Streaming Data Pipelines Made Easy
 
Tale of two streaming frameworks (Karthik D - Walmart)
Tale of two streaming frameworks (Karthik D - Walmart)Tale of two streaming frameworks (Karthik D - Walmart)
Tale of two streaming frameworks (Karthik D - Walmart)
 
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...
 
Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...
Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...
Developing a custom Kafka connector? Make it shine! | Igor Buzatović, Porsche...
 
Intro to Apache Kafka
Intro to Apache KafkaIntro to Apache Kafka
Intro to Apache Kafka
 
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
Creating Connector to Bridge the Worlds of Kafka and gRPC at Wework (Anoop Di...
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
 
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
 

Similar to Webinar | Better Together: Apache Cassandra and Apache Kafka

GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023
Timothy Spann
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
DataStax Academy
 
Apache Kafka - Scalable Message Processing and more!
Apache Kafka - Scalable Message Processing and more!Apache Kafka - Scalable Message Processing and more!
Apache Kafka - Scalable Message Processing and more!
Guido Schmutz
 
Unconference Round Table Notes
Unconference Round Table NotesUnconference Round Table Notes
Unconference Round Table Notes
Timothy Spann
 

Similar to Webinar | Better Together: Apache Cassandra and Apache Kafka (20)

GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023GSJUG: Mastering Data Streaming Pipelines 09May2023
GSJUG: Mastering Data Streaming Pipelines 09May2023
 
Data Con LA 2019 - Patterns for Persistence and Streaming in Cloud Architectu...
Data Con LA 2019 - Patterns for Persistence and Streaming in Cloud Architectu...Data Con LA 2019 - Patterns for Persistence and Streaming in Cloud Architectu...
Data Con LA 2019 - Patterns for Persistence and Streaming in Cloud Architectu...
 
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
 
Real time Messages at Scale with Apache Kafka and Couchbase
Real time Messages at Scale with Apache Kafka and CouchbaseReal time Messages at Scale with Apache Kafka and Couchbase
Real time Messages at Scale with Apache Kafka and Couchbase
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Patterns for Persistence and Streaming in Microservice Architectures
Patterns for Persistence and Streaming in Microservice ArchitecturesPatterns for Persistence and Streaming in Microservice Architectures
Patterns for Persistence and Streaming in Microservice Architectures
 
Leverage Kafka to build a stream processing platform
Leverage Kafka to build a stream processing platformLeverage Kafka to build a stream processing platform
Leverage Kafka to build a stream processing platform
 
PartnerSkillUp_Enable a Streaming CDC Solution
PartnerSkillUp_Enable a Streaming CDC SolutionPartnerSkillUp_Enable a Streaming CDC Solution
PartnerSkillUp_Enable a Streaming CDC Solution
 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?
 
NoSQL_Night
NoSQL_NightNoSQL_Night
NoSQL_Night
 
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
 
Apache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data ProcessingApache Tez - A unifying Framework for Hadoop Data Processing
Apache Tez - A unifying Framework for Hadoop Data Processing
 
Solution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorSolution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline Accelerator
 
Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19
 
Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19
 
Apache Kafka - Scalable Message Processing and more!
Apache Kafka - Scalable Message Processing and more!Apache Kafka - Scalable Message Processing and more!
Apache Kafka - Scalable Message Processing and more!
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
Couchbase Data Pipeline
Couchbase Data PipelineCouchbase Data Pipeline
Couchbase Data Pipeline
 
Building a Hadoop Data Warehouse with Impala
Building a Hadoop Data Warehouse with ImpalaBuilding a Hadoop Data Warehouse with Impala
Building a Hadoop Data Warehouse with Impala
 
Unconference Round Table Notes
Unconference Round Table NotesUnconference Round Table Notes
Unconference Round Table Notes
 

More from DataStax

More from DataStax (20)

Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
 
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid EnvironmentsRunning DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
 
Best Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise GraphBest Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise Graph
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step JourneyWebinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
 
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax EnterpriseTop 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
 
Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
 
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
 
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
 
Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)
 
An Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking ApplicationsAn Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking Applications
 
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design ThinkingBecoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
 
Innovation Around Data and AI for Fraud Detection
Innovation Around Data and AI for Fraud DetectionInnovation Around Data and AI for Fraud Detection
Innovation Around Data and AI for Fraud Detection
 

Recently uploaded

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Recently uploaded (20)

Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Buy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdfBuy Epson EcoTank L3210 Colour Printer Online.pdf
Buy Epson EcoTank L3210 Colour Printer Online.pdf
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Buy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptxBuy Epson EcoTank L3210 Colour Printer Online.pptx
Buy Epson EcoTank L3210 Colour Printer Online.pptx
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
THE BEST IPTV in GERMANY for 2024: IPTVreel
THE BEST IPTV in  GERMANY for 2024: IPTVreelTHE BEST IPTV in  GERMANY for 2024: IPTVreel
THE BEST IPTV in GERMANY for 2024: IPTVreel
 

Webinar | Better Together: Apache Cassandra and Apache Kafka

  • 2. 2 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 3. Your Presenters 3 © DataStax, All Rights Reserved.
  • 4. 4 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 5. Apache Cassandra Overview • First developed by Facebook • Top-level Apache project since 2010 • Partitioned row store • Distributed, decentralized • Elastic scalability / high performance • High availability / fault tolerant • Tuneable consistency • Cassandra Query Language (CQL) © DataStax, All Rights Reserved.5 Apache Cassandra ® Apache Software Foundation
  • 6. Apache Kafka Overview • First developed by LinkedIn • Top-level Apache Project since 2012 • Distributed streaming platform • Used for real-time data pipelines and streaming applications • Horizontal scalability / high performance • High availability / Fault tolerance • Stream persistence and querying (KSQL) • Connect framework 6 © DataStax, All Rights Reserved. Apache Kafka ® Apache Software Foundation
  • 7. Kafka Concepts • Topics – Collection of key/value pairs – Append-only – Can be partitioned • Producers • Consumers – Separate offsets 7 © DataStax, All Rights Reserved.
  • 8. Kafka Concepts • Streams applications – Combined Producer/Consumer • KSQL – Query language used by stream applications 8 © DataStax, All Rights Reserved.
  • 9. Kafka Concepts • Brokers • Clusters • Connect Framework – Sources – Sinks 9 © DataStax, All Rights Reserved.
  • 10. Cassandra + Kafka – Similarities and Distinctives • Concepts in common – Distributed Systems – Partitioning / Hashing – Replication • Slight differences in implementation – Multi-DC – Log-structured – TTL / retention • Cassandra excels at… – High volume, write intensive data storage workloads at scale – Suitable as a system of record – High performance searching via DSE • Kafka excels at… – Streaming data to/from services and legacy data sources – Acting upon changes in data from multiple sources (aka pipelines) 10 © DataStax, All Rights Reserved.
  • 11. + Better Together – using the best of both
  • 12. 12 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 13. Pattern 1: Cassandra + Kafka in Microservices 13 © DataStax, All Rights Reserved. Some Producer My microservice DataStax Enterprise • Consume topic(s) Other consumers • Read / write data • Publish to topic(s)
  • 14. KillrVideo Services Suggested Videos Service DataStax Enterprise DSE Graph • UserCreated • YouTubeVideoAdded • UserRatedVideo • Populate graph • Graph recommender traversal • Read and write data User Management, Video Catalog, Ratings Cassandra + Kafka – KillrVideo Example
  • 15. Confidential© DataStax, All Rights Reserved. Pattern 2: Kafka into Cassandra 15
  • 16. 16 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 17. Why a Kafka Connector ? Account & Product Usage Geolocation ClickStream Mobile Device System of records Event Sources … Topic: stocks-ticks
  • 18. Why a Kafka Connector? • Spark Streaming = PULL ➢ Enable advanced transformations and computations ➢ Mode Pull with a dedicated runtime (poll) 18 • Kakfa Connector = PUSH ➢ No extra runtime
  • 19. What is the Kafka Connector ? Account & Product Usage Geolocation ClickStream Mobile Device System of records Event Sources … Topic: stocks-ticks Kafka Connect Sources Sinks HERE
  • 20. What is the Kafka Connector? • Automatically ingest from Kafka to DSE – Simple, Fast, Flexible, Secure • Deployed in the Kafka Connect framework – Managed through the built-in REST API • Visibility into running connectors and tasks • Endpoints for operator tasks – Automatic rebalancing • Useful for availability and scaling 20 Kafka Connect Sources Sinks HERE
  • 21. What is the Kafka Connector? • Built by DataStax drivers team – Best practices for writing to DSE – Resiliency of DS drivers 21 Kafka Connect Sources Sinks HERE
  • 22. What is the Kafka Connector? 22 Kafka Connect Data Sinks DataStax Connector workers <standalone-worker>.properties <distributed-worker>.properties or reads Connectors list <connector>.properties or <connector>.json read Config (mapping) instantiate start
  • 23. 23 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 25. What versions does this work with? 25 DSE 5.0+ Supported Versions - DSE 5.0+ - Confluent 3.2+ - Apache Kafka 0.10.2+ Supported offerings - DS Enterprise - DS Basic - DDAC Confluent Apache Kafka 3.2.x+ 0.10.2.x+ 3.3.x+ 0.11.0.x+ 4.0.x+ 1.0.x+ 4.1.x+ 1.1.x+ 5.0.x+ 2.0.x+
  • 26. Docker and Datastax 26 Confidential • WHERE – https://hub.docker.com/u/datastax/ – https://github.com/datastax/docker- images/tree/master/datastax-docker-image- examples • We provide – Dockers images for DSE, studio, Opscenter – Docker-compose configuration files – Sample Deployments • We support – Installation on dev before 6.7 – Installation on prod from 6.7 (December 2018)
  • 28. Demonstration Overview © DataStax, All Rights Reserved. Kafka-dse-producer TickGenerator StockTickProducer timer Alpha Vantage 2181 9092 KAFKA-CONNECT 8083 Sources Sinks DSE Kafka-dse-webui Java-dse-driver Spring Web Flux
  • 29. 29 © DataStax, All Rights Reserved. Confidential Agenda 1 Apache Cassandra and Apache Kafka 2 Better Together – Common Patterns 3 DataStax Kafka Connector 4 Demonstration 5 Resources
  • 30. Documentation and Examples • Blog – https://www.datastax.com/2018/12/introducing-the-datastax-apache-kafka-connector • Download – https://academy.datastax.com/downloads#connectors • Docs – https://docs.datastax.com/en/kafka/doc/index.html • Demonstration – https://github.com/clun/kafka-dse/tree/driver2 • Examples – https://github.com/datastax/kafka-examples 30 © DataStax, All Rights Reserved.
  • 31. DataStax Academy • Free self-paced courses • DS201: Apache Cassandra™ • DS210: Operations • DS220: Data Modeling • DS310: Search • DS320: Analytics • DS330: Graph • Kafka Connector Getting Started 31 © DataStax, All Rights Reserved. https://academy.datastax.com
  • 32. Live Coding on Twitch • Live coding sessions with advocates and guests • Working through the challenges of building distributed systems • Join the conversation and ask questions • Twitch Rewind: Kafka Connector – https://www.youtube.com/watch?v=2_BidD K5zGE https://www.twitch.tv/datastaxacademy 32 © DataStax, All Rights Reserved.
  • 33. Join us at Accelerate! 33 © DataStax, All Rights Reserved. http://www.datastax.com/accelerate Discount Code: ADVOCATE20
  • 34. Cassandra + Kafka at Accelerate! 34 © DataStax, All Rights Reserved. …and additional sessions
  • 35. Thank you 35 © DataStax, All Rights Reserved. Confidential