Real-time platforms using Spark, Akka, Cassandra, Kafka, Alpakka, Kafka
Connect, & Kafka Streams.
Realtime Business Platform
Architecture Review
Business Platform Success
We design, build, and manage business
platforms by leveraging DataStax,
Sitecore, Salesforce, Quickbooks and
other cloud software.
Modern Digital Business Technology Platform
Business Use Case for Real-time
The current thinking is that the state of the systems in an
enterprise are synchronous and that analysis must be done
sequentially, iteratively from beginning to end in batch.
Old: Synchronous ETL / Batch Pipeline
Modern Business Platform : IoT + API + CX + IS + DA
New: Coordinated Disparate Systems
Reactive Manifesto
Modern: Async via Streams/Queues/Actors
Modern: Async via Streams/Queues/Actors
Real-time Analytics
Real-time Analytics
• Deterministic
• Available
• Machine Learning
• Data Enrichment
• Business Intelligence Tools
• SQL
● Data
○ S3
○ Cassandra
○ Redshift
○ RDS
● Analysis
○ Spark
■ Python
■ Scala
■ Java
■ R
○ SQL
Real-time + Batch = Lambda Architecture
Real-time Analytics
● Message Processing / Materialization
○ Kafka SQL
○ Kafka Connect
○ Samza
○ Akka
○ Alpakka
● Message Enhancement / Computation
○ Kafka Streams
○ Spark
○ Flink
○ Kafka Streams
○ Serverless (Lambda/Functions)
● Realtime Distributed Data
○ Cassandra*/Scylla/DSE
○ Cockroach/TiDB/Galera/YB
○ S3
○ DynamoDB
● Colocated Data / Analytics
○ Cassandra + Spark SQL
○ Cassandra + Spark
○ Cassandra + Spark + Solr
○ Snowflake
○ Redshift
Real-time Analytics: SMACK Stack
Apache Kafka
Kafka as Event Store
Kafka, Spark, Cassandra, SQS, Redshift on AWS
Kafka Streams
Kafka Connect + DataStax Connector
Kafka w/ Machine Learning
Apache Spark
Spark + Streaming + Static Data Sources
Spark + Tableau for Business Intelligence
Apache Kafka
+ Apache Spark
Spark + Kafka Data Processing
Simplified Lambda with Kafka + Spark + Cassandra
Apache Kafka
+ Apache Spark
+ Akka | Alpakka
Canonical Kafka Spark Cassandra Stack
Akka + Spark + Kafka + Cassandra
Alpakka (Akka Streams)
Apache Pulsar
Pulsar Architecture
Pulsar Subscription Modes
Pulsar Geo Replication
Pulsar Tiered Storage
Major Players
Realtime Components
www.anant.us | solutions@anant.us | (855) 262-6826
3 Washington Circle, NW | Suite 301 | Washington, DC 20037
Data & Analytics
Cassandra, DataStax, Kafka, Spark
Customer Experience
Sitecore
Information Systems
Salesforce, Quickbooks, and more

Realtime Business Platform Architecture Review