Captial One: Why Stream Data as Part of Data Transformation?

Why Stream Data as Part
of Data Transformation
Glen Gomez Zuazo, Senior Solutions Architect

Presenter
Glen Gomez Zuazo, Senior Solutions Architect
● Data Science, Machine Learning, Distributed Systems, Full
Stack Development, Blockchain and Enterprise
Architecture
● Passionate involvement in Diversity and Inclusion
● STEM advocate for young people (Middle and High School)
● Teaching technology (CSSE, AWS and Microservices)
● Spending time with his family, including his dog (Bolillo),
running and camping

Event-Driven Data Architecture in 2019
■ Event-driven architectures are increasingly part of a complete data
transformation solution
■ This talks covers
● details of each
● advantages and disadvantages
● how to select the best for your company’s needs

Prevalent examples
■ Apache Kafka
■ Cloud Native Computing Foundation’s NATS
■ Amazon SQS
■ Lightbend Akka

Amazon Simple Queue Service
■ Fully managed message queuing service
■ Allows decoupling/scaling microservices, distributed systems, and
serverless applications from Sync to Asynch.
■ Eliminates complexity/overhead of managing and operating
message oriented middleware

SQS: type types of message queues
■ Standard queues: maximum throughput, best-effort ordering,
and at-least-once delivery
■ SQS FIFO queues: guarantees messages are processed exactly
once, in the exact order that they are sent.

SQS Functionality
■ Unlimited queues and messages
■ Payload
● Up to 256KB of text in any format
● Each 64KB ‘chunk’ of payload is billed as 1 request
● (E.g. 256KB payload is billed as four requests)
● Use Amazon SQS Extended Client Library for Java to send messages >256KB
● Extended Client Library uses Amazon S3 to store the message payload
■ Batches
● Send, receive, or delete messages in batches of up to 10 messages or 256KB
● Batches cost the same amount as single messages
● More cost effective for customers

SQS Functionality (cont’d)
■ Long polling
● Reduce extraneous polling to minimize cost while receiving new messages as
quickly as possible.
● When your queue is empty, long-poll requests wait up to 20 seconds for the next
message to arrive
● Long poll requests cost the same amount as regular requests.
■ Retain messages in queues for up to 14 days.
■ Send and read messages simultaneously

Functionality
■ Message locking.
● While is Processing.
■ Queue sharing
● Anonymously
● Specific AWS Accounts
■ Server-side encryption (SSE)
● AWS Key Management Service (AWS KMS)
■ Dead Letter Queues (DLQ)
● source queue (standard or FIFO).

Publish-Subscribe for Application Integration
● Exchange Data Asynchronously
● Be Independent and fault-tolerant
● Allow Systems to be in different environments (OS, Language)

Messaging Patterns
Message queuing
Publish-subscribe (pub-sub)

NATS
■ high-performance, cloud native messaging system
■ provides an entire foundational level
■ can build both synchronous and asynchronous, reliable, highly
available systems
■ 2.0 release provides incredible features both for high availability and
security
● not to be confused with NGS, the Synadia commercial version
Let’s cover the details of how we plan to deploy and configure NATS with
special focus on HA and security.

High Availability
■ Deploy a NATS cluster as a global entity with NATS gateways used to
connect multi regions. Both NATS and System proper will be
deployed active/active.
■ It is assumed that there is a geographically pinned single point of
entry into each cluster in all of these scenarios as per standard AWS
practices.
■ In "classic" active-active scenarios, you have two or more completely
isolated mirrors.

Sharing Streams and Services
■ NATS account model also comes with an explicit and secure by
default means of allowing communication between accounts.
● Account owners can export either a stream (write-only from the account, read-only
to subscribers)
● Service (read/write).
■ Ability to export your service or stream
● Public export (allows any authorized account to import that subject)
● Private export. (Requires an explicit, out of band delivery of an activation token).

Security and Multi-Tenancy
■ Main considerations / concerns in a multi-tenant system that sits on top of a
central messaging system
● Security of clients and the message traffic
● Configuration maintenance.
● Multi-tenant systems running in the same cluster (e.g. K8s tenants co-
existing with ECS tenants) complexity
■ In a decentralized model, clients authenticate to NATS with signed user JWTs.
There is a hierarchy that goes from Operator to Account to User.
■ In NATS, an account is a unit of isolation and a user is a unit of client
authentication and authorization.

RabbitMQ
■ Messages published to queues (through exchange points).
■ Multiple consumers can connect to a queue.
■ Message broker distribute messages across all available consumers.
■ Also, we can re-deliver the message if the consumer fails.
■ Delivery order guaranteed for queues with a single consumer (this is
not possible when the queue has multiple consumers).

Architecture Considerations
■ Performance:
● RabbitMQ is around 20,000 messages/second
■ Processing:
● The consumer is just FIFO based, reading from the HEAD and processing 1 by 1
■ HA
● Provides High Availability Support
■ Open Source
● RabbitMQ is open Source through Mozilla Public License

Kafka
■ We use Apache Kafka when it comes to enabling communication
between producers and consumers using message-based topics.
Apache Kafka is a fast, scalable, fault-tolerant, publish-subscribe
messaging system.
■ Basically, it designs a platform for high-end new generation
distributed applications. Also, it allows a large number of permanent
or ad-hoc consumers.

Architecture
■ Kafka Producer API
● Permits an application to publish a stream of records to one or more Kafka topics.
■ Kafka Consumer API
● To subscribe to one or more topics and process the stream of records produced to
them in an application
■ Kafka Streams API
● Gives permission to an application in order to act as a stream processor
● Consumes an input stream from one or more topics
● Produces an output stream to one or more output topics
● Also effectively transforming the input streams to output streams
■ Kafka Connector API
● Allows building and running reusable producers or consumers that connect Kafka
topics to existing applications or data systems
● Example: connector to a relational database might capture every change to a table

Data Transformation - Architecture

Scylla + Kafka Users — just at Scylla Summit!
Scylla Summit 2018 Presenters
■ Discord
■ Faraday Future
■ GE
■ Grab
■ Natura
■ Nauto
■ Numberly
Scylla Summit 2019 Presenters
■ Lookout
■ Nauto
■ Numberly
■ OlaCabs
■ SmartDeployAI
■ Zeotap

Architectural Message Review Example
We follow processes to define which technology and patterns are going
to be apply base on the specifics requirements of the system.
We perform the following steps:
■ System Requirements
■ ASR (Architecturally Significant Requirements)
■ ADR (Architecturally Decisions Record)
■ System Context and Data Flow
■ PoC
■ MVPx

Architecturally Significant Requirements
Architecturally Significant Requirements (ASR) have a measurable effect on a system's
architecture, which includes application and infrastructure.
ASR Criteria
Requirements that have wide effects, are strict, or difficult to achieve are often ASRs. Per the Wikipedia article
on ASRs, some common indicators for a requirement being an ASR are:
■ The requirement is associated with high business value and/or technical risk.
■ The requirement is a concern of a particularly important (influential, that is) stakeholder.
■ The requirement has a first-of-a-kind character, e.g. none of the responsibilities of already existing
components in the architecture addresses it.
■ The requirement has QoS/SLA characteristics that deviate from all ones that are already satisfied by the
evolving architecture.
■ The requirement has caused budget overruns or client dissatisfaction in a previous project with a similar
context.

Architecturally Significant Requirements
Categories
We have split our ASRs up into categories to make them easier to read and to allow us to
provide more detail for each requirement. These categories are:
■ Availability
■ Maintainability
■ Observability
■ Performance
■ Resiliency
■ Testability
■ Usability

Architecturally Decision Record
■ NATS is an open source, powerful, lightweight, secure-by-default
messaging system.
■ Gives same kind of delivery control as consumer groups in Kafka
■ But without overhead of maintenance and operations cost.
■ NATS is essentially self-managing---it doesn’t need anyone to create
new partitions to scale up or down
■ Clusters form themselves and self-heal, and clients are immediately
notified of cluster topology changes.
■ NATS supports traditional request/reply, pub/sub, fanout, and many
more messaging patterns.

Why did we need a message broker?
Our ASRs lean heavily toward:
■ Resiliency,
■ Stability, and
■ Performance
When doing traditional point-to-point communications you have to do a number of things
that introduce points of failure, possible performance degradation, and loss of stability:
■ Service discovery (what's the address for a service?)
■ Retries and Failure Responses
■ Coping with slow connections and intermittent failure
■ Exponential back-off to avoid cascading failures

Why not Kafka?
Once we decided that we wanted to take advantage of a message
broker and utilize all of the asynchronous power that comes with it,
we needed to pick which broker.
■ We require low operations burden.
■ Ability to scale without having delicate reconfiguration
■ Fast request-response performance

Why not RabbitMQ?
Rabbit has a reputation for reliability and speed, and some of the team
members had used it before. One of the main reasons we disliked the
use of Rabbit was because of the explicit nature of fanout exchanges.
■ Require explicit definition of queues and subscriptions
■ Not recommended for multi-tenant systems
■ Ability to add instances / subscribers without reconfiguration

NATS Security
Neither Rabbit nor Kafka gave us the kind of security support we
needed. We need the ability to explicitly control which clients can
publish to which topics and which clients can subscribe to those
topics.
■ Ability to inject the security information without taking broker
down.
■ Flexibility to work with nkeys
■ Asymmetric encryption key system

Comparison Matrix
The following is a summary of the satisfaction of requirements for each
of the options.

References
■ Apache Kafka Website: https://kafka.apache.org
■ NATS Website: https://nats.io
■ AWS SQS: https://aws.amazon.com/sqs/
■ MQ Website: https://www.rabbitmq.com
■ Benchmarking Message Queue Latency: https://bravenewgeek.com/benchmarking-
message-queue-latency/

Thank you Stay in touch
Any questions?
Glen Gomez Zuazo
g_gomez_zuazo@hotmail.com
@ZuazoGlen

Captial One: Why Stream Data as Part of Data Transformation?

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Captial One: Why Stream Data as Part of Data Transformation?

Similar to Captial One: Why Stream Data as Part of Data Transformation? (20)

More from ScyllaDB

More from ScyllaDB (20)

Recently uploaded

Recently uploaded (20)

Captial One: Why Stream Data as Part of Data Transformation?

Editor's Notes