SlideShare a Scribd company logo
1 of 33
© 2016 24/7 CUSTOMER, INC.
BIG DATA BANGALORE JAN MEETUP - 24/7 CUSTOMER, INC.
Recipes for building resilient cross-
DC data pipeline with Kafka
Sr. Engineering Manager - Big Data
Platform
Suneet Grover
© 2016 24/7 CUSTOMER, INC. 2
About [24]7
© 2016 24/7 CUSTOMER, INC.
Today’s engagement is not driving successful moments
3
Q&A
IVR
© 2016 24/7 CUSTOMER, INC.
Smart Customer Engagement
4
Data-Driven
Reflecting All
Available Data
Click here to see [24]7 in action
Video available at http://player.vimeo.com/video/85280070
Predictive
Real-time
Decisions
Omni-channel
Across Digital
& Voice
Personalized
User Experience
© 2016 24/7 CUSTOMER, INC.
Intent-driven
engagement
Anticipate consumer intent
Holistic experience across channels
Delivering the right moments
to
Move from
Channel-centric
engagement
Reacting to consumer behavior
Disconnected, fragmented channels
Too many failed experiences
5
© 2016 24/7 CUSTOMER, INC.
[24]7 by the numbers
6
1.2bsmart speech
calls/year
127mvirtual agent
inquiries/year
30magent
chats/year
341mweb visitors
/month
5000+digital chat agents
(#1 WW)
70+data scientists
(most in industry)
100+patents
300+software engineers &
designers
© 2016 24/7 CUSTOMER, INC.
Agenda
• Introduction to Kafka
• Kafka at [24]7
• From problems to solutions
• Transparency and Resiliency
• Metrics Demo
• Design for multiple data centers
7
© 2016 24/7 CUSTOMER, INC. 8
Introduction to Kafka
© 2016 24/7 CUSTOMER, INC.
Apache Kafka
• Distributed
• High performance and throughput
• Streaming platform, pub/sub system
9
© 2016 24/7 CUSTOMER, INC.
Topic and Partitions
10
© 2016 24/7 CUSTOMER, INC.
Producers Consumers
11
© 2016 24/7 CUSTOMER, INC.
Kafka setup across DCs
12
Brokers
Region1 Region 2
Mirrormakers
Zookeepers
Brokers
Mirrormakers
Zookeepers
© 2016 24/7 CUSTOMER, INC. 13
Kafka at [24]7
© 2016 24/7 CUSTOMER, INC. 14
Intent Prediction
Data Analytics
Business Intelligence
© 2016 24/7 CUSTOMER, INC. 15
From problems to solutions
© 2016 24/7 CUSTOMER, INC.
Challenges with Kafka 0.8.0
• Broker partition stickiness does not allow to scale
• ZK load and latencies keep increasing
• Range based mirror-maker algorithm not optimal
• Stale topics cannot be deleted
• Controller can get into a stuck state
• Conflict errors in mirror-makers
• Socket leaks leading to open file descriptors
16
© 2016 24/7 CUSTOMER, INC.
Learnings from Kafka 0.8.0
• If the controller gets into a stuck state, delete the “/controller”
node from zookeeper
• Always do clean shutdown and restart of brokers
• Some issues are not always visible as errors or warnings
• Run ZK on SSD
17
© 2016 24/7 CUSTOMER, INC.
Kafka 0.10
18
• Very stable release
• Easy to do in-place from 0.8.2 onwards
• Better client APIs
• Richer admin operations
© 2016 24/7 CUSTOMER, INC.
Broker configurations that worked for us
19
• default.replication.factor = 3
• num.partitions = 2
• delete.topic.enable = true
• auto.leader.rebalance.enable = true
• controlled.shutdown.enable = true
• queued.max.requests = 1000
© 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 20
Transparency and
Resiliency
© 2016 24/7 CUSTOMER, INC.
Metrics flow
21
Grafana
Graphite
Kafka Broker
Metrics Reporter
Kafka MM JMXTrans
Zookeeper
Host level
Metrics & Alerts
Lag monitor
ELK
© 2016 24/7 CUSTOMER, INC.
Essential Broker Metrics
• Disk, CPU and throughput utilization
• Ingress, egress volume per broker and topic
• Active controller count
• Offline partitions
• Under replicated partitions
• Partitions per broker
• Log flush rate
22
© 2016 24/7 CUSTOMER, INC.
Basic Alerts
23
• Disk, CPU utilization
• Open file handles
• Controller count
• Controller re-elections
• Under replicated partitions
• Offline partitions
• Stuck pending commands in zookeeper
• Conflicts in mirror-makers
© 2016 24/7 CUSTOMER, INC.
JMXTrans
24
• Push mirror-maker metrics to graphite
• Throughput per topic, per thread, per instance etc.
• WaitOnTake, WaitOnPut
• Push zookeeper metrics to graphite
• Latency, quorum, connections etc.
© 2016 24/7 CUSTOMER, INC.
Data Lag Monitoring
25
• Measures the event level time delay
• Monitors data latencies per cluster, per topic, per partition
• Latencies between multiple steps in Kafka pipeline
• Optimize and configure sampling ratio
• Supports multiple message formats json, avro etc.
• Alerts based on pre-defined thresholds
© 2016 24/7 CUSTOMER, INC.
Indicative Broker Metrics
• Request Metrics
• Local Time
• Remote Time
• Queue Time
• Request Handler Idle Percent
• Network Processor Idle Percent
26
© 2016 24/7 CUSTOMER, INC. 27
Now some demo
© 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 28
Design for Multiple Data
Centers
© 2016 24/7 CUSTOMER, INC.
Range Based Mirror Makers
1000
181
14
5
1
10
100
1000
Consumer 1 Consumer 2 Consumer 3 Consumer 4
Skewed Partition Assignment
Num Partitions
29
© 2016 24/7 CUSTOMER, INC.
Round Robin Mirror Makers
0
50
100
150
200
250
300
350
Consumer 1 Consumer 2 Consumer 3 Consumer 4
Uniform Partition Assignment
Num Partitions
30
© 2016 24/7 CUSTOMER, INC.
Mirror-maker fine tuning
• Round Robin works better than Range based in most cases
• Spread out the topics in multiple MM consumer groups
• If you have a few large volume topics
• Negative regex works with whitelist parameter
• Doesn’t help to have too many MM consumer threads
• Tune socket buffer size (doesn’t apply unless OS allows)
• MM - socket.receive.buffer.bytes = 1048576
• Broker - socket.send.buffer.bytes = 1048576
31
© 2016 24/7 CUSTOMER, INC. 32
We are hiring!!!
For current open positions, please log onto our careers web page
http://www.247-inc.com/
Company>Careers>Location
For further details, Please reach out to:
Achappa C B - achappa.cb@247-inc.com, M: +91-7338458247
© 2016 24/7 CUSTOMER, INC. 33

More Related Content

What's hot

Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...HostedbyConfluent
 
A guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update ConferenceA guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update ConferenceEldert Grootenboer
 
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...HostedbyConfluent
 
Microservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka EcosystemMicroservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka Ecosystemconfluent
 
Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?confluent
 
The Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and ServicesThe Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and Servicesconfluent
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Araf Karsh Hamid
 
Microservices with Kafka Ecosystem
Microservices with Kafka EcosystemMicroservices with Kafka Ecosystem
Microservices with Kafka EcosystemGuido Schmutz
 
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, ConfluentJay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluentconfluent
 
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...HostedbyConfluent
 
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...confluent
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...HostedbyConfluent
 
Removing performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configurationRemoving performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configurationKnoldus Inc.
 
Building Microservices with Apache Kafka
Building Microservices with Apache KafkaBuilding Microservices with Apache Kafka
Building Microservices with Apache Kafkaconfluent
 
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...HostedbyConfluent
 
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful StreamsKafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streamsconfluent
 
Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service confluent
 
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, ClouderaLessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, ClouderaHostedbyConfluent
 
Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...HostedbyConfluent
 
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...HostedbyConfluent
 

What's hot (20)

Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
 
A guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update ConferenceA guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update Conference
 
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
 
Microservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka EcosystemMicroservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka Ecosystem
 
Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?
 
The Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and ServicesThe Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and Services
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Microservices with Kafka Ecosystem
Microservices with Kafka EcosystemMicroservices with Kafka Ecosystem
Microservices with Kafka Ecosystem
 
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, ConfluentJay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
 
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
 
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
 
Removing performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configurationRemoving performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configuration
 
Building Microservices with Apache Kafka
Building Microservices with Apache KafkaBuilding Microservices with Apache Kafka
Building Microservices with Apache Kafka
 
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
 
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful StreamsKafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
 
Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service
 
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, ClouderaLessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
 
Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...
 
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
 

Viewers also liked

Getting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics servicesGetting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics servicesVladimir Bychkov
 
London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)Landoop Ltd
 
Not Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabsNot Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabsKonrad Malawski
 
Processing IoT Data with Apache Kafka
Processing IoT Data with Apache KafkaProcessing IoT Data with Apache Kafka
Processing IoT Data with Apache KafkaMatthew Howlett
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected BreweryJason Hubbard
 
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...confluent
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformconfluent
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at TwitterPrasad Wagle
 
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, SparkBuilding Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, SparkTodd Fritz
 
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...Data Con LA
 
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...Kay Lerch
 
Introduction to Structured Streaming
Introduction to Structured StreamingIntroduction to Structured Streaming
Introduction to Structured StreamingKnoldus Inc.
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseAldrin Piri
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologiesSachin Aggarwal
 
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...Spark Summit
 
Real-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS AzureReal-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS AzureKhalid Salama
 
Reactive integrations with Akka Streams
Reactive integrations with Akka StreamsReactive integrations with Akka Streams
Reactive integrations with Akka StreamsKonrad Malawski
 
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Kevin Mao
 

Viewers also liked (20)

Getting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics servicesGetting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics services
 
Storm over gearpump
Storm over gearpumpStorm over gearpump
Storm over gearpump
 
London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)
 
Kafka connect
Kafka connectKafka connect
Kafka connect
 
Not Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabsNot Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabs
 
Processing IoT Data with Apache Kafka
Processing IoT Data with Apache KafkaProcessing IoT Data with Apache Kafka
Processing IoT Data with Apache Kafka
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at Twitter
 
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, SparkBuilding Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
 
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
 
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
 
Introduction to Structured Streaming
Introduction to Structured StreamingIntroduction to Structured Streaming
Introduction to Structured Streaming
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologies
 
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
 
Real-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS AzureReal-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS Azure
 
Reactive integrations with Akka Streams
Reactive integrations with Akka StreamsReactive integrations with Akka Streams
Reactive integrations with Akka Streams
 
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
 

Similar to Blr hadoop meetup

Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.Suneet Grover
 
Tokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfTokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfssuser2ae721
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCAlan Percy
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCTelcoBridges Inc.
 
Getting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital ExperienceGetting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital ExperienceThousandEyes
 
VMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta ExamVMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta ExamArchana Parameshwari
 
Praxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloudPraxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloudRoman Weber
 
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdfSRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdfWeaveworks
 
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...DevOps.com
 
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaSFlexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaSFlexiant
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfLibbySchulze
 
Getting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsGetting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsThousandEyes
 
Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!Flink Forward
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticDataWorks Summit
 
Valve CPQ challenges & solutions
Valve CPQ challenges & solutionsValve CPQ challenges & solutions
Valve CPQ challenges & solutionsSanjeev Nadkarni
 
Martin Anderson - Fast We Promise
Martin Anderson - Fast We PromiseMartin Anderson - Fast We Promise
Martin Anderson - Fast We Promisebloodredsun
 
Why Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve IndustryWhy Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve IndustrySanjeev Nadkarni
 
How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?eG Innovations
 

Similar to Blr hadoop meetup (20)

Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
 
Tokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfTokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdf
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBC
 
Wavefront-by-VMware-April-2019
Wavefront-by-VMware-April-2019Wavefront-by-VMware-April-2019
Wavefront-by-VMware-April-2019
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBC
 
Getting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital ExperienceGetting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital Experience
 
VMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta ExamVMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta Exam
 
Praxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloudPraxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloud
 
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdfSRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
 
PEnDAR webinar 2 with notes
PEnDAR webinar 2 with notesPEnDAR webinar 2 with notes
PEnDAR webinar 2 with notes
 
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
 
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaSFlexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
 
Getting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsGetting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of Concepts
 
Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMatic
 
Valve CPQ challenges & solutions
Valve CPQ challenges & solutionsValve CPQ challenges & solutions
Valve CPQ challenges & solutions
 
Martin Anderson - Fast We Promise
Martin Anderson - Fast We PromiseMartin Anderson - Fast We Promise
Martin Anderson - Fast We Promise
 
Why Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve IndustryWhy Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve Industry
 
How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?
 

Recently uploaded

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Blr hadoop meetup

  • 1. © 2016 24/7 CUSTOMER, INC. BIG DATA BANGALORE JAN MEETUP - 24/7 CUSTOMER, INC. Recipes for building resilient cross- DC data pipeline with Kafka Sr. Engineering Manager - Big Data Platform Suneet Grover
  • 2. © 2016 24/7 CUSTOMER, INC. 2 About [24]7
  • 3. © 2016 24/7 CUSTOMER, INC. Today’s engagement is not driving successful moments 3 Q&A IVR
  • 4. © 2016 24/7 CUSTOMER, INC. Smart Customer Engagement 4 Data-Driven Reflecting All Available Data Click here to see [24]7 in action Video available at http://player.vimeo.com/video/85280070 Predictive Real-time Decisions Omni-channel Across Digital & Voice Personalized User Experience
  • 5. © 2016 24/7 CUSTOMER, INC. Intent-driven engagement Anticipate consumer intent Holistic experience across channels Delivering the right moments to Move from Channel-centric engagement Reacting to consumer behavior Disconnected, fragmented channels Too many failed experiences 5
  • 6. © 2016 24/7 CUSTOMER, INC. [24]7 by the numbers 6 1.2bsmart speech calls/year 127mvirtual agent inquiries/year 30magent chats/year 341mweb visitors /month 5000+digital chat agents (#1 WW) 70+data scientists (most in industry) 100+patents 300+software engineers & designers
  • 7. © 2016 24/7 CUSTOMER, INC. Agenda • Introduction to Kafka • Kafka at [24]7 • From problems to solutions • Transparency and Resiliency • Metrics Demo • Design for multiple data centers 7
  • 8. © 2016 24/7 CUSTOMER, INC. 8 Introduction to Kafka
  • 9. © 2016 24/7 CUSTOMER, INC. Apache Kafka • Distributed • High performance and throughput • Streaming platform, pub/sub system 9
  • 10. © 2016 24/7 CUSTOMER, INC. Topic and Partitions 10
  • 11. © 2016 24/7 CUSTOMER, INC. Producers Consumers 11
  • 12. © 2016 24/7 CUSTOMER, INC. Kafka setup across DCs 12 Brokers Region1 Region 2 Mirrormakers Zookeepers Brokers Mirrormakers Zookeepers
  • 13. © 2016 24/7 CUSTOMER, INC. 13 Kafka at [24]7
  • 14. © 2016 24/7 CUSTOMER, INC. 14 Intent Prediction Data Analytics Business Intelligence
  • 15. © 2016 24/7 CUSTOMER, INC. 15 From problems to solutions
  • 16. © 2016 24/7 CUSTOMER, INC. Challenges with Kafka 0.8.0 • Broker partition stickiness does not allow to scale • ZK load and latencies keep increasing • Range based mirror-maker algorithm not optimal • Stale topics cannot be deleted • Controller can get into a stuck state • Conflict errors in mirror-makers • Socket leaks leading to open file descriptors 16
  • 17. © 2016 24/7 CUSTOMER, INC. Learnings from Kafka 0.8.0 • If the controller gets into a stuck state, delete the “/controller” node from zookeeper • Always do clean shutdown and restart of brokers • Some issues are not always visible as errors or warnings • Run ZK on SSD 17
  • 18. © 2016 24/7 CUSTOMER, INC. Kafka 0.10 18 • Very stable release • Easy to do in-place from 0.8.2 onwards • Better client APIs • Richer admin operations
  • 19. © 2016 24/7 CUSTOMER, INC. Broker configurations that worked for us 19 • default.replication.factor = 3 • num.partitions = 2 • delete.topic.enable = true • auto.leader.rebalance.enable = true • controlled.shutdown.enable = true • queued.max.requests = 1000
  • 20. © 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 20 Transparency and Resiliency
  • 21. © 2016 24/7 CUSTOMER, INC. Metrics flow 21 Grafana Graphite Kafka Broker Metrics Reporter Kafka MM JMXTrans Zookeeper Host level Metrics & Alerts Lag monitor ELK
  • 22. © 2016 24/7 CUSTOMER, INC. Essential Broker Metrics • Disk, CPU and throughput utilization • Ingress, egress volume per broker and topic • Active controller count • Offline partitions • Under replicated partitions • Partitions per broker • Log flush rate 22
  • 23. © 2016 24/7 CUSTOMER, INC. Basic Alerts 23 • Disk, CPU utilization • Open file handles • Controller count • Controller re-elections • Under replicated partitions • Offline partitions • Stuck pending commands in zookeeper • Conflicts in mirror-makers
  • 24. © 2016 24/7 CUSTOMER, INC. JMXTrans 24 • Push mirror-maker metrics to graphite • Throughput per topic, per thread, per instance etc. • WaitOnTake, WaitOnPut • Push zookeeper metrics to graphite • Latency, quorum, connections etc.
  • 25. © 2016 24/7 CUSTOMER, INC. Data Lag Monitoring 25 • Measures the event level time delay • Monitors data latencies per cluster, per topic, per partition • Latencies between multiple steps in Kafka pipeline • Optimize and configure sampling ratio • Supports multiple message formats json, avro etc. • Alerts based on pre-defined thresholds
  • 26. © 2016 24/7 CUSTOMER, INC. Indicative Broker Metrics • Request Metrics • Local Time • Remote Time • Queue Time • Request Handler Idle Percent • Network Processor Idle Percent 26
  • 27. © 2016 24/7 CUSTOMER, INC. 27 Now some demo
  • 28. © 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 28 Design for Multiple Data Centers
  • 29. © 2016 24/7 CUSTOMER, INC. Range Based Mirror Makers 1000 181 14 5 1 10 100 1000 Consumer 1 Consumer 2 Consumer 3 Consumer 4 Skewed Partition Assignment Num Partitions 29
  • 30. © 2016 24/7 CUSTOMER, INC. Round Robin Mirror Makers 0 50 100 150 200 250 300 350 Consumer 1 Consumer 2 Consumer 3 Consumer 4 Uniform Partition Assignment Num Partitions 30
  • 31. © 2016 24/7 CUSTOMER, INC. Mirror-maker fine tuning • Round Robin works better than Range based in most cases • Spread out the topics in multiple MM consumer groups • If you have a few large volume topics • Negative regex works with whitelist parameter • Doesn’t help to have too many MM consumer threads • Tune socket buffer size (doesn’t apply unless OS allows) • MM - socket.receive.buffer.bytes = 1048576 • Broker - socket.send.buffer.bytes = 1048576 31
  • 32. © 2016 24/7 CUSTOMER, INC. 32 We are hiring!!! For current open positions, please log onto our careers web page http://www.247-inc.com/ Company>Careers>Location For further details, Please reach out to: Achappa C B - achappa.cb@247-inc.com, M: +91-7338458247
  • 33. © 2016 24/7 CUSTOMER, INC. 33

Editor's Notes

  1. Intro to yourself Credit to the team
  2. Click and let this do the slow build. The key points are: Consumers find it frustrating to cross channels (web, phone, IVR, etc.) because their content is not preserved. So they have to do things like authenticate (user ID, password) multiple times in the same interaction. These types of experiences turn potential Brand Advocates into Detractors who will move to other brands.
  3. This a KEY slide. Emphasize that today’s leading-edge companies – those that consumers love to engage with and have strong brands – are moving to Intent Driven Engagement
  4. Would like viewers read the slide. Then focus on bottom row: 300+ software engineers and designers Most data scientists in the industry 100+ patents point made earlier We are the #1 provider of digital chat agents in the world.
  5. Other features which we haven’t tried Security, Streams etc.