SlideShare a Scribd company logo
1 of 33
Real-time Data Pipelines
with SAP and Apache
Kafka
© 2017 Attunity, Accenture Analytics and Confluent
Amiya Chand
Sr. Manager, Accenture Digital | SAP Data
& Analytics Lead, Accenture
David Tucker
Director Partner Engineering, Confluent
Jordan Martz
Director of Technology Solutions, Attunity
Guest Speakers
© 2017 Attunity, Accenture Analytics and Confluent
• Accenture Analytics - Business in the moment and
imperatives
• Confluent - Apache Kafka
• Attunity - Migrating Data to a Apache Kafka
• All - Q&A
Agenda
• Real-time Data Pipelines with SAP
and Apache Kafka
© 2017 Attunity, Accenture Analytics and Confluent
News from the frontline
Significant investments have been made in traditional data integration and platforms which have
limitations in storage, data latency and performance. These complex systems inhibit ‘speed to
value’ and ‘speed to deploy’
The digitization of everything has the potential to provide organizations with “data about
everything”. Organizations that are able to effectively harness the power of this data to drive insights
actions (at speed) will have a distinct advantage over their competition.
In order to effectively address these challenges and to capitalize on “new” data sources to gain a competitive
advantage, organizations will have to ingest data at the speed of business.
To unlock value within their data, organizations must start treating data as a supply chain,
enabling it to flow more easily through the organization.
© 2017 Attunity, Accenture Analytics and Confluent
Market trends for data & analytics
Complex data-driven environment with significant opportunities to create business value
DEMOCRATIZATION
OF DATA AND DATA
DISCOVERY
NEW DATA
SOURCES
CHANGING SKILLS
REQUIREMENTS
FOCUS ON
ADVANCED
ANALYTICS
BIG DATA AND
HYBRID
ARCHITECTURES
Self-service
analytics, multi-
channel delivery
Addition of social
media, Internet of
Things, unstructured
content, etc.
Evolving patterns,
data-driven point-of-
interaction decisions
shifting to the front
line
New forms, more
complex
architectures on
Premise or Cloud
Pictorial story-
telling, customers
going digital
© 2017 Attunity, Accenture Analytics and Confluent
The changing landscape in SAP data &
analytics
SAP systems are evolving from a ‘system of record’ to ‘system of innovation’
System of Record
System of Insights
System of Innovation
SpeedandAgility
Level of Integration between People, Process and Platform
Pre 2011
Traditional SAP ERP and BI
Reporting platform on-premise
2013-15
Evolution of SAP HANA as a high
performance data base and Analytics
Engine on-premise / Cloud
SAP BW
Any
Database
Enterprise
BI
Any
Database
Any Database
Enterprise
BI
Data
Visualiza
-tion
Advanced
Analytics
SAP ERP
Any
Database
BW /
Native
HANA
HANA
SAP ERP
Any
Database
BW /
Native
HANA
S4
Innovation
Suite on
HANA
Enterprise
BI
Data
Visualiza-
tion
Advanced
Analytics
BOBJ
Cloud
HANA Hadoop/ Any database
2015+
Maturity of SAP HANA as the
Digital Platform on-premise /
Cloud / As-a-Service
Faster insights to
action
1
Bridge the digital
divide
2
Accelerate data
integration
3
BusinessImperative
© 2017 Attunity, Accenture Analytics and Confluent
Enabling the insight-driven enterprise
DataIngestion
DataAccess
Point
Analytics
&
Insights
SourceSystems
Transactional
systems
Business
Applications
Unstructured
Sources
Real time
Batch
DataMarketplace Data Warehouse/ Data
Mart
Data lake
Operational
Reporting &
Self Service
Data Science lab
Cross
Pollination
Data
Exploration
Guided
Analytics &
Advanced
Analytics
Analytics Engine
A POV and Reference Architecture for enabling the ‘insight driven enterprise’
Load to any
target
Enable real-
time analytics
Ingest/replicate
data on demand
Speed to Data
Ingest any data at
the frequency
needed by
business
Speed to Insight
Access to any
data, anywhere
and anytime
Speed to Value
Integrated
platform that is
scalable and
flexible
Data Ingestion
An abstract layer that allows
various data elements – type &
latency to flow into the storage
environment
Data Marketplace
Includes the components
that allow to store &
transform raw data into
information
Data Access
Semantic layer that allows
access to the data for end
consumption – interactive,
blending, low latency
Apache Kafka
© 2017 Attunity, Accenture Analytics and Confluent
How organizations handle data flow: a giant mess
Data
Warehouse
Hadoop
NoSQL
Oracle
SFDC
Logging
Bloomberg
…any sink/source
Web Custom Apps Microservices Monitoring Analytics
…and more
OLTP
ActiveMQ
App App
Caches
OLTP OLTPAppAppApp
© 2017 Attunity, Accenture Analytics and Confluent
Apache Kafka™: a distributed streaming
platform
Apache Kafka
Offline Batch (+1 Hour)Near-Real Time (>100s ms)Real Time (0-100 ms)
Data
Warehouse
Hadoop
NoSQL
Oracle
SFDC
Twitter
Bloomberg
…any sink/source …any sink/source
…and more
Web Custom Apps Microservices Monitoring Analytics
© 2017 Attunity, Accenture Analytics and Confluent
Over 35% of Fortune 500’s are using Apache Kafka™
6 of top 10
Travel
7 of top 10
Global banks
8 of top 10
Insurance
9 of top 10
Telecom
© 2017 Attunity, Accenture Analytics and Confluent
Confluent: a streaming platform based on Apache
Kafka™
Database
Changes
Log Events loT Data Web Events …
CRM
Data Warehouse
Database
Hadoop
Data
Integration
…
Monitoring
Analytics
Custom Apps
Transformations
Real-time
Applications
…
Apache Open Source Confluent Open Source Confluent Commercial
Confluent Platform
Confluent Platform
Apache Kafka™
Core | Connect| Streams
Data Compatibility
Schema Registry
Monitoring & Administration
Confluent Control Center
Operations
Replicator | Auto Data Balancing
Development and Connectivity
Clients | Connectors | REST Proxy
© 2017 Attunity, Accenture Analytics and Confluent
Kafka adoption spans many industries
Financial Services Enterprise Tech Consumer Tech
Entertainment & Media Telecom Retail Travel & Leisure
© 2017 Attunity, Accenture Analytics and Confluent
Apache KafkaTM Connect API – streaming data capture
JDBC
Mongo
MySQL
Elastic
Cassandra
HDFS
Kafka Connect API
Kafka Pipeline
Connector
Connector
Connector
Connector
Connector
Connector
Sources Sinks
Fault tolerant
Manage hundreds of data
sources and sinks
Preserves data schema
Part of Apache Kafka
project
Integrated within Confluent
Platform’s Control Center
© 2017 Attunity, Accenture Analytics and Confluent
Kafka Connect API -part of the Apache Kafka Project
Connect any source to any target system
Integrated
• 100% compatible with Kafka v0.9 and higher
• Integrated with Confluent’s Schema Registry
• Easy to manage with Confluent Control
Center
Flexible
• 40+ open source connectors available
• Easy to develop additional connectors
• Flexible support for data types and formats
Compatible
• Maintains critical metadata
• Preserves schema information
• Supports schema evolution
Reliable
• Automated failover
• At-least-once guaranteed
• Balances workload between nodes
© 2017 Attunity, Accenture Analytics and Confluent
Architecture of Kafka Streams API, a Part of Apache Kafka
Kafka
Streams
API
Producer
Kafka Cluster
Topic TopicTopic
Consumer Consumer
Key benefits
• No additional cluster
• Easy to run as a service
• Supports large aggregations and joins
• Security & permissions fully integrated from
Kafka
Example Use Cases
• Microservices
• Continuous queries
• Continuous transformations
• Event-triggered processes
© 2017 Attunity, Accenture Analytics and Confluent
Kafka Streams API: easiest way to process data in
Apache Kafka
Example Use Cases
• Microservices
• Large-scale continuous queries and transformations
• Event-triggered processes
• Reactive applications
• Customer 360-degree view, fraud detection, location-based
marketing, smart electrical grids, fleet management, …
Key Benefits of Apache Kafka’s Streams API
• Build Apps, Not Clusters: no additional cluster required
• Elastic, highly-performant, distributed, fault-tolerant, secure
• Equally viable for small, medium, and large-scale use cases
• “Run Everywhere”: integrates with your existing deployment
strategies such as containers, automation, cloud
Your App
Kafka
Streams API
© 2017 Attunity, Accenture Analytics and Confluent
Confluent Control Center
• Monitor all pipelines end-
to-end
• Manage and configure
Kafka connectors
• Administer Kafka cluster
and monitor overall
operational health
© 2017 Attunity, Accenture Analytics and Confluent
Why Confluent? More than just enterprise software
Confluent Platform Professional Services Enterprise Support
Complete support across the entire adoption lifecycle
Kafka Training
SAP Data Integration for Apache
Kafka
© 2017 Attunity, Accenture Analytics and Confluent
Attunity – Corporate Overview
Data Integration & Big Data Management Software
Accelerate data delivery and availability
Automate data readiness for analytics
Optimize data management with intelligence
Hadoop & Big Data
Databases & Data Warehouses
On premise & in the Cloud
Solutions for: Global OfficesOverview
2000 customers in 65 countries
250 people and growing
NASDAQ traded (ATTU)
© 2017 Attunity, Accenture Analytics and Confluent
Load to any
target
Enable real-
time analytics
Ingest/replicate
data on demand
Traditional Data Integration Requirements
© 2017 Attunity, Accenture Analytics and Confluent
UniversalReal-TimeSimpler
Attunity Replicate
Universal Platform for Data Replication/Ingestion
#1 independent provider of streaming CDC
© 2017 Attunity, Accenture Analytics and Confluent
Ingest Data Real-time and at Scale into Hadoop
• Stream data with CDC from any major source
into HDFS or Hive
 Data then available to be used in Hbase or by Spark
• Automatically generate schema in HCatalog
• Leverage same UI and process as for other
platforms
• Integrate with native APIs
 No external adapters or custom code requiredHadoop Data Lake
© 2017 Attunity, Accenture Analytics and Confluent
Attunity Replicate Architecture
Attunity
Replicate
Batch
CDC Incremental
Batch
Cloud
On-
prem
Cloud
On-
premPersistent Store
Files
RDBMS
Mainframe
EDW
© 2017 Attunity, Accenture Analytics and Confluent
Attunity Replicate Server
TransformFilter
Batch
CDC Incremental
In-Memory
File Channel
Batch
Attunity Replicate for SAP
Persistent Store
Extract relationships for Pool and Cluster Tables
RDBMS
(Oracle, DB2, etc.)
Redo/
Archive
logs
or
Journal
File
----------------
Transparent
Tables
On Premises
Hadoop RDBMS
Data
WarehouseKafka
Cloud
Attunity Replicate Agent
for SAP
SAP ECC
(Enterprise Central
Component)
Data Model Mapping
Pool/Cluster table RFC
© 2017 Attunity, Accenture Analytics and Confluent
Easier Integration with Attunity Replicate
Go Agile with Automated Processes
• No manual coding or scripting
• Automated end-to-end
• Optimized and configurable
• Target schema creation
• Heterogeneous data type
mapping
• Batch to CDC transition
• DDL change propagation
• Filtering
• Transformations
Hadoop
File
s
RDBMS
Mainframe
Hadoop
Files
RDBMS
Kafka
EDW EDW
© 2017 Attunity, Accenture Analytics and Confluent
Easier Integration with Attunity Replicate
Zero Footprint Architecture
• CDC identifies source updates by
scanning change logs
• No software agents required on
sources or targets
• Minimizes administrative
overhead
• CDC based on change logs
• No agents on source or target
Low Infrastructure Impact
Hadoop
File
s
RDBMS
Mainframe
Hadoop
Files
RDBMS
Kafka
EDW EDW
© 2017 Attunity, Accenture Analytics and Confluent
Universal Integration with Attunity Replicate
Rapidly Move Data Across Complex Hybrid Environments
• Source
 Log-based CDC
 Batch APIs
• WAN
 Encrypted multi-pathing
• Target
 Transactional/batch CDC options
 Native loader integration
TargetsSources
On Premises
Cloud Platform
HadoopRDBMS
Data Warehouse
Hadoop
RDBMS
Data Warehouse
Optimized for…
© 2017 Attunity, Accenture Analytics and Confluent
Demo
attunity.com
Questions?
Thanks!

More Related Content

What's hot

Real Time Data Strategy and Architecture
Real Time Data Strategy and ArchitectureReal Time Data Strategy and Architecture
Real Time Data Strategy and ArchitectureAlan McSweeney
 
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Amazon Web Services
 
SAP Cloud Platform Integration Services – L1 Deck
SAP Cloud Platform Integration Services – L1 DeckSAP Cloud Platform Integration Services – L1 Deck
SAP Cloud Platform Integration Services – L1 DeckSAP Cloud Platform
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceSamanthaBerlant
 
Overview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud PlatformOverview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud PlatformVitaliy Rudnytskiy
 
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3Archive First: An Intelligent Data Archival Strategy, Part 1 of 3
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3Hitachi Vantara
 
Solution deck capgemini cloud assessment
Solution deck capgemini cloud assessmentSolution deck capgemini cloud assessment
Solution deck capgemini cloud assessmentAdobe
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeSnowflake Computing
 
[웨비나] 우리가 데이터 메시에 주목해야 할 이유
[웨비나] 우리가 데이터 메시에 주목해야 할 이유[웨비나] 우리가 데이터 메시에 주목해야 할 이유
[웨비나] 우리가 데이터 메시에 주목해야 할 이유confluent
 
Apache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial ServicesApache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial Servicesconfluent
 
SAP Cloud Platform Extension Suite Overview
SAP Cloud Platform Extension Suite OverviewSAP Cloud Platform Extension Suite Overview
SAP Cloud Platform Extension Suite OverviewSAP Cloud Platform
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
 
Fundamentals Big Data and AI Architecture
Fundamentals Big Data and AI ArchitectureFundamentals Big Data and AI Architecture
Fundamentals Big Data and AI ArchitectureGuido Schmutz
 
High Availability and Disaster Recovery
High Availability and Disaster RecoveryHigh Availability and Disaster Recovery
High Availability and Disaster RecoveryAkelios
 
Multi Tenancy In The Cloud
Multi Tenancy In The CloudMulti Tenancy In The Cloud
Multi Tenancy In The Cloudrohit_ainapure
 
Enabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache KuduEnabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache KuduGrant Henke
 
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit Splunk
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit SplunkSplunk Webinar: Full-Stack End-to-End SAP-Monitoring mit Splunk
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit SplunkSplunk
 
AWS Community Day 2022 Joe Daly FinOps
AWS Community Day 2022 Joe Daly FinOpsAWS Community Day 2022 Joe Daly FinOps
AWS Community Day 2022 Joe Daly FinOpsAWS Chicago
 

What's hot (20)

Real Time Data Strategy and Architecture
Real Time Data Strategy and ArchitectureReal Time Data Strategy and Architecture
Real Time Data Strategy and Architecture
 
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
 
SAP Cloud Platform Integration Services – L1 Deck
SAP Cloud Platform Integration Services – L1 DeckSAP Cloud Platform Integration Services – L1 Deck
SAP Cloud Platform Integration Services – L1 Deck
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High Performance
 
Overview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud PlatformOverview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud Platform
 
Cloud Migration Workshop
Cloud Migration WorkshopCloud Migration Workshop
Cloud Migration Workshop
 
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3Archive First: An Intelligent Data Archival Strategy, Part 1 of 3
Archive First: An Intelligent Data Archival Strategy, Part 1 of 3
 
Solution deck capgemini cloud assessment
Solution deck capgemini cloud assessmentSolution deck capgemini cloud assessment
Solution deck capgemini cloud assessment
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with Snowflake
 
[웨비나] 우리가 데이터 메시에 주목해야 할 이유
[웨비나] 우리가 데이터 메시에 주목해야 할 이유[웨비나] 우리가 데이터 메시에 주목해야 할 이유
[웨비나] 우리가 데이터 메시에 주목해야 할 이유
 
Apache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial ServicesApache Kafka® Use Cases for Financial Services
Apache Kafka® Use Cases for Financial Services
 
SAP Cloud Platform Extension Suite Overview
SAP Cloud Platform Extension Suite OverviewSAP Cloud Platform Extension Suite Overview
SAP Cloud Platform Extension Suite Overview
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Fundamentals Big Data and AI Architecture
Fundamentals Big Data and AI ArchitectureFundamentals Big Data and AI Architecture
Fundamentals Big Data and AI Architecture
 
High Availability and Disaster Recovery
High Availability and Disaster RecoveryHigh Availability and Disaster Recovery
High Availability and Disaster Recovery
 
Multi Tenancy In The Cloud
Multi Tenancy In The CloudMulti Tenancy In The Cloud
Multi Tenancy In The Cloud
 
Enabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache KuduEnabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache Kudu
 
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit Splunk
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit SplunkSplunk Webinar: Full-Stack End-to-End SAP-Monitoring mit Splunk
Splunk Webinar: Full-Stack End-to-End SAP-Monitoring mit Splunk
 
AWS Community Day 2022 Joe Daly FinOps
AWS Community Day 2022 Joe Daly FinOpsAWS Community Day 2022 Joe Daly FinOps
AWS Community Day 2022 Joe Daly FinOps
 
Azure storage
Azure storageAzure storage
Azure storage
 

Similar to Real-time Data Pipelines with SAP and Apache Kafka

SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSUSE Italy
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Be the Data Hero in Your Organization with SAP and CA Analytic Solutions
Be the Data Hero in Your Organization with SAP and CA Analytic SolutionsBe the Data Hero in Your Organization with SAP and CA Analytic Solutions
Be the Data Hero in Your Organization with SAP and CA Analytic SolutionsCA Technologies
 
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...Amazon Web Services
 
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoire
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoireHIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoire
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoireHitachi Data Systems France
 
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...Amazon Web Services
 
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...Amazon Web Services
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...Amazon Web Services
 
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...HostedbyConfluent
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...Amazon Web Services
 
Cloud Integration Services on SAP HANA Cloud Platform
Cloud Integration Services on SAP HANA Cloud PlatformCloud Integration Services on SAP HANA Cloud Platform
Cloud Integration Services on SAP HANA Cloud PlatformMichael Hill
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceSalesforce Developers
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Impetus Technologies
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANASAP Technology
 
Development to Deployment with SAP HANA
Development to Deployment with SAP HANADevelopment to Deployment with SAP HANA
Development to Deployment with SAP HANACraig Cmehil
 
Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafkaconfluent
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
Business intelligence in the era of big data
Business intelligence in the era of big dataBusiness intelligence in the era of big data
Business intelligence in the era of big dataJC Raveneau
 

Similar to Real-time Data Pipelines with SAP and Apache Kafka (20)

SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service Platform
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Be the Data Hero in Your Organization with SAP and CA Analytic Solutions
Be the Data Hero in Your Organization with SAP and CA Analytic SolutionsBe the Data Hero in Your Organization with SAP and CA Analytic Solutions
Be the Data Hero in Your Organization with SAP and CA Analytic Solutions
 
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...
Reduce Operating Costs and Accelerate Efficiency by Migrating Your SAP Applic...
 
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoire
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoireHIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoire
HIF Paris 2014 - SAP - SAP HANA : bien plus qu’une base de données en mémoire
 
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...
SAP on AWS: How UCT is Experiencing Better Performance on AWS While Saving 60...
 
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...
GPSBUS221_Breaking Barriers Move Enterprise SAP Customers to SAP HANA on AWS ...
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
 
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
Navigating Your Data Landscape With Siddharth Desai and Elena Cuevas | Curren...
 
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
SAP on AWS: Big Businesses, Big Workloads, Big Time - ENT202 - Chicago AWS Su...
 
Cloud Integration Services on SAP HANA Cloud Platform
Cloud Integration Services on SAP HANA Cloud PlatformCloud Integration Services on SAP HANA Cloud Platform
Cloud Integration Services on SAP HANA Cloud Platform
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to Salesforce
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
 
Building Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANABuilding Custom Advanced Analytics Applications with SAP HANA
Building Custom Advanced Analytics Applications with SAP HANA
 
Development to Deployment with SAP HANA
Development to Deployment with SAP HANADevelopment to Deployment with SAP HANA
Development to Deployment with SAP HANA
 
Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafka
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Business intelligence in the era of big data
Business intelligence in the era of big dataBusiness intelligence in the era of big data
Business intelligence in the era of big data
 

Recently uploaded

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 

Recently uploaded (20)

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 

Real-time Data Pipelines with SAP and Apache Kafka

  • 1. Real-time Data Pipelines with SAP and Apache Kafka
  • 2. © 2017 Attunity, Accenture Analytics and Confluent Amiya Chand Sr. Manager, Accenture Digital | SAP Data & Analytics Lead, Accenture David Tucker Director Partner Engineering, Confluent Jordan Martz Director of Technology Solutions, Attunity Guest Speakers
  • 3. © 2017 Attunity, Accenture Analytics and Confluent • Accenture Analytics - Business in the moment and imperatives • Confluent - Apache Kafka • Attunity - Migrating Data to a Apache Kafka • All - Q&A Agenda
  • 4. • Real-time Data Pipelines with SAP and Apache Kafka
  • 5. © 2017 Attunity, Accenture Analytics and Confluent News from the frontline Significant investments have been made in traditional data integration and platforms which have limitations in storage, data latency and performance. These complex systems inhibit ‘speed to value’ and ‘speed to deploy’ The digitization of everything has the potential to provide organizations with “data about everything”. Organizations that are able to effectively harness the power of this data to drive insights actions (at speed) will have a distinct advantage over their competition. In order to effectively address these challenges and to capitalize on “new” data sources to gain a competitive advantage, organizations will have to ingest data at the speed of business. To unlock value within their data, organizations must start treating data as a supply chain, enabling it to flow more easily through the organization.
  • 6. © 2017 Attunity, Accenture Analytics and Confluent Market trends for data & analytics Complex data-driven environment with significant opportunities to create business value DEMOCRATIZATION OF DATA AND DATA DISCOVERY NEW DATA SOURCES CHANGING SKILLS REQUIREMENTS FOCUS ON ADVANCED ANALYTICS BIG DATA AND HYBRID ARCHITECTURES Self-service analytics, multi- channel delivery Addition of social media, Internet of Things, unstructured content, etc. Evolving patterns, data-driven point-of- interaction decisions shifting to the front line New forms, more complex architectures on Premise or Cloud Pictorial story- telling, customers going digital
  • 7. © 2017 Attunity, Accenture Analytics and Confluent The changing landscape in SAP data & analytics SAP systems are evolving from a ‘system of record’ to ‘system of innovation’ System of Record System of Insights System of Innovation SpeedandAgility Level of Integration between People, Process and Platform Pre 2011 Traditional SAP ERP and BI Reporting platform on-premise 2013-15 Evolution of SAP HANA as a high performance data base and Analytics Engine on-premise / Cloud SAP BW Any Database Enterprise BI Any Database Any Database Enterprise BI Data Visualiza -tion Advanced Analytics SAP ERP Any Database BW / Native HANA HANA SAP ERP Any Database BW / Native HANA S4 Innovation Suite on HANA Enterprise BI Data Visualiza- tion Advanced Analytics BOBJ Cloud HANA Hadoop/ Any database 2015+ Maturity of SAP HANA as the Digital Platform on-premise / Cloud / As-a-Service Faster insights to action 1 Bridge the digital divide 2 Accelerate data integration 3 BusinessImperative
  • 8. © 2017 Attunity, Accenture Analytics and Confluent Enabling the insight-driven enterprise DataIngestion DataAccess Point Analytics & Insights SourceSystems Transactional systems Business Applications Unstructured Sources Real time Batch DataMarketplace Data Warehouse/ Data Mart Data lake Operational Reporting & Self Service Data Science lab Cross Pollination Data Exploration Guided Analytics & Advanced Analytics Analytics Engine A POV and Reference Architecture for enabling the ‘insight driven enterprise’ Load to any target Enable real- time analytics Ingest/replicate data on demand Speed to Data Ingest any data at the frequency needed by business Speed to Insight Access to any data, anywhere and anytime Speed to Value Integrated platform that is scalable and flexible Data Ingestion An abstract layer that allows various data elements – type & latency to flow into the storage environment Data Marketplace Includes the components that allow to store & transform raw data into information Data Access Semantic layer that allows access to the data for end consumption – interactive, blending, low latency
  • 10. © 2017 Attunity, Accenture Analytics and Confluent How organizations handle data flow: a giant mess Data Warehouse Hadoop NoSQL Oracle SFDC Logging Bloomberg …any sink/source Web Custom Apps Microservices Monitoring Analytics …and more OLTP ActiveMQ App App Caches OLTP OLTPAppAppApp
  • 11. © 2017 Attunity, Accenture Analytics and Confluent Apache Kafka™: a distributed streaming platform Apache Kafka Offline Batch (+1 Hour)Near-Real Time (>100s ms)Real Time (0-100 ms) Data Warehouse Hadoop NoSQL Oracle SFDC Twitter Bloomberg …any sink/source …any sink/source …and more Web Custom Apps Microservices Monitoring Analytics
  • 12. © 2017 Attunity, Accenture Analytics and Confluent Over 35% of Fortune 500’s are using Apache Kafka™ 6 of top 10 Travel 7 of top 10 Global banks 8 of top 10 Insurance 9 of top 10 Telecom
  • 13. © 2017 Attunity, Accenture Analytics and Confluent Confluent: a streaming platform based on Apache Kafka™ Database Changes Log Events loT Data Web Events … CRM Data Warehouse Database Hadoop Data Integration … Monitoring Analytics Custom Apps Transformations Real-time Applications … Apache Open Source Confluent Open Source Confluent Commercial Confluent Platform Confluent Platform Apache Kafka™ Core | Connect| Streams Data Compatibility Schema Registry Monitoring & Administration Confluent Control Center Operations Replicator | Auto Data Balancing Development and Connectivity Clients | Connectors | REST Proxy
  • 14. © 2017 Attunity, Accenture Analytics and Confluent Kafka adoption spans many industries Financial Services Enterprise Tech Consumer Tech Entertainment & Media Telecom Retail Travel & Leisure
  • 15. © 2017 Attunity, Accenture Analytics and Confluent Apache KafkaTM Connect API – streaming data capture JDBC Mongo MySQL Elastic Cassandra HDFS Kafka Connect API Kafka Pipeline Connector Connector Connector Connector Connector Connector Sources Sinks Fault tolerant Manage hundreds of data sources and sinks Preserves data schema Part of Apache Kafka project Integrated within Confluent Platform’s Control Center
  • 16. © 2017 Attunity, Accenture Analytics and Confluent Kafka Connect API -part of the Apache Kafka Project Connect any source to any target system Integrated • 100% compatible with Kafka v0.9 and higher • Integrated with Confluent’s Schema Registry • Easy to manage with Confluent Control Center Flexible • 40+ open source connectors available • Easy to develop additional connectors • Flexible support for data types and formats Compatible • Maintains critical metadata • Preserves schema information • Supports schema evolution Reliable • Automated failover • At-least-once guaranteed • Balances workload between nodes
  • 17. © 2017 Attunity, Accenture Analytics and Confluent Architecture of Kafka Streams API, a Part of Apache Kafka Kafka Streams API Producer Kafka Cluster Topic TopicTopic Consumer Consumer Key benefits • No additional cluster • Easy to run as a service • Supports large aggregations and joins • Security & permissions fully integrated from Kafka Example Use Cases • Microservices • Continuous queries • Continuous transformations • Event-triggered processes
  • 18. © 2017 Attunity, Accenture Analytics and Confluent Kafka Streams API: easiest way to process data in Apache Kafka Example Use Cases • Microservices • Large-scale continuous queries and transformations • Event-triggered processes • Reactive applications • Customer 360-degree view, fraud detection, location-based marketing, smart electrical grids, fleet management, … Key Benefits of Apache Kafka’s Streams API • Build Apps, Not Clusters: no additional cluster required • Elastic, highly-performant, distributed, fault-tolerant, secure • Equally viable for small, medium, and large-scale use cases • “Run Everywhere”: integrates with your existing deployment strategies such as containers, automation, cloud Your App Kafka Streams API
  • 19. © 2017 Attunity, Accenture Analytics and Confluent Confluent Control Center • Monitor all pipelines end- to-end • Manage and configure Kafka connectors • Administer Kafka cluster and monitor overall operational health
  • 20. © 2017 Attunity, Accenture Analytics and Confluent Why Confluent? More than just enterprise software Confluent Platform Professional Services Enterprise Support Complete support across the entire adoption lifecycle Kafka Training
  • 21. SAP Data Integration for Apache Kafka
  • 22. © 2017 Attunity, Accenture Analytics and Confluent Attunity – Corporate Overview Data Integration & Big Data Management Software Accelerate data delivery and availability Automate data readiness for analytics Optimize data management with intelligence Hadoop & Big Data Databases & Data Warehouses On premise & in the Cloud Solutions for: Global OfficesOverview 2000 customers in 65 countries 250 people and growing NASDAQ traded (ATTU)
  • 23. © 2017 Attunity, Accenture Analytics and Confluent Load to any target Enable real- time analytics Ingest/replicate data on demand Traditional Data Integration Requirements
  • 24. © 2017 Attunity, Accenture Analytics and Confluent UniversalReal-TimeSimpler Attunity Replicate Universal Platform for Data Replication/Ingestion #1 independent provider of streaming CDC
  • 25. © 2017 Attunity, Accenture Analytics and Confluent Ingest Data Real-time and at Scale into Hadoop • Stream data with CDC from any major source into HDFS or Hive  Data then available to be used in Hbase or by Spark • Automatically generate schema in HCatalog • Leverage same UI and process as for other platforms • Integrate with native APIs  No external adapters or custom code requiredHadoop Data Lake
  • 26. © 2017 Attunity, Accenture Analytics and Confluent Attunity Replicate Architecture Attunity Replicate Batch CDC Incremental Batch Cloud On- prem Cloud On- premPersistent Store Files RDBMS Mainframe EDW
  • 27. © 2017 Attunity, Accenture Analytics and Confluent Attunity Replicate Server TransformFilter Batch CDC Incremental In-Memory File Channel Batch Attunity Replicate for SAP Persistent Store Extract relationships for Pool and Cluster Tables RDBMS (Oracle, DB2, etc.) Redo/ Archive logs or Journal File ---------------- Transparent Tables On Premises Hadoop RDBMS Data WarehouseKafka Cloud Attunity Replicate Agent for SAP SAP ECC (Enterprise Central Component) Data Model Mapping Pool/Cluster table RFC
  • 28. © 2017 Attunity, Accenture Analytics and Confluent Easier Integration with Attunity Replicate Go Agile with Automated Processes • No manual coding or scripting • Automated end-to-end • Optimized and configurable • Target schema creation • Heterogeneous data type mapping • Batch to CDC transition • DDL change propagation • Filtering • Transformations Hadoop File s RDBMS Mainframe Hadoop Files RDBMS Kafka EDW EDW
  • 29. © 2017 Attunity, Accenture Analytics and Confluent Easier Integration with Attunity Replicate Zero Footprint Architecture • CDC identifies source updates by scanning change logs • No software agents required on sources or targets • Minimizes administrative overhead • CDC based on change logs • No agents on source or target Low Infrastructure Impact Hadoop File s RDBMS Mainframe Hadoop Files RDBMS Kafka EDW EDW
  • 30. © 2017 Attunity, Accenture Analytics and Confluent Universal Integration with Attunity Replicate Rapidly Move Data Across Complex Hybrid Environments • Source  Log-based CDC  Batch APIs • WAN  Encrypted multi-pathing • Target  Transactional/batch CDC options  Native loader integration TargetsSources On Premises Cloud Platform HadoopRDBMS Data Warehouse Hadoop RDBMS Data Warehouse Optimized for…
  • 31. © 2017 Attunity, Accenture Analytics and Confluent Demo