Monitoring and Troubleshooting a Real Time Pipeline

•Download as PPTX, PDF•

1 like•748 views

Alan Ngai, CTO/Co-Founder, OpsClarity OpsClarity is a performance monitoring solution for stream processing applications. In additional to providing deep component monitoring it leverages data science to proactively identify anomalies across the entire data pipeline and correlates issues across the data and app tier to identify common concerns that impact business. OpsClarity automatically discovers the entire app and data topology and is years ahead of anything else in how it leverages the rich meta-data and network dependency context captured through the topology to provide rich analysis and fastest correlated troubleshooting. This talk will additionally cover integration with Apache Apex.

Technology

Monitoring and Troubleshooting a Real Time Pipeline
Alan Ngai, CTO/Co-Founder, OpsClarity

Businesses are Turning to Data-First Applications
AD Network – Real-time bidding
DDoS Attack Prevention
Fraud Detection
Internet of Things
Financial Services
Real-time Personalization

Data-First Application: Many Moving Parts!
DATA SOURCE MESSAGE BROKER STREAM
PROCESSOR
DATA SINK APPLICATIONS
DATA PIPELINE
ELASTIC
INFRASTRUCTURE
BUSINESS LOGIC AS
MICROSERVICES CODE

Characteristics of Data Pipelines
• Heterogeneous Components

Characteristics of Data Pipelines
• Heterogeneous Components
• Extremely Complex
Storm Master Host
Storm Worker Host
Supervisor Process
Topology
Executor
Spout
Task
Bolt
Task
Bolt
Task
Bolt
Task
METRIC
STORM

Characteristics of Data Pipelines
• Heterogeneous Components
• Highly Complex
• Highly Inter-dependent

Characteristics of Data Pipelines
• Heterogeneous Components
• Highly Interdependent
• Highly Complex
• Painful to Monitor and Debug

Put Data In One Place (don’t rely on this)
Kafka Web Console Spark UI Marvel (Elasticsearch)
Ambari (Hadoop) Ganglia Nagios

Organize Your Concerns Horizontally
• Throughput
• Latency
• Error Rate
• Buffered
• Data Loss
• Duplication
stuff per unit of time
how long it takes to process stuff
how frequently bad stuff happens
how much stuff is piled up
how much stuff is being lost
How much stuff is being duplicated
Matters for all stages in a pipeline!
Matters for all business use cases too!

Organize Your Concerns Horizontally
• Throughput
• Latency
• Error Rate
• Buffered
• Data Loss
• Duplication

…And Also Vertically
Storm Master Host
Storm Worker Host
Supervisor Process
Topology
Executor
Spout
Task
Bolt
Task
Bolt
Task
Bolt
Task
METRIC
STORM

…And Also Vertically
Data Health
Dependency
Health
Service Health
Application
Job/Topology
Health
Node Service
Health
Node System
Health
throughput, latency, errors?
Are Kafka and Zookeeper
healthy?
Is the Storm Master healthy? Are there
adequate resources in the cluster?
Are my application KPI’s within normal
range?
Is my Job well distributed in the
cluster? Are job counters normal?
Are all jobs running on this node
normal?
Are key system metrics (cpu, mem,
network, disk i/o) normal?
Data Health
Dependency
Health
Service Health
Application
Job/Topology
Health
Node Service
Health
Node System
Health

What We Talked About
• Data-First Applications Are Becoming a Thing
• Monitoring Data-First Applications is Hard!
• Get Your Metrics In One Place
• Organize Your Data Horizontally and Vertically

Questions?
Alan Ngai
alan@opsclarity.com

What's hot

PayPal currently processes tens of billions of signals per day from different sources in batch and streaming mode. The data processing platform is the one powering these different analytical needs and use cases, not just at PayPal but our adjacencies like Venmo, Hyperwallet and iZettle. End users of this platform demand access to data insights with as much flexibility as possible to explore it with low processing latency. One such use case is where our Switchboard(data de-multiplexer) platform where we process approximately 20 billion events daily and provide data to different teams and platforms with-in PayPal and also to platform outside PayPal for more insights. When we started building this platform Kafka was just another asynchronous message processing platform for us but we have seen it evolving to a place where its adds value not just in terms of event processing but also for platform resiliency and scalability. Takeaway for the audience: Most people work with and have knowledge about data. With this talk I want to present information which is relevant and meaningful to the audience. Information and examples which will make it easier for attendees to understand our complex system and hopefully have some practical takeaways to use Kafka for similar problems on their hand.

Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...

confluent

Watch this talk here: https://www.confluent.io/online-talks/siem-modernization-build-a-situationally-aware-organization-with-apache-kafka Of all security breaches, 85% are conducted with compromised credentials, often at the administration level or higher. A lot of IT groups think “security” means authentication, authorization and encryption (AAE), but these are often tick-boxes that rarely stop breaches. The internal threat surfaces of data streams or disk drives in a raidset in a data centre are not the threat surface of interest. Cyber or Threat organizations must conduct internal investigations of IT, subcontractors and supply chains without implicating the innocent. Therefore, they are organizationally air-gapped from IT. Some surveys indicate up to 10% of IT is under investigation at any given time. Deploying a signal processing platform, such as Confluent Platform, allows organizations to evaluate data as soon as it becomes available enabling them to assess and mitigate risk before it arises. In Cyber or Threat Intelligence, events can be considered signals, and when analysts are hunting for threat actors, these don't appear as a single needle in a haystack, but as a series of needles. In this paradigm, streams of signals aggregate into signatures. This session shows how various sub-systems in Apache Kafka can be used to aggregate, integrate and attribute these signals into signatures of interest. In this talk you will learn: -The current threat landscape -The difference between Security and Threat Intelligence -The value of Confluent platform as an ideal complement to hardware endpoint detection systems and batch-based SIEM warehouses

SIEM Modernization: Build a Situationally Aware Organization with Apache Kafka®

confluent

Aleksandr Tavgen from Playtech, the world’s largest online gambling software supplier, will share how they are using InfluxDB 2.0, Flux, and the OpenTracingAPI to gain full observability of their platform. In addition, he will share how InfluxDB has served as the glue to cope with multiple sets of time series data, especially in the case of understanding online user activity — a use case that is normally difficult without the math functions now available with Flux.

Using InfluxDB for Full Observability of a SaaS Platform by Aleksandr Tavgen,...

InfluxData

Apache Hudi is a data lake platform, that provides streaming primitives (upserts/deletes/change streams) on top of data lake storage. Hudi powers very large data lakes at Uber, Robinhood and other companies, while being pre-installed on four major cloud platforms. Hudi supports exactly-once, near real-time data ingestion from Apache Kafka to cloud storage, which is typically used in-place of a S3/HDFS sink connector to gain transactions and mutability. While this approach is scalable and battle-tested, it can only ingest data in mini batches, leading to lower data freshness. In this talk, we introduce a Kafka Connect Sink Connector for Apache Hudi, which writes data straight into Hudi's log format, making the data immediately queryable, while Hudi's table services like indexing, compaction, clustering work behind the scenes, to further re-organize for better query performance.

Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...

HostedbyConfluent

Streamsets and spark

Hari Shreedharan

Advanced time series analysis (TSA) requires very special data preparation procedures to convert raw data into useful and compatible formats. In this presentation you will see some typical processing patterns for time series based research, from simple statistics to reconstruction of correlation networks. The first case is relevant for anomaly detection and to protect safety. Reconstruction of graphs from time series data is a very useful technique to better understand complex systems like supply chains, material flows in factories, information flows within organizations, and especially in medical research. With this motivation we will look at typical data aggregation patterns. We investigate how to apply analysis algorithms in the cloud. Finally we discuss a simple reference architecture for TSA on top of the Confluent Platform or Confluent cloud.

Time Series Analysis Using an Event Streaming Platform

Dr. Mirko Kämpf

At Stripe, we operate a general ledger modeled as double-entry bookkeeping for all financial transactions. Warehousing such data is challenging due to its high volume and high cardinality of unique accounts. aFurthermore, it is financially critical to get up-to-date, accurate analytics over all records. Due to the changing nature of real time transactions, it is impossible to pre-compute the analytics as a fixed time series. We have overcome the challenge by creating a real time key-value store inside Pinot that can sustain half million QPS with all the financial transactions. We will talk about the details of our solution and the interesting technical challenges faced.

Analyzing Petabyte Scale Financial Data with Apache Pinot and Apache Kafka | ...

HostedbyConfluent

As many industries, banking is undergoing a fundamental change because of the software revolution. No longer are banks competing only on interest rates and having the best traders, these days customer experience and having the best engineers are the focus. In this changing world, banks compete with new start-ups, the so-called Fintechs, and with large platform organisations such as Google, Facebook and Apple. At ING, we believe that staying ahead of the game means changing how we interact with our customers, no longer a traditional model of waiting for the customers to come to the bank through our website or apps, but to actively reach out to the customer with information that is relevant to him or her in order to make their financial life frictionless. Many of these changes are driven by reacting to all events that are relevant to the customer, and using streaming analytics to be able to reach out to the customer in milliseconds after the event occurs. Apache Flink is key for ING to achieve this. This presentation addresses how ING approaches the challenge, the role that Apache Flink plays, and the consequences regulations have on how we work with Open Source in general, and with Apache Flink (and data Artisans) in particular. This keynote takes place at Kino 3.

Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...

Flink Forward

Lambda architecture with Spark

Vincent GALOPIN

Monitoring Large-Scale Apache Spark Clusters at Databricks

Anyscale

In this webinar, Engineering Manager at Credit Karma, Dustin Lyons, discusses how not long ago his team was facing a common challenge shared by many financial services architects and engineering leaders: not only how to move from the offline, batch-mode processing of Big Data to streaming, Fast Data, and how to enable real-time decision making based on the information flowing in from over 60 million members. Dustin reviews how his team migrated away from PHP and successfully implemented Akka Streams with Apache Kafka to ingest, process and route real-time events throughout their data ecosystem. At the end of this presentation, you’ll better understand: * The design considerations for new Fast Data architectures, from streaming to microservices to real-time analysis. * Some lessons learned when it comes to progressing from batch to streaming using Akka, Spark and Kafka * Why Akka’s self-healing actor model and the resilience that it provides is actually what matters most when delivering real-time customer experiences

How Credit Karma Makes Real-Time Decisions For 60 Million Users With Akka Str...

Lightbend

Tuning Java Driver for Apache Cassandra by Nenad Bozic at Big Data Spain 2017

Big Data Spain

Responding to a global pandemic presents a unique set of technical and public health challenges. The real challenge is the ability to gather data coming in via many data streams in variety of formats influences the real-world outcome and impacts everyone. The Centers for Disease Control and Prevention CELR (COVID Electronic Lab Reporting) program was established to rapidly aggregate, validate, transform, and distribute laboratory testing data submitted by public health departments and other partners. Confluent Kafka with KStreams and Connect play a critical role in program objectives to: o Track the threat of COVID-19 virus o Provide comprehensive data for local, state, and federal response o Better understand locations with an increase in incidence

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...

confluent

Taboola Road To Scale With Apache Spark

tsliwowicz

Since its release in 2018, KSQL has grown from interesting curiosity into ksqlDB - a production grade streaming system. What does it look like to run KSQL in the enterprise? How has the promise of the Kafka Streams with an SQL dialect worked in the wild? Let's explore stream processing with ksqlDB in the enterprise. How is it used to rapid prototyping; for taking an idea to production. Using the flexible scripting to help teams with error discover and system introspection. Plus how extended teams can use KSQL as a stepping stone for building and sharing real-time scoring and streaming insights. This session will cover production deployments of ksqlDB in banking, finance, transport and insurance. What can go wrong, and what can go right. See how teams embrace the technology to solve stream processing challenges.

KSQL-ops! Running ksqlDB in the Wild (Simon Aubury, ThoughtWorks) Kafka Summi...

confluent

Cask Webinar Date: 08/10/2016 Link to video recording: https://www.youtube.com/watch?v=XUkANr9iag0 In this webinar, Nitin Motgi, CTO of Cask, walks through the new capabilities of CDAP 3.5 and explains how your organization can benefit. Some of the highlights include: - Enterprise-grade security - Authentication, authorization, secure keystore for storing configurations. Plus integration with Apache Sentry and Apache Ranger. - Preview mode - Ability to preview and debug data pipelines before deploying them. - Joins in Cask Hydrator - Capabilities to join multiple data sources in data pipelines - Real-time pipelines with Spark Streaming - Drag & drop real-time pipelines using Spark Streaming. - Data usage analytics - Ability to report application usage of data sets. - And much more!

Webinar: What's new in CDAP 3.5?

Cask Data

Several different frameworks have been developed to draw data from Kafka and maintain standard SQL over continually changing data. This provides an easy way to query and transform data - now accessible by orders of magnitude more users. At the same time, using Standard SQL against changing data is a new pattern for many engineers and analysts. While the language hasn’t changed, we’re still in the early stages of understanding the power of SQL over Kafka - and in some interesting ways, this new pattern introduces some exciting new idioms. In this session, we’ll start with some basic use cases of how Standard SQL can be effectively used over events in Kafka- including how these SQL engines can help teams that are brand new to streaming data get started. From there, we’ll cover a series of more advanced functions and their implications, including: - WHERE clauses that contain time change the validity intervals of your data; you can programmatically introduce and retract records based on their payloads! - LATERAL joins turn streams of query arguments into query results; they will automatically share their query plans and resources! - GROUP BY aggregations can be applied to ever-growing data collections; reduce data that wouldn't even fit in a database in the first place. We'll review in-production examples where each of these cases make unmodified Standard SQL, run and maintain over data streams in Kafka, and provide the functionality of bespoke stream processors.

How to use Standard SQL over Kafka: From the basics to advanced use cases | F...

HostedbyConfluent

We often need to build applications that analyze Kafka data to unlock the most value from event streams, so how can organizations build these real-time analytics applications? In this talk, we examine an indexing approach that enables fast SQL analytics on data from Kafka, without data flattening or denormalization. Rockset is the real-time indexing database that builds an inverted index, a columnar index and a row index on all fields of your Kafka messages, including nested fields and arrays. This Converged Index accelerates various types of analytic queries–search, aggregations and joins–without the need to denormalize or transform data for performance reasons. With indexing delivering significant gains in query performance, we also need to index new data in a timely manner. We discuss several strategies used for efficient ingestion and indexing from Kafka, including rollups, write optimizations on the underlying RocksDB storage engine, and the disaggregation of ingest and query compute.

Sub-Second SQL Search, Aggregations and Joins with Kafka and Rockset | Dhruba...

HostedbyConfluent

The Oak Ridge Leadership Facility (OLCF) in the National Center for Computational Sciences (NCCS) division at Oak Ridge National Laboratory (ORNL) houses world-class high-performance computing (HPC) resources and has a history of operating top-ranked supercomputers on the TOP500 list, including the world's current fastest, Summit, an IBM AC922 machine with a peak of 200 petaFLOPS. With the exascale era rapidly approaching, the need for a robust and scalable big data platform for operations data is more important than ever. In the past when a new HPC resource was added to the facility, pipelines from data sources spanned multiple data sinks which oftentimes resulted in data silos, slow operational data onboarding, and non-scalable data pipelines for batch processing. Using Apache Kafka as the message bus of the division's new big data platform has allowed for easier decoupling of scalable data pipelines, faster data onboarding, and stream processing with the goal to continuously improve insight into the HPC resources and their supporting systems. This talk will focus on the NCCS division's transition to Apache Kafka over the past few years to enhance the OLCF's current capabilities and prepare for Frontier, OLCF's future exascale system; including the development and deployment of a full big data platform in a Kubernetes environment from both a technical and cultural shift perspective. This talk will also cover the mission of the OLCF, the operational data insights related to high-performance computing that the organization strives for, and several use-cases that exist in production today.

Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...

confluent

“The mainframe is going away” is as true now as it was 10, 20 and 30 years ago. Mainframes are still crucial in handling critical business transactions, they were however built for an era where batch data movement was the norm and can be difficult to integrate into today’s data-driven, real-time, analytics-focused business processes as well as the environments that support them. Until now. Join experts from Confluent, Attunity, and Capgemini for a one-hour online talk session where you’ll learn how to: Unlock your mainframe data with unique change data capture (CDC) functionality without incurring the complexity and expense that come with sending ongoing queries into the mainframe database How using CDC benefits advanced analytics approaches such as deep machine learning and predictive analytics Deliver ongoing streams of data in real-time to the most demanding analytics environments Ensure that your analytics environment includes the broadest possible range of data sources and destinations while ensuring true enterprise-grade functionality Identify use cases that can help you get started delivering value to the business moving from POC to Pilot to Production

Leveraging Mainframe Data for Modern Analytics

confluent

What's hot (20)

Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...

SIEM Modernization: Build a Situationally Aware Organization with Apache Kafka®

Using InfluxDB for Full Observability of a SaaS Platform by Aleksandr Tavgen,...

Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...

Streamsets and spark

Time Series Analysis Using an Event Streaming Platform

Analyzing Petabyte Scale Financial Data with Apache Pinot and Apache Kafka | ...

Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...

Lambda architecture with Spark

Monitoring Large-Scale Apache Spark Clusters at Databricks

How Credit Karma Makes Real-Time Decisions For 60 Million Users With Akka Str...

Tuning Java Driver for Apache Cassandra by Nenad Bozic at Big Data Spain 2017

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...

Taboola Road To Scale With Apache Spark

KSQL-ops! Running ksqlDB in the Wild (Simon Aubury, ThoughtWorks) Kafka Summi...

Webinar: What's new in CDAP 3.5?

How to use Standard SQL over Kafka: From the basics to advanced use cases | F...

Sub-Second SQL Search, Aggregations and Joins with Kafka and Rockset | Dhruba...

Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...

Leveraging Mainframe Data for Modern Analytics

Viewers also liked

Monitor troubleshooting

Courtney Casper

Siri hardware troubleshooting

sirikeshava

Common Computer Faults and Problems

Sef Cambaliza

TroubleShooting Solutions

huu

Using Kafka and Kudu for fast, low-latency SQL analytics on streaming data

Mike Percy

Motherboard of a pc

Tech_MX

Sqoop on Spark for Data Ingestion

DataWorks Summit

In this Document, hands-on exercises provide you with the knowledge and experience to take apart and reassemble computer components, and use specific techniques for identifying the source of hardware and software problems. Generally, in this handout students will install, upgrade, repair, configure, optimize, troubleshoot, and perform preventative maintenance on basic personal computer hardware and operating systems.

Advanced PC Maintenance and Troubleshooting

Natan Mesfin

Basic computer troubleshooting

dan0530

How to install windows 7

cmark11

Computer hardware troubleshooting

Jerome Luison

Troubleshooting

Julia .

Basic Computer Troubleshooting

Meredith Martin

Introduction to Motherboard

Makrand Patil

How to install windows 7

elboob2025

Windows 7 installation ppt

Nagarajan Kamalakannan

Organizations need to gain insight and knowledge from a growing number of Internet of Things (IoT), application programming interfaces (API), clickstreams, unstructured and log data sources. However, organizations are also often limited by legacy data warehouses and ETL processes that were designed for transactional data. Building scalable big data pipelines with automated extract-transform-load (ETL) and machine learning processes can address these limitations. JustGiving is the world’s largest social platform for online giving. In this session, we describe how we created several scalable and loosely coupled event-driven ETL and ML pipelines as part of our in-house data science platform called RAVEN. You learn how to leverage AWS Lambda, Amazon S3, Amazon EMR, Amazon Kinesis, and other services to build serverless, event-driven, data and stream processing pipelines in your organization. We review common design patterns, lessons learned, and best practices, with a focus on serverless big data architectures with AWS Lambda.

AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...

Amazon Web Services

Viewers also liked (17)

Monitor troubleshooting

Siri hardware troubleshooting

Common Computer Faults and Problems

TroubleShooting Solutions

Using Kafka and Kudu for fast, low-latency SQL analytics on streaming data

Motherboard of a pc

Sqoop on Spark for Data Ingestion

Advanced PC Maintenance and Troubleshooting

Basic computer troubleshooting

How to install windows 7

Computer hardware troubleshooting

Troubleshooting

Basic Computer Troubleshooting

Introduction to Motherboard

How to install windows 7

Windows 7 installation ppt

AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...

Similar to Monitoring and Troubleshooting a Real Time Pipeline

Data Vault 2.0: Big Data Meets Data Warehousing

All Things Open

datavault2.pptx

Mounika662749

Data data everywhere

Metron

On 2020-12-09 Laurens Vijnck and Jonny Daenen gave a workshop at PXL. During this session, we collectively provisioned a streaming ingestion pipeline in mere minutes. The technology stack included Pub/Sub, Dataflow, and BigQuery. Hereafter, students had the opportunity to perform interactive queries on their own real-time data to answer a series of business questions. These questions were borrowed from real-life cases that we encountered at Selligent Marketing Cloud. Google Colab (Free Jupyter Notebooks) and Google Data Studio have proven to be excellent tools to facilitate these kinds of interactive sessions.

PXL Data Engineering Workshop By Selligent

Jonny Daenen

Before the rise of Big Data, the Enterprise Data Warehouse (EDW) reigned supreme in Business Intelligence architecture. However, modern data rates and volumes often outstripped the capacity of traditional Data Warehousing tools and modeling strategies to keep pace. Many companies turned to unstructured Data Lakes as an means of keeping up with the influx. Consequently, they often discovered that the road from Data Lake to Business Intelligence was filled with its own steep challenges. As a result, any savings in terms of throughput and storage costs was more than offset by the high extraction and analytics costs of turning an unstructured Data Lake into an insights-yielding asset. Enter Data Vault 2.0, the Enterprise Data Warehouse reimagined to meet today’s data rate, volume and analytics demands. Not strictly an alternative to Data Lakes, Data Vault can easily integrate with your Data Lake and Big Data ingestion pipelines and analytics toolchain. This talk will introduce the fundamental concepts and advantages of Data Vault 2.0, and explain its approach to modeling data around your business domain’s “Hubs”, “Links” and “Satellites”. Finally, the talk will examine a real case study of building a Data Vault, including some challenges and drawbacks we encountered and addressed along the way.

Big Data or Data Warehousing? How to Leverage Both in the Enterprise

Dean Hallman

How to build streaming data applications - evaluating the top contenders

Akmal Chaudhri

Every software team has its moments of truth. How does this impact the way agile developers, site reliability engineers, and IT operations teams work together? We'll break down the intricacies of modern monitoring and show you what to look for, particularly when monitoring microservices and dynamic clouds. With examples from New Relic customers, you'll learn what to look out for when preparing to conquer your digital moments of truth, master microservices, using cloud services for autoscaling, and getting your teams to work together. I also added a quick bit on quickly evaluating the security of a cloud service provider before you engage your infosec team.

11 Ways Microservices & Dynamic Clouds Break Your Monitoring

Abner Germanow

Cloud Computing models : Cloud Computing models Cloud Computing consists of all types of outsourced IT services: Application, Platform, Infrastructure, Security... as a Service: XaaS Two typical deployments: SaaS: applications may be outsourced to different providers in the Cloud, using their own technology IaaS/PaaS: applications are housed by an Infrastructure/ Data Center provider and are downloaded as Virtual Machines Cloud Computing impacts on IT : Cloud Computing impacts on IT The IT becomes a separate entity from the firm: The technology in the Enterprise cloud, may run in another time zone, country... Cloud computing may reduce the IT branch to the architecture, strategy and planning functions Technology buy, upgrades, licensing and management are not a firm’s concern any longer Replaced by contracts and utility like charging, the bitter relation between business and IT vanishes Cloud Computing characteristics : Cloud Computing characteristics Web interface for self provisioning and reporting Charging mechanisms for actual consumption Multi-tenant data centers with frequently used platforms (PaaS) Technology is typically virtualized Blades technology could be used for scalability, low cost, reduced space & reduced power consumption Comes with development and deployment tools The Virtual Enterprise & Business Utilities : The Virtual Enterprise & Business Utilities The Virtual Enterprise business concept, known also as the Networked Enterprise, consists of distributed business functions and utilities, outsourced to partners that work together with the firm to deliver the product to end customers “Business Process Utilities are an emerging form of business process outsourcing. BPU is useful when a more standardized solution is sought that can be paid for on a transactional basis”, Gartner http://www.gartner.com/DisplayDocument?id=527120 The evolution to the Cloud Enterprise : The evolution to the Cloud Enterprise The Monolithic Enterprise The Virtual Enterprise Company 8 Company 9 The Cloud Enterprise The Cloud Enterprise and EA : The Cloud Enterprise and EA The Business Architecture layer rests on top of the computing cloud consisting of the IT Application and Technology layers The Cloud Enterprise Architecture (EA) consists mostly of Business Architecture, rather than technology detail The Data Center, its virtualization become the concern of the cloud services providers The EA layers and current outsourcing types : The EA layers and current outsourcing types BPUtility outsourcing Managed Services (Apps,Infra) Outsourcing type Data Centre outsourcing Application outsourcing (SaaS) Call Centre (people) outsourcing The Enterprise The increasing EA layers virtualization : The increasing EA layers virtualization But virtualization increasingly occurs at interfaces between the EA layers (business, applications...) progressively abstracting and decoupling them, enabling as such business and IT outsourcing The UI become

The Cloud Computing and Enterprise Architecture

Dr. Saurabh Katiyar

DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility

Prajesh Bhattacharya

Financial impact of Cloud Computing

krisbliesner

Spca2014 navigating clouds sp_con14_mackie

NCCOMMS

DataSaturday 2019 session: Domain driven design, microservices, event-driven, polyglot data storage. All popular developments within software architecture to realize modular and ultra-scalable solutions. But what is the impact on the Data & Analytics side? So how to contain a global vision on the data and processes when every service contains their own logic, data and enrichments? which data is leading? How to avoid conflicts? So what do these architectures mean for Data & Analytics.

Data & analytics challenges in a microservice architecture

Niels Naglé

Global Azure Bootcamp 2017 - Performance and Health Management for Modern App...

Adin Ermie

This webinar focuses on the particular use case of graph databases in Network & IT-Management. This webinar is designed for people who work with Network Management at telecom companies or professionals within industries that handle and rely on complex networks. We’ll start with an overview of Neo4j and Graph-thinking within Networks, explaining how Neworks are naturally modelled as graphs. We’ll explain how graph databases vastly help mitigate some of the major challenges the Network and Security Managers face on daily basis — including intrusions and other cyber crimes, performance optimization, outage simulations, fraud prevention and more.

Network and IT Operations

Neo4j

Data Lake allows an organisation to store all of their data, structured and unstructured, in one, centralised repository. Since data can be stored as-is, there is no need to convert it to a predefined schema and you no longer need to know what questions you want to ask of your data beforehand. In this session we will explore the architecture of a Data Lake on AWS and cover topics such as storage, processing and security.

Building your Datalake on AWS

Amazon Web Services

In this chalk talk, we focus on how to model teams, processes, and tools to achieve a secure CI/CD pipeline that results in continuously compliant and authorized services. We look at how public sector organizations such as the United States Customs and Immigration Services (USCIS) were able to successfully implement these novel capabilities for mission-critical workloads. Then, AWS and USCIS models those best practice designs, processes, and tools with participants in an interactive whiteboard session. We walk through deconstructing monolith services and processes, plus employing designs such as containers, microservices, and CI/CD tools to achieve success.

Design Continuous Authorization for Rapid Delivery of Mission-Critical Servic...

Amazon Web Services

Many organizations struggle daily with the question - "Where do we stand with our AWS security practices?" With the recent release of the Center for Internet Security's CIS AWS Foundations Benchmark, organizations now have an industry-accepted set of security configuration best practices. These benchmarks, in combination with 3rd party security solutions that support them, can form the foundation for security operations at organizations of all sizes through continuous monitoring and auditing.

Developing a Continuous Automated Approach to Cloud Security

Amazon Web Services

MicroServices-Part-1.pdf

chanhluc2112

The blessing and curse of today's database market? So many choices! While relational databases still dominate the day-to-day business, a host of alternatives has evolved around very specific use cases: graph, document, NoSQL, hybrid (HTAP), column store, the list goes on. And the database tools market is teeming with activity as well. Register for this special Research Webcast to hear Dr. Robin Bloor share his early findings about the evolving database market. He'll be joined by Steve Sarsfield of HPE Vertica, and Robert Reeves of Datical in a roundtable discussion with Bloor Group CEO Eric Kavanagh. Send any questions to info@insideanalysis.com, or tweet with #DBSurvival.

Horses for Courses: Database Roundtable

Eric Kavanagh

Presented by Lee Atchison at the Amazon Web Services Summit in San Francisco on April 18, 2017. Keeping an application running at scale can be a daunting task. When do you need to add more capacity? Larger databases? Additional servers? These questions get harder as the complexity of your application grows. Microservice based architectures and cloud-based dynamic infrastructures are technologies that help you keep your application running with high availability, even during times of extreme scaling. We will discuss some of the best practices we’ve learned working with New Relic customers on how you can manage your applications running at scale, and how technologies such as microservices and dynamic infrastructure can help you with this challenge.

Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...

New Relic

Similar to Monitoring and Troubleshooting a Real Time Pipeline (20)

Data Vault 2.0: Big Data Meets Data Warehousing

datavault2.pptx

Data data everywhere

PXL Data Engineering Workshop By Selligent

Big Data or Data Warehousing? How to Leverage Both in the Enterprise

How to build streaming data applications - evaluating the top contenders

11 Ways Microservices & Dynamic Clouds Break Your Monitoring

The Cloud Computing and Enterprise Architecture

DRAFT - Enterprise Data and Analytics Architecture Overview for Electric Utility

Financial impact of Cloud Computing

Spca2014 navigating clouds sp_con14_mackie

Data & analytics challenges in a microservice architecture

Global Azure Bootcamp 2017 - Performance and Health Management for Modern App...

Network and IT Operations

Building your Datalake on AWS

Design Continuous Authorization for Rapid Delivery of Mission-Critical Servic...

Developing a Continuous Automated Approach to Cloud Security

MicroServices-Part-1.pdf

Horses for Courses: Database Roundtable

Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...

More from Apache Apex

Data science is fast becoming a complementary approach and process to solve business challenges today. The explosion of frameworks to help data scientists build models bears a testimony to this. However when a model needs to be turned into a production version in very low latency and enterprise grade environments, there are a very few choices with each one having their own strengths and weaknesses. Adding to this is the current disconnect between a data scientists world which is all about modelling and an engineers world which is about SLAs and service guarantees. A framework like Apache Apex can complement each of these roles and provide constructs for both these worlds. This would help enterprises to drastically cut down the cost of model deployment to production environments.

Low Latency Polyglot Model Scoring using Apache Apex

Apache Apex

From Batch to Streaming with Apache Apex Dataworks Summit 2017

Apache Apex

The presentation covers how Apache Apex is used to deliver actionable insights in real-time for Ad-tech. It includes a reference architecture to provide dimensional aggregates on TB scale for billions of events per day. The reference architecture covers concepts around Apache Apex, with Kafka as source and dimensional compute. Slides from Devendra Tagare at Apache Big Data North America in Miami 2017.

Actionable Insights with Apache Apex at Apache Big Data 2017 by Devendra Tagare

Apache Apex

David Yan offers an overview of Apache Apex, a stream processing engine used in production by several large companies for real-time data analytics. Apache Apex uses a programming paradigm based on a directed acyclic graph (DAG). Each node in the DAG represents an operator, which can be data input, data output, or data transformation. Each directed edge in the DAG represents a stream, which is the flow of data from one operator to another. As part of Apex, the Malhar library provides a suite of connector operators so that Apex applications can read from or write to various data sources. It also includes utility operators that are commonly used in streaming applications, such as parsers, deduplicators and join, and generic building blocks that facilitate scalable state management and checkpointing. In addition to processing based on ingression time and processing time, Apex supports event-time windows and session windows. It also supports windowing, watermarks, allowed lateness, accumulation mode, triggering, and retraction detailed by Apache Beam as well as feedback loops in the DAG for iterative processing and at-least-once and “end-to-end” exactly-once processing guarantees. Apex provides various ways to fine-tune applications, such as operator partitioning, locality, and affinity. Apex is integrated with several open source projects, including Apache Beam, Apache Samoa (distributed machine learning), and Apache Calcite (SQL-based application specification). Users can choose Apex as the backend engine when running their application model based on these projects. David explains how to develop fault-tolerant streaming applications with low latency and high throughput using Apex, presenting the programming model with examples and demonstrating how custom business logic can be integrated using both the declarative high-level API and the compositional DAG-level API.

Developing streaming applications with apache apex (strata + hadoop world)

Apache Apex

Stream data processing is becoming increasingly important to support business needs for faster time to insight and action with growing volume of information from more sources. Apache Apex (http://apex.apache.org/) is a unified big data in motion processing platform for the Apache Hadoop ecosystem. Apex supports demanding use cases with: * Architecture for high throughput, low latency and exactly-once processing semantics. * Comprehensive library of building blocks including connectors for Kafka, Files, Cassandra, HBase and many more * Java based with unobtrusive API to build real-time and batch applications and implement custom business logic. * Advanced engine features for auto-scaling, dynamic changes, compute locality. Apex was developed since 2012 and is used in production in various industries like online advertising, Internet of Things (IoT) and financial services.

Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex

Apache Apex

Stream processing applications built on Apache Apex run on Hadoop clusters and typically power analytics use cases where availability, flexible scaling, high throughput, low latency and correctness are essential. These applications consume data from a variety of sources, including streaming sources like Apache Kafka, Kinesis or JMS, file based sources or databases. Processing results often need to be stored in external systems (sinks) for downstream consumers (pub-sub messaging, real-time visualization, Hive and other SQL databases etc.). Apex has the Malhar library with a wide range of connectors and other operators that are readily available to build applications. We will cover key characteristics like partitioning and processing guarantees, generic building blocks for new operators (write-ahead-log, incremental state saving, windowing etc.) and APIs for application specification.

Apache Big Data EU 2016: Building Streaming Applications with Apache Apex

Apache Apex

Intro to Apache Apex @ Women in Big Data

Apache Apex

Deep Dive into Apache Apex App Development

Apache Apex

Hadoop Interacting with HDFS

Apache Apex

Introduction to Real-Time Data Processing

Apache Apex

Introduction to Apache Apex

Apache Apex

Introduction to Yarn

Apache Apex

Introduction to Map Reduce

Apache Apex

HDFS Internals

Apache Apex

Intro to Big Data Hadoop

Apache Apex

Kafka to Hadoop Ingest with Parsing, Dedup and other Big Data Transformations

Apache Apex

This webinar will be a hands-on demonstration of how to clone and build the Apache Apex source code repositories, how to run the maven archetype to create a new Apex project, how to enhance it to build a word counting application and finally, how to run it and view results. We will also do a brief code walkthrough. Bio: Dr. Munagala V. Ramanath is a Committer for Apache Apex and a Software Engineer at DataTorrent. He has many years experience working for a variety of companies in California and a Ph.D. in Computer Science from the University of Wisconsin, Madison.

Building Your First Apache Apex (Next Gen Big Data/Hadoop) Application

Apache Apex

Introduction to Apache Apex - The next generation native Hadoop platform. This talk will cover details about how Apache Apex can be used as a powerful and versatile platform for big data processing. Common usage of Apache Apex includes big data ingestion, streaming analytics, ETL, fast batch alerts, real-time actions, threat detection, etc. Bio: Pramod Immaneni is Apache Apex PMC member and senior architect at DataTorrent, where he works on Apache Apex and specializes in big data platform and applications. Prior to DataTorrent, he was a co-founder and CTO of Leaf Networks LLC, eventually acquired by Netgear Inc, where he built products in core networking space and was granted patents in peer-to-peer VPNs.

Intro to Apache Apex - Next Gen Platform for Ingest and Transform

Apache Apex

Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)

Apache Apex

Presenter - Dr Sandeep Deshmukh, Committer Apache Apex, DataTorrent engineer Abstract: Ingesting and extracting data from Hadoop can be a frustrating, time consuming activity for many enterprises. Apache Apex Data Ingestion is a standalone big data application that simplifies the collection, aggregation and movement of large amounts of data to and from Hadoop for a more efficient data processing pipeline. Apache Apex Data Ingestion makes configuring and running Hadoop data ingestion and data extraction a point and click process enabling a smooth, easy path to your Hadoop-based big data project. In this series of talks, we would cover how Hadoop Ingestion is made easy using Apache Apex. The third talk in this series would focus on ingesting unbounded data from Kafka to JDBC with couple of processing operators -Transform and enrichment.

Ingesting Data from Kafka to JDBC with Transformation and Enrichment

Apache Apex

More from Apache Apex (20)

Low Latency Polyglot Model Scoring using Apache Apex

From Batch to Streaming with Apache Apex Dataworks Summit 2017

Actionable Insights with Apache Apex at Apache Big Data 2017 by Devendra Tagare

Developing streaming applications with apache apex (strata + hadoop world)

Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex

Apache Big Data EU 2016: Building Streaming Applications with Apache Apex

Intro to Apache Apex @ Women in Big Data

Deep Dive into Apache Apex App Development

Hadoop Interacting with HDFS

Introduction to Real-Time Data Processing

Introduction to Apache Apex

Introduction to Yarn

Introduction to Map Reduce

HDFS Internals

Intro to Big Data Hadoop

Kafka to Hadoop Ingest with Parsing, Dedup and other Big Data Transformations

Building Your First Apache Apex (Next Gen Big Data/Hadoop) Application

Intro to Apache Apex - Next Gen Platform for Ingest and Transform

Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)

Ingesting Data from Kafka to JDBC with Transformation and Enrichment

Recently uploaded

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

Scaling API-first – The story of a global engineering organization

Radu Cotescu

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

In an era where artificial intelligence (AI) stands at the forefront of business innovation, Information Architecture (IA) is at the core of functionality. See “There’s No AI Without IA” – (from 2016 but even more relevant today) Understanding and leveraging how Information Architecture (IA) supports AI synergies between knowledge engineering and prompt engineering is critical for senior leaders looking to successfully deploy AI for internal and externally facing knowledge processes. This webinar be a high-level overview of the methodologies that can elevate AI-driven knowledge processes supporting both employees and customers. Core Insights Include: Strategic Knowledge Engineering: Delve into how structuring AI's knowledge base is required to prevent hallucinations, enable contextual retrieval of accurate information. This will include discussion of gold standard libraries of use cases support testing various LLMs and structures and configurations of knowledge base. Precision in Prompt Engineering: Learn the art of crafting prompts that direct AI to deliver targeted, relevant responses, thereby optimizing customer experiences and business outcomes. Unified Approach for Enhanced AI Performance: Explore the intersection of knowledge and prompt engineering to develop AI systems that are not only more responsive but also aligned with overarching business strategies. Guiding Principles for Implementation: Equip yourself with best practices, ethical guidelines, and strategic considerations for embedding these technologies into your business ecosystem effectively. This webinar is designed to empower business and technology leaders with the knowledge to harness the full potential of AI, ensuring their organizations not only keep pace with digital transformation but lead the charge. Join us to map a roadmap to fully leverage Information Architecture (IA) and AI chart a course towards a future where AI is a key pillar of strategic innovation and business success.

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Earley Information Science

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Delhi Call girls

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

With more memory available, system performance of three Dell devices increased, which can translate to a better user experience Conclusion When your system has plenty of RAM to meet your needs, you can efficiently access the applications and data you need to finish projects and to-do lists without sacrificing time and focus. Our test results show that with more memory available, three Dell PCs delivered better performance and took less time to complete the Procyon Office Productivity benchmark. These advantages translate to users being able to complete workflows more quickly and multitask more easily. Whether you need the mobility of the Latitude 5440, the creative capabilities of the Precision 3470, or the high performance of the OptiPlex Tower Plus 7010, configuring your system with more RAM can help keep processes running smoothly, enabling you to do more without compromising performance.

Boost PC performance: How more available memory can improve productivity

Principled Technologies

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

What is a good lead in your organisation? Which leads are priority? What happens to leads? When sales and marketing give different answers to these questions, or perhaps aren't sure of the answers at all, frustrations build and opportunities are left on the table. Join us for an illuminating session with Cian McLoughlin, HubSpot Principal Customer Success Manager, as we look at that crucial piece of the customer journey in which leads are transferred from marketing to sales.

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

HampshireHUG

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

CNv6 Instructor Chapter 6 Quality of Service

giselly40

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

08448380779 Call Girls In Friends Colony Women Seeking Men

Delhi Call girls

Recently uploaded (20)

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Exploring the Future Potential of AI-Enabled Smartphone Processors

Axa Assurance Maroc - Insurer Innovation Award 2024

GenCyber Cyber Security Day Presentation

Data Cloud, More than a CDP by Matt Robison

08448380779 Call Girls In Civil Lines Women Seeking Men

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Scaling API-first – The story of a global engineering organization

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Powerful Google developer tools for immediate impact! (2023-24 C)

Boost PC performance: How more available memory can improve productivity

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

What Are The Drone Anti-jamming Systems Technology?

CNv6 Instructor Chapter 6 Quality of Service

How to Troubleshoot Apps for the Modern Connected Worker

Handwritten Text Recognition for manuscripts and early printed texts

08448380779 Call Girls In Friends Colony Women Seeking Men

Monitoring and Troubleshooting a Real Time Pipeline

1. Monitoring and Troubleshooting a Real Time Pipeline Alan Ngai, CTO/Co-Founder, OpsClarity

2. Businesses are Turning to Data-First Applications AD Network – Real-time bidding DDoS Attack Prevention Fraud Detection Internet of Things Financial Services Real-time Personalization

3. Data-First Application: Many Moving Parts! DATA SOURCE MESSAGE BROKER STREAM PROCESSOR DATA SINK APPLICATIONS DATA PIPELINE ELASTIC INFRASTRUCTURE BUSINESS LOGIC AS MICROSERVICES CODE

4. OpsClarity Runs on Data Pipelines

5. Characteristics of Data Pipelines • Heterogeneous Components

6. Characteristics of Data Pipelines • Heterogeneous Components • Extremely Complex Storm Master Host Storm Worker Host Supervisor Process Topology Executor Spout Task Bolt Task Bolt Task Bolt Task METRIC STORM

7. Characteristics of Data Pipelines • Heterogeneous Components • Highly Complex • Highly Inter-dependent

8. Characteristics of Data Pipelines • Heterogeneous Components • Highly Interdependent • Highly Complex • Painful to Monitor and Debug

9. Put Data In One Place (don’t rely on this) Kafka Web Console Spark UI Marvel (Elasticsearch) Ambari (Hadoop) Ganglia Nagios

10. Organize Your Concerns Horizontally • Throughput • Latency • Error Rate • Buffered • Data Loss • Duplication stuff per unit of time how long it takes to process stuff how frequently bad stuff happens how much stuff is piled up how much stuff is being lost How much stuff is being duplicated Matters for all stages in a pipeline! Matters for all business use cases too!

11. Organize Your Concerns Horizontally • Throughput • Latency • Error Rate • Buffered • Data Loss • Duplication

12. …And Also Vertically Storm Master Host Storm Worker Host Supervisor Process Topology Executor Spout Task Bolt Task Bolt Task Bolt Task METRIC STORM

13. …And Also Vertically Data Health Dependency Health Service Health Application Job/Topology Health Node Service Health Node System Health throughput, latency, errors? Are Kafka and Zookeeper healthy? Is the Storm Master healthy? Are there adequate resources in the cluster? Are my application KPI’s within normal range? Is my Job well distributed in the cluster? Are job counters normal? Are all jobs running on this node normal? Are key system metrics (cpu, mem, network, disk i/o) normal? Data Health Dependency Health Service Health Application Job/Topology Health Node Service Health Node System Health

14. DEMO

15. What We Talked About • Data-First Applications Are Becoming a Thing • Monitoring Data-First Applications is Hard! • Get Your Metrics In One Place • Organize Your Data Horizontally and Vertically

16. Questions? Alan Ngai alan@opsclarity.com

Editor's Notes

Intro CTO/Cofounder of OpsClarity At OpsClarity, we've built an intelligent monitoring platform for stream processing applications Today we’ll be talking about lessons we’ve learned monitoring and troubleshooting real time pipelines
Talk time: 1:30 - First things first: let's talk about data first applications - Any application architecture who’s primary purpose is to extract value out of data Business domains where data is key to solving their associated problems What does an application look like that solve problems in this space?
Let’s break down the pieces key part: Real time Data pipeline In fact, what makes this even more difficult to monitor is that you often have multiple applications running on top of the same pipeline infrastructure For example, user interaction report, price optimizer, fraud detection, there's often an overlap of the input data Elastic infra microservices: account management other web applications That + Business services + infrastructure = data first app
Talk time: 2:00 Total: 5:00 Topo Discovery: discover and build service cluster topology in real time Anomaly Detection: automatically baseline and discovery anomalies in metrics Health Aggregation: aggregate health from port checks and http checks with metric AD into host health and then service cluster health
Talk time: 0:30 Total: 5:30 Any data pipeline that you build today will include a number of these components, as well as some that are not included here **audience interaction**: Who’s companies are using some of these technologies? ex: 0mq, flume, flink, datatorrent, heron, solr , etc.
Talk time: 1:30 Total: 7:00 Scenario: device data -> api gateway -> message broker -> data analysis -> data store <- node <- analytics reports **audience interaction**: What if you notice something wonky data in the reports you’re seeing? Where can the problem be? - time consuming process to debug - chances are you have different people responsible for different parts of the system, which makes it that much harder to troubleshoot
Talk time: 0:30 Total: 9:30
Get all your data in one place Organize your data so that you can look at cross component concerns Organize your data so that you can explore data hierarchically (data, dependency, app, etc.)
Talk time: 1:30 Total: 12:00 Throughput: clicks per second, docs per second, network bytes in/out Latency: processing time, queued time Error Rates: exception count, 500 errors regardless of what business problem you're solving, as long as you have a data pipeline, you care about these concerns these concerns also matter at every stage of your pipelie
Talk time: 1:00 Total: 13:00 Talk through throughput metrics for each component Point out that noise from other metrics (cpu, load, mem, etc.) are filtered out **audience interaction**: Where do you think the problem is? That’s right, play! We can do the same for latency, error rate, etc.
Talk time: 0:30 Total: 13:30 Remember this slide? In order to make sense of the metrics from different sub-components, we need to organize them into tiers
Talk time: 1:30 Total: 15:00 Remember this slide? In order to make sense of the metrics from different sub-components, we need to organize them into tiers

Monitoring and Troubleshooting a Real Time Pipeline

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (17)

Similar to Monitoring and Troubleshooting a Real Time Pipeline

Similar to Monitoring and Troubleshooting a Real Time Pipeline (20)

More from Apache Apex

More from Apache Apex (20)

Recently uploaded

Recently uploaded (20)

Monitoring and Troubleshooting a Real Time Pipeline

Editor's Notes