Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/2A6lx09.
Dave Casper talks about how modern data science and algorithms are being applied to "fight machines with machines". Filmed at qconsf.com.
Dave Casper is CTO at Moogsoft. Previously, he served as Senior Vice President and Executive Director at several of the world's largest financial institutions, including UBS, Merrill Lynch, Bank of America, and Deutsche Bank and also co-chaired the Open Data Center Alliance Infrastructure Group.
Splunk’s machine learning framework mixed with Splunk’s Event Management capabilities gives operations teams the opportunity to proactively act and automate on an event before it becomes an IT outage. This session will detail and demonstrate how to predict a health score of your business service, proactively take action based on those predictions and publish to your collaborative messaging and automation solutions.
SplunkLive! London 2017 - DevOps Powered by SplunkSplunk
DevOps is powering the computing environments of tomorrow. When properly configured, the Splunk platform allows us to gain real-time visibility into the velocity, quality, and business impact of DevOps-driven application delivery across all roles, departments, process, and systems. Splunk can be used by DevOps practitioners to provide continuous integration/deployment and the real-time feedback to help the organisation with their operational intelligence. Join us for an exciting talk about Splunk’s current approach to DevOps, and for examples of how Splunk is being used by customers today to transform DevOps initiatives.
This session will provide an overview and demo of the features of Splunk Cloud and Splunk Enterprise, including machine learning, data analysis, power user productivity and platform management.
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...DevOps.com
For some, observability is just a hollow rebranding of monitoring, for others it’s monitoring on steroids. But what if we told you observability is the new way to find out why—not just if—your distributed system or application isn’t working as expected? Today, we see that traditional monitoring approaches can fall short if a system or application doesn’t adequately externalize its state.
This is truer as workloads move into the cloud and leverage ephemeral technologies, such as microservices and containers. To reach observability, IT and DevOps teams need to correlate different sources from logs, metrics, traces, events and more. This becomes even more challenging when defining the online revenue impact of a failed container—after all, this is what really matters to the business.
This webinar will cover:
The differences between observability and monitoring
Why it is a bigger challenge in a multicloud and containerized world
How observability results in less firefighting and more fire prevention
How new platforms can help gain observability (on premises and in the cloud) for containers, microservices and even SAP or mainframes
Customers migrating workloads to AWS have a variety of tools to monitor their infrastructure, generating large volumes of alarms from services such as Amazon CloudWatch, AWS Config, and other third party tools. Without careful curation, events and tickets can exponentially multiply and overwhelm ITSM systems and the teams operating them, obscuring real problems and wasting time. Using advanced Machine Learning techniques, customers can reduce noise from these events and tickets and increase their service quality. In this presentation, we explore challengs of adopting AIOps, and provide examples of how AIOPs can be used to reduce Mean Time To Restore and improve customer outcomes
Splunk’s machine learning framework mixed with Splunk’s Event Management capabilities gives operations teams the opportunity to proactively act and automate on an event before it becomes an IT outage. This session will detail and demonstrate how to predict a health score of your business service, proactively take action based on those predictions and publish to your collaborative messaging and automation solutions.
SplunkLive! London 2017 - DevOps Powered by SplunkSplunk
DevOps is powering the computing environments of tomorrow. When properly configured, the Splunk platform allows us to gain real-time visibility into the velocity, quality, and business impact of DevOps-driven application delivery across all roles, departments, process, and systems. Splunk can be used by DevOps practitioners to provide continuous integration/deployment and the real-time feedback to help the organisation with their operational intelligence. Join us for an exciting talk about Splunk’s current approach to DevOps, and for examples of how Splunk is being used by customers today to transform DevOps initiatives.
This session will provide an overview and demo of the features of Splunk Cloud and Splunk Enterprise, including machine learning, data analysis, power user productivity and platform management.
More Than Monitoring: How Observability Takes You From Firefighting to Fire P...DevOps.com
For some, observability is just a hollow rebranding of monitoring, for others it’s monitoring on steroids. But what if we told you observability is the new way to find out why—not just if—your distributed system or application isn’t working as expected? Today, we see that traditional monitoring approaches can fall short if a system or application doesn’t adequately externalize its state.
This is truer as workloads move into the cloud and leverage ephemeral technologies, such as microservices and containers. To reach observability, IT and DevOps teams need to correlate different sources from logs, metrics, traces, events and more. This becomes even more challenging when defining the online revenue impact of a failed container—after all, this is what really matters to the business.
This webinar will cover:
The differences between observability and monitoring
Why it is a bigger challenge in a multicloud and containerized world
How observability results in less firefighting and more fire prevention
How new platforms can help gain observability (on premises and in the cloud) for containers, microservices and even SAP or mainframes
Customers migrating workloads to AWS have a variety of tools to monitor their infrastructure, generating large volumes of alarms from services such as Amazon CloudWatch, AWS Config, and other third party tools. Without careful curation, events and tickets can exponentially multiply and overwhelm ITSM systems and the teams operating them, obscuring real problems and wasting time. Using advanced Machine Learning techniques, customers can reduce noise from these events and tickets and increase their service quality. In this presentation, we explore challengs of adopting AIOps, and provide examples of how AIOPs can be used to reduce Mean Time To Restore and improve customer outcomes
SplunkLive! London 2017 - Splunk Enterprise for IT TroubleshootingSplunk
If you’re just getting started with Splunk, this session will help you understand how to use Splunk software to turn your silos of data into insights that are actionable. In this session, we’ll dive right into a Splunk environment and show you how to use the simple Splunk search interface to quickly find the needle-in-the-haystack or multiple needles in multiple haystacks. We’ll demonstrate how to perform rapid ad hoc searches to conduct routine investigations across your entire IT infrastructure in one place, whether physical, virtual or in the cloud. We’ll show you how to then convert these searches into real-time alerts and dashboards, so you can proactively monitor for problems before they impact your end user. We’ll also demonstrate how you can use Splunk to connect the dots across heterogeneous systems in your environment for cross-tier, cross-silo visibility. Don’t forget to bring your laptop and install Spunk Enterprise before you join us.
Splunk is a powerful platform for understanding your data. This session will provide an overview of machine learning capabilities available across Splunk’s portfolio. We'll dive deeply into Splunk's Machine Learning Toolkit App, which extends Splunk Enterprise with a rich suite of advanced analytics, machine learning algorithms, and rich visualizations. It also provides customers with a guided model-building and operationalization environment. The demonstration will include the guided model-building UI for tasks such as predictive analytics, outlier detection, event clustering, and anomaly detection. We’ll also review typical use cases and real-world customers who are using the Toolkit to drive business results.
Online Memory Leak Detection in the Cloud-based InfrastructuresAnshul Jindal
A memory leak in an application deployed on the cloud can affect the availability and reliability of the application. Therefore, to identify and ultimately resolve it quickly is highly important. However, in the production environment running on the cloud, memory leak detection is a challenge without the knowledge of the application or its internal object allocation details. This paper addresses this challenge of online detection of memory leaks in cloud-based infrastructure without having any internal application knowledge by introducing a novel machine learning-based algorithm Precog. This algorithm solely uses one metric i.e the system's memory utilization on which the application is deployed for the detection of a memory leak. The developed algorithm's accuracy was tested on 60 virtual machines manually labeled memory utilization data provided by our industry partner Huawei Munich Research Center and it was found that the proposed algorithm achieves the accuracy score of 85% with less than half a second prediction time per virtual machine.
Enabling Airbus Digital Transformation with Splunk
Learn how Airbus are turning their data into doing across their organisation. From real time monitoring to IT Service Management to security operations – Airbus are maximising their use of data to deliver more services and continuous process improvement.
SplunkLive! London 2017 - Using Machine Learning to Feed Hungry PeopleSplunk
There is often too much data to be able to understand it all by hand, and often it is difficult to see the interesting trees in the forest of data. Machine learning gives us an opportunity to get computers to do this heavy lifting, and present us with key actions for operations. Machine learning has many applications across a wide variety of fields – here we demonstrate hands on with donuts how you can use Machine Learning to see deviations from expected donut consumption and either make more donuts or send out a targeted marketing campaign to get donuts off the shelves and into happy customers.
SplunkLive! London 2017 - Happy Apps, Happy UsersSplunk
No matter what business you’re in, your web applications are front-and-center for your customers. Downtime, or even bad performance not only creates a spike in costs, they often translate into loss of customers and revenue. You need immediate insight into the availability, performance and usage of your applications and the infrastructure your applications run on. In this session, you will learn why you need to take a platform approach to full stack application management, whether your applications reside on-premises or in the cloud. Second, we will show you how you can use Splunk to monitor the usage and performance of your applications, and quickly troubleshoot faults by stepping through some of the most common issues our customers experience. Third, we’ll contrast what Splunk does relative to other APM tools you may already have deployed, and even show you how you can bring APM data into Splunk to gain more insight into application performance.
AIOps: Anomalous Span Detection in Distributed Traces Using Deep LearningJorge Cardoso
The field of AIOps, also known as Artificial Intelligence for IT Operations, uses algorithms and machine learning to dramatically improve the monitoring, operation, and maintenance of distributed systems. Its main premise is that operations can be automated using monitoring data to reduce the workload of operators (e.g., SREs or production engineers). Our current research explores how AIOps – and many related fields such as deep learning, machine learning, distributed traces, graph analysis, time-series analysis, sequence analysis, and log analysis – can be explored to effectively detect, localize, and remediate failures in large-scale cloud infrastructures (>50 regions and AZs). In particular, this lecture will describe how a particular monitoring data structure, called distributed trace, can be analyzed using deep learning to identify anomalies in its spans. This capability empowers operators to quickly identify which components of a distributed system are faulty.
On your marks, get set GO!
Take a more in-depth look at the automation and orchestration journey and the future of SOAR.
Watch the SOCtails video here: https://www.youtube.com/watch?v=YzsGQzqaDYw&t=2s
Looking into 2020 and beyond, we are certainly going to continue this trend of strategic technology investment and architecture evolution. This session’s aim is to highlight Splunk platform evolutionary approach to address key technology trends. Additionally, many customers are adopting Serverless cloud services to deliver their cloud solutions. This session will include a live demo of a new library of functions which provides Google Cloud Platform (GCP) serverless “push” capability to send data into Splunk, via HTTP Event Collector (HEC).
The Hitchhiker's Guide to Service IntelligenceSplunk
Providing transformational impact and insight into key business services while maintaining operational oversight is often difficult in organizations. To effectively communicate business value and alignment organizations must find new methods to bridge the gap between business and operations. This half-day hands-on workshop demonstrates how customers can quickly gain insight into high-value services while aligning business and IT Operations using Splunk’s IT Service Intelligence solution. By leveraging the machine data you are already collecting the exercise provides a transformational method to model high-value services and rapidly build custom visualizations and dashboards. From executive leaders to administrators these personalized service-centric views provide powerful analytics and machine learning to transform service intelligence across your organization.
Summary of fast development and cloud native architecture along with cost optimization techniques. Presented as opening keynote at the Utility and Cloud Computing 2014 as part of the Cloud Control Workshop.
Sildes of an internal talk given at Twitter similar to a previous webinar for Redhat with the same title.
Speeding up development is a key concern, cloud and technology improvements like Docker speed up key steps that make continuous delivery possible. Breaking up the work into many separate microservices and datastores with stable APIs allows teams to make progress independently so that the organization scales. Monolithic apps are preferred for small projects, built by small teams and when very low latency and high efficiency is the primary requirement. Monitoring microservices is currently a challenge with solutions starting to emerge.
SplunkLive! London 2017 - Splunk Enterprise for IT TroubleshootingSplunk
If you’re just getting started with Splunk, this session will help you understand how to use Splunk software to turn your silos of data into insights that are actionable. In this session, we’ll dive right into a Splunk environment and show you how to use the simple Splunk search interface to quickly find the needle-in-the-haystack or multiple needles in multiple haystacks. We’ll demonstrate how to perform rapid ad hoc searches to conduct routine investigations across your entire IT infrastructure in one place, whether physical, virtual or in the cloud. We’ll show you how to then convert these searches into real-time alerts and dashboards, so you can proactively monitor for problems before they impact your end user. We’ll also demonstrate how you can use Splunk to connect the dots across heterogeneous systems in your environment for cross-tier, cross-silo visibility. Don’t forget to bring your laptop and install Spunk Enterprise before you join us.
Splunk is a powerful platform for understanding your data. This session will provide an overview of machine learning capabilities available across Splunk’s portfolio. We'll dive deeply into Splunk's Machine Learning Toolkit App, which extends Splunk Enterprise with a rich suite of advanced analytics, machine learning algorithms, and rich visualizations. It also provides customers with a guided model-building and operationalization environment. The demonstration will include the guided model-building UI for tasks such as predictive analytics, outlier detection, event clustering, and anomaly detection. We’ll also review typical use cases and real-world customers who are using the Toolkit to drive business results.
Online Memory Leak Detection in the Cloud-based InfrastructuresAnshul Jindal
A memory leak in an application deployed on the cloud can affect the availability and reliability of the application. Therefore, to identify and ultimately resolve it quickly is highly important. However, in the production environment running on the cloud, memory leak detection is a challenge without the knowledge of the application or its internal object allocation details. This paper addresses this challenge of online detection of memory leaks in cloud-based infrastructure without having any internal application knowledge by introducing a novel machine learning-based algorithm Precog. This algorithm solely uses one metric i.e the system's memory utilization on which the application is deployed for the detection of a memory leak. The developed algorithm's accuracy was tested on 60 virtual machines manually labeled memory utilization data provided by our industry partner Huawei Munich Research Center and it was found that the proposed algorithm achieves the accuracy score of 85% with less than half a second prediction time per virtual machine.
Enabling Airbus Digital Transformation with Splunk
Learn how Airbus are turning their data into doing across their organisation. From real time monitoring to IT Service Management to security operations – Airbus are maximising their use of data to deliver more services and continuous process improvement.
SplunkLive! London 2017 - Using Machine Learning to Feed Hungry PeopleSplunk
There is often too much data to be able to understand it all by hand, and often it is difficult to see the interesting trees in the forest of data. Machine learning gives us an opportunity to get computers to do this heavy lifting, and present us with key actions for operations. Machine learning has many applications across a wide variety of fields – here we demonstrate hands on with donuts how you can use Machine Learning to see deviations from expected donut consumption and either make more donuts or send out a targeted marketing campaign to get donuts off the shelves and into happy customers.
SplunkLive! London 2017 - Happy Apps, Happy UsersSplunk
No matter what business you’re in, your web applications are front-and-center for your customers. Downtime, or even bad performance not only creates a spike in costs, they often translate into loss of customers and revenue. You need immediate insight into the availability, performance and usage of your applications and the infrastructure your applications run on. In this session, you will learn why you need to take a platform approach to full stack application management, whether your applications reside on-premises or in the cloud. Second, we will show you how you can use Splunk to monitor the usage and performance of your applications, and quickly troubleshoot faults by stepping through some of the most common issues our customers experience. Third, we’ll contrast what Splunk does relative to other APM tools you may already have deployed, and even show you how you can bring APM data into Splunk to gain more insight into application performance.
AIOps: Anomalous Span Detection in Distributed Traces Using Deep LearningJorge Cardoso
The field of AIOps, also known as Artificial Intelligence for IT Operations, uses algorithms and machine learning to dramatically improve the monitoring, operation, and maintenance of distributed systems. Its main premise is that operations can be automated using monitoring data to reduce the workload of operators (e.g., SREs or production engineers). Our current research explores how AIOps – and many related fields such as deep learning, machine learning, distributed traces, graph analysis, time-series analysis, sequence analysis, and log analysis – can be explored to effectively detect, localize, and remediate failures in large-scale cloud infrastructures (>50 regions and AZs). In particular, this lecture will describe how a particular monitoring data structure, called distributed trace, can be analyzed using deep learning to identify anomalies in its spans. This capability empowers operators to quickly identify which components of a distributed system are faulty.
On your marks, get set GO!
Take a more in-depth look at the automation and orchestration journey and the future of SOAR.
Watch the SOCtails video here: https://www.youtube.com/watch?v=YzsGQzqaDYw&t=2s
Looking into 2020 and beyond, we are certainly going to continue this trend of strategic technology investment and architecture evolution. This session’s aim is to highlight Splunk platform evolutionary approach to address key technology trends. Additionally, many customers are adopting Serverless cloud services to deliver their cloud solutions. This session will include a live demo of a new library of functions which provides Google Cloud Platform (GCP) serverless “push” capability to send data into Splunk, via HTTP Event Collector (HEC).
The Hitchhiker's Guide to Service IntelligenceSplunk
Providing transformational impact and insight into key business services while maintaining operational oversight is often difficult in organizations. To effectively communicate business value and alignment organizations must find new methods to bridge the gap between business and operations. This half-day hands-on workshop demonstrates how customers can quickly gain insight into high-value services while aligning business and IT Operations using Splunk’s IT Service Intelligence solution. By leveraging the machine data you are already collecting the exercise provides a transformational method to model high-value services and rapidly build custom visualizations and dashboards. From executive leaders to administrators these personalized service-centric views provide powerful analytics and machine learning to transform service intelligence across your organization.
Summary of fast development and cloud native architecture along with cost optimization techniques. Presented as opening keynote at the Utility and Cloud Computing 2014 as part of the Cloud Control Workshop.
Sildes of an internal talk given at Twitter similar to a previous webinar for Redhat with the same title.
Speeding up development is a key concern, cloud and technology improvements like Docker speed up key steps that make continuous delivery possible. Breaking up the work into many separate microservices and datastores with stable APIs allows teams to make progress independently so that the organization scales. Monolithic apps are preferred for small projects, built by small teams and when very low latency and high efficiency is the primary requirement. Monitoring microservices is currently a challenge with solutions starting to emerge.
Cisco Live: Containers on Enterprise Compute and NetworksMichael Duarte
Containers are the next evolutionary step in how applications are managed and consumed. Allowing application teams to control and optimize their application deployment process. Along with the advantages provided to application teams, it's also a dynamic shift for data center design. Allowing for better resource utilization, and management resulting in both cost savings and faster IT. This session will explain how Cisco IT has delivered this new paradigm in Cloud Technology by using Cisco ACI, Cisco UCS and open-source solutions like Kubernetes.
Keynote at Dockercon Europe Amsterdam Dec 4th, 2014.
Speeding up development with Docker.
Summary of some interesting web scale microservice architectures.
Please send me updates and corrections to the architecture summaries @adrianco
Thanks Adrian
B5 - Introduction à Microsoft Flow - Isabelle Van Campenhoudt, Sabrine chouk,...SPS Paris
Microsoft Flow est un nouvel outil de workflow fonctionnant dans Azure et fournissant des services de workflows/automatisation à de nombreuses plate-formes dont SharePoint. Microsoft Flow va bien au-de-là de ce que SharePoint Designer nous apportait. L'immense valeur ajoutée de l'outil augmente chaque semaine. Microsoft Flow doit se trouver dans la boite à outil de toute organisation utilisant office 365.
Lors de cette session nous vous donnerons un aperçu général de l'outil; nous vous illustrerons par de nombreuses démonstrations l'étendue de ses possibilités ainsi que tu retour sur experience après l'avoir utilisé dès sa conception.
La gouvernance ainsi que la protection des données seront également abordées, de même que l'interaction avec PowerApps et PowerBI, ou encore des scénarios hybrides (interaction avec SharePoint on prem, SQL server on prem, ou encore SAP)
OpenNebulaConf2017US: Welcome and project update by Ignacio M. Llorente and R...OpenNebula Project
We’re moving into a world of open cloud — where each organization can find the right cloud for its unique needs. A single cloud management platform can not be all things to all people, there will be a cloud space with several offerings focused on different environments and/or industries. The OpenNebula commitment to the open cloud flows directly out of its mission — to become the simplest cloud enabling platform — and its purpose — to bring simplicity to the private and hybrid enterprise cloud. OpenNebula exists to help companies build simple, cost-effective, reliable, open enterprise clouds on existing IT infrastructure. The OpenNebula Conference will be a great opportunity to remind our vision, vision and commitment, to look back at how the project has grown in the last 8 years, and to give a peek at what to expect from the project in the near future.
Serverless Security: A pragmatic primer for builders and defendersJames Wickett
Talk given at O'Reilly's 2017 Velocity Conference in San Jose.
Serverless is the design pattern for writing applications at scale without the necessity of managing infrastructure. This is done across the continuum of the cloud—from storage as a service to database as a service—but the center of serverless is functions as a service (FaaS). (Current FaaS offerings include AWS Lambda, Azure Functions, and Google Cloud Functions.) Now processes run for milliseconds before being destroyed and then get instantiated for subsequent requests.
Serverless adds simplicity and a new economic model to cloud computing, but it creates some unique security challenges. In serverless architectures, technologies like antivirus and intrusion detection become meaningless. James Wickett explores practical security approaches for serverless in four key areas—the software supply chain, the delivery pipeline, data flow, and attack detection—and examines how traditional approaches need to be adapted to serverless.
Even if you don’t have any experience with serverless, don’t worry; this session starts with the basics. You’ll learn what serverless is (hint: it’s still being defined) and practical patterns for serverless adoption.
Microservices architecture has changed how companies develop and deploy applications. This change has affected testing process as well. New techniques have emerged and others have been enhanced. Does your organization wish to achieve unicorn like speed? With N deploys per day? Or a lead time (time to business value from idea to production) measured in days/weeks instead of months/years?In this session, we will not only describe how to use Istio and Kubernetes but actively demonstrate 3 specific techniques that you can use immediately following the session.
Couchbase Cloud No Equal (Rick Jacobs, Couchbase) Kafka Summit 2020HostedbyConfluent
This session will describe and demonstrate the longstanding integration between Couchbase Server and Apache Kafka and will include descriptions of both the mechanics of the integration and practical situations when combining these products is appropriate.
Many industries are going through a digital transformation as their existing business models are being disrupted and new competitors emerge. The key driver is a need for faster time-to-value as a direct relationship with customers provides analytics that drive personalization and rapid product development. There’s a cultural aspect to the change, as well as new organizational patterns that go along with a migration to cloud native services. Application architectures are evolving from monoliths to microservices and serverless deployments, and they becoming more distributed, highly available, and resilient. The highly automated practices that have built up around DevOps are moving to the mainstream, and some new techniques are emerging around security red teams and chaos engineering.
Many industries are going through a digital transformation as their existing business models are being disrupted and new competitors emerge. The key driver is a need for faster time-to-value as a direct relationship with customers provides analytics that drive personalization and rapid product development. There’s a cultural aspect to the change, as well as new organizational patterns that go along with a migration to cloud native services. Application architectures are evolving from monoliths to microservices and serverless deployments, and they becoming more distributed, highly available, and resilient. The highly automated practices that have built up around DevOps are moving to the mainstream, and some new techniques are emerging around security red teams and chaos engineering.
Continuous Delivery: Fly the Friendly CI in Pivotal Cloud Foundry with ConcourseVMware Tanzu
Concourse is an open source continuous integration (CI) system designed for agile development teams. It supports developers that practice test-driven development and continuous delivery (CD) by automating a teams build-to-release process inclusive of all automated testing.
Concourse provides dependable results for each build run. It allows agile development teams to deliver business value at a much higher velocity. It allows teams to treat every code commit as if it’s about to be deployed to production.
In this webinar, we’ll talk about how teams’ practice agile development in relation to developing, testing and deploying apps in Cloud Foundry. We’ll also cover the role that Concourse plays in aiding high velocity delivery of applications.
Our agenda includes:
- What is CI / CD and how do these practices fit into Pivotal's development practices
- Overview of Concourse and how it differs from other CI / CD systems
- Why Pipelines are useful for continuously delivering apps to Pivotal Cloud Foundry
- Why containers are useful for continuously delivering apps to Pivotal Cloud Foundry
- Examples of how these concepts work in practice
- How to get started using Concourse to continuously deliver value
Presenters : Greg Chase, James Ma, Topher Bullock, Pivotal
Similar to Monitoring Modern Architectures with Data Science (20)
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/39NIjLV.
Akhilesh Gupta does a technical deep-dive into how Linkedin uses the Play/Akka Framework and a scalable distributed system to enable live interactions like likes/comments at massive scale at extremely low costs across multiple data centers. Filmed at qconlondon.com.
Akhilesh Gupta is the technical lead for LinkedIn's Real-time delivery infrastructure and LinkedIn Messaging. He has been working on the revamp of LinkedIn’s offerings to instant, real-time experiences. Before this, he was the head of engineering for the Ride Experience program at Uber Technologies in San Francisco.
Next Generation Client APIs in Envoy MobileC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2x0Fav8.
Jose Nino guides the audience through the journey of Mobile APIs at Lyft. He focuses on how the team has reaped the benefits of API generation to experiment with the network transport layer. He also discusses recent developments the team has made with Envoy Mobile and the roadmap ahead. Filmed at qconlondon.com.
Jose Nino works as a Software Engineer at Lyft.
Software Teams and Teamwork Trends Report Q1 2020C4Media
How do we cope with an environment that has been radically disrupted, where people are suddenly thrust into remote work in a chaotic state? What are the emerging good practices and new ideas that are shaping the way in which software development teams work? What can we do to make the workplace a more secure and diverse one while increasing the productivity of our teams? This report aims to assist technical leaders in making mid- to long-term decisions that will have a positive impact on their organisations and teams and help individual contributors find the practices, approaches, tools, techniques, and frameworks that can help them get a better experience at work - irrespective of where they are working from.
Understand the Trade-offs Using Compilers for Java ApplicationsC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2QCmmJ0.
Mark Stoodley examines some of the strengths and weaknesses of the different Java compilation technologies, if one was to apply them in isolation. Stoodley discusses how production JVMs are assembling a combination of these tools that work together to provide excellent performance across the large spectrum of applications written in Java and JVM based languages. Filmed at qconsf.com.
Mark Stoodley joined IBM Canada to build Java JIT compilers for production use and led the team that delivered AOT compilation in the IBM SDK for Java 6. He spent the last five years leading the effort to open source nearly 4.3 million lines of source code from the IBM J9 Java Virtual Machine to create the two open source projects Eclipse OMR and Eclipse OpenJ9, and now co-leads both projects.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2y2yPiS.
Colin McCabe talks about the ongoing effort to replace the use of Zookeeper in Kafka: why they want to do it and how it will work. He discusses the limitations they have found and how Kafka benefits both in terms of stability and scalability by bringing consensus in house. He talks about their progress, what work is remaining, and how contributors can help. Filmed at qconsf.com.
Colin McCabe is a Kafka committer at Confluent, working on the scalability and extensibility of Kafka. Previously, he worked on the Hadoop Distributed Filesystem and the Ceph Filesystem.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2SXXXiD.
Katharina Probst talks about what it means to act like an owner and why teams need ownership to be high-performing. When team members, regardless of whether they have a formal leadership role or not, act like owners, magical things can happen. She shares ideas that we can apply to our own work, and talks about how to recognize when we don’t live up to our own expectations of acting like an owner. Filmed at qconsf.com.
Katharina Probst is a Senior Engineering Leader, Kubernetes & SaaS at Google. Before this, she was leading engineering teams at Netflix, being responsible for the Netflix API, which helps bring Netflix streaming to millions of people around the world. Prior to joining Netflix, she was in the cloud computing team at Google, where she saw cloud computing from the provider side.
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2T04Lw4.
Sergey Kuksenko talks about the performance benefits inline types bring to Java and how to exploit them. Inline/value types are the key part of experimental project Valhalla, which should bring new abilities to the Java language. Filmed at qconsf.com.
Sergey Kuksenko is a Java Performance Engineer at Oracle working on a variety of Java and JVM performance enhancements. He started working as Java Engineer in 1996 and as Java Performance Engineer in 2005. He has had a passion for exploring how Java works on modern hardware.
Do you need service meshes in your tech stack?
This on-line guide aims to answer pertinent questions for software architects and technical leaders, such as: what is a service mesh?, do I need a service mesh?, how do I evaluate the different service mesh offerings? In software architecture, a service mesh is a dedicated infrastructure layer for facilitating service-to-service communications between microservices, often using a sidecar proxy.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2UgQ3BU.
Christie Wilson describes what to expect from CI/CD in 2019, and how Tekton is helping bring that to as many tools as possible, such as Jenkins X and Prow. Wilson talks about Tekton itself and performs a live demo that shows how cloud native CI/CD can help debug, surface and fix mistakes faster. Filmed at qconsf.com.
Christie Wilson is a software engineer at Google, currently leading the Tekton project. Over the past decade, she has worked in the mobile, financial and video game industries. Prior to working at Google she led a team of software developers to build load testing tools for AAA video game titles, and founded the Vancouver chapter of PyLadies.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2S7lDiS.
Sasha Rosenbaum shows how a CI/CD pipeline for Machine Learning can greatly improve both productivity and reliability. Filmed at qconsf.com.
Sasha Rosenbaum is a Program Manager on the Azure DevOps engineering team, focused on improving the alignment of the product with open source software. She is a co-organizer of the DevOps Days Chicago and the DeliveryConf conferences, and recently published a book on Serverless computing in Azure with .NET.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/36epVKg.
Todd Montgomery discusses the techniques and lessons learned from implementing Aeron Cluster. His focus is on how Raft can be implemented on Aeron, minimizing the network round trip overhead, and comparing single process to a fully distributed cluster. Filmed at qconsf.com.
Todd Montgomery is a networking hacker who has researched, designed, and built numerous protocols, messaging-oriented middleware systems, and real-time data systems, done research for NASA, contributed to the IETF and IEEE, and co-founded two startups. He currently works as an independent consultant and is active in several open source projects.
Architectures That Scale Deep - Regaining Control in Deep SystemsC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2FWc5Sk.
Ben Sigelman talks about "Deep Systems", their common properties and re-introduces the fundamentals of control theory from the 1960s, including the original conceptualizations of Observability & Controllability. He uses examples from Google & other companies to illustrate how deep systems have damaged people's ability to observe software, and what needs to be done in order to regain control. Filmed at qconsf.com.
Ben Sigelman is a co-founder and the CEO at LightStep, a co-creator of Dapper (Google’s distributed tracing system), and co-creator of the OpenTracing and OpenTelemetry projects (both part of the CNCF). His work and interests gravitate towards observability, especially where microservices, high transaction volumes, and large engineering organizations are involved.
ML in the Browser: Interactive Experiences with Tensorflow.jsC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/39SddUL.
Victor Dibia provides a friendly introduction to machine learning, covers concrete steps on how front-end developers can create their own ML models and deploy them as part of web applications. He discusses his experience building Handtrack.js - a library for prototyping real time hand tracking interactions in the browser. Filmed at qconsf.com.
Victor Dibia is a Research Engineer with Cloudera’s Fast Forward Labs. Prior to this, he was a Research Staff Member at the IBM TJ Watson Research Center, New York. His research interests are at the intersection of human computer interaction, computational social science, and applied AI.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2s9T3Vl.
Colin Eberhardt looks at some of the internals of WebAssembly, explores how it works “under the hood”, and looks at how to create a (simple) compiler that targets this runtime. Filmed at qconsf.com.
Colin Eberhardt is the Technology Director at Scott Logic, a UK-based software consultancy where they create complex application for their financial services clients. He is an avid technology enthusiast, spending his evenings contributing to open source projects, writing blog posts and learning as much as he can.
User & Device Identity for Microservices @ Netflix ScaleC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2S9tOgy.
Satyajit Thadeshwar provides useful insights on how Netflix implemented a secure, token-agnostic, identity solution that works with services operating at a massive scale. He shares some of the lessons learned from this process, both from architectural diagrams and code. Filmed at qconsf.com.
Satyajit Thadeshwar is an engineer on the Product Edge Access Services team at Netflix, where he works on some of the most critical services focusing on user and device authentication. He has more than a decade of experience building fault-tolerant and highly scalable, distributed systems.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2Ezs08q.
Justin Ryan talks about Netflix’ scalability issues and some of the ways they addressed it. He shares successes they’ve had from unintuitively partitioning computation into multiple services to get better runtime characteristics. He introduces us to useful probabilistic data structures, innovative bi-directional data passing, open-source projects available from Netflix that make this all possible. Filmed at qconsf.com.
Justin Ryan is Playback Edge Engineering at Netflix. He works on some of the most critical services at Netflix, specifically focusing on user and device authentication. Years of building developer tools has also given him a healthy set of opinions on developer productivity.
Make Your Electron App Feel at Home EverywhereC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2Z4ZJjn.
Kilian Valkhof discusses the process of making an Electron app feel at home on all three platforms: Windows, MacOS and Linux, making devs aware of the pitfalls and how to avoid them. Filmed at qconsf.com.
Kilian Valkhof is a Front-end Developer & User-experience Designer at Firstversionist. He writes about various topics, from design to machine learning, on his personal website, kilianvalkhof.com and is a frequent contributer to open source software. He is part of the Electron governance team that oversees the development of the Electron framework.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/344PnB1.
Steve Klabnik goes over the deep details of how async/await works in Rust, covering concepts like coroutines, generators, stack-less vs stack-ful, "pinning", and more. Filmed at qconsf.com.
Steve Klabnik is on the core team of Rust, leads the documentation team, and is an author of "The Rust Programming Language." He is a frequent speaker at conferences and is a prolific open source contributor, previously working on projects such as Ruby and Ruby on Rails.
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2OUz6dt.
Chris Riccomini talks about the current state-of-the-art in data pipelines and data warehousing, and shares some of the solutions to current problems dealing with data streaming and warehousing. Filmed at qconsf.com.
Chris Riccomini works as a Software Engineer at WePay.
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreC4Media
Video and slides synchronized, mp3 and slide download available at URL https://bit.ly/2rm4hFD.
Yevgeniy Brikman talks about how to write automated tests for infrastructure code, including the code written for use with tools such as Terraform, Docker, Packer, and Kubernetes. Topics covered include: unit tests, integration tests, end-to-end tests, dependency injection, test parallelism, retries and error handling, static analysis, property testing and CI / CD for infrastructure code. Filmed at qconsf.com.
Yevgeniy Brikman is the co-founder of Gruntwork, a company that provides DevOps as a Service. He is the author of two books published by O'Reilly Media: Hello, Startup and Terraform: Up & Running. Previously, he worked as a software engineer at LinkedIn, TripAdvisor, Cisco Systems, and Thomson Financial.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
2. InfoQ.com: News & Community Site
• Over 1,000,000 software developers, architects and CTOs read the site world-
wide every month
• 250,000 senior developers subscribe to our weekly newsletter
• Published in 4 languages (English, Chinese, Japanese and Brazilian
Portuguese)
• Post content from our QCon conferences
• 2 dedicated podcast channels: The InfoQ Podcast, with a focus on
Architecture and The Engineering Culture Podcast, with a focus on building
• 96 deep dives on innovative topics packed as downloadable emags and
minibooks
• Over 40 new content items per week
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
modern-architectures-data-science
3. Purpose of QCon
- to empower software development by facilitating the spread of
knowledge and innovation
Strategy
- practitioner-driven conference designed for YOU: influencers of
change and innovation in your teams
- speakers and topics driving the evolution and innovation
- connecting and catalyzing the influencers and innovators
Highlights
- attended by more than 12,000 delegates since 2007
- held in 9 cities worldwide
Presented at QCon San Francisco
www.qconsf.com
4. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
Much has changed since simple distributed client/server
architectures and so-too have the technologies and industry
practices around monitoring.
Cloud-Native, DevOps, blue/green deployments,
server-less, edge/fog, IoT all fit into a world much better
handled by the emerging Artificial Intelligence for IT
Operations domain more-so than traditional ITIL/SDLC
approaches.
Abstract
5. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
Software continues to eat the world. Software automates,
defines.
The world is "going digital" and it's quite exciting -- but this
always-connected from-everything-to-everywhere world
adds complexity to software systems and this talk will dive
in to some of that complexity and how modern data science
and algorithms are being applied to "fight machines with
machines," so to speak.
Abstract
6. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
-25822282
623992118
1343963318
7. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
8. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
moogsoft
9. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
10. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
discovery
monitoring
(observing)
analytics
11. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
fluid
infrastructure
containers dc/os server-less software defined/dynamic
12. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
anything
anywhere
anytime
data/txfrom
mobile IoT bots/RUM
13. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
14. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
"if/else"
rules
algorithms
ML
millions millions
noise filt.
clustering prc
deja
vu
15. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
AIOps
AI for IT Ops
16. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
customer/
business
perspective
17. COURAGE INSIGHT
CONTEXT VELOCITY
ARE YOU READY
TO GO DIGITAL ?
This slide courtesy Andy Brown, Sandhill East
https://www.linkedin.com/in/andybrown63/
18. “Silicon Valley is coming.
There are hundreds of startups with a lot
of brains and money working on various
alternatives to traditional banking. They
are very good at reducing the ‘pain
points’ …”
JAMIE DIMON
JPMorgan Chase & Co.
Chairman & Chief Executive Officer
April 2015
This slide courtesy Andy Brown, Sandhill East
https://www.linkedin.com/in/andybrown63/
19. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
go digital
or die trying
20. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
gs wants to
become "google
of wall st."
21. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
stanford
PhD
CIO CFO
marquee
data analytics data
api
api
monitor
observe
analyze
22. THE REALLY BIG PICTURE
2020 2021 2022 2023 2024
In 5 – 10 years, every
company will be a
Digital Software
Business
Security, service
assurance and consumer
centricity become THE
BOARD LEVEL
PRIORITY
Enterprises going
DIGITAL ADOPT
HYBRID IT
This slide courtesy Andy Brown, Sandhill East
https://www.linkedin.com/in/andybrown63/
23. 40% Change 60% Run
Infrastructure Led
Owns Facilities, Data Centers,
Hardware, Networks et al
Has Refresh Cycles caused by
Capital Depreciation
Still using Waterfall for App
Dev
Thinking led by Inf
Technologists
(hardware, DB, OS et al)
Traditional Procurement
Less Agile, Change resistant
60% Change 40% Run
AppDev starting to lead
Owns less Facilities, Data
Centers, Hardware,
Networks, et al
Still Has Refresh Cycles
caused by
CapitalDepreciation
Combination Waterfall &
Agilefor App Dev
Thinking led by CIO “Move to
Cloud” Traditional
Procurement weakening
More Agile, Less Change
resistant
80% Change 20% Run
AppDev leads decisioning
Doesn’t own hardware
Refresh doesn’t exist
All Agile for App Dev
Thinking led by CIO “Move to
Cloud”
Cloud Centric “Marketplace”
Procurement
Embraces Change, Very Agile
traditional hybrid digital
This slide courtesy Andy Brown, Sandhill East
https://www.linkedin.com/in/andybrown63/
24. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
2045
?
25. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
SNMP / traps
or
Daylight Savings
26. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
AMRS
EMEA
APAC
EPS
every ip interface globally
AIOps
EdgeOps
EdgeOps
EdgeOps
27. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
algorithms we
use
28. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
This slide courtesy our Chief Scientist Dr. Rob Harper -- Do check out his great 3-part blog on Machine Learning in Moogsoft AIOps: https://www.moogsoft.com/author/robharper/
29. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
By Hui Li on Subconscious Musings April 12, 2017
30. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
This slide courtesy our Chief Scientist Dr. Rob Harper -- Do check out his great 3-part blog on Machine Learning in Moogsoft AIOps: https://www.moogsoft.com/author/robharper/
31. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
This slide courtesy our Chief Scientist Dr. Rob Harper -- Do check out his great 3-part blog on Machine Learning in Moogsoft AIOps: https://www.moogsoft.com/author/robharper/
32. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
regression classification clustering
This slide courtesy our Chief Scientist Dr. Rob Harper -- Do check out his great 3-part blog on Machine Learning in Moogsoft AIOps: https://www.moogsoft.com/author/robharper/
33. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
classification supervised
“learn by example” approach. Supervised learning systems need to be given examples of what is “good” and
what is “bad”
This slide courtesy our Chief Scientist Dr. Rob Harper -- Do check out his great 3-part blog on Machine Learning in Moogsoft AIOps: https://www.moogsoft.com/author/robharper/
34. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
classification
35. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
clustering unsupervised
Patterns that you didn’t know existed prior. Recommender systems rely heavily on these techniques.
36. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
supervised machine learning "hot dog?" "not hot dog?"
37. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
38. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
algorithms we
use
39. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
lua code: https://pastebin.com/ZZmSNaHX
SethBlingmari/o
neural nets
https://www.youtube.com/watch?v=qv6UVOQ0F44
40. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
k-means clustering
41. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
matrix factorization
42. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
shannon entropy
43. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
typical entropy distribution
44. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
algorithmic
workflow
45. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
millions events
de-duplication
cluster analysis
algorithms
situation room
teams-centric
thousands of alerts
algorithmic
noise filtering
[shannon entropy]
tens of alert clusters
(situations)
"today's warnings are
tomorrows outages"
"all about the MTTR"
algorithmic
probable root cause
AIOps
Algorithmic IT Operations
knowledge capture
auto-recurrance detect
entropy_threshold
non-noisy alerts
what you're likely
doing today
L1 "Catch & Dispatch"
(automated)
ignore
situation next steps
46. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
...speaking of classification
47. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
fault vs audit
fix → optimize
48. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
monitoring fail-around
analytics analytics
fail-around fail-around
monitor
49. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
weld the datacenter doors shut
50. NOT FOR GENERAL DISTRIBUTION ONLY INTENDED FOR REGISTERED ATTENDEES OF QCON SF 2017
<lofty_tangent>
non-technical