From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation

•

0 likes•5 views

Are you collecting just about every metric under the sun and the kitchen sink too? Understanding the cost of collecting metrics and the usefulness of those metrics is the only way to scale in a cloud native world. You can’t get away with just collecting everything as you grow. How can you make decisions about what to collect, what to drop, what to aggregate while still being able to alert, triage, remediate? Gain immediate insights into high cost data (DPPS), when to drop time series data, and how to determine when the value of that data is at its lowest.

Technology

chronosphere.io
From Cardinal(ity) Sins to
Cost Efﬁcient Metrics
Aggregation
Paige Cruz, retired SRE
open source observability advocate

chronosphere.io
CFO looking at the
o11y bill

Cloud Native Observability
bills are outrageous

Cloud Native Data Growth
7
Cloud
(IaaS,
VM-based)
2008 - 2018
Cloud Native
(Microservices and Containers)
2018 - ?
On-Premises
(Data center)
1998 - 2008
Business
Increase in
Scale
Observability
Data Increase
in Scale
*Source: ESG Distributed Cloud Series: Observability, Feb 2022, Scott Sinclair and Rob Strechay

chronosphere.io
Most recently [vendor] was looked at to help monitor a small Kubernetes test cluster. 3
nodes.
Now the base rate of $18/mo is ﬁne…except now they charge $1 per container per month
past 10 containers per host.
Since K8s (depending on how you install it) runs a bunch of little containers handling various
back end things, you might not deploy anything to the cluster and still be WAY over that 10
container limit.
In our case it came out to like $200/mo to monitor 3 nodes - that were nowhere fully
loaded.
- Hacker News thread

chronosphere.io
Data volume
Experiment:
- Hello World app on 4 node
Kubernetes cluster with
Tracing, End User Metrics
(EUM), Logs, Metrics
(containers / nodes)
- 30 days == +450 GB

“ 1 in 10 metrics are
actually directly
queried
- ServiceNow

Contributing Factors to the Metrics Bill
12
How many
things you’re
monitoring
# of containers
and infra
components
How often each
metric is
scraped
Metric
Granularity
How long you
keep the data
Retention
Window
How many
unique combos
of dimensions
on metrics
Cardinality
12

14
Cost of monitoring can be a factor in determining how quickly
to deprecate or sunset features/services/environments
# of containers and infra components
14

15
Emission time = adjust scrape_interval (from 10s samples ->
30s samples)
Ingest time = aggregate
Over time post-storage = downsampling
Metric Granularity
15

17
For operational metrics……most (99.9%) of queries do
not pass 7 days but average retention at original
granularity ranges from 2-4 weeks
Retention Window
17

18
Low value tags or entire metrics should be dropped
as early as possible
Dropping Data
18

“
What is the value
of this metric?
- You, when auditing metrics

Auditing Your Metrics
22
22
● Scope what your team is responsible for
○ filter queries with team:YOURS
● Identify easy wins. Metrics that aren’t
○ In a monitor definition
○ Directly queried by end users
○ Powering charts for visited dashboards
● Identify labels that are unnecessary
○ e.g. prometheus instance label or instance_type
● Share your successes!

24
24
CFO looking at the
cost efﬁciency of
metrics

Resources
- How Gloo uses the OTel Collector to drop metrics/labels
and provide the Minimum Metrics Set
- How to drop and delete metrics in Prometheus
- How can recording and data roll-up rules help your
metrics?
- Observability is Too Damn Expensive - DevOpsDays London

Catch up with me:
- Rescuing On-Call Engineers
(send your manager)
- KubeCon OTel 101: Let’s
Instrument! (tracing) workshop
- There’s No Place Like
Production Conf42 Incident
Management
paigerduty@
chronosphere.io
hachyderm.io
LinkedIn

Similar to From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation

In this talk we share our experiences developing and deploying a microservices-based application. You will learn about the distributed data management challenges that arise in a microservices architecture. We will describe how we solved them using event sourcing to reliably publish events that drive eventually consistent workflows and pdate CQRS-based views. You will also learn how we build and deploy the application using a Jenkins-based deployment pipeline that creates Docker images that run on Amazon EC2.

Building and deploying microservices with event sourcing, CQRS and Docker (Ha...

Chris Richardson

Clickhouse MeetUp@ContentSquare - ContentSquare's Experience Sharing

Vianney FOUCAULT

ClickHouse Paris Meetup. ClickHouse at ContentSquare, by Christophe Kalenzaga...

Altinity Ltd

Container world 2019 Canary Release

Billy Yuen

DevOps. Microservices. Containers. These terms have a lot of buzz for their role in cloud-native application development and operations. But, if you haven't automated your tests and builds with continuous integration (CI), none of them matter. Continuous integration is the automation of building and testing new code. Development teams that use CI can catch bugs early and often; resulting in code that is always production ready. Compared to manual testing, CI eliminates a lot of toil and improves code quality. At the end of the day, it's those code defects that slip into production that slow down teams and cause apps to fall over. The journey to continuous integration maturity has some requirements. Join Pivotal's James Ma, product manager for Concourse, and Dormain Drewitz, product marketing to learn about: - How Test-Driven Development feeds the CI process - What is different about CI in a cloud-native context - How to measure progress and success in adopting CI Dormain is a Senior Director of Product and Customer Marketing with Pivotal. She has published extensively on cloud computing topics for ten years, demystifying the changing requirements of the infrastructure software stack. She’s presented at the Gartner Application Architecture, Development, and Integration Summit; Open Source Summit; Cloud Foundry Summit, and numerous software user events. James Ma is a product manager at Pivotal and is based out of their office in Toronto, Canada. As a consultant for the Pivotal Labs team, James worked with Fortune 500 companies to hone their agile software development practices and adopt a user-centered approach to product development. He has worked with companies across multiple industries including: mobile e-commerce, finance, heath and hospitality. James is currently a part of the Pivotal Cloud Foundry R&D group and is the product manager for Concourse CI, the continuous "thing do-er". Presenters : Dormain Drewitz & James Ma, Pivotal

Cloud-Native Fundamentals: Accelerating Development with Continuous Integration

VMware Tanzu

Victor Chang: Cloud computing business framework

CBOD ANR project U-PSUD

We all know real-time data has a value. But how do you quantify that value in order to create a business case for becoming more data, or event, driven? The first half of this talk will explore the value of data across a variety of organizations, starting with the five most valuable companies in the world: Apple, Alphabet (Google), Microsoft, Amazon, and Facebook. We will go on to discuss other digital natives; Uber, Ebay, Netflix and LinkedIn before exploring more traditional companies across Retail, Finance and Automotive. Whether organizations are using data to create new business products and services, improving user experiences, increasing productivity, managing risk or influencing global power, we’ll see that fast and interconnected data, or ‘event streaming’ is increasingly important. After showing data value can be quantified, the second half of this talk will explain the five steps to creating a business case around Kafka use cases. Most businesses focus on: 1. Making more money, or conferring competitive advantage to make more money 2. Increasing efficiency to save money, and / or 3. Mitigating risk to the business, to protect money. We’ll walk through examples of real business cases, discuss how business cases have evolved over the years and show the power of a sound business case. If you’re interested in Big Money and Big Business, as well as Big Data, this talk is for you.

Connecting Kafka to Cash (CKC) (Lyndon Hedderly, Confluent) Kafka Summit Lond...

confluent

Dyn delivers exceptional Internet Performance. Enabling high quality services requires data centers around the globe. In order to manage services, customers need timely insight collected from all over the world. Dyn uses DataStax Enterprise (DSE) to deploy complex clusters across multiple datacenters to enable sub 50 ms query responses for hundreds of billions of data points. From granular DNS traffic data, to aggregated counts for a variety of report dimensions, DSE at Dyn has been up since 2013 and has shined through upgrades, data center migrations, DDoS attacks and hardware failures. In this webinar, Principal Engineers Tim Chadwick and Rick Bross cover the requirements which led them to choose DSE as their go-to Big Data solution, the path which led to SPARK, and the lessons that we’ve learned in the process.

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...

DataStax

IBM Monitoring and Event Management Solutions

IBM Danmark

Running OpenStack in Production

Tesora

Outsourcing IT Projects to Managed Hosting of the Cloud

Rackspace

Learn from renowned author of “Cloud Capacity Management,” Navin Sabharwal (HCL Technologies) about the unique challenges of planning for capacity in hybrid cloud and virtualized environments. He reveals the capacity planning tools and processes needed to successfully plan for and predict the most cost-effective and reliable infrastructure needed in today’s cloud environments. For more information on DevOps solutions from CA Technologies, please visit: http://bit.ly/1wbjjqX

Case Study: HCL Technologies On Capacity Planning for Cloud and Virtualized E...

CA Technologies

1806 cosmic progress

Charles Symons

Jazz for Service Management

IBM Danmark

IRJET- Predicting Bitcoin Prices using Convolutional Neural Network Algor...

IRJET Journal

When Data Visualizations and Data Imports Just Don’t Work – Importing data is a dirty job as can painting user final pictures with that data. This webinar will explore the dirty little secrets that ensure data is imported completely and accurately, as well as, painting scenarios when a visualization may not be the best approach to meeting an audit objective. Specific learning objectives include: o Walk through case studies of “dirty” data and how to improve then using improved data requests and cleansing tools. o Watch case study examples of top tests to validate data tables to ensure data quality. o Discover a host of baseline tests and other baseline statistics to validate, understand and possibly extract key trends for review. o Understand visualization and dashboard types along with their associated analytical strengths from an audit perspective. o Identify situations where statistics may be more effective audit extractors than relying on the human eye to spot notable events.

When Data Visualizations and Data Imports Just Don’t Work

Jim Kaplan CIA CFE

Adtech scala-performance-tuning-150323223738-conversion-gate01

Giridhar Addepalli

Adtech x Scala x Performance tuning

Yosuke Mizutani

Abstract: Auditing is an important service to verify the data in the cloud. Most of the auditing protocols are based on the assumption that the client’s secret key for auditing is secure. The security is not fully achieved, because of the low security parameters of the client. If the auditing protocol is not secured means the data of the client will exposed inevitably. In this paper a new mechanism of cloud auditing is implemented. And investigate to reduce the damage of the client key exposure in cloud storage auditing. Here the designing is built upon to overcome the week key auditing process. The auditing protocol is designed with the help of key exposure resilience. In the proposed design, the binary tree structure and the pre-order traversal technique is used to update the secret keys of the client. The security proof and the performance shows the cloud storage auditing with key exposure resilience is very efficient.

Enhancing Data Security in Cloud Storage Auditing With Key Abstraction

paperpublications3

As a thought experiment, I have chosen to open-source this idea so that my LinkedIn community can take it to the next level. Blockchain solutions and quantum-level computing represent an unprecedented opportunity for advancing human development. Keeping these communications secure will be a major undertaking. My idea, Quantum Threat Monitoring As a Service (QTMaaS) goes some way to better understanding the challenge ahead. By sharing it, I hope to speed up the adoption of more secure methods of communication, such as quantum-level blockchain solutions. We're still many years away from mainstream adoption, but this is one step in the right direction. Idea inspired by 'The Age of Cryptocurrency' (2015) by Paul Vigna and Michael J.Casey and 'The Trust Machine: The technology behind bitcoin could transform how the economy works' in 'The Economist' (Oct 31st, 2015)

The Future of Secure Digital Transactions: QTMaaS

Steve Downer

Similar to From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation (20)

Building and deploying microservices with event sourcing, CQRS and Docker (Ha...

Clickhouse MeetUp@ContentSquare - ContentSquare's Experience Sharing

ClickHouse Paris Meetup. ClickHouse at ContentSquare, by Christophe Kalenzaga...

Container world 2019 Canary Release

Cloud-Native Fundamentals: Accelerating Development with Continuous Integration

Victor Chang: Cloud computing business framework

Connecting Kafka to Cash (CKC) (Lyndon Hedderly, Confluent) Kafka Summit Lond...

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...

IBM Monitoring and Event Management Solutions

Running OpenStack in Production

Outsourcing IT Projects to Managed Hosting of the Cloud

Case Study: HCL Technologies On Capacity Planning for Cloud and Virtualized E...

1806 cosmic progress

Jazz for Service Management

IRJET- Predicting Bitcoin Prices using Convolutional Neural Network Algor...

When Data Visualizations and Data Imports Just Don’t Work

Adtech scala-performance-tuning-150323223738-conversion-gate01

Adtech x Scala x Performance tuning

Enhancing Data Security in Cloud Storage Auditing With Key Abstraction

The Future of Secure Digital Transactions: QTMaaS

More from Paige Cruz

Curious about containers beyond Docker? There’s a new generation of containers on the scene, Podman! Supporting secure, rootless containers for Kubernetes microservices, it was designed and built with the cloud in mind. Benefitting from the lessons learned out in the open from Docker, this next generation of containers will quickly become a trusted daily driver in your dev workflow. Covering what you need to know as an end-user from the UI to the backend, sharing a real world use case leveraging Podman for open source observability workshops https://o11y-workshops.gitlab.io. Paige will share how Podman and the adorable seal mascots Caitlín, Maighréad and Róisín have transformed her local development!

Power Up with Podman - Kubernetes Community Day LA

Paige Cruz

Distributed tracing is still finding its footing in many organizations today, despite being an industry concept for 14 years! One challenge to overcome is managing the data volume. Keeping 100% of traces is expensive and unnecessary - enter sampling. From head-based, tail-based or mixing and matching there is a buffet of configuration options available with OpenTelemetry. Let’s review the tradeoffs associated with different sampling configurations and their impact to the cost and value of your tracing data.

99.99% of Your Traces Are (Probably) Trash (SRECon NA 2024).pdf

Paige Cruz

OTel Orientation: How to Train Teams (OTel in Practice)

Paige Cruz

The signal to noise ratio can be frustratingly high for on-call alerts and downright mystifying when joining a new company or rotation. If you treat PagerDuty like the boy who cried wolf you’ve certainly mired in monitoring debt. Monitoring debit is tragically common and with investment mitigatable! Just how much of a risk are false positives? The data shows that a high number of false alerts trains workers to assume most alerts will be false. Repetition of the same alerts causes even greater alert fatigue. For clinicians the likelihood of acknowledging an alert dropped 30% for each reminder. Paige will share the nuts and bolts of how to start auditing your own alerts to begin your journey towards an on-call oasis.

Avoiding Alert Bankruptcy and Burnout

Paige Cruz

Many things can go awry on the journey from pull request (PR) open to merge to production deployment. Issues can arise from the application code, layers of YAML configuration, underlying infrastructure or pipeline logic itself. How can distributed tracing and trace-derived metrics bring developers and operators together for troubleshooting paradise? I’ll unpack a deploy gone bad from both vantage points, gaining an empathy for the engineer who needs to deploy their changes and an ops engineer who is responsible for keeping the system up and running. With signals from OpenTelemetry I will show how increasing the observability of your deploy system can facilitate better collaboration and quicker troubleshooting.

Tracing Adventures from PR - Production

Paige Cruz

Threat modeling is a key part of securing your cloud computing environment and is unique to each organization's people, practices and processes. Without a threat model you could end up going on wild goose chases and miss the forest for the trees. This talk will introduce the concept of threat modeling including how to get started, guiding questions and recap findings from the Argo Project Threat Model from 2021. Learn how and why to start threat modeling today!

Threat Modeling in the Cloud

Paige Cruz

There’s a reason “i test in prod” isn’t a cheeky take but a lived reality. And that reason is there is no place like production. Not local dev or staging or other environment. This became clear when I deployed a tiny config change that passed all checks, reviews and pre-production environments that triggered a SEV-1. Examining each step of the journey from PR to production I uncovered the snafu that had occurred (unsurprisingly it relates to overwriting key blocks on nested YAML files). I’ll share how difficult it was to reconstruct the chain of events in the system compared to the ideal case of a highly observable system and how to share your own incident learnings since we all test in prod!

There's No Place Like Production

Paige Cruz

The misunderstanding that devops is a tool or team or role has tragically increased the gulf between developers and operators.If you have been unofficially on-call 24/7, toiled away in turmoil, managed initiatives across multiple silos sans project manager with a directive to “influence without authority”...surprise you have lived the contradiction that is feral devops! Consequences of letting this fester are: burn out, learned helplessness and missing out on the speed that happens when developers and operators communicate openly. Taming feral devops is a journey you can start by gathering feedback on the status quo, iterating on onboarding and training and holistically examining on-call health.Whether you’re an individual contributor, manager or leader you will find ideas to experiment with to “domesticate” devops in your organization

Taming Feral DevOps

Paige Cruz

Cognitive apprenticeship is the philosophy that it is more effective to learn in context and real-world situations compared to following a tutorial in a sandboxed environment. In a nutshell it is “learning-through-guided-experience” and shines when teaching problem-solving processes experts use to handle complex tasks like say…investigating an alert by spelunking in production observability and monitoring data. Learn how Alert Triage Hour of Power became a can’t miss meeting of camaraderie and system surprises!

SRECon23 Cognitive Apprenticeship in Action_ Alert Triage Hour of Power

Paige Cruz

For years tech companies have chased the fabled “single pane of glass” , the one observability tool to understand your system from north to south and east to west. Leafing through promo materials promising instant insights and seamless turnkey integrations you’d think increasing system observability is as easy as assembling a Lego set. In my experience chasing the “single pane of glass” translates to “pain in the ass”. Survey data supports this revealing the majority of engineers cite tool sprawl as a minor or non-existent problem despite relying on several tools. As alluring as the siren call of “single pane of glass” is, let's be practical and examine how to best observe systems across a myriad of tooling. From the telemetry buffet of metrics, events, traces and logs learn when to reach for which type and ways to bridge the gaps with links and enhance with context to free yourself from the fool’s errand of a “single pane of glass”.

Pushing Observability Uphill - The Single “Pain” of Glass

Paige Cruz

Curious about containers? There’s a new generation of containers on the scene, Podman! Supporting secure, rootless containers for Kubernetes microservices, it was designed and built with the cloud in mind. Benefitting from the lessons learned out in the open from Docker, this next generation of containers will quickly become a trusted daily driver in your dev workflow. Covering what you need to know as an end-user from the UI to the backend, sharing a real world use case leveraging Podman for open source observability workshops https://o11y-workshops.gitlab.io. Paige will share how Podman and the adorable seal mascots Caitlín, Maighréad and Róisín have transformed her local development!

Power Up with Podman

Paige Cruz

* Who/What/Where/When/Why of openTelemetry * Demystifying observability terms (telemetry, instrumentation, cardinality, percentile, observability) * Nuts and Bolts of Instrumentation * Tour of the Sample App * Automatic instrumentation with openTelemetry * Explore OOTB response time metric on charts * Manually add a label to the metric and review new facets to query by * Explore an OOTB trace * Manually instrument and add a span and review new level of visibility

Intro to Instrumentation

Paige Cruz

99.9% of Your Traces are Trash

Paige Cruz

Today we are caught between the 2nd and 3rd waves of system observability - the proprietary approaches of the past are fading fast in favor of open source instrumentation. Pioneering projects like OpenTelemetry and Prometheus are leading the way providing a path for competitors to cooperate for the benefit of the community. Let's look back at the 3 waves of observability to understand how we got here, the vision for the future and ways you can join us for the journey to liberate telemetry for all!

3rd Wave Observability: Open or Bust

Paige Cruz

More from Paige Cruz (14)

Power Up with Podman - Kubernetes Community Day LA

99.99% of Your Traces Are (Probably) Trash (SRECon NA 2024).pdf

OTel Orientation: How to Train Teams (OTel in Practice)

Avoiding Alert Bankruptcy and Burnout

Tracing Adventures from PR - Production

Threat Modeling in the Cloud

There's No Place Like Production

Taming Feral DevOps

SRECon23 Cognitive Apprenticeship in Action_ Alert Triage Hour of Power

Pushing Observability Uphill - The Single “Pain” of Glass

Power Up with Podman

Intro to Instrumentation

99.9% of Your Traces are Trash

3rd Wave Observability: Open or Bust

Recently uploaded

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Delhi Call girls

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA. In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability. In this session, participants gained insights about the following: Most common types of AI categories and use cases; Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives; Taxonomy and ontology design considerations and best practices; Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and Tools, roles, and skills to design and implement AI-powered search solutions.

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Enterprise Knowledge

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

Abhishek Deb(1), Mr Abdul Kalam(2) M. Des (UX) , School of Design, DIT University , Dehradun. This paper explores the future potential of AI-enabled smartphone processors, aiming to investigate the advancements, capabilities, and implications of integrating artificial intelligence (AI) into smartphone technology. The research study goals consist of evaluating the development of AI in mobile phone processors, analyzing the existing state as well as abilities of AI-enabled cpus determining future patterns as well as chances together with reviewing obstacles as well as factors to consider for more growth.

Exploring the Future Potential of AI-Enabled Smartphone Processors

debabhi2

Boost Fertility New Invention Ups Success Rates.pdf

sudhanshuwaghmare1

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

CNv6 Instructor Chapter 6 Quality of Service

giselly40

Evaluating the top large language models.pdf

ChristopherTHyatt

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

Scaling API-first – The story of a global engineering organization

Radu Cotescu

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Presentation on how to chat with PDF using ChatGPT code interpreter

IAC 2024 - IA Fast Track to Search Focused AI Solutions

08448380779 Call Girls In Civil Lines Women Seeking Men

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

A Domino Admins Adventures (Engage 2024)

Powerful Google developer tools for immediate impact! (2023-24 C)

Artificial Intelligence: Facts and Myths

Exploring the Future Potential of AI-Enabled Smartphone Processors

Boost Fertility New Invention Ups Success Rates.pdf

2024: Domino Containers - The Next Step. News from the Domino Container commu...

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

CNv6 Instructor Chapter 6 Quality of Service

Evaluating the top large language models.pdf

Data Cloud, More than a CDP by Matt Robison

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Scaling API-first – The story of a global engineering organization

🐬 The future of MySQL is Postgres 🐘

GenAI Risks & Security Meetup 01052024.pdf

Automating Google Workspace (GWS) & more with Apps Script

From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation

1. chronosphere.io From Cardinal(ity) Sins to Cost Efﬁcient Metrics Aggregation Paige Cruz, retired SRE open source observability advocate

2. chronosphere.io CFO looking at the o11y bill

3. chronosphere.io

4. chronosphere.io

5. chronosphere.io

6. Cloud Native Observability bills are outrageous

7. Cloud Native Data Growth 7 Cloud (IaaS, VM-based) 2008 - 2018 Cloud Native (Microservices and Containers) 2018 - ? On-Premises (Data center) 1998 - 2008 Business Increase in Scale Observability Data Increase in Scale *Source: ESG Distributed Cloud Series: Observability, Feb 2022, Scott Sinclair and Rob Strechay

8. chronosphere.io Most recently [vendor] was looked at to help monitor a small Kubernetes test cluster. 3 nodes. Now the base rate of $18/mo is ﬁne…except now they charge $1 per container per month past 10 containers per host. Since K8s (depending on how you install it) runs a bunch of little containers handling various back end things, you might not deploy anything to the cluster and still be WAY over that 10 container limit. In our case it came out to like $200/mo to monitor 3 nodes - that were nowhere fully loaded. - Hacker News thread

9. chronosphere.io Data volume Experiment: - Hello World app on 4 node Kubernetes cluster with Tracing, End User Metrics (EUM), Logs, Metrics (containers / nodes) - 30 days == +450 GB

10. Mighty Metrics

11. “ 1 in 10 metrics are actually directly queried - ServiceNow

12. Contributing Factors to the Metrics Bill 12 How many things you’re monitoring # of containers and infra components How often each metric is scraped Metric Granularity How long you keep the data Retention Window How many unique combos of dimensions on metrics Cardinality 12

13. 13

14. 14 Cost of monitoring can be a factor in determining how quickly to deprecate or sunset features/services/environments # of containers and infra components 14

15. 15 Emission time = adjust scrape_interval (from 10s samples -> 30s samples) Ingest time = aggregate Over time post-storage = downsampling Metric Granularity 15

16. 16 Aggregation: Roll Up 16

17. 17 For operational metrics……most (99.9%) of queries do not pass 7 days but average retention at original granularity ranges from 2-4 weeks Retention Window 17

18. 18 Low value tags or entire metrics should be dropped as early as possible Dropping Data 18

19. 19 Cardinality 19

20. Auditing Your Metrics

21. “ What is the value of this metric? - You, when auditing metrics

22. Auditing Your Metrics 22 22 ● Scope what your team is responsible for ○ filter queries with team:YOURS ● Identify easy wins. Metrics that aren’t ○ In a monitor definition ○ Directly queried by end users ○ Powering charts for visited dashboards ● Identify labels that are unnecessary ○ e.g. prometheus instance label or instance_type ● Share your successes!

23. 23 23

24. 24 24 CFO looking at the cost efﬁciency of metrics

25. Resources - How Gloo uses the OTel Collector to drop metrics/labels and provide the Minimum Metrics Set - How to drop and delete metrics in Prometheus - How can recording and data roll-up rules help your metrics? - Observability is Too Damn Expensive - DevOpsDays London

26. Catch up with me: - Rescuing On-Call Engineers (send your manager) - KubeCon OTel 101: Let’s Instrument! (tracing) workshop - There’s No Place Like Production Conf42 Incident Management paigerduty@ chronosphere.io hachyderm.io LinkedIn

27. Q&A

From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation

Recommended

Recommended

More Related Content

Similar to From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation

Similar to From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation (20)

More from Paige Cruz

More from Paige Cruz (14)

Recently uploaded

Recently uploaded (20)

From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation