Tales of Linux micro-benchmarks

•

0 likes•449 views

The document discusses Linux micro-benchmarks, which measure small portions of a system's performance rather than modeling a real workload. While micro-benchmarks can be misused, they are valid when used correctly to understand bottlenecks by profiling workloads. Several case studies are presented, showing how micro-benchmarks can reveal unexpected behaviors when systems calls are measured, such as a benchmark that actually measured page faults in addition to forks. Micro-benchmarks are most useful when profiling to identify bottlenecks, when they are kept very simple, and when they have been thoroughly tested.

Software

Tales of Linux Micro-benchmarks
Matt Fleming
@fleming_matt

Agenda
- Background: What is a micro-benchmark?
- Why all the hate?
- Case Studies
- When are they useful?
@fleming_mattTales of Linux micro-benchmarks

Micro-what?
Benchmark: A program to measure the
performance of a system, usually for comparsion.
Models a real-life workload.
Micro-benchmark: A program to measure a (small
but important!) portion of a system.
Artificial/synthetic.
@fleming_mattTales of Linux micro-benchmarks

Why all the hate?
You need some understanding of OS and
runtime/toolchain.
Many do not actually test what the author
intended.
But this is simply a bug or user error, it doesn’t
invalidate the concept.
@fleming_mattTales of Linux micro-benchmarks

Why all the hate?
C: simple loop compiled with -O0 and -O2
for (i = 0; i < 1000000000; i++)
val = val * 2;
Time: 2.238s Time: 0.001s
@fleming_mattTales of Linux micro-benchmarks

Why all the hate?
C: simple loop compiled with -O0 and -O2
for (i = 0; i < 1000000000; i++)
val = val * 2;
movl $0x0,-0xc(%rbp)
jmp 2
1:
shlq -0x8(%rbp)
addl $0x1,-0xc(%rbp)
2:
cmpl $999999999,-0xc(%rbp)
jle 1
Time: 2.238s Time: 0.001s
@fleming_mattTales of Linux micro-benchmarks

Case study 1 - Siege
@fleming_mattTales of Linux micro-benchmarks

Case study 1 - Siege
5.62% [kernel] [k] task_cputime
3.33% [kernel] [k] osq_lock
2.58% [kernel] [k] thread_group_cputime
$ perf top
- task_cputime
- 97.35% thread_group_cputime
thread_group_cputime_adjusted
do_sys_times
sys_times
entry_SYSCALL_64_fastpath
@fleming_mattTales of Linux micro-benchmarks

Case study 2 - lmbench
Measures fork() + exit()
@fleming_mattTales of Linux micro-benchmarks

Case study 2 - lmbench
Measures fork() + exit()
Actually measures fork() + page fault + exit()
@fleming_mattTales of Linux micro-benchmarks

Case study 2 - lmbench
Measures fork() + exit()
Actually measures fork() + page fault + exit()
Faulting address
fault_around_bytes#PF
@fleming_mattTales of Linux micro-benchmarks

Case study 3 - hackbench
Message-passing micro-benchmark
Processes or threads
Pipes or sockets
@fleming_mattTales of Linux micro-benchmarks

Case study 3 - hackbench
Message-passing micro-benchmark
Processes or threads
Pipes or sockets
70%
@fleming_mattTales of Linux micro-benchmarks

Case study 4 - pipetest
@fleming_mattTales of Linux micro-benchmarks

When are they useful?
After profiling your workload and identifying
bottlenecks
When they’re super simple
When they’ve been tested
@fleming_mattTales of Linux micro-benchmarks

Questions?
@fleming_mattTales of Linux micro-benchmarks

Gunnar Morling is a software engineer and open-source enthusiast by heart. He is leading the Debezium project, a platform for change data capture (CDC). He is a Java Champion, the spec lead for Bean Validation 2.0 (JSR 380) and has founded multiple open source projects such as Deptective and MapStruct. Prior to joining Red Hat, Gunnar worked on a wide range of Java EE projects in the logistics and retail industries. He's based in Hamburg, Germany.

Tcp repair

Pavel Emelyanov

This document discusses TCP connection repair in Linux. It introduces the problem of relocating one end of a TCP connection to another machine. The solution presented uses a repair mode for TCP connections, which are represented as pairs of sockets. New socket options and syscalls are introduced to manipulate TCP-specific attributes like sequence numbers, timestamps, and queues. This allows the connections to be disassembled and reassembled. Items for future work include improving support for transitional states, out-of-band data, connection shutdown, and connection tracking.

Intro to open source telemetry linux con 2016

Matthew Broberg

Abstract As part of the team delivering Snap, an open telemetry framework, I've run through dozens of use cases where gathering disparate metrics from services can roll up into meaningful diagrams for operations engineers and developers alike. We will use Snap's plugin model to collect, process and publish these measurements into meaningful graphs using open source tools. By joining this session, you can follow along and install industry-standard open source projects, deploy them and then use Snap to collect, process and visualize these metrics. Audience Anyone with an operations-background (or future ahead of them) that wants to see the breadth of available open source tooling around telemetry. This proposal is designed for the hands-on user, who is comfortable running containers or virtual machines locally. Experience Level Intermediate Benefits to the Ecosystem By joining this session, you can follow along and install industry-standard open source projects, deploy them and then use Snap to collect, process and visualize these metrics. This empowers users within the Linux ecosystem to see their knowledge as powerful when visualized next to other layers of the datacenter.

Lightining Talk - Task queue and micro task queues in browser

Jitendra Kasaudhan

Apache Storm based Real Time Analytics for Recommending Trending Topics and S...

Humoyun Ahmedov

Flink Forward Berlin 2017: Matt Zimmer - Custom, Complex Windows at Scale Usi...

Flink Forward

The windowing capabilities offered by most stream processing engines are limited to aligned windows of a fixed duration. However, many real-world event processing use cases don’t fit this rigid structure, resulting in awkward processing pipelines. There haven’t been good alternatives, until recently that is. Apache Flink offers a rich Window API that supports implementing unaligned windows of varying duration. In this talk, Matt Zimmer will discuss using this API at Netflix to aggregate events into windows customized along varying definitions of a session. He will talk about implementation details such as: * Handling out-of-order events * Limiting state build-up while aggregating a subset of events from an event stream * Periodically emitting early results * Creating windows bounded by a type of event Attendees will leave this talk with practical techniques and knowledge to implement their own custom windows in Apache Flink.

BWB Meetup: Storm - distributed realtime computation system

Andrii Gakhov

Storm is a free and open source distributed real-time computation system. It is fault-tolerant, scalable, and guarantees data processing. Storm topologies can integrate data streams from multiple sources and languages, and run computations across computer clusters in a distributed manner. It is used by companies for applications like stream processing, distributed RPCs, and continuous computations.

This document provides an introduction and overview of MATLAB and its Control System Toolbox. It discusses transfer functions, representing systems using poles and zeros, multiplying transfer functions, finding closed-loop transfer functions, and converting between transfer function and state space representations. It also demonstrates how to use MATLAB to analyze and simulate linear time-invariant control systems, including finding step, impulse, and ramp responses.

Snap Telemetry Framework & Plugin Architecture at GrafanaCon 2016

Matthew Broberg

Case study: formal verification of the Brain Fuck Scheduler

Mengxuan Xia

What is Functional Programming?

Eric Normand

This document discusses the problems of complexity in software development and proposes functional programming as a solution. It outlines three main sources of complexity: time, state, and architecture. Functional programming aims to master these through an immutable data and stateless function model where functions take inputs and produce outputs without side effects. This reduces complexity by avoiding unintended interactions between different parts of a program over time and through state changes.

Debugging tricks you wish you knew Tamir Dresher - Odessa 2019

Tamir Dresher

My talk from the Odessa .NET User Group - http://www.usergroup.od.ua/2019/02/microsoft-net-user-group.html Source can be found here: https://github.com/tamirdresher/DebuggingTricks Do you know what developers do most of their day (except for surfing the internet)? Writing code? WRONG! They are debugging. The debugger is a powerful tool, but in this talk you'll learn tricks that will help find bugs in half the time and with less frustration. Because a happy developer is a productive developer. I'll show you to use tools that will point to you to right direction and features didn't know that are even there, for both development time debugging and post-mortem production analysis.

Performance improvement techniques for software distributed shared memory

ZongYing Lyu

Continuous Processing with Apache Flink - Strata London 2016

Stephan Ewen

Your data is in Prometheus, now what? (CurrencyFair Engineering Meetup, 2016)

Brian Brazil

Framingham Go Meetup - October 2016

Matthew Broberg

As part of the team delivering Snap, an open telemetry framework, I've run through dozens of use cases where gathering disparate metrics from services can roll up into meaningful diagrams for operations engineers and developers alike. I will introduce you to the concept of telemetry by talking through the basics then using Snap's plugin model to collect, process and publish these measurements into meaningful graphs using open source tools.

Communicating with Channels

jlongster2

The document discusses using channels for communication between components in React applications as an alternative to traditional event handling. Channels provide a way to compose asynchronous workflows and signals between components using a data flow-based approach with primitives like PUT, TAKE, and GO. This avoids mutating shared state and makes the data and event flows more explicit. The document provides examples of using channels for a simple list, tracking mouse movement, and implementing tooltips. It also discusses how the Flux architecture could be re-envisioned using channels.

Flink Forward Berlin 2017: Aljoscha Krettek - Talk Python to me: Stream Proce...

Flink Forward

Flink is a great stream processor, Python is a great programming language, Apache Beam is a great programming model and portability layer. Using all three together is a great idea! We will demo and discuss writing Beam Python pipelines and running them on Flink. We will cover Beam's portability vision that led here, what you need to know about how Beam Python pipelines are executed on Flink, and where Beam's portability framework is headed next (hint: Python pipelines reading from non-Python connectors)

HA with RelStorage and Postgres

Simone Deponti

Towards an Integration of the Actor Model in an FRP Language for Small-Scale ...

Takuo Watanabe

This paper presents an integration of the Actor model in Emfrp, a functional reactive programming language designed for resource constrained embedded systems. In this integration, actors not only express nodes that represent time-varying values, but also present communication mechanism. The integration provides a higher-level view of the internal representation of nodes, representations of time-varying values, as well as an actor-based inter-device communication mechanism.

What’s eating python performance

Piotr Przymus

Performance and how to measure it - ProgSCon London 2016

Matt Warren

Velocity 2012 - Learning WebOps the Hard Way

Cosimo Streppone

Working in Web Operations means dealing with production systems that in most cases needs to be operational 24×7x365. To reach 99.99999% uptime, you must fail as little as possible. This talk will go through a few real-world incidents and failures experienced by our small WebOps team, and outline what we are learning (the hard way), and how we’re trying to improve. What could possibly go wrong? :-)

The Art Of Performance Tuning

Jonathan Ross

Presented at JavaOne 2017 [CON4027], this presentation takes a practical, hands-on look at Java performance tuning. It discusses methodology (spoiler: it’s the scientific method) and how to apply it to Java SE systems (on any budget). Exploring concrete examples with tools such as the Oracle Java Mission Control feature of Oracle Java SE Advanced, VisualVM, YourKit, and JMH, the presentation focuses on ways of measuring performance, how to interpret data, ways of eliminating bottlenecks, and even how to avoid future performance regressions. A separate version will be uploaded with speaker notes.

Java Micro-Benchmarking

Constantine Nosovsky

- The document discusses best practices for micro-benchmarking in Java, including using frameworks like JMH that account for JVM warmup and avoid benchmark overhead. - It explains common pitfalls like dead code elimination and loop unrolling that can incorrectly optimize away the code being measured. - An example benchmark compares the performance of ArrayList and LinkedList iteration in different Java versions.

Performance is a Feature!

PostSharp Technologies

Starting with the premise that "Performance is a Feature", Matt Warren will show you how to measure, what to measure and how to get the best performance from your .NET code. We will look at real-world examples from the Roslyn code-base and StackOverflow (the product), including how the .NET Garbage Collector needs to be tamed! The presentation covers: Why we should care about performance Pitfalls to avoid when measuring performance How the .NET Garbage Collector can hurt performance Real-world performance lessons from open-source code The webinar recording can be found here: http://www.postsharp.net/blog/post/webinar-recording-performance-is-a-feature

May2010 hex-core-opt

Jeff Larkin

This document discusses porting, scaling, and optimizing applications on Cray XT systems. It covers topics such as choosing compilers, profiling and debugging applications at scale, understanding CPU affinity, and improvements in the Cray Message Passing Toolkit (MPT). The document provides guidance on leveraging different compilers, collecting performance data using hardware counters and CrayPAT, understanding MPI process binding, and enhancements in MPT 4.0 related to MPI standards support and communication optimizations.

Pragmatic model checking: from theory to implementations

Universität Rostock

Trends in Systems and How to Get Efficient Performance

inside-BigData.com

In this video from Switzerland HPC Conference, Martin Hilgeman from Dell presents: HPC Workload Efficiency and the Challenges for System Builders. "With all the advances in massively parallel and multi-core computing with CPUs and accelerators it is often overlooked whether the computational work is being done in an efficient manner. This efficiency is largely being determined at the application level and therefore puts the responsibility of sustaining a certain performance trajectory into the hands of the user. It is observed that the adoption rate of new hardware capabilities is decreasing and lead to a feeling of diminishing returns. This presentation shows the well-known laws of parallel performance from the perspective of a system builder. It also covers through the use of real case studies, examples of how to program for energy efficient parallel application performance." Watch the video: http://wp.me/p3RLHQ-gIS Learn more: http://dell.com and http://www.hpcadvisorycouncil.com/events/2017/swiss-workshop/agenda.php Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

What's hot

Control System toolbox in Matlab

Abdul Sami

Snap Telemetry Framework & Plugin Architecture at GrafanaCon 2016

Matthew Broberg

Case study: formal verification of the Brain Fuck Scheduler

Mengxuan Xia

What is Functional Programming?

Eric Normand

Debugging tricks you wish you knew Tamir Dresher - Odessa 2019

Tamir Dresher

Performance improvement techniques for software distributed shared memory

ZongYing Lyu

Continuous Processing with Apache Flink - Strata London 2016

Stephan Ewen

Your data is in Prometheus, now what? (CurrencyFair Engineering Meetup, 2016)

Brian Brazil

Framingham Go Meetup - October 2016

Matthew Broberg

Communicating with Channels

jlongster2

Flink Forward Berlin 2017: Aljoscha Krettek - Talk Python to me: Stream Proce...

Flink Forward

HA with RelStorage and Postgres

Simone Deponti

Towards an Integration of the Actor Model in an FRP Language for Small-Scale ...

Takuo Watanabe

What's hot (13)

Control System toolbox in Matlab

Snap Telemetry Framework & Plugin Architecture at GrafanaCon 2016

Case study: formal verification of the Brain Fuck Scheduler

What is Functional Programming?

Debugging tricks you wish you knew Tamir Dresher - Odessa 2019

Performance improvement techniques for software distributed shared memory

Continuous Processing with Apache Flink - Strata London 2016

Your data is in Prometheus, now what? (CurrencyFair Engineering Meetup, 2016)

Framingham Go Meetup - October 2016

Communicating with Channels

Flink Forward Berlin 2017: Aljoscha Krettek - Talk Python to me: Stream Proce...

HA with RelStorage and Postgres

Towards an Integration of the Actor Model in an FRP Language for Small-Scale ...

Similar to Tales of Linux micro-benchmarks

What’s eating python performance

Piotr Przymus

Performance and how to measure it - ProgSCon London 2016

Matt Warren

Velocity 2012 - Learning WebOps the Hard Way

Cosimo Streppone

The Art Of Performance Tuning

Jonathan Ross

Java Micro-Benchmarking

Constantine Nosovsky

Performance is a Feature!

PostSharp Technologies

May2010 hex-core-opt

Jeff Larkin

Pragmatic model checking: from theory to implementations

Universität Rostock

Trends in Systems and How to Get Efficient Performance

inside-BigData.com

RTOS implementation

Rajan Kumar

The document discusses real-time operating systems for embedded systems. It describes that RTOS are necessary for systems with scheduling of multiple processes and devices. An RTOS kernel manages tasks, inter-task communication, memory allocation, timers and I/O devices. The document provides examples of creating tasks to blink an LED and print to USART ports, using a semaphore for synchronization between tasks. The tasks are run and output is seen on a Minicom terminal.

Lab6 rtos

indirakumar86

This document provides an introduction to using the RTX real-time operating system with the Keil uVision IDE. It describes how to create a basic RTX application with multiple tasks and use RTX functions for task scheduling and synchronization. The document gives an example 4-task application that blinks LEDs, increments counters, and displays text. It also discusses using semaphores, specifically mutexes, to control access to shared resources between tasks in a real-time system.

[COSCUP 2022] 腳踏多條船-利用 Coroutine在 Software Transactional Memory上進行動態排程

littleuniverse24

Matopt

Afaf Soumia Medjden

The document provides tips for improving the performance of MATLAB code. It discusses using the profiler to identify bottlenecks, preallocating arrays to avoid dynamic resizing overhead, and how the Just-In-Time accelerator can speed up loops and functions by avoiding interpretation. Preallocating arrays is shown to improve the speed of examples by over 3 times, and is beneficial for cases where the final array size may vary. The JIT accelerator most effectively accelerates code using supported data types, array shapes, and language elements within loops and conditionals.

Cgc2

Chong-Kuan Chen

The document discusses techniques from the DARPA Cyber Grand Challenge (CGC) and DEFCON CTF for developing automatic attack and defense systems, including fuzzing, symbolic/concolic execution, and software hardening. It provides an overview of the CGC competition format and challenges competitors to analyze binaries to discover vulnerabilities and generate exploits or patches. The competition was won by Team Mayhem from startup ForAllSecure, which utilized techniques like symbolic execution to analyze programs.

Analysis of Algorithms

Amna Saeed

These lecture notes cover algorithms and their analysis over 4 modules. Module I introduces algorithms, their properties, analysis of complexity and asymptotic notations. It covers analysis of sorting algorithms like merge sort, quicksort and binary search. Module II covers dynamic programming and greedy algorithms. Module III covers graph algorithms like BFS, DFS and minimum spanning trees. Module IV covers advanced topics like fast Fourier transform, string matching, NP-completeness and approximation algorithms.

Daa

Dhananjay Singh

These lecture notes cover the design and analysis of algorithms over 4 modules. Module I introduces algorithms, their characteristics, expectations and analysis. It discusses asymptotic analysis using big O, Ω and Θ notations to analyze the growth of algorithms like insertion sort, which has a worst case running time of Θ(n2). Subsequent modules cover dynamic programming, greedy algorithms, graphs, and NP-completeness. The notes provide an overview of key algorithm design and analysis topics.

Design & Analysis of Algorithms Lecture Notes

FellowBuddy.com

FellowBuddy.com is an innovative platform that brings students together to share notes, exam papers, study guides, project reports and presentation for upcoming exams. We connect Students who have an understanding of course material with Students who need help. Benefits:- # Students can catch up on notes they missed because of an absence. # Underachievers can find peer developed notes that break down lecture and study material in a way that they can understand # Students can earn better grades, save time and study effectively Our Vision & Mission – Simplifying Students Life Our Belief – “The great breakthrough in your life comes when you realize it, that you can learn anything you need to learn; to accomplish any goal that you have set for yourself. This means there are no limits on what you can be, have or do.” Like Us - https://www.facebook.com/FellowBuddycom

Python Programming - IX. On Randomness

Ranel Padon

Training ImageNet-1k ResNet50 in 15min pfn

Mila, Université de Montréal

This document summarizes research conducted by Preferred Networks to train a ResNet-50 model on the ImageNet dataset using a minibatch size of 32,000 across 1024 Tesla P100 GPUs. They were able to complete 90 training epochs in 15 minutes, achieving a validation accuracy of 74.9%. To enable training with such a large minibatch, they employed techniques like RMSprop warmup, slow start learning rates, and batch normalization without moving averages. The training was conducted on Preferred Networks' in-house cluster MN-1 which has 128 nodes each with 8 GPUs connected via InfiniBand FDR interconnect.

How to use mtr 2

Eduardo Narvaez

MTR is a network diagnostic tool that combines the functionality of traceroute and ping. It probes routers on the network path by sending packets and listening for responses to determine the quality of each hop. As it runs continuously, it tracks response times and packet loss to identify links that may be causing issues like increased latency or buffering. The MTR output provides statistics on each hop, including the hostname, packet loss percentage, and response times, to help locate potential problems along the route.

Similar to Tales of Linux micro-benchmarks (20)

What’s eating python performance

Performance and how to measure it - ProgSCon London 2016

Velocity 2012 - Learning WebOps the Hard Way

The Art Of Performance Tuning

Java Micro-Benchmarking

Performance is a Feature!

May2010 hex-core-opt

Pragmatic model checking: from theory to implementations

Trends in Systems and How to Get Efficient Performance

RTOS implementation

Lab6 rtos

[COSCUP 2022] 腳踏多條船-利用 Coroutine在 Software Transactional Memory上進行動態排程

Matopt

Cgc2

Analysis of Algorithms

Daa

Design & Analysis of Algorithms Lecture Notes

Python Programming - IX. On Randomness

Training ImageNet-1k ResNet50 in 15min pfn

How to use mtr 2

Recently uploaded

Energy consumption of Database Management - Florina Jonuzi

Green Software Development

Using Xen Hypervisor for Functional Safety

Ayan Halder

UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem

Peter Muessig

在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样

mz5nrf0n

原版一模一样【微信：741003700 】【加拿大英属哥伦比亚大学毕业证本科学位证书】【微信：741003700 】学位证，留信认证（真实可查，永久存档）offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原海外各大学 Bachelor Diploma degree, Master Degree Diploma 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才

Unveiling the Advantages of Agile Software Development.pdf

brainerhub1

J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...

Bert Jan Schrijver

UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions

Peter Muessig

The UI5 tooling is the development and build tooling of UI5. It is built in a modular and extensible way so that it can be easily extended by your needs. This session will showcase various tooling extensions which can boost your development experience by far so that you can really work offline, transpile your code in your project to use even newer versions of EcmaScript (than 2022 which is supported right now by the UI5 tooling), consume any npm package of your choice in your project, using different kind of proxies, and even stitching UI5 projects during development together to mimic your target environment.

KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD

rodomar2

Measures in SQL (SIGMOD 2024, Santiago, Chile)

Julian Hyde

SQL has attained widespread adoption, but Business Intelligence tools still use their own higher level languages based upon a multidimensional paradigm. Composable calculations are what is missing from SQL, and we propose a new kind of column, called a measure, that attaches a calculation to a table. Like regular tables, tables with measures are composable and closed when used in queries. SQL-with-measures has the power, conciseness and reusability of multidimensional languages but retains SQL semantics. Measure invocations can be expanded in place to simple, clear SQL. To define the evaluation semantics for measures, we introduce context-sensitive expressions (a way to evaluate multidimensional expressions that is consistent with existing SQL semantics), a concept called evaluation context, and several operations for setting and modifying the evaluation context. A talk at SIGMOD, June 9–15, 2024, Santiago, Chile Authors: Julian Hyde (Google) and John Fremlin (Google) https://doi.org/10.1145/3626246.3653374

Microservice Teams - How the cloud changes the way we work

Sven Peters

A lot of technical challenges and complexity come with building a cloud-native and distributed architecture. The way we develop backend software has fundamentally changed in the last ten years. Managing a microservices architecture demands a lot of us to ensure observability and operational resiliency. But did you also change the way you run your development teams? Sven will talk about Atlassian’s journey from a monolith to a multi-tenanted architecture and how it affected the way the engineering teams work. You will learn how we shifted to service ownership, moved to more autonomous teams (and its challenges), and established platform and enablement teams.

Need for Speed: Removing speed bumps from your Symfony projects ⚡️

Łukasz Chruściel

No one wants their application to drag like a car stuck in the slow lane! Yet it’s all too common to encounter bumpy, pothole-filled solutions that slow the speed of any application. Symfony apps are not an exception. In this talk, I will take you for a spin around the performance racetrack. We’ll explore common pitfalls - those hidden potholes on your application that can cause unexpected slowdowns. Learn how to spot these performance bumps early, and more importantly, how to navigate around them to keep your application running at top speed. We will focus in particular on tuning your engine at the application level, making the right adjustments to ensure that your system responds like a well-oiled, high-performance race car.

How Can Hiring A Mobile App Development Company Help Your Business Grow?

ToXSL Technologies

socradar-q1-2024-aviation-industry-report.pdf

SOCRadar

SOCRadar's Aviation Industry Q1 Incident Report is out now! The aviation industry has always been a prime target for cybercriminals due to its critical infrastructure and high stakes. In the first quarter of 2024, the sector faced an alarming surge in cybersecurity threats, revealing its vulnerabilities and the relentless sophistication of cyber attackers. SOCRadar’s Aviation Industry, Quarterly Incident Report, provides an in-depth analysis of these threats, detected and examined through our extensive monitoring of hacker forums, Telegram channels, and dark web platforms.

Everything You Need to Know About X-Sign: The eSign Functionality of XfilesPr...

XfilesPro

Transform Your Communication with Cloud-Based IVR Solutions

TheSMSPoint

Discover the power of Cloud-Based IVR Solutions to streamline communication processes. Embrace scalability and cost-efficiency while enhancing customer experiences with features like automated call routing and voice recognition. Accessible from anywhere, these solutions integrate seamlessly with existing systems, providing real-time analytics for continuous improvement. Revolutionize your communication strategy today with Cloud-Based IVR Solutions. Learn more at: https://thesmspoint.com/channel/cloud-telephony

316895207-SAP-Oil-and-Gas-Downstream-Training.pptx

ssuserad3af4

Lecture 2 - software testing SE 412.pptx

TaghreedAltamimi

Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf

VALiNTRY360

Salesforce Healthcare CRM, implemented by VALiNTRY360, revolutionizes patient management by enhancing patient engagement, streamlining administrative processes, and improving care coordination. Its advanced analytics, robust security, and seamless integration with telehealth services ensure that healthcare providers can deliver personalized, efficient, and secure patient care. By automating routine tasks and providing actionable insights, Salesforce Healthcare CRM enables healthcare providers to focus on delivering high-quality care, leading to better patient outcomes and higher satisfaction. VALiNTRY360's expertise ensures a tailored solution that meets the unique needs of any healthcare practice, from small clinics to large hospital systems. For more info visit us https://valintry360.com/solutions/health-life-sciences

WWDC 2024 Keynote Review: For CocoaCoders Austin

Patrick Weigel

Top 9 Trends in Cybersecurity for 2024.pptx

devvsandy

Recently uploaded (20)

Energy consumption of Database Management - Florina Jonuzi

Using Xen Hypervisor for Functional Safety

UI5con 2024 - Keynote: Latest News about UI5 and it’s Ecosystem

在线购买加拿大英属哥伦比亚大学毕业证本科学位证书原版一模一样

Unveiling the Advantages of Agile Software Development.pdf

J-Spring 2024 - Going serverless with Quarkus, GraalVM native images and AWS ...

UI5con 2024 - Boost Your Development Experience with UI5 Tooling Extensions

KuberTENes Birthday Bash Guadalajara - Introducción a Argo CD

Measures in SQL (SIGMOD 2024, Santiago, Chile)

Microservice Teams - How the cloud changes the way we work

Need for Speed: Removing speed bumps from your Symfony projects ⚡️

How Can Hiring A Mobile App Development Company Help Your Business Grow?

socradar-q1-2024-aviation-industry-report.pdf

Everything You Need to Know About X-Sign: The eSign Functionality of XfilesPr...

Transform Your Communication with Cloud-Based IVR Solutions

316895207-SAP-Oil-and-Gas-Downstream-Training.pptx

Lecture 2 - software testing SE 412.pptx

Top Benefits of Using Salesforce Healthcare CRM for Patient Management.pdf

WWDC 2024 Keynote Review: For CocoaCoders Austin

Top 9 Trends in Cybersecurity for 2024.pptx

Tales of Linux micro-benchmarks

1. Tales of Linux Micro-benchmarks Matt Fleming @fleming_matt

2. Agenda - Background: What is a micro-benchmark? - Why all the hate? - Case Studies - When are they useful? @fleming_mattTales of Linux micro-benchmarks

3. Micro-what? Benchmark: A program to measure the performance of a system, usually for comparsion. Models a real-life workload. Micro-benchmark: A program to measure a (small but important!) portion of a system. Artificial/synthetic. @fleming_mattTales of Linux micro-benchmarks

4. Why all the hate? You need some understanding of OS and runtime/toolchain. Many do not actually test what the author intended. But this is simply a bug or user error, it doesn’t invalidate the concept. @fleming_mattTales of Linux micro-benchmarks

5. Why all the hate? C: simple loop compiled with -O0 and -O2 for (i = 0; i < 1000000000; i++) val = val * 2; Time: 2.238s Time: 0.001s @fleming_mattTales of Linux micro-benchmarks

6. Why all the hate? C: simple loop compiled with -O0 and -O2 for (i = 0; i < 1000000000; i++) val = val * 2; movl $0x0,-0xc(%rbp) jmp 2 1: shlq -0x8(%rbp) addl $0x1,-0xc(%rbp) 2: cmpl $999999999,-0xc(%rbp) jle 1 Time: 2.238s Time: 0.001s @fleming_mattTales of Linux micro-benchmarks

7. Why all the hate? C: simple loop compiled with -O0 and -O2 for (i = 0; i < 1000000000; i++) val = val * 2; movl $0x0,-0xc(%rbp) jmp 2 1: shlq -0x8(%rbp) addl $0x1,-0xc(%rbp) 2: cmpl $999999999,-0xc(%rbp) jle 1 Time: 2.238s Time: 0.001s @fleming_mattTales of Linux micro-benchmarks

8. Case study 1 - Siege @fleming_mattTales of Linux micro-benchmarks

9. Case study 1 - Siege 5.62% [kernel] [k] task_cputime 3.33% [kernel] [k] osq_lock 2.58% [kernel] [k] thread_group_cputime $ perf top - task_cputime - 97.35% thread_group_cputime thread_group_cputime_adjusted do_sys_times sys_times entry_SYSCALL_64_fastpath @fleming_mattTales of Linux micro-benchmarks

10. Case study 2 - lmbench Measures fork() + exit() @fleming_mattTales of Linux micro-benchmarks

11. Case study 2 - lmbench Measures fork() + exit() Actually measures fork() + page fault + exit() @fleming_mattTales of Linux micro-benchmarks

12. Case study 2 - lmbench Measures fork() + exit() Actually measures fork() + page fault + exit() Faulting address fault_around_bytes#PF @fleming_mattTales of Linux micro-benchmarks

13. Case study 3 - hackbench Message-passing micro-benchmark Processes or threads Pipes or sockets @fleming_mattTales of Linux micro-benchmarks

14. Case study 3 - hackbench Message-passing micro-benchmark Processes or threads Pipes or sockets 70% @fleming_mattTales of Linux micro-benchmarks

15. Case study 4 - pipetest @fleming_mattTales of Linux micro-benchmarks

16. Case study 4 - pipetest @fleming_mattTales of Linux micro-benchmarks

17. When are they useful? After profiling your workload and identifying bottlenecks When they’re super simple When they’ve been tested @fleming_mattTales of Linux micro-benchmarks

18. Questions? @fleming_mattTales of Linux micro-benchmarks

Tales of Linux micro-benchmarks

Recommended

Recommended

More Related Content

What's hot

What's hot (13)

Similar to Tales of Linux micro-benchmarks

Similar to Tales of Linux micro-benchmarks (20)

Recently uploaded

Recently uploaded (20)

Tales of Linux micro-benchmarks