@Server Density we organize our internal War Games were all engineers practice the processes involved in incident handling. We have seen how this improves the associated human factors, our processes and our tools.
Atmosphere 2016 - Jorge Salamero Sanz - HumanOps, the impact of human health ...PROIDEA
HumanOps is a set of principles which focus on the human aspects of running infrastructure.
It deliberately highlights the importance of the teams running systems, not just the systems themselves.
The health of your infrastructure is not just about hardware, software, automations and uptime - it also includes the health and wellbeing of your team.
The goal of HumanOps is to improve and maintain the good health of your team: easing communication, reducing fatigue and reducing stress.
Ryktene om TDDs død er sterkt overdrevet, men det har skjedd mye inn test-drevet utvikling siden Kent Beck introduserte TDD med Extreme Programming i 1999. TDD, BDD, ATDD, ST, DDT … hvordan passer alt dette sammen? Hva skal til for å lykkes med automatisert testing?
A developers' journey into building automated tests for IT from the ground upstefanorago
Having learned the benefits of applying TDD to application code, my team decided to apply the same technique to infrastructure code. There isn’t quite as much literature available, and we faced many challenges, but we ended up with a rewarding result: a continuous integration server that runs tests at several levels, including acceptance tests. The technology stack involves Chef, Test Kitchen, Vagrant, KVM, Serverspec and Jenkins. Although each one of these parties has at least a good level of maturity, integrating all the blocks has been the interesting part. We discuss how to put together virtualization solutions, configuration management and BDD specs to converge into a fully automated continuous integration system that can and, in our opinion, should become part of the workflow for DevOp teams.
Le 12 pratiche - Un introduzione a XP (Mini Italian Agile Day)Andrea Francia
Una introduzione base ad eXtreme Programming per i nuovi arrivati dell'Agile Day che vogliono cominciare a capire di cosa si tratta e vogliono sapere come viene implementata nelle aziende che la praticano. Alla fine della sessione i partecipanti si saranno fatti un idea di come si fa XP in concreto.
Slide della presentazione fatta al Mini Italian Agile Day di Vimercate - 11 febbraio 2017
@Server Density we organize our internal War Games were all engineers practice the processes involved in incident handling. We have seen how this improves the associated human factors, our processes and our tools.
Atmosphere 2016 - Jorge Salamero Sanz - HumanOps, the impact of human health ...PROIDEA
HumanOps is a set of principles which focus on the human aspects of running infrastructure.
It deliberately highlights the importance of the teams running systems, not just the systems themselves.
The health of your infrastructure is not just about hardware, software, automations and uptime - it also includes the health and wellbeing of your team.
The goal of HumanOps is to improve and maintain the good health of your team: easing communication, reducing fatigue and reducing stress.
Ryktene om TDDs død er sterkt overdrevet, men det har skjedd mye inn test-drevet utvikling siden Kent Beck introduserte TDD med Extreme Programming i 1999. TDD, BDD, ATDD, ST, DDT … hvordan passer alt dette sammen? Hva skal til for å lykkes med automatisert testing?
A developers' journey into building automated tests for IT from the ground upstefanorago
Having learned the benefits of applying TDD to application code, my team decided to apply the same technique to infrastructure code. There isn’t quite as much literature available, and we faced many challenges, but we ended up with a rewarding result: a continuous integration server that runs tests at several levels, including acceptance tests. The technology stack involves Chef, Test Kitchen, Vagrant, KVM, Serverspec and Jenkins. Although each one of these parties has at least a good level of maturity, integrating all the blocks has been the interesting part. We discuss how to put together virtualization solutions, configuration management and BDD specs to converge into a fully automated continuous integration system that can and, in our opinion, should become part of the workflow for DevOp teams.
Le 12 pratiche - Un introduzione a XP (Mini Italian Agile Day)Andrea Francia
Una introduzione base ad eXtreme Programming per i nuovi arrivati dell'Agile Day che vogliono cominciare a capire di cosa si tratta e vogliono sapere come viene implementata nelle aziende che la praticano. Alla fine della sessione i partecipanti si saranno fatti un idea di come si fa XP in concreto.
Slide della presentazione fatta al Mini Italian Agile Day di Vimercate - 11 febbraio 2017
Overview of Site Reliability Engineering (SRE) & best practicesAshutosh Agarwal
In any software organization, stability & innovation are always at loggerheads - the faster you move, the more things will break. This talk defines what SRE org looks like at high-tech organizations (Google, Uber).
Practical DevSecOps: Fundamentals of Successful ProgramsMatt Tesauro
From ONUG Fall 2022:
"Shift Left'' and automation have turned from ideals to meaningless buzzwords. Instead of riding the hype train, let's get real and cover practical and real-world examples taken from actual product security successes. Not every business is the same, neither will their DevSecOps program.
In this talk, I'll cover the fundamentals of common to successful DevSecOps programs as well as a grab bag of useful techniques to consider. These are lessons learned doing AppSec at a wide variety of companies including Rackspace, Pearson, a fortune 500 financial, Duo Security and Cognizant Healthcare. Bruce Lee said "Research your own experience. Absorb what is useful, reject what is useless, add what is essentially your own". The goal of this talk is to provide you with enough examples to build your own pragmatic and practical DevSecOps program or maybe absorb a new technique or two into your existing program.
Winston - Netflix's event driven auto remediation and diagnostics toolVinay Shah
This was a slide deck on Winston presented at a meetup on auto remediation and diagnostics: https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/234628846/
Aleksej Šipulia - Retrospective – heart of scrumAgile Lietuva
Topic: Retrospective – heart of scrum
Scrum is simple agile framework, but sometimes it’s hard to implement it with some team. Scrum should be understood not as a collection artifacts and meeting, but as a philosophy of your daily work. You need to adopt new habits which improve your performance as a team and try to get rid of waste which slows you down. Retrospective is proper instrument to implement this type of thinking. I’ll talk on how to use this powerful tool and will provide some insights, tips and trick from my long experience being a Scrum Master.
About Aleksej: Aleksej is Experienced Software Developer and Agile Coach with a demonstrated history of working in
the financial services industry. Skilled in Java, Coaching, Agile Methodologies, Spring and other. Strong engineering professional and active leader of Java community in Vilnius
Linkedin: https://www.linkedin.com/in/aleksejsipulia/
Tired of doing upfront test script creation in your testing efforts? Feeling bad for demotivating your testers? Want something to replace this sickening approach to software testing? This presentation outlines why test scripts are not useful, and how test ideas are the new way forward to better testing. Coverage, traceability, reporting, automation and skills are all covered. Take a quick look and see if you can see there is another way to do software testing that is actually pure common sense.
Leading IT and DevOps teams are moving beyond simple alert escalation and oncall management to Incident Response Orchestration (IRO). IRO enables you to acceleration the identification of problems, the notification of the right people, and the facilitation of collaboration across all business units to resolve issues quickly. In this webinar, Berkay Mollamustafaoglu, CEO of OpsGenie, will discuss the challenges faced by modern operations teams and how IRO is empowering organizations to address incidents of any size - before they impact business.
How do you know what to monitor in your environment? Failure modes have become so complex that we need a cross-functional view of the system to identify what failure looks like. This talks walks through the FMEA process as applied to monitoring and metrics collection. The process will help you identify your failure points and the risks associated with a particular failure mode.
Digital Forensics & Incident Response Fundamentals.pdfChristopher Doman
Digital forensics and incident response (DFIR) are crucial areas in the field of cybersecurity, involving the identification, analysis, and response to security incidents. In this talk, we talk about the fundamentals of DFIR, including the key concepts and techniques used to investigate and respond to cyber attacks. We will explore the various phases of incident response, from initial detection/ triage to postincident analysis and remediation. We will also discuss the importance of preserving and analyzing digital evidence, as well as the legal and ethical considerations involved in conducting DFIR investigations. Whether you are a seasoned cybersecurity professional or just starting out in the field thetalk will provide valuable insights into the fundamentals of digital forensics and incident response.
When designing, building, and maintaining a computer system, one can ask the following questions: "Is my data safe from being accidentally deleted or corrupted? How do I ensure data integrity in the long term?"
The main goal of the presentation is to analyze several data integrity pitfalls and review recommended solutions, so that you can construct a data integrity strategy appropriate for your service and your DevOps team.
Exploratory testing is a big part of 'agile' but what exactly does it mean?
How does it differ to other approaches? How do we get value from it? How does the team benefit from it?
How can the whole team participate? What are some misconceptions?
Time, effectiveness and the value of Exploratory Testing can be lost if the team doesn't have a decent understanding of what it is. It is one of the most powerful learning tools your team has and the team can be losing out on the value of this.
During this interactive session lets learn about the tool so we can make the best use of it. And get that time, effectiveness and value back.
Automation is a big part of DevOps, but are you practicing DevOps if you do not automate every single thing? I'd like to talk about where automation fits into DevOps, how to go about scripting "all the things" and why sometimes it is not necessary. Personal story of how one Ruby script saved me and my friends a bunch of time that could be used to do things slightly more exciting than copy/pasting to spreadsheets. Plus my quick API 101 lesson...
What Your Tech Lead Thinks You Know (But Didn't Teach You)Chris Riccomini
Starting out as a new software engineer is daunting. There's so much to learn: semantic versioning, schema compatibility, tracing, working with legacy code, going on-call, having 1:1s, setting OKRs, and so much more. Dmitriy and Chris will discuss some tips to get you on your way.
Software development myths that block your careerPiotr Horzycki
During 15 years of my software development career, I was a victim of numerous myths and fads of the IT industry. "We must have Scrum", "Rewrite everything", Hype-Driven Development, 100% test coverage - just to name a few. You'll learn where do these myths come from, why they're wrong and what are the real-world, battle-tested alternatives. You can skyrocket your career just by focusing on the right things!
Overview of Site Reliability Engineering (SRE) & best practicesAshutosh Agarwal
In any software organization, stability & innovation are always at loggerheads - the faster you move, the more things will break. This talk defines what SRE org looks like at high-tech organizations (Google, Uber).
Practical DevSecOps: Fundamentals of Successful ProgramsMatt Tesauro
From ONUG Fall 2022:
"Shift Left'' and automation have turned from ideals to meaningless buzzwords. Instead of riding the hype train, let's get real and cover practical and real-world examples taken from actual product security successes. Not every business is the same, neither will their DevSecOps program.
In this talk, I'll cover the fundamentals of common to successful DevSecOps programs as well as a grab bag of useful techniques to consider. These are lessons learned doing AppSec at a wide variety of companies including Rackspace, Pearson, a fortune 500 financial, Duo Security and Cognizant Healthcare. Bruce Lee said "Research your own experience. Absorb what is useful, reject what is useless, add what is essentially your own". The goal of this talk is to provide you with enough examples to build your own pragmatic and practical DevSecOps program or maybe absorb a new technique or two into your existing program.
Winston - Netflix's event driven auto remediation and diagnostics toolVinay Shah
This was a slide deck on Winston presented at a meetup on auto remediation and diagnostics: https://www.meetup.com/Auto-Remediation-and-Event-Driven-Automation/events/234628846/
Aleksej Šipulia - Retrospective – heart of scrumAgile Lietuva
Topic: Retrospective – heart of scrum
Scrum is simple agile framework, but sometimes it’s hard to implement it with some team. Scrum should be understood not as a collection artifacts and meeting, but as a philosophy of your daily work. You need to adopt new habits which improve your performance as a team and try to get rid of waste which slows you down. Retrospective is proper instrument to implement this type of thinking. I’ll talk on how to use this powerful tool and will provide some insights, tips and trick from my long experience being a Scrum Master.
About Aleksej: Aleksej is Experienced Software Developer and Agile Coach with a demonstrated history of working in
the financial services industry. Skilled in Java, Coaching, Agile Methodologies, Spring and other. Strong engineering professional and active leader of Java community in Vilnius
Linkedin: https://www.linkedin.com/in/aleksejsipulia/
Tired of doing upfront test script creation in your testing efforts? Feeling bad for demotivating your testers? Want something to replace this sickening approach to software testing? This presentation outlines why test scripts are not useful, and how test ideas are the new way forward to better testing. Coverage, traceability, reporting, automation and skills are all covered. Take a quick look and see if you can see there is another way to do software testing that is actually pure common sense.
Leading IT and DevOps teams are moving beyond simple alert escalation and oncall management to Incident Response Orchestration (IRO). IRO enables you to acceleration the identification of problems, the notification of the right people, and the facilitation of collaboration across all business units to resolve issues quickly. In this webinar, Berkay Mollamustafaoglu, CEO of OpsGenie, will discuss the challenges faced by modern operations teams and how IRO is empowering organizations to address incidents of any size - before they impact business.
How do you know what to monitor in your environment? Failure modes have become so complex that we need a cross-functional view of the system to identify what failure looks like. This talks walks through the FMEA process as applied to monitoring and metrics collection. The process will help you identify your failure points and the risks associated with a particular failure mode.
Digital Forensics & Incident Response Fundamentals.pdfChristopher Doman
Digital forensics and incident response (DFIR) are crucial areas in the field of cybersecurity, involving the identification, analysis, and response to security incidents. In this talk, we talk about the fundamentals of DFIR, including the key concepts and techniques used to investigate and respond to cyber attacks. We will explore the various phases of incident response, from initial detection/ triage to postincident analysis and remediation. We will also discuss the importance of preserving and analyzing digital evidence, as well as the legal and ethical considerations involved in conducting DFIR investigations. Whether you are a seasoned cybersecurity professional or just starting out in the field thetalk will provide valuable insights into the fundamentals of digital forensics and incident response.
When designing, building, and maintaining a computer system, one can ask the following questions: "Is my data safe from being accidentally deleted or corrupted? How do I ensure data integrity in the long term?"
The main goal of the presentation is to analyze several data integrity pitfalls and review recommended solutions, so that you can construct a data integrity strategy appropriate for your service and your DevOps team.
Exploratory testing is a big part of 'agile' but what exactly does it mean?
How does it differ to other approaches? How do we get value from it? How does the team benefit from it?
How can the whole team participate? What are some misconceptions?
Time, effectiveness and the value of Exploratory Testing can be lost if the team doesn't have a decent understanding of what it is. It is one of the most powerful learning tools your team has and the team can be losing out on the value of this.
During this interactive session lets learn about the tool so we can make the best use of it. And get that time, effectiveness and value back.
Automation is a big part of DevOps, but are you practicing DevOps if you do not automate every single thing? I'd like to talk about where automation fits into DevOps, how to go about scripting "all the things" and why sometimes it is not necessary. Personal story of how one Ruby script saved me and my friends a bunch of time that could be used to do things slightly more exciting than copy/pasting to spreadsheets. Plus my quick API 101 lesson...
What Your Tech Lead Thinks You Know (But Didn't Teach You)Chris Riccomini
Starting out as a new software engineer is daunting. There's so much to learn: semantic versioning, schema compatibility, tracing, working with legacy code, going on-call, having 1:1s, setting OKRs, and so much more. Dmitriy and Chris will discuss some tips to get you on your way.
Software development myths that block your careerPiotr Horzycki
During 15 years of my software development career, I was a victim of numerous myths and fads of the IT industry. "We must have Scrum", "Rewrite everything", Hype-Driven Development, 100% test coverage - just to name a few. You'll learn where do these myths come from, why they're wrong and what are the real-world, battle-tested alternatives. You can skyrocket your career just by focusing on the right things!
Similar to Flight training for DevOps & HumanOps - IncontroDevOps 2016 (20)
Scaling humans - Ops teams and incident managementServer Density
100% uptime is impossible. Modern architectures are designed around failure but what does that mean for the human aspect of incident management? This talk considers how to prepare for outages, how to structure the response, and how those experiences and techniques differ for small and large companies.
Presented by David Mytton at dotScale Paris 2015-06-08
Containers seem to have suddenly become the hot new thing everyone is talking about, but what are they?
Why are they important?
How should you use them and what does it mean for cloud infrastructure? This talk will examine the history, technical details and strategy around containerisation from the perspective of developers and operations, consider internal container OSs like Rocket and Ubuntu Core as well as management layers like Docker and Apache Mesos and take a look at why cloud providers are launching their own services around them.
Presented by David Mytton at Datacloud Monaco 2015-06-04
Why Puppet? Why now? Can you get by without using any config management? You probably think don't have time, or that your project is too small. What can using Puppet really add? How can you justify investing time up front? Maybe you can just do it later?
Getting started with config management can often seem like a big project, especially if you only manage a few systems or have a small team. This talk will examine why you should use Puppet from the beginning. It will examine what you can do with Puppet that couldn't do otherwise, how much time it will save and why it's especially important if you think your project has even the smallest chance of scaling in the future.
Presented by David Mytton at Puppet Camp London 2015-04-13
Infrastructure choices - cloud vs colo vs bare metalServer Density
Everyone deploy to the cloud! Unlimited scaling and the best pricing! Or is it? This talk will examine different deployment strategies and how scaling differs between them. It will consider cloud environments, bare metal and building out your own equipment in a colo facility. And it will look at real examples of interesting approaches and war stories from the major infrastructure providers including AWS, Google Cloud and Softlayer.
Presented by David Mytton at CloudConf Turin 2015-03-19
The customer lifecycle - from visitor to customer. Techniques for driving traffic, trials, nurturing, conversion, success monitoring and handling churn.
Presented by David Mytton at Startup Camp Berlin 2015-03-13.
DevOps Incident Handling - Making friends not enemies.Server Density
David Mytton CEO of Server Density presented this talk to the DevOps Meetup in London. It takes you through how to handle DevOps incidents, outages and downtime -- and more specifically how to make friends, not enemies in the process.
Joined by Rick Nelson, Technical Solutions architect from NGINX Server Density take you though the do's and don'ts of monitoring NGINX. Critical and non critical metrics to monitor, important alerts to configure and the best monitoring tools available.
David Mytton is a MongoDB master and the founder of Server Density. In this presentation David delves deeper into what's discussed in our how to monitor MongoDB tutorial (https://blog.serverdensity.com/monitor-mongodb/), with the aim of taking you through:
Key MongoDB metrics to monitor.
Non-critical MongoDB metrics to monitor.
Alerts to set for MongoDB on production.
Tools for monitoring MongoDB.
Puppet can be used as a control panel to perform a wide variety of tasks within your infrastructure. It can be used to trigger failover between hot standby servers or entire data centres. It can be used to deploy package updates across large clusters. It can be used to deploy code to staging and then onto production, ensuring the right versions are present. It can be used to replicate production environments locally for all our engineers. And it can even be used for config management!
This talk will take a hands on technical look at how we use Puppet to achieve all these things. It will include code samples, hints and tips and explain how the flexibility of Puppet can be used to really control your entire infrastructure.
This talk was given by David Mytton at PuppetCamp 2013: Berlin, London and Munich.
NoSQL databases are often touted for their performance and whilst it's true that they usually offer great performance out of the box, it still really depends on how you deploy your infrastructure. Dedicated vs cloud? In memory vs on disk? Spindal vs SSD? Replication lag. Multi data centre deployment.
This talk considers all the infrastructure requirements of a successful high performance infrastructure with hints and tips that can be applied to any NoSQL technology. It includes things like OS tweaks, disk benchmarks, replication, monitoring and backups.
Remote startup - building a company from everywhere in the worldServer Density
The likes of Automattic (Wordpress) and 37signals promote remote working as a way to hire the best talent, regardless of location. Programming can be done from anywhere and saving office costs is a great way to bootstrap. This talk looks at the advantages and disadvantages, tools and methodologies for building a remote company.
This talk was given by David Mytton at Digital Shoreditch London 2013.
NoSQL databases are often touted for their performance and whilst it's true that they usually offer great performance out of the box, it still really depends on how you deploy your infrastructure. Dedicated vs cloud? In memory vs on disk? Spindal vs SSD? Replication lag. Multi data centre deployment.
This talk considers all the infrastructure requirements of a successful high performance infrastructure with hints and tips that can be applied to any NoSQL technology. It includes things like OS tweaks, disk benchmarks, replication, monitoring and backups.
Presented at NoSQL Roadshow Berlin 2013 by David Mytton.
StartOps: Growing an ops team from 1 founderServer Density
Bootstrapped startups don't have the luxury of a full team of ops engineers available to respond to issues 24/7, so how can you survive on your own? This talk will tell the story of how to run your infrastructure as a single founder through to growing that into a team of on call engineers. It will include some interesting war stories as well as tips and suggestions for how to run ops at a startup.
Presented at DevOpsDays London 2013 by David Mytton.
MongoDB: Optimising for Performance, Scale & AnalyticsServer Density
MongoDB is easy to download and run locally but requires some thought and further understanding when deploying to production. At scale, schema design, indexes and query patterns really matter. So does data structure on disk, sharding, replication and data centre awareness. This talk will examine these factors in the context of analytics, and more generally, to help you optimise MongoDB for any scale.
Presented at MongoDB Days London 2013 by David Mytton.
Italy Agriculture Equipment Market Outlook to 2027harveenkaur52
Agriculture and Animal Care
Ken Research has an expertise in Agriculture and Animal Care sector and offer vast collection of information related to all major aspects such as Agriculture equipment, Crop Protection, Seed, Agriculture Chemical, Fertilizers, Protected Cultivators, Palm Oil, Hybrid Seed, Animal Feed additives and many more.
Our continuous study and findings in agriculture sector provide better insights to companies dealing with related product and services, government and agriculture associations, researchers and students to well understand the present and expected scenario.
Our Animal care category provides solutions on Animal Healthcare and related products and services, including, animal feed additives, vaccination
Instagram has become one of the most popular social media platforms, allowing people to share photos, videos, and stories with their followers. Sometimes, though, you might want to view someone's story without them knowing.
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC
Ellisha Heppner, Grant Management Lead, presented an update on APNIC Foundation to the PNG DNS Forum held from 6 to 10 May, 2024 in Port Moresby, Papua New Guinea.
Gen Z and the marketplaces - let's translate their needsLaura Szabó
The product workshop focused on exploring the requirements of Generation Z in relation to marketplace dynamics. We delved into their specific needs, examined the specifics in their shopping preferences, and analyzed their preferred methods for accessing information and making purchases within a marketplace. Through the study of real-life cases , we tried to gain valuable insights into enhancing the marketplace experience for Generation Z.
The workshop was held on the DMA Conference in Vienna June 2024.
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBrad Spiegel Macon GA
Brad Spiegel Macon GA’s journey exemplifies the profound impact that one individual can have on their community. Through his unwavering dedication to digital inclusion, he’s not only bridging the gap in Macon but also setting an example for others to follow.
4. ● Humans are part of any system
● Initial design, ongoing improvements
● Maintenance
● Upgrades
● Issues, Incident response
Humans in DevOps
5. ● System issues = error rates + SLA + ...
● Human issues = alerts out of hours + interruptions + .
● System issues = Human issues
Human issues = system issues
6. ● System health impacts human health
● Human health impacts system health
Humans impact systems
7. ● Downtime = loss of users, reputation, revenue
● Downtime caused by unreliable systems
● Unhealthy teams reduce reliability
● Unhealthy teams = loss of users, reputation, revenue
Humans impact business
8. ● Slip
● Lapse
● Mistake
● Violation
● (Always, again, again)
Human risk
9. ● Prepare and practice
● Respond
● Postmortem
Expect downtime
11. ● Power failure to half of our servers
● Automated failover unavailable
(known failure condition)
● Manual DNS switch required
● Expected impact: 20 min
● Actual impact: 43min
Incident example
16. ● First responder, acknowledge alert
● Load incident response checklist
● Log into #ops-war-room in Slack
● Log incident into JIRA
● Begin investigation
General response process
18. ● The “limits of human memory and
attention”
○ Complexity
○ Stress and fatigue
○ Ego
● Pilots, doctors, divers:
Bruce Willis Ruins All Films
(BCD, weights, releases, air, final)
Pre-flight checklists
19. 1. Extended use of checklists
2. Not to follow blindly, use knowledge
and experience
3. Independent system
4. Searchable
5. List of known issues and
documented workarounds/fixes
Documented procedures
20. ● Replica environment
● or mock command line
● Record actions and timing
● Multiple failures
● Unexpected results
Realistic scenarios: War Games