SlideShare a Scribd company logo
1 of 41
Download to read offline
Distributed Monitoring and
Cloud Scaling for Web
Apps
Fernando Hönig
fernando@honig.com.ar
* Other names and brands may be claimed as the property of others.
About me
- From Córdoba, Argentina
- Operations Engineer / Linux Admin
- Working last 8 years in IT Companies
- Working in Intel IT since April 2011
* Other names and brands may be claimed as the property of others.
Third Party Vendors / Open Source
 This presentation will cover the solution achieved instead
of talking about third party vendors.
 All products used for this are open source.
Best Practices
 With this presentation I would like to show
processes, best practices and how to do it.
* Other names and brands may be claimed as the property of others.
Topics
- Problem Overview
- External Distributed Infrastructure
- Monitoring Architecture
- Cloud Scaling and Automatic monitoring
- Hostgroups and services association
- Nagios Event Brokers
- Dashboards
- Live Demo
- Q/A
* Other names and brands may be claimed as the property of others.
Purpose / Executive Summary
 Provide agility and rapid cycle time of development
 Infrastructure alignment with services demand
 Zero human interaction related to infrastructure setup and
application deployments cycles.
Business Objective
 Reduce 50% operative costs for current infrastructure
 Enable multi-geo applications
 Ensure 99,99% of availability for services
hosted under this architecture
* Other names and brands may be claimed as the property of others.
Why Distributed Monitoring
Infrastructure?
 More than 500 Services Checks per Customer
 Apps from Customer that needs to be reached from diff
GEOs
 Checks every 1 or 5 minutes
 Redundancy / Fast RecoveryWhy do we need a Centralized
Dashboard?
 Automatic Reporting for SLA metrics
 Fast and simple services/commands/hosts view.
 One single view for several regions / hostgroups
* Other names and brands may be claimed as the property of others.
Infrastructure Capabilities
 Solid Network Architecture
 VPN multi-geo secure connection
 Automated Monitoring
 Centralized logging for app services
Infrastructure Components
 Virtual Cloud Infrastructure
 Firewall rules and communication flow
 Public vs Private subnets
 Load Balancers
 DNS Failover
* Other names and brands may be claimed as the property of others.
Start Automation!
* Other names and brands may be claimed as the property of others.
Virtual Cloud Network Infrastructure
* Other names and brands may be claimed as the property of others.
Create VPN Tunnel!
* Other names and brands may be claimed as the property of others.
Virtual Cloud Network Infrastructure
* Other names and brands may be claimed as the property of others.
Virtual Cloud VPN Multi Geo – Floating
ENI
 Elastic Network Interface can be attached to an instance
with an specific private IP Address and a Public IP
Address.
 All subnets need to route traffic via that interface.
 In case of instance failure:
 Interface is detached from failing instance and attached to
the backup one.
 No changes need to be done in all routing tables
 Downtime is less than 5 mins.
* Other names and brands may be claimed as the property of others.
Virtual Cloud Network Infrastructure
* Other names and brands may be claimed as the property of others.
How it works?
* Other names and brands may be claimed as the property of others.
Cloud Formation + AWS cli
* Other names and brands may be claimed as the property of others.
Understanding the Monitoring
pieces
* Other names and brands may be claimed as the property of others.
External Distributed Infrastructure
* Other names and brands may be claimed as the property of others.
Cloud Monitoring Architecture
Hostgroups
Services
Contacts
Scripts
* Other names and brands may be claimed as the property of others.
Cloud Monitoring Architecture - Tools
MK Livestatus
 Opens a socket by which data can be retrieved on demand
 The socket allows you to send a request for hosts, services or other pieces of
data and get an immediate answer
 Scales fairly well to large installations, even beyond 50.000 services
RESTlos
 Is a generic Nagios API (it can be used with every core that understands the
nagios configuration syntax)
 Provides a RESTful api for generating any standard nagios
object, modify it or delete it
 Open Source code
* Other names and brands may be claimed as the property of others.
Cloud Monitoring Architecture - Tools
iwatch
 Written in Perl and based on inotify, a file change notification system, a kernel
feature that allows applications to request the monitoring of a set of files
against a list of events
 Can watch directory recursively
 Can execute command if an event occurs
Webinject
 Is a free tool for automated testing of web applications and
web services.
 It can be used to test individual system components that
have HTTP interfaces.
 Offers real-time results display and may also be used for
monitoring system response times
* Other names and brands may be claimed as the property of others.
Cloud Monitoring Architecture -
Integration
Mklive broker
RESTlos
Plugins
Webinject
iwatch
 Mklive for output data
 RESTlos for adding/removing hosts
 Webinject for Apps monitoring
 Iwatch for files changes
* Other names and brands may be claimed as the property of others.
Cloud Scaling and Automatic monitoring
 Create UserData for every instance based on the host-type (DB, WS,
App)
 [ADD] Use cURL to send a POST call to Nagios server thru RESTlos when server is
starting
 [DEL] Send a DELETE action with cURL when instance is shutting down
 [HOST-TYPE] Use variables to define what type of server are you adding
 [TOOLS] Add snmp and NRPE in your user-data info to install such software
to enable monitoring
* Other names and brands may be claimed as the property of others.
Cloud Scaling and Automatic monitoring
 [ADD] Use cURL to send a POST call to Nagios server thru RESTlos
when server is starting. Also you must save this in a startup script like
rc.local
"sed -i '$icurl -X POST -d @/etc/host-monitor -H "content-type:
application/json" http://admin:password@" ,{ "Ref" : "MonitInstanceIP" }
,"/restlos/host?host_name=new' /etc/rc.localn",[
{
"host_name": "HOSTNAME",
"use": "generic-host",
"alias": "HOSTNAME",
"address": "HOSTNAME",
"hostgroups": "HOSTGROUPS",
"_SNMPCOMMUNITY": "snmpcom",
"check_command": "check_ping!100.0,20%!500.0,60%",
"max_check_attempts": "3",
"check_interval": "5",
"retry_interval": "5",
"check_period": "24x7",
"notification_interval": "60",
"first_notification_delay": "1",
"notification_period": "24x7",
"notification_options": "d,u,r"
}
]
* Other names and brands may be claimed as the property of others.
Cloud Scaling and Automatic monitoring
 [DEL] Send a DELETE action with cURL when instance is shutting
down
 You need to create a script in /etc/rc0.d/ as follow:
"echo -e '#!/bin/bash' > /etc/rc0.d/K99host-monitorn",
"echo -e 'curl -X DELETE -H "content-type: application/json"
http://admin:password@" ,{ "Ref" : "MonitInstanceIP" }
,"/restlos/host?host_name=HOSTNAME' >> /etc/rc0.d/K99host-monitorn",
"chmod +x /etc/rc0.d/K99host-monitorn",
"HOST=$(hostname); sed -i "s/HOSTNAME/$HOST/g" /etc/rc0.d/K99host-monitorn"
* Other names and brands may be claimed as the property of others.
Cloud Scaling and Automatic monitoring
* Other names and brands may be claimed as the property of others.
iWatch Sync and Nagios files
administration
 For adding/removing hosts
 Every time you add or remove a host, that hostfile is
uploaded/removed in a central repository for backup
purposes.
 For new services
 If you have more than 1 nagios, this is perfect to
have all synced. No need to access to the linux
console for edit.
 For new hostgroups or servicegroups
 If you have a new type of server, just add it to
hostgroups.cfg and that file will be delivered across
all your nagios servers.
 For new contacts
* Other names and brands may be claimed as the property of others.
Hostgroups
A host group definition is used to group one or more hosts together for simplifying
configuration
You can put in a host configuration file as many hostgroups as you need for that
particular host.
* Other names and brands may be claimed as the property of others.
Hostgroups
* Other names and brands may be claimed as the property of others.
Hostgroups - Services Association
* Other names and brands may be claimed as the property of others.
Wrap up
* Other names and brands may be claimed as the property of others.
Get Monitoring data from
anywhere!
* Other names and brands may be claimed as the property of others.
Integration Dashboards
* Other names and brands may be claimed as the property of others.
Integration Dashboards
* Other names and brands may be claimed as the property of others.
SLA Reporting
* Other names and brands may be claimed as the property of others.
What was created?
* Other names and brands may be claimed as the property of others.
Demo Components created on the fly!
 2 isolated networks on US (east and west)
 Each one with a public subnet and a NAT instance
for outgoing traffic.
 IPSec tunnel configured between zones to
communicate securely and encrypted.
 2 independent monitoring systems
 Each network with their own scripts to install Nagios
+ MK Live on the fly during bootstrap process.
 2 dashboard systems, 1 single view
 Each one including both Nagios in their config and
showing same information.
 Both were bootstrapped with the scripts to autoinstall
on the fly and configure previous Nagios installed.
* Other names and brands may be claimed as the property of others.
Show some code!
* Other names and brands may be claimed as the property of others.
Live Demo!
* Other names and brands may be claimed as the property of others.
Q/A
* Other names and brands may be claimed as the property of others.
GitHub Repo
https://github.com/fernandohonig/osmc
* Other names and brands may be claimed as the property of others.
Vielen Dank und
auf Wiedersehen
Fernando Hönig
fernando@honig.com.ar
@fernandohonig
www.linkedin.com/in/fernandoh
onig

More Related Content

What's hot

Setting up your virtual infrastructure using fi-lab cloud
Setting up your virtual infrastructure using fi-lab cloudSetting up your virtual infrastructure using fi-lab cloud
Setting up your virtual infrastructure using fi-lab cloudFernando Lopez Aguilar
 
Fiware cloud capabilities_and_setting_up_your_environment
Fiware cloud capabilities_and_setting_up_your_environmentFiware cloud capabilities_and_setting_up_your_environment
Fiware cloud capabilities_and_setting_up_your_environmentMiguel García González
 
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit kimw001
 
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 20133 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013Puppet
 
Setting up your virtual infrastructure using FIWARE Lab Cloud
Setting up your virtual infrastructure using FIWARE Lab CloudSetting up your virtual infrastructure using FIWARE Lab Cloud
Setting up your virtual infrastructure using FIWARE Lab CloudFernando Lopez Aguilar
 
Cisco Automation with Puppet and onePK - PuppetConf 2013
Cisco Automation with Puppet and onePK - PuppetConf 2013Cisco Automation with Puppet and onePK - PuppetConf 2013
Cisco Automation with Puppet and onePK - PuppetConf 2013Puppet
 
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Puppet
 
Implementing transparent proxy server with acl
Implementing transparent proxy server with aclImplementing transparent proxy server with acl
Implementing transparent proxy server with aclTakahiro Arai
 
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...VMworld
 
Storage Provisioning for Enterprise Information Applications
Storage Provisioning for Enterprise Information ApplicationsStorage Provisioning for Enterprise Information Applications
Storage Provisioning for Enterprise Information ApplicationsAmbareesh Kulkarni
 
Chapter 2 overview
Chapter 2 overviewChapter 2 overview
Chapter 2 overviewali raza
 
Chapter 8 overview
Chapter 8 overviewChapter 8 overview
Chapter 8 overviewali raza
 
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...Nagios
 
Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Amazon Web Services
 
How to deploy spark instance using ansible 2.0 in fiware lab v2
How to deploy spark instance using ansible 2.0 in fiware lab v2How to deploy spark instance using ansible 2.0 in fiware lab v2
How to deploy spark instance using ansible 2.0 in fiware lab v2Fernando Lopez Aguilar
 
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...OpenStack Korea Community
 
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure Orgad Kimchi
 
Cisco ios on cisco catalyst switches
Cisco ios on cisco catalyst switchesCisco ios on cisco catalyst switches
Cisco ios on cisco catalyst switchesIT Tech
 

What's hot (20)

FIWARE Lab
FIWARE LabFIWARE Lab
FIWARE Lab
 
Setting up your virtual infrastructure using fi-lab cloud
Setting up your virtual infrastructure using fi-lab cloudSetting up your virtual infrastructure using fi-lab cloud
Setting up your virtual infrastructure using fi-lab cloud
 
Fiware cloud capabilities_and_setting_up_your_environment
Fiware cloud capabilities_and_setting_up_your_environmentFiware cloud capabilities_and_setting_up_your_environment
Fiware cloud capabilities_and_setting_up_your_environment
 
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit
Intel- OpenStack Summit 2016/Red Hat NFV Mini Summit
 
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 20133 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013
3 Years of Puppet at Cisco: The Secrets to Our Success - PuppetConf 2013
 
Setting up your virtual infrastructure using FIWARE Lab Cloud
Setting up your virtual infrastructure using FIWARE Lab CloudSetting up your virtual infrastructure using FIWARE Lab Cloud
Setting up your virtual infrastructure using FIWARE Lab Cloud
 
Cisco Automation with Puppet and onePK - PuppetConf 2013
Cisco Automation with Puppet and onePK - PuppetConf 2013Cisco Automation with Puppet and onePK - PuppetConf 2013
Cisco Automation with Puppet and onePK - PuppetConf 2013
 
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
 
Implementing transparent proxy server with acl
Implementing transparent proxy server with aclImplementing transparent proxy server with acl
Implementing transparent proxy server with acl
 
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
VMworld 2013: Deploying vSphere with OpenStack: What It Means to Your Cloud E...
 
Storage Provisioning for Enterprise Information Applications
Storage Provisioning for Enterprise Information ApplicationsStorage Provisioning for Enterprise Information Applications
Storage Provisioning for Enterprise Information Applications
 
Chapter 2 overview
Chapter 2 overviewChapter 2 overview
Chapter 2 overview
 
Chapter 8 overview
Chapter 8 overviewChapter 8 overview
Chapter 8 overview
 
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...
Nagios Conference 2012 - Eric Loyd - Nagios Implementation Case Eastman Kodak...
 
Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018Advanced Container Security - AWS Summit Sydney 2018
Advanced Container Security - AWS Summit Sydney 2018
 
Simple docker hosting in FIWARE Lab
Simple docker hosting in FIWARE LabSimple docker hosting in FIWARE Lab
Simple docker hosting in FIWARE Lab
 
How to deploy spark instance using ansible 2.0 in fiware lab v2
How to deploy spark instance using ansible 2.0 in fiware lab v2How to deploy spark instance using ansible 2.0 in fiware lab v2
How to deploy spark instance using ansible 2.0 in fiware lab v2
 
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...
[OpenStack Day in Korea 2015] Track 3-6 - Archiectural Overview of the Open S...
 
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure
Oracle Solaris 11 platform for ECI Telecom private cloud infrastructure
 
Cisco ios on cisco catalyst switches
Cisco ios on cisco catalyst switchesCisco ios on cisco catalyst switches
Cisco ios on cisco catalyst switches
 

Similar to OSMC 2013 | Distributed Monitoring and Cloud Scaling for Web Apps by Fernando Hönig

NFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkNFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkMichelle Holley
 
Open Source for the 4th Industrial Revolution
Open Source for the 4th Industrial RevolutionOpen Source for the 4th Industrial Revolution
Open Source for the 4th Industrial RevolutionLiz Warner
 
Cloud Native Development
Cloud Native DevelopmentCloud Native Development
Cloud Native DevelopmentManuel Garcia
 
IBM Think Session 8598 Domino and JavaScript Development MasterClass
IBM Think Session 8598 Domino and JavaScript Development MasterClassIBM Think Session 8598 Domino and JavaScript Development MasterClass
IBM Think Session 8598 Domino and JavaScript Development MasterClassPaul Withers
 
New Roles In The Cloud
New Roles In The CloudNew Roles In The Cloud
New Roles In The CloudSteve Loughran
 
Enterprise Node - Securing Your Environment
Enterprise Node - Securing Your EnvironmentEnterprise Node - Securing Your Environment
Enterprise Node - Securing Your EnvironmentKurtis Kemple
 
F5 Meetup presentation automation 2017
F5 Meetup presentation automation 2017F5 Meetup presentation automation 2017
F5 Meetup presentation automation 2017Guy Brown
 
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptx
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptxA_Z-_104_T0_0_A-EN_U-Power_Point_04.pptx
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptxjayshuklatrainer
 
Securing Servers in Public and Hybrid Clouds
Securing Servers in Public and Hybrid CloudsSecuring Servers in Public and Hybrid Clouds
Securing Servers in Public and Hybrid CloudsRightScale
 
Introduction to PaaS and Heroku
Introduction to PaaS and HerokuIntroduction to PaaS and Heroku
Introduction to PaaS and HerokuTapio Rautonen
 
Kamil Kolodziejski_Structura-AWS.pptx
Kamil Kolodziejski_Structura-AWS.pptxKamil Kolodziejski_Structura-AWS.pptx
Kamil Kolodziejski_Structura-AWS.pptxAWS Chicago
 
Super-NetOps Source of Truth
Super-NetOps Source of TruthSuper-NetOps Source of Truth
Super-NetOps Source of TruthJoel W. King
 
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29MysoreMuleSoftMeetup
 
Build HA Asterisk on Microsoft Azure using DRBD/Heartbeat
Build HA Asterisk on Microsoft Azure using DRBD/HeartbeatBuild HA Asterisk on Microsoft Azure using DRBD/Heartbeat
Build HA Asterisk on Microsoft Azure using DRBD/HeartbeatSanjay Willie
 
OpenStack + Cloud Foundry for the OpenStack Boston Meetup
OpenStack + Cloud Foundry for the OpenStack Boston MeetupOpenStack + Cloud Foundry for the OpenStack Boston Meetup
OpenStack + Cloud Foundry for the OpenStack Boston Meetupragss
 
Automação do físico ao NetSecDevOps
Automação do físico ao NetSecDevOpsAutomação do físico ao NetSecDevOps
Automação do físico ao NetSecDevOpsRaul Leite
 
CISCO - Presentation at Hortonworks Booth - Strata 2014
CISCO - Presentation at Hortonworks Booth - Strata 2014CISCO - Presentation at Hortonworks Booth - Strata 2014
CISCO - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 

Similar to OSMC 2013 | Distributed Monitoring and Cloud Scaling for Web Apps by Fernando Hönig (20)

NFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function FrameworkNFF-GO (YANFF) - Yet Another Network Function Framework
NFF-GO (YANFF) - Yet Another Network Function Framework
 
Automation Suite PPT (2).pptx
Automation Suite PPT (2).pptxAutomation Suite PPT (2).pptx
Automation Suite PPT (2).pptx
 
Open Source for the 4th Industrial Revolution
Open Source for the 4th Industrial RevolutionOpen Source for the 4th Industrial Revolution
Open Source for the 4th Industrial Revolution
 
Cloud Native Development
Cloud Native DevelopmentCloud Native Development
Cloud Native Development
 
IBM Think Session 8598 Domino and JavaScript Development MasterClass
IBM Think Session 8598 Domino and JavaScript Development MasterClassIBM Think Session 8598 Domino and JavaScript Development MasterClass
IBM Think Session 8598 Domino and JavaScript Development MasterClass
 
New Roles In The Cloud
New Roles In The CloudNew Roles In The Cloud
New Roles In The Cloud
 
Enterprise Node - Securing Your Environment
Enterprise Node - Securing Your EnvironmentEnterprise Node - Securing Your Environment
Enterprise Node - Securing Your Environment
 
F5 Meetup presentation automation 2017
F5 Meetup presentation automation 2017F5 Meetup presentation automation 2017
F5 Meetup presentation automation 2017
 
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptx
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptxA_Z-_104_T0_0_A-EN_U-Power_Point_04.pptx
A_Z-_104_T0_0_A-EN_U-Power_Point_04.pptx
 
Securing Servers in Public and Hybrid Clouds
Securing Servers in Public and Hybrid CloudsSecuring Servers in Public and Hybrid Clouds
Securing Servers in Public and Hybrid Clouds
 
Introduction to PaaS and Heroku
Introduction to PaaS and HerokuIntroduction to PaaS and Heroku
Introduction to PaaS and Heroku
 
Kamil Kolodziejski_Structura-AWS.pptx
Kamil Kolodziejski_Structura-AWS.pptxKamil Kolodziejski_Structura-AWS.pptx
Kamil Kolodziejski_Structura-AWS.pptx
 
citus™ iot ecosystem
citus™ iot ecosystemcitus™ iot ecosystem
citus™ iot ecosystem
 
Super-NetOps Source of Truth
Super-NetOps Source of TruthSuper-NetOps Source of Truth
Super-NetOps Source of Truth
 
Automation day red hat ansible
   Automation day red hat ansible    Automation day red hat ansible
Automation day red hat ansible
 
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29
Platform configuration on CloudHub 2.0 | MuleSoft Mysore Meetup #29
 
Build HA Asterisk on Microsoft Azure using DRBD/Heartbeat
Build HA Asterisk on Microsoft Azure using DRBD/HeartbeatBuild HA Asterisk on Microsoft Azure using DRBD/Heartbeat
Build HA Asterisk on Microsoft Azure using DRBD/Heartbeat
 
OpenStack + Cloud Foundry for the OpenStack Boston Meetup
OpenStack + Cloud Foundry for the OpenStack Boston MeetupOpenStack + Cloud Foundry for the OpenStack Boston Meetup
OpenStack + Cloud Foundry for the OpenStack Boston Meetup
 
Automação do físico ao NetSecDevOps
Automação do físico ao NetSecDevOpsAutomação do físico ao NetSecDevOps
Automação do físico ao NetSecDevOps
 
CISCO - Presentation at Hortonworks Booth - Strata 2014
CISCO - Presentation at Hortonworks Booth - Strata 2014CISCO - Presentation at Hortonworks Booth - Strata 2014
CISCO - Presentation at Hortonworks Booth - Strata 2014
 

Recently uploaded

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 

Recently uploaded (20)

Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 

OSMC 2013 | Distributed Monitoring and Cloud Scaling for Web Apps by Fernando Hönig

  • 1. Distributed Monitoring and Cloud Scaling for Web Apps Fernando Hönig fernando@honig.com.ar
  • 2. * Other names and brands may be claimed as the property of others. About me - From Córdoba, Argentina - Operations Engineer / Linux Admin - Working last 8 years in IT Companies - Working in Intel IT since April 2011
  • 3. * Other names and brands may be claimed as the property of others. Third Party Vendors / Open Source  This presentation will cover the solution achieved instead of talking about third party vendors.  All products used for this are open source. Best Practices  With this presentation I would like to show processes, best practices and how to do it.
  • 4. * Other names and brands may be claimed as the property of others. Topics - Problem Overview - External Distributed Infrastructure - Monitoring Architecture - Cloud Scaling and Automatic monitoring - Hostgroups and services association - Nagios Event Brokers - Dashboards - Live Demo - Q/A
  • 5. * Other names and brands may be claimed as the property of others. Purpose / Executive Summary  Provide agility and rapid cycle time of development  Infrastructure alignment with services demand  Zero human interaction related to infrastructure setup and application deployments cycles. Business Objective  Reduce 50% operative costs for current infrastructure  Enable multi-geo applications  Ensure 99,99% of availability for services hosted under this architecture
  • 6. * Other names and brands may be claimed as the property of others. Why Distributed Monitoring Infrastructure?  More than 500 Services Checks per Customer  Apps from Customer that needs to be reached from diff GEOs  Checks every 1 or 5 minutes  Redundancy / Fast RecoveryWhy do we need a Centralized Dashboard?  Automatic Reporting for SLA metrics  Fast and simple services/commands/hosts view.  One single view for several regions / hostgroups
  • 7. * Other names and brands may be claimed as the property of others. Infrastructure Capabilities  Solid Network Architecture  VPN multi-geo secure connection  Automated Monitoring  Centralized logging for app services Infrastructure Components  Virtual Cloud Infrastructure  Firewall rules and communication flow  Public vs Private subnets  Load Balancers  DNS Failover
  • 8. * Other names and brands may be claimed as the property of others. Start Automation!
  • 9. * Other names and brands may be claimed as the property of others. Virtual Cloud Network Infrastructure
  • 10. * Other names and brands may be claimed as the property of others. Create VPN Tunnel!
  • 11. * Other names and brands may be claimed as the property of others. Virtual Cloud Network Infrastructure
  • 12. * Other names and brands may be claimed as the property of others. Virtual Cloud VPN Multi Geo – Floating ENI  Elastic Network Interface can be attached to an instance with an specific private IP Address and a Public IP Address.  All subnets need to route traffic via that interface.  In case of instance failure:  Interface is detached from failing instance and attached to the backup one.  No changes need to be done in all routing tables  Downtime is less than 5 mins.
  • 13. * Other names and brands may be claimed as the property of others. Virtual Cloud Network Infrastructure
  • 14. * Other names and brands may be claimed as the property of others. How it works?
  • 15. * Other names and brands may be claimed as the property of others. Cloud Formation + AWS cli
  • 16. * Other names and brands may be claimed as the property of others. Understanding the Monitoring pieces
  • 17. * Other names and brands may be claimed as the property of others. External Distributed Infrastructure
  • 18. * Other names and brands may be claimed as the property of others. Cloud Monitoring Architecture Hostgroups Services Contacts Scripts
  • 19. * Other names and brands may be claimed as the property of others. Cloud Monitoring Architecture - Tools MK Livestatus  Opens a socket by which data can be retrieved on demand  The socket allows you to send a request for hosts, services or other pieces of data and get an immediate answer  Scales fairly well to large installations, even beyond 50.000 services RESTlos  Is a generic Nagios API (it can be used with every core that understands the nagios configuration syntax)  Provides a RESTful api for generating any standard nagios object, modify it or delete it  Open Source code
  • 20. * Other names and brands may be claimed as the property of others. Cloud Monitoring Architecture - Tools iwatch  Written in Perl and based on inotify, a file change notification system, a kernel feature that allows applications to request the monitoring of a set of files against a list of events  Can watch directory recursively  Can execute command if an event occurs Webinject  Is a free tool for automated testing of web applications and web services.  It can be used to test individual system components that have HTTP interfaces.  Offers real-time results display and may also be used for monitoring system response times
  • 21. * Other names and brands may be claimed as the property of others. Cloud Monitoring Architecture - Integration Mklive broker RESTlos Plugins Webinject iwatch  Mklive for output data  RESTlos for adding/removing hosts  Webinject for Apps monitoring  Iwatch for files changes
  • 22. * Other names and brands may be claimed as the property of others. Cloud Scaling and Automatic monitoring  Create UserData for every instance based on the host-type (DB, WS, App)  [ADD] Use cURL to send a POST call to Nagios server thru RESTlos when server is starting  [DEL] Send a DELETE action with cURL when instance is shutting down  [HOST-TYPE] Use variables to define what type of server are you adding  [TOOLS] Add snmp and NRPE in your user-data info to install such software to enable monitoring
  • 23. * Other names and brands may be claimed as the property of others. Cloud Scaling and Automatic monitoring  [ADD] Use cURL to send a POST call to Nagios server thru RESTlos when server is starting. Also you must save this in a startup script like rc.local "sed -i '$icurl -X POST -d @/etc/host-monitor -H "content-type: application/json" http://admin:password@" ,{ "Ref" : "MonitInstanceIP" } ,"/restlos/host?host_name=new' /etc/rc.localn",[ { "host_name": "HOSTNAME", "use": "generic-host", "alias": "HOSTNAME", "address": "HOSTNAME", "hostgroups": "HOSTGROUPS", "_SNMPCOMMUNITY": "snmpcom", "check_command": "check_ping!100.0,20%!500.0,60%", "max_check_attempts": "3", "check_interval": "5", "retry_interval": "5", "check_period": "24x7", "notification_interval": "60", "first_notification_delay": "1", "notification_period": "24x7", "notification_options": "d,u,r" } ]
  • 24. * Other names and brands may be claimed as the property of others. Cloud Scaling and Automatic monitoring  [DEL] Send a DELETE action with cURL when instance is shutting down  You need to create a script in /etc/rc0.d/ as follow: "echo -e '#!/bin/bash' > /etc/rc0.d/K99host-monitorn", "echo -e 'curl -X DELETE -H "content-type: application/json" http://admin:password@" ,{ "Ref" : "MonitInstanceIP" } ,"/restlos/host?host_name=HOSTNAME' >> /etc/rc0.d/K99host-monitorn", "chmod +x /etc/rc0.d/K99host-monitorn", "HOST=$(hostname); sed -i "s/HOSTNAME/$HOST/g" /etc/rc0.d/K99host-monitorn"
  • 25. * Other names and brands may be claimed as the property of others. Cloud Scaling and Automatic monitoring
  • 26. * Other names and brands may be claimed as the property of others. iWatch Sync and Nagios files administration  For adding/removing hosts  Every time you add or remove a host, that hostfile is uploaded/removed in a central repository for backup purposes.  For new services  If you have more than 1 nagios, this is perfect to have all synced. No need to access to the linux console for edit.  For new hostgroups or servicegroups  If you have a new type of server, just add it to hostgroups.cfg and that file will be delivered across all your nagios servers.  For new contacts
  • 27. * Other names and brands may be claimed as the property of others. Hostgroups A host group definition is used to group one or more hosts together for simplifying configuration You can put in a host configuration file as many hostgroups as you need for that particular host.
  • 28. * Other names and brands may be claimed as the property of others. Hostgroups
  • 29. * Other names and brands may be claimed as the property of others. Hostgroups - Services Association
  • 30. * Other names and brands may be claimed as the property of others. Wrap up
  • 31. * Other names and brands may be claimed as the property of others. Get Monitoring data from anywhere!
  • 32. * Other names and brands may be claimed as the property of others. Integration Dashboards
  • 33. * Other names and brands may be claimed as the property of others. Integration Dashboards
  • 34. * Other names and brands may be claimed as the property of others. SLA Reporting
  • 35. * Other names and brands may be claimed as the property of others. What was created?
  • 36. * Other names and brands may be claimed as the property of others. Demo Components created on the fly!  2 isolated networks on US (east and west)  Each one with a public subnet and a NAT instance for outgoing traffic.  IPSec tunnel configured between zones to communicate securely and encrypted.  2 independent monitoring systems  Each network with their own scripts to install Nagios + MK Live on the fly during bootstrap process.  2 dashboard systems, 1 single view  Each one including both Nagios in their config and showing same information.  Both were bootstrapped with the scripts to autoinstall on the fly and configure previous Nagios installed.
  • 37. * Other names and brands may be claimed as the property of others. Show some code!
  • 38. * Other names and brands may be claimed as the property of others. Live Demo!
  • 39. * Other names and brands may be claimed as the property of others. Q/A
  • 40. * Other names and brands may be claimed as the property of others. GitHub Repo https://github.com/fernandohonig/osmc
  • 41. * Other names and brands may be claimed as the property of others. Vielen Dank und auf Wiedersehen Fernando Hönig fernando@honig.com.ar @fernandohonig www.linkedin.com/in/fernandoh onig