An introduction to Stacki-the fastest bare metal Linux server provisioning tool from the Stacki Atlanta kickoff meetup on 2/23/16 at the Microsoft Innovation Center. Greg Bruno is the VP Engineering at StackIQ.
Salesforce at Stacki Atlanta Meetup February 2016StackIQ
Dave Peterson's presentation on how Salesforce uses Stacki and Chef to provision and manage thousands of servers. Stacki Atlanta kickoff Meetup on 2/23/16 at the Microsoft Innovation Center. Dave is a Lead Systems Engineer at Salesforce.
The document discusses setting up a Hadoop cluster with CentOS 6.5 installed on multiple physical servers. It describes the process of installing CentOS via USB, configuring basic OS settings like hostname, users, SSH, firewall. It also covers configuring network settings, Java installation and enabling passwordless SSH login. The document concludes with taking server snapshots for backup/recovery and installing Hadoop services like HDFS, Hive etc using Cloudera Express on the cluster.
Red Hat Enterprise Linux OpenStack Platform on Inktank Ceph EnterpriseRed_Hat_Storage
This document summarizes performance testing of OpenStack with Cinder volumes on Ceph storage. It tested scaling performance with increasing instance counts on a 4-node and 8-node Ceph cluster. Key findings include:
- Large file sequential write performance peaked with a single instance per server due to data striping across OSDs. Read performance peaked at 32 instances per server.
- Large file random I/O performance scaled linearly with increasing instances up to the maximum tested (512 instances).
- Small file operations showed good scaling up to 32 instances per server for creates and reads, but lower performance for renames and deletes.
- Performance tuning like tuned profiles, device readahead, and Ceph journal configuration improved both
StackiFest16: What's Next in Stacki - Mason Katz StackIQ
1. Stacki aims to simplify server provisioning and management by keeping servers under a certain threshold and staying out of the way with tools for deployment, networking, and storage configuration.
2. Upcoming releases will focus on improvements to networking including IPv6 support, a new API, simplifying carts into pallet images, and expanding remote management capabilities to additional hardware.
3. Stacki supports multiple Linux distributions and aims to expand support for operating systems like Ubuntu while encouraging community development and contributions.
[Open Infrastructure & Cloud Native Days Korea 2019]
커뮤니티 버전의 OpenStack 과 Ceph를 활용하여 대고객서비스를 구축한 사례를 공유합니다. 유연성을 확보한 기업용 클라우드 서비스 구축 사례와 높은 수준의 보안을 요구하는 거래소 서비스를 구축, 운영한 사례를 소개합니다. 또한 이 프로젝트에 사용된 기술 스택 및 장애 해결사례와 최적화 방안을 소개합니다. 오픈스택은 역시 오픈소스컨설팅입니다.
#openstack #ceph #openinfraday #cloudnative #opensourceconsulting
Salesforce at Stacki Atlanta Meetup February 2016StackIQ
Dave Peterson's presentation on how Salesforce uses Stacki and Chef to provision and manage thousands of servers. Stacki Atlanta kickoff Meetup on 2/23/16 at the Microsoft Innovation Center. Dave is a Lead Systems Engineer at Salesforce.
The document discusses setting up a Hadoop cluster with CentOS 6.5 installed on multiple physical servers. It describes the process of installing CentOS via USB, configuring basic OS settings like hostname, users, SSH, firewall. It also covers configuring network settings, Java installation and enabling passwordless SSH login. The document concludes with taking server snapshots for backup/recovery and installing Hadoop services like HDFS, Hive etc using Cloudera Express on the cluster.
Red Hat Enterprise Linux OpenStack Platform on Inktank Ceph EnterpriseRed_Hat_Storage
This document summarizes performance testing of OpenStack with Cinder volumes on Ceph storage. It tested scaling performance with increasing instance counts on a 4-node and 8-node Ceph cluster. Key findings include:
- Large file sequential write performance peaked with a single instance per server due to data striping across OSDs. Read performance peaked at 32 instances per server.
- Large file random I/O performance scaled linearly with increasing instances up to the maximum tested (512 instances).
- Small file operations showed good scaling up to 32 instances per server for creates and reads, but lower performance for renames and deletes.
- Performance tuning like tuned profiles, device readahead, and Ceph journal configuration improved both
StackiFest16: What's Next in Stacki - Mason Katz StackIQ
1. Stacki aims to simplify server provisioning and management by keeping servers under a certain threshold and staying out of the way with tools for deployment, networking, and storage configuration.
2. Upcoming releases will focus on improvements to networking including IPv6 support, a new API, simplifying carts into pallet images, and expanding remote management capabilities to additional hardware.
3. Stacki supports multiple Linux distributions and aims to expand support for operating systems like Ubuntu while encouraging community development and contributions.
[Open Infrastructure & Cloud Native Days Korea 2019]
커뮤니티 버전의 OpenStack 과 Ceph를 활용하여 대고객서비스를 구축한 사례를 공유합니다. 유연성을 확보한 기업용 클라우드 서비스 구축 사례와 높은 수준의 보안을 요구하는 거래소 서비스를 구축, 운영한 사례를 소개합니다. 또한 이 프로젝트에 사용된 기술 스택 및 장애 해결사례와 최적화 방안을 소개합니다. 오픈스택은 역시 오픈소스컨설팅입니다.
#openstack #ceph #openinfraday #cloudnative #opensourceconsulting
Out of the Box Replication in Postgres 9.4(pgconfsf)Denish Patel
Denish Patel gave a presentation on PostgreSQL replication. He began by introducing himself and his background. He then discussed PostgreSQL write-ahead logging (WAL), replication history, and how replication is currently setup. The presentation covered replication slots, demoing replication without external tools using pg_basebackup, streaming replication with slots, and pg_receivexlog. Patel also discussed monitoring replication and answered questions from the audience.
The document provides requirements and sample exam questions for the Red Hat Certified Engineer (RHCE) EX294 exam. It outlines 18 exam questions to test Ansible skills. Key requirements include setting up 5 virtual machines, one as the Ansible control node and 4 managed nodes. The questions cover tasks like Ansible installation, ad-hoc commands, playbooks, roles, vaults and more. Detailed solutions are provided for each question/task.
Out of the box replication in postgres 9.4(pg confus)Denish Patel
This document contains notes from a presentation on PostgreSQL replication. It discusses write-ahead logs (WAL), replication history in PostgreSQL from versions 7.0 to 9.4, how to set up basic replication, tools for backups and monitoring replication, and demonstrates setting up replication without third party tools using pg_basebackup, replication slots, and pg_receivexlog. It also includes contact information for the presenter and an invitation to join the PostgreSQL Slack channel.
Red Hat Certified Engineer (RHCE) EX294 Exam QuestionsStudy Material
Do you want to succeed in attaining Red Hat Certified Engineer (RHCE) in one shot? Dumpspedia can do that for you. It’s no joke! We have fantastic set of several RedHat Practice Test Questions Answers to choose from. All of them extracted directly from Red Hat Certified Engineer (RHCE) exam for Red Hat Enterprise Linux 8 Test Questions. EX294 Test Questions are verified and authentic with possibilities highest as they come to be on your actual exam. We put your satisfaction on top while making a perfect collection of valid EX294 Practice Questions. Join us on our website to have a better insight.
https://www.dumpspedia.com/EX294-dumps-questions.html
greenplum installation guide - 4 node VM seungdon Choi
This document provides instructions for installing Pivotal Greenplum Database on virtual machines. It outlines installing the software on the master and segment nodes, configuring the operating system prerequisites, initializing the Greenplum system, and validating the installation. Key steps include running the Greenplum installer, configuring SSH and installing the software on all nodes, creating data directories, and initializing the Greenplum database.
Cassandra Summit 2014: Lesser Known Features of Cassandra 2.1DataStax Academy
This document summarizes some lesser known features in Apache Cassandra 2.1, including:
1) Cassandra's logging was changed to use Logback, allowing for faster and more configurable logging through a logback.xml file.
2) New default paths were added in Cassandra 2.1 for data, commit logs, and configurations to keep directories cleaner.
3) A number of command line parameters and YAML configuration options were added for more control over logging levels, commit log handling, compaction settings, and more.
4) Enhancements were made to the CQL shell cqlsh and nodetool for additional debugging and management capabilities.
To know more, Register for Online Hadoop Training at WizIQ.
Click here : http://www.wiziq.com/course/21308-hadoop-big-data-training
A complete guide to Hadoop Installation that will help you when ever you face problems while installing Hadoop !!
1. The document describes how to set up a PXE kickstart server to automatically install CentOS 5.3 over the network using DHCP and TFTP. It provides instructions for configuring the TFTP, DHCP and NFS servers, creating a kickstart installation tree and kickstart files.
2. Additional clients can be added by modifying the DHCP configuration to assign them a static IP, creating a customized kickstart file, and PXE booting the client to initiate the network installation.
3. Example configurations are provided for the /etc/dhcpd.conf DHCP configuration file and a sample kickstart file.
This presentation provides an overview of the Dell PowerEdge R730xd server performance results with Red Hat Ceph Storage. It covers the advantages of using Red Hat Ceph Storage on Dell servers with their proven hardware components that provide high scalability, enhanced ROI cost benefits, and support of unstructured data.
Transforming the Ceph Integration Tests with OpenStack Ceph Community
This document discusses transforming Ceph tests to use OpenStack. It describes running unit tests locally but running integration tests on OpenStack instances. Developers can now run integration tests on their own OpenStack tenant without waiting for resources. Specifying resources for the OpenStack machines makes the tests more self-service. Future improvements include better multi-cloud support and making archival and setup more convenient.
Open Source Backup Conference 2014: Workshop bareos introduction, by Philipp ...NETWAYS
It gives an introduction to the architecture of Bareos, and how the components of Bareos interact. The configuration of Bareos will be discussed and the main Bareos features will be shown. As a practical part of the workshop the adaption of the preconfigured standard backup scheme to the attendees’ wishes will be developed.
Attendees are kindly asked to contribute configuration tasks that they want to have solved.
This document outlines the steps to install Oracle Grid Infrastructure and configure an Oracle Real Application Clusters (RAC) database with iSCSI high availability on two nodes. It describes pre-requisite tasks like setting up repositories, installing Oracle Grid and database packages, configuring users, directories and environment variables. Specific steps covered include bonding network interfaces, configuring the hosts file, setting swap space and installing Oracle Grid software.
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...Nagios
Dan Wittenberg's presentation on using Nagios at a Fortune 50 Company
The presentation was given during the Nagios World Conference North America held Sept 25-28th, 2012 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Troubleshooting Complex Oracle Performance Problems with Tanel PoderTanel Poder
The document describes troubleshooting a performance issue involving parallel data loads into a data warehouse. It is determined that the slowness is due to recursive locking and buffer busy waits occurring during inserts into the SEG$ table as new segments are created by parallel CREATE TABLE AS SELECT statements. This is causing a nested locking ping-pong effect between the cache, transaction, and I/O layers as sessions repeatedly acquire and release locks and buffers.
图文详解安装Net backup 6.5备份恢复oracle 10g rac 数据库maclean liu
This document describes how to install and configure NetBackup 6.5 to backup an Oracle 10g RAC database. It discusses installing NetBackup server software on a NAS host, defining storage units and backup policies, installing NetBackup client software and the Oracle agent on RAC nodes, and linking the Oracle homes. It also provides an example of using RMAN to backup the control file and archive logs to the NetBackup server.
1. The document provides steps for installing and configuring Oracle Grid Control 10.2.0.5 on Oracle Enterprise Linux 5 (OEL5), beginning with downloading required software packages, modifying configuration files, and installing Grid Control 10.2.0.1 in silent mode without configuring the OMS.
2. It then describes installing the Grid Control 10.2.0.5 patch set, and modifying response files for installation. Finally, it outlines configuring the OMS using a Perl script and monitoring the configuration process in log files.
3. Key steps include preparing the system, installing Grid Control 10.2.0.1, applying the 10.2.0.5 patch set,
Out of the box replication in postgres 9.4Denish Patel
This document provides an overview of setting up out of the box replication in PostgreSQL 9.4 without third party tools. It discusses write-ahead logs (WAL), replication slots, pg_basebackup, and pg_receivexlog. The document then demonstrates setting up replication on VMs with pg_basebackup to initialize a standby server, configuration of primary and standby servers, and monitoring of replication.
This document discusses Checkpoint/Restore In Userspace (CRIU), a tool for live migration of processes and containers. CRIU works by dumping the memory, file descriptors, and other process state of a running process, then restoring it elsewhere. This allows live migration of processes between systems for purposes like load balancing, maintenance, and high performance computing. The document provides details on how CRIU works, its uses cases, limitations, and how to install and use it on Red Hat Enterprise Linux 7.
El documento describe la estructura básica de una página web que incluye etiquetas HTML como <html>, <head>, <title>, <body> para definir el título y contenido. También menciona el nombre de archivo y color de fondo de la página usando códigos hexadecimales.
El documento habla sobre comentarios y etiquetas HTML como <br> para saltos de línea, <pre> para texto preformateado, y <hr> para separadores horizontales. Explica cómo insertar comentarios entre <!-- y --> y cómo usar <br> en lugar de saltos de línea para asegurar que se muestren en el navegador. También cubre cómo <pre> mantiene los espacios en blanco y saltos de línea, y cómo <hr> inserta una regla horizontal sin etiqueta de cierre.
Out of the Box Replication in Postgres 9.4(pgconfsf)Denish Patel
Denish Patel gave a presentation on PostgreSQL replication. He began by introducing himself and his background. He then discussed PostgreSQL write-ahead logging (WAL), replication history, and how replication is currently setup. The presentation covered replication slots, demoing replication without external tools using pg_basebackup, streaming replication with slots, and pg_receivexlog. Patel also discussed monitoring replication and answered questions from the audience.
The document provides requirements and sample exam questions for the Red Hat Certified Engineer (RHCE) EX294 exam. It outlines 18 exam questions to test Ansible skills. Key requirements include setting up 5 virtual machines, one as the Ansible control node and 4 managed nodes. The questions cover tasks like Ansible installation, ad-hoc commands, playbooks, roles, vaults and more. Detailed solutions are provided for each question/task.
Out of the box replication in postgres 9.4(pg confus)Denish Patel
This document contains notes from a presentation on PostgreSQL replication. It discusses write-ahead logs (WAL), replication history in PostgreSQL from versions 7.0 to 9.4, how to set up basic replication, tools for backups and monitoring replication, and demonstrates setting up replication without third party tools using pg_basebackup, replication slots, and pg_receivexlog. It also includes contact information for the presenter and an invitation to join the PostgreSQL Slack channel.
Red Hat Certified Engineer (RHCE) EX294 Exam QuestionsStudy Material
Do you want to succeed in attaining Red Hat Certified Engineer (RHCE) in one shot? Dumpspedia can do that for you. It’s no joke! We have fantastic set of several RedHat Practice Test Questions Answers to choose from. All of them extracted directly from Red Hat Certified Engineer (RHCE) exam for Red Hat Enterprise Linux 8 Test Questions. EX294 Test Questions are verified and authentic with possibilities highest as they come to be on your actual exam. We put your satisfaction on top while making a perfect collection of valid EX294 Practice Questions. Join us on our website to have a better insight.
https://www.dumpspedia.com/EX294-dumps-questions.html
greenplum installation guide - 4 node VM seungdon Choi
This document provides instructions for installing Pivotal Greenplum Database on virtual machines. It outlines installing the software on the master and segment nodes, configuring the operating system prerequisites, initializing the Greenplum system, and validating the installation. Key steps include running the Greenplum installer, configuring SSH and installing the software on all nodes, creating data directories, and initializing the Greenplum database.
Cassandra Summit 2014: Lesser Known Features of Cassandra 2.1DataStax Academy
This document summarizes some lesser known features in Apache Cassandra 2.1, including:
1) Cassandra's logging was changed to use Logback, allowing for faster and more configurable logging through a logback.xml file.
2) New default paths were added in Cassandra 2.1 for data, commit logs, and configurations to keep directories cleaner.
3) A number of command line parameters and YAML configuration options were added for more control over logging levels, commit log handling, compaction settings, and more.
4) Enhancements were made to the CQL shell cqlsh and nodetool for additional debugging and management capabilities.
To know more, Register for Online Hadoop Training at WizIQ.
Click here : http://www.wiziq.com/course/21308-hadoop-big-data-training
A complete guide to Hadoop Installation that will help you when ever you face problems while installing Hadoop !!
1. The document describes how to set up a PXE kickstart server to automatically install CentOS 5.3 over the network using DHCP and TFTP. It provides instructions for configuring the TFTP, DHCP and NFS servers, creating a kickstart installation tree and kickstart files.
2. Additional clients can be added by modifying the DHCP configuration to assign them a static IP, creating a customized kickstart file, and PXE booting the client to initiate the network installation.
3. Example configurations are provided for the /etc/dhcpd.conf DHCP configuration file and a sample kickstart file.
This presentation provides an overview of the Dell PowerEdge R730xd server performance results with Red Hat Ceph Storage. It covers the advantages of using Red Hat Ceph Storage on Dell servers with their proven hardware components that provide high scalability, enhanced ROI cost benefits, and support of unstructured data.
Transforming the Ceph Integration Tests with OpenStack Ceph Community
This document discusses transforming Ceph tests to use OpenStack. It describes running unit tests locally but running integration tests on OpenStack instances. Developers can now run integration tests on their own OpenStack tenant without waiting for resources. Specifying resources for the OpenStack machines makes the tests more self-service. Future improvements include better multi-cloud support and making archival and setup more convenient.
Open Source Backup Conference 2014: Workshop bareos introduction, by Philipp ...NETWAYS
It gives an introduction to the architecture of Bareos, and how the components of Bareos interact. The configuration of Bareos will be discussed and the main Bareos features will be shown. As a practical part of the workshop the adaption of the preconfigured standard backup scheme to the attendees’ wishes will be developed.
Attendees are kindly asked to contribute configuration tasks that they want to have solved.
This document outlines the steps to install Oracle Grid Infrastructure and configure an Oracle Real Application Clusters (RAC) database with iSCSI high availability on two nodes. It describes pre-requisite tasks like setting up repositories, installing Oracle Grid and database packages, configuring users, directories and environment variables. Specific steps covered include bonding network interfaces, configuring the hosts file, setting swap space and installing Oracle Grid software.
Nagios Conference 2012 - Dan Wittenberg - Case Study: Scaling Nagios Core at ...Nagios
Dan Wittenberg's presentation on using Nagios at a Fortune 50 Company
The presentation was given during the Nagios World Conference North America held Sept 25-28th, 2012 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Troubleshooting Complex Oracle Performance Problems with Tanel PoderTanel Poder
The document describes troubleshooting a performance issue involving parallel data loads into a data warehouse. It is determined that the slowness is due to recursive locking and buffer busy waits occurring during inserts into the SEG$ table as new segments are created by parallel CREATE TABLE AS SELECT statements. This is causing a nested locking ping-pong effect between the cache, transaction, and I/O layers as sessions repeatedly acquire and release locks and buffers.
图文详解安装Net backup 6.5备份恢复oracle 10g rac 数据库maclean liu
This document describes how to install and configure NetBackup 6.5 to backup an Oracle 10g RAC database. It discusses installing NetBackup server software on a NAS host, defining storage units and backup policies, installing NetBackup client software and the Oracle agent on RAC nodes, and linking the Oracle homes. It also provides an example of using RMAN to backup the control file and archive logs to the NetBackup server.
1. The document provides steps for installing and configuring Oracle Grid Control 10.2.0.5 on Oracle Enterprise Linux 5 (OEL5), beginning with downloading required software packages, modifying configuration files, and installing Grid Control 10.2.0.1 in silent mode without configuring the OMS.
2. It then describes installing the Grid Control 10.2.0.5 patch set, and modifying response files for installation. Finally, it outlines configuring the OMS using a Perl script and monitoring the configuration process in log files.
3. Key steps include preparing the system, installing Grid Control 10.2.0.1, applying the 10.2.0.5 patch set,
Out of the box replication in postgres 9.4Denish Patel
This document provides an overview of setting up out of the box replication in PostgreSQL 9.4 without third party tools. It discusses write-ahead logs (WAL), replication slots, pg_basebackup, and pg_receivexlog. The document then demonstrates setting up replication on VMs with pg_basebackup to initialize a standby server, configuration of primary and standby servers, and monitoring of replication.
This document discusses Checkpoint/Restore In Userspace (CRIU), a tool for live migration of processes and containers. CRIU works by dumping the memory, file descriptors, and other process state of a running process, then restoring it elsewhere. This allows live migration of processes between systems for purposes like load balancing, maintenance, and high performance computing. The document provides details on how CRIU works, its uses cases, limitations, and how to install and use it on Red Hat Enterprise Linux 7.
El documento describe la estructura básica de una página web que incluye etiquetas HTML como <html>, <head>, <title>, <body> para definir el título y contenido. También menciona el nombre de archivo y color de fondo de la página usando códigos hexadecimales.
El documento habla sobre comentarios y etiquetas HTML como <br> para saltos de línea, <pre> para texto preformateado, y <hr> para separadores horizontales. Explica cómo insertar comentarios entre <!-- y --> y cómo usar <br> en lugar de saltos de línea para asegurar que se muestren en el navegador. También cubre cómo <pre> mantiene los espacios en blanco y saltos de línea, y cómo <hr> inserta una regla horizontal sin etiqueta de cierre.
This certificate certifies that Ana Marzal Lopez passed the Foundation Certificate in Business Analysis exam from BCS, The Chartered Institute for IT in November 2016. The certificate number is 00300860 and was issued by Paul Fletcher, the Group Chief Executive of BCS on November 6, 2016 after Ana satisfied the requirements for the Foundation Certificate in Business Analysis.
This document discusses Linux server provisioning using Stacki. Stacki is a tool that automates the provisioning of Linux servers at scale from bare metal to a fully configured system. It addresses the exponential complexity of managing large clusters as more servers are added. Stacki handles all aspects of server provisioning from OS installation to configuration of networking, storage, software and more. It provides a fully automated, repeatable process to quickly deploy and manage servers.
Introduction to Stacki - World's fastest Linux server provisioning ToolSuresh Paulraj
Stacki is an open source tool for provisioning and managing Linux servers at scale. It provides fast, reliable provisioning of servers from bare metal to a fully configured system. PayPal uses Stacki to manage their Hadoop infrastructure, which includes over 3,000 nodes spread across multiple datacenters. Stacki automates tasks like disk formatting, partitioning, OS installation, and integration with other tools to quickly provision new servers. It helped PayPal reduce provisioning time from hours to just 14 minutes for 288 servers.
Joe Kaiser, System Engineer at StackIQ at the Seattle Scalability Meetup on April 27, 2016
This presentation was followed by a demo of Kubernetes on Stacki
1) The document provides an overview of OpenStack, an open source cloud computing platform, describing its main components for compute (Nova), object storage (Swift), and history.
2) It discusses different methods for deploying and testing OpenStack, including using Vagrant and the nova.sh script, and considerations for physical deployment like hardware selection and network configuration.
3) The document concludes with information on monitoring, upcoming features, and thanks/questions.
This document outlines the process of implementing OpenStack on 4 machines each with Intel Core processors, 8GB RAM and Ubuntu 14.04. It discusses OpenStack components like Nova, Glance, Swift etc. Devstack was used to deploy OpenStack which pulls code from git repositories. Procedures included installing components without clustering, LVM, MAAS and Trove. Issues faced included TFTP booting, Wake on LAN, image importing and accessing instances. In conclusion, the team was able to understand OpenStack functionality which was the goal of the project.
The document provides an overview of Puppet, including:
- Puppet's architecture with the Puppet Master server and Puppet agents.
- The lifecycle of a Puppet run where facts are sent to the master to compile a catalog and agents report back.
- Core components that Puppet is built on like the Puppet language, Facter, Hiera, MCollective, and PuppetDB.
- Products like Puppet Enterprise that provide additional features on top of open source Puppet.
- How to get involved with the Puppet community through training, contributing code/modules, and asking questions.
20150704 benchmark and user experience in sahara weitingWei Ting Chen
Sahara provides a way to deploy and manage Hadoop clusters within an OpenStack cloud. It addresses common customer needs like providing an elastic environment for data processing jobs, integrating Hadoop with the existing private cloud infrastructure, and reducing costs. Key challenges include speeding up cluster provisioning times, supporting complex data workflows, optimizing storage architectures, and improving performance when using remote object storage.
Java ee7 with apache spark for the world's largest credit card core systems, ...Rakuten Group, Inc.
Financial industry companies need Java EE to power for its business today. Rakuten Card, one of the largest credit card companies in Japan, adopted Java EE 7 for its credit card core systems architecture, from one of the oldest COBOL based mainframe in Japan. Additionally, we chose Apache Spark for super rapid batch execution platform. We completed this big core system migration project successfully.
You can learn why we choose Java EE, and Apache Spark for super rapid batch execution, and our experiences and lessons we learned. How to start such a the big project? Why we choose it, how we ported, how use Apache Spark for performance improvements, and launched with? We’ll answer these questions and any that you may have.
Additionally, we are going to unveil our future roadmap for expanding our systems as well, with the cutting edge technology and standards.
The document provides an overview of Puppet, an open source configuration management tool. It discusses Puppet's architecture including the Puppet Master, agents, Facter for collecting facts, Hiera for hierarchical data, and MCollective for orchestration. It also summarizes Puppet Enterprise features like the PuppetDB, interactive event analysis, and cloud provisioning support. The document encourages joining the Puppet community, getting certified through training, and influencing further product development.
Tharun Kumar Padigala is seeking a challenging position in the IT industry where he can apply his 4.6 years of technical experience. He has experience working with OpenStack, Linux administration, virtualization, networking and monitoring tools. His most recent role was as a Development Engineer at Pramati Technologies where he provided OpenStack deployment, support and development for clients.
Accelerate Your OpenStack Deployment Presented by SolidFire and Red HatNetApp
What would you do if your storage infrastructure weren't a barrier to your cloud? In 'Accelerate your OpenStack Deployment' you'll see how Agile Infrastructure (AI) simplifies deployments and dynamic IT-as-a-Service-style offerings, such as self-service test & development or production-ready private clouds. AI frees you to think up the stack and stop worrying about your infrastructure.
This document summarizes a presentation by Vinay Chella and Joey Lynch from Netflix on building and running cloud native Cassandra. They outline some of Cassandra's limitations for cloud deployments including development friction, packaging issues, cluster startup difficulties, and lack of scaling tools. Their proposals aim to address these by improving documentation, automating builds/tests, packaging for containers/packages, adding cluster control planes, and integrating metrics/monitoring. The speakers believe targeted changes can help Cassandra better support cloud-native principles of flexibility, scalability, and reliability.
This document provides an overview of Puppet, an open source configuration management tool. It summarizes Puppet's key components like the Puppet Master, agents, Facter, Hiera and PuppetDB. It also outlines Puppet Enterprise features like interactive reporting, resource browsing and cloud provisioning. Finally, it encourages engagement with the Puppet community through contributing code/modules, asking questions, helping with documentation and attending training.
Handling Kernel Upgrades at Scale - The Dirty Cow StoryDataWorks Summit
Apache Hadoop at Yahoo is a massive platform with 36 different clusters spread across YARN, Apache HBase, and Apache Storm deployments, totaling 60,000 servers made up of 100s of different hardware configurations accumulated over generations, presenting unique operational challenges and a variety of unforeseen corner cases. In this talk, we will share methods, tips and tricks to deal with large scale kernel upgrade on heterogeneous platforms within tight timeframes with 100% uptime and no service or data loss through the Dirty COW use case (privilege escalation vulnerability found in the Linux Kernel in late 2016).
We will dive deep into our three phased approach that led to eventual success of the program - pre work, kernel upgrade itself, and post work / cleanup. We will share the details on automation tools, UIs, and reporting tools developed and used to achieve the stated objectives of 800+ server upgrades per hour, track the upgrade progress, validate and report data blocks, and recover quickly from bad blocks encountered. Throughout the talk, we will highlight the importance of process management, communicating with 100s of custom teams to ensure they are onboard and aware, and successful coordination tactics with SREs and Site Operations. We will also touch upon some of the unique challenges we faced along with way such as BIOS updates necessary on over 20,000 hosts along the way, and explain system rolling upgrade support we added to HBase and Storm for avoiding service disruption to low latency customer during these upgrades.
The best way to understand the cloud, is to have one of your own to kick around, poke, break, fix, and see what it looks like when it's running. In OpenStack we've got a whole project called Devstack which is designed to quickly bootstrap the latest git versions of all the OpenStack components and create an experimentation friendly OpenStack environment. This talk will introduce Devstack, how to get a running OpenStack with it, and how one might begin making changes and seeing them in action. We'll explore a few of the major OpenStack services, see what's going on, all with the intent to explain what OpenStack is by seeing not only the interface, but the internals at work.
Presented at LinuxCon NA 2014
OSCON 2013 - Planning an OpenStack Cloud - Tom FifieldOSCON Byrum
The flexibility of OpenStack is a dual-edged sword, giving you unprecedented control over your infrastructure, but potentially becoming a nightmare for the indecisive manager, architect or sysadmin!
In this presentation, Tom Fifield – co-author of the OpenStack Operations Guide, and Community Manager at the OpenStack Foundation – takes you through some of the decisions you will face when planning your OpenStack cloud. In addition to a brief introduction on OpenStack and advice on how to interact with the community, he will cover topics such as:
How to approach your deployment, ranging from DIY to a turn-key solution from the ecosystem
Storage and networking decisions, including plugin options
Automating deployment and configuration with popular tools like Puppet and Chef
Through discussion of the ecosystem, customization and scaling, you’ll walk away with an understanding of ‘what it takes’ to build your OpenStack cloud.
A brief overview of what we do at Gruntwork. Learn what we mean by "DevOps as a Service" and how you can get your entire infrastructure, defined as code, in about a day. https://www.gruntwork.io/
Nigel Kersten presented on Puppet and infrastructure automation. Some key points include:
- Puppet is an open source tool for infrastructure automation and application deployment. It uses a declarative language to define and enforce the desired state of systems.
- Puppet Enterprise builds on Puppet to add features for reporting, workflows, access control and more.
- Related tools like MCollective, PuppetDB, Facter and Hiera work together to provide orchestration, inventory, hierarchical data and other capabilities.
- Puppet Labs is continually improving Puppet and related tools while also growing its customer base and hiring more employees.
The document provides an overview of various cloud computing, big data, and web development projects. It summarizes achievements in cloud infrastructure using OpenStack and OpenShift, building Hadoop clusters for big data analytics, and developing web applications. It outlines next steps of integrating OpenShift with OpenStack, implementing real-time data processing using HBase, and automating matching between farmers and food processors for a web application.
Similar to Introduction to Stacki at Atlanta Meetup February 2016 (20)
Installing a Cluster of Raspberry Pis with Stacki AceStackIQ
To fully utilize the power of Raspberry Pis, StackIQ ported Stacki to support the inexpensive single-board computers, creating Stacki Ace: an open-source bare-metal installer for Raspberry Pis.
With the release of Stacki 4.0 comes more improvements in Stacki Ace. The avalanche installer has been added for parallel installation and 20MB images instead of 2GB images are now used to install the backend nodes increasing the installation speed immensely.
This is Greg Bruno's presentation from his webinar on how to create a cluster of Raspberry Pis with Stacki Ace.
Presentation by StackIQ's Director of Open Source Engineering, Joe Kaiser, at StackiFest 2017.
Step 1 of every Hadoop vendor’s documentation reads something like this: “First install a cluster.” Without a consistent group of installed machines, a Hadoop installation is prone to failure. Open source Stacki installs machines to a ping and a prompt enabling the consistency and configuration required for a functioning Hadoop installation.
StackIQ released a new open source Hortonworks bridge pallet to enable the installation of Hortonworks through the Ambari appliance at the beginning of 2017. In this presentation, Joe will show you how to set-up Stacki, the HDP Bridge pallet, Ambari, and then install Hadoop on a running cluster.
Teradata uses Stacki for automated bare-metal provisioning and configuration management of its database servers. Some key points:
- Stacki allows Teradata to provision servers consistently at scale for testing, manufacturing, and customer environments in a standardized way.
- Previously Teradata used customized SUSE DVDs which became outdated, inconsistent, and did not scale well. Stacki addresses these challenges through its stackable software components approach.
- Teradata combines different "pallets" and "carts" in Stacki to create customized software stacks for different server types and roles like database servers, Hadoop servers, and Aster servers.
- Stacki is integrated into Teradata's continuous integration/deployment
Installing a Cluster of Raspberry Pis with Stacki AceStackIQ
Presentation by StackIQ's VP Engineering/Co-Founder, Greg Bruno, at StackiFest 2017.
The Raspberry Pi was originally developed by the Raspberry Pi Foundation to promote the teaching of basic computer science in schools and developing countries. And although these little single-board computers have done just that, they have the ability to do so much more when paired with the correct tools. That’s why StackIQ ported Stacki (their original bare metal x86_64 server installer) to support Raspberry Pis, creating Stacki Ace: an open-source bare-metal installer for Raspberry Pis.
Stacki has released version 4.0 with new features including complete REST API version 2, support for SLES 11 and 12, NetApp storage configuration, OS abstraction to support backend nodes running Ubuntu or SLES, and an open source Kubernetes pallet version 2. Stacki Ace version 1 has also been released which allows Stacki to run on Raspberry Pi and ARM architectures. Future planned developments include REST API version 3, UX updates integrating the REST API and adding user accounts, and support for MicroOS/CaaSP and non-RHEL frontends like SUSE.
Automation of your OpenStack Infrastructure with StackiStackIQ
This document discusses CloudLabs' focus on rack scale reference platforms and integrated solutions. It provides an overview of CloudLabs' investments in rack solutions including CORD, OPNFV, OCP, and Intel RSA architectures. It also summarizes Stacki for baremetal provisioning, OpenStack-Ansible for OpenStack deployment, and CloudLabs' benchmarking framework for validating solutions from baremetal to rack scale.
This document discusses bare metal servers in data centers. Bare metal servers are at an innovative crossroads as containers gain popularity. The Stacki community has grown near linearly to over 150 members managing over 1 million paying customers across more than 2,500 bare metal nodes under Stacki management software.
Provisioning Heterogenous Bare Metal with StackiStackIQ
Justin Senseney of NIST's presentation from StackiFest 2017.
Stacki was used to upgrade a high-performance computing (HPC) cluster at the National Institute of Standards and Technology (NIST) in Gaithersburg, Maryland. NIST is the United States’ federal metrology institute, performing research and creating standards for measurements and technology, including materials, data, and cyber-security. A 1,200 node CentOS5 Maui/Torque cluster was upgraded to CentOS7 with a Slurm queuing system. At the same time, hundreds of servers were removed and added to this cluster. This presentation will show the application of Stacki to this HPC cluster and contrast previous methods used for provisioning. Stacki carts and pallets are used to provision role-based servers, including GPU, high-memory, and multiple login servers. Ideas are proposed to allow us to extend this application to managing multiple clusters. Any mention of commercial products within this presentation, including Stacki, is for information purposes; it does not imply recommendation or endorsement by NIST.
Stacki DC Meetup (11/30/16)
Presenter: Justin Senseney- Senior Computer Scientist, NIST
Description:Stacki was used to upgrade a high performance computing (HPC) cluster at the National Institute of Standards and Technology (NIST) in Gaithersburg, MD. A 1,200 node CentOS5 Maui/Torque cluster was upgraded to CentOS7 with a slurm queuing system. This presentation will show the application of Stacki to this HPC cluster and contrast previous methods used for provisioning. Stacki carts and pallets are used to provision role-based servers. Ideas are presented that would make it easier for multiple clusters to be managed. Any mention of commercial products, including Stacki, within this presentation is for information only; it does not imply recommendation or endorsement by NIST.
Public vs. Private Cloud Performance by FlexStackIQ
This document discusses using Ansible to automate benchmarking of OpenStack clouds. It describes using OpenStack-Ansible to deploy OpenStack, Ansible roles to run benchmarks, and parsing tasks to analyze benchmark results. Benchmarking tasks test performance using tools like Passmark, stress, mprime, and reboot tests. Results are fetched to a control host and system logs are collected. The experience of using Ansible for automation and repeatable benchmarking is positive due to its ease of use, flexibility, and ability to reduce manual work. Future enhancements could expand benchmark coverage and contribute to community benchmarking projects.
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha StackIQ
This document provides an overview of SaltStack's event-driven infrastructure and common Salt architectures. It introduces the core Salt components of the salt-master, salt-minion, and salt-ssh. Typical layouts involve salt-minions communicating with a central salt-master over ZeroMQ or SSH. The document also briefly outlines some advanced Salt features like the event bus, beacons, reactors and engines before closing with contact information.
This document outlines a lab session on Phase 1 projects. It discusses what systems and tools are available, what is missing, how CoreOS and Ubuntu are set up, and that there will be a workshop and demo before concluding the session.
Private clouds are gaining popularity over public clouds as they provide more security, reliability, customizability and control over data. A private cloud allows for on-demand scalability and dedicated servers. It is expected that private cloud use will grow to almost 50% of companies in the next two years, compared to only 15% using public clouds primarily. Companies primarily use private clouds for application development, data storage, and core business applications.
There are some things in Stacki that you can only do with Remove commands. This tutorial takes you over the most common remove commands and offers an overview of how they work.
Download Stacki: www.stacki.com
Open source software has emerged as the driving force of technology innovation, from cloud and Big Data to social media and mobile.
Download Stacki, open source bare metal installer, at www.stacki.com
Transform Your Communication with Cloud-Based IVR SolutionsTheSMSPoint
Discover the power of Cloud-Based IVR Solutions to streamline communication processes. Embrace scalability and cost-efficiency while enhancing customer experiences with features like automated call routing and voice recognition. Accessible from anywhere, these solutions integrate seamlessly with existing systems, providing real-time analytics for continuous improvement. Revolutionize your communication strategy today with Cloud-Based IVR Solutions. Learn more at: https://thesmspoint.com/channel/cloud-telephony
Zoom is a comprehensive platform designed to connect individuals and teams efficiently. With its user-friendly interface and powerful features, Zoom has become a go-to solution for virtual communication and collaboration. It offers a range of tools, including virtual meetings, team chat, VoIP phone systems, online whiteboards, and AI companions, to streamline workflows and enhance productivity.
Mobile App Development Company In Noida | Drona InfotechDrona Infotech
Drona Infotech is a premier mobile app development company in Noida, providing cutting-edge solutions for businesses.
Visit Us For : https://www.dronainfotech.com/mobile-application-development/
Hand Rolled Applicative User ValidationCode KataPhilip Schwarz
Could you use a simple piece of Scala validation code (granted, a very simplistic one too!) that you can rewrite, now and again, to refresh your basic understanding of Applicative operators <*>, <*, *>?
The goal is not to write perfect code showcasing validation, but rather, to provide a small, rough-and ready exercise to reinforce your muscle-memory.
Despite its grandiose-sounding title, this deck consists of just three slides showing the Scala 3 code to be rewritten whenever the details of the operators begin to fade away.
The code is my rough and ready translation of a Haskell user-validation program found in a book called Finding Success (and Failure) in Haskell - Fall in love with applicative functors.
What is Master Data Management by PiLog Groupaymanquadri279
PiLog Group's Master Data Record Manager (MDRM) is a sophisticated enterprise solution designed to ensure data accuracy, consistency, and governance across various business functions. MDRM integrates advanced data management technologies to cleanse, classify, and standardize master data, thereby enhancing data quality and operational efficiency.
Neo4j - Product Vision and Knowledge Graphs - GraphSummit ParisNeo4j
Dr. Jesús Barrasa, Head of Solutions Architecture for EMEA, Neo4j
Découvrez les dernières innovations de Neo4j, et notamment les dernières intégrations cloud et les améliorations produits qui font de Neo4j un choix essentiel pour les développeurs qui créent des applications avec des données interconnectées et de l’IA générative.
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeAftab Hussain
Understanding variable roles in code has been found to be helpful by students
in learning programming -- could variable roles help deep neural models in
performing coding tasks? We do an exploratory study.
- These are slides of the talk given at InteNSE'23: The 1st International Workshop on Interpretability and Robustness in Neural Software Engineering, co-located with the 45th International Conference on Software Engineering, ICSE 2023, Melbourne Australia
When it is all about ERP solutions, companies typically meet their needs with common ERP solutions like SAP, Oracle, and Microsoft Dynamics. These big players have demonstrated that ERP systems can be either simple or highly comprehensive. This remains true today, but there are new factors to consider, including a promising new contender in the market that’s Odoo. This blog compares Odoo ERP with traditional ERP systems and explains why many companies now see Odoo ERP as the best choice.
What are ERP Systems?
An ERP, or Enterprise Resource Planning, system provides your company with valuable information to help you make better decisions and boost your ROI. You should choose an ERP system based on your company’s specific needs. For instance, if you run a manufacturing or retail business, you will need an ERP system that efficiently manages inventory. A consulting firm, on the other hand, would benefit from an ERP system that enhances daily operations. Similarly, eCommerce stores would select an ERP system tailored to their needs.
Because different businesses have different requirements, ERP system functionalities can vary. Among the various ERP systems available, Odoo ERP is considered one of the best in the ERp market with more than 12 million global users today.
Odoo is an open-source ERP system initially designed for small to medium-sized businesses but now suitable for a wide range of companies. Odoo offers a scalable and configurable point-of-sale management solution and allows you to create customised modules for specific industries. Odoo is gaining more popularity because it is built in a way that allows easy customisation, has a user-friendly interface, and is affordable. Here, you will cover the main differences and get to know why Odoo is gaining attention despite the many other ERP systems available in the market.
Do you want Software for your Business? Visit Deuglo
Deuglo has top Software Developers in India. They are experts in software development and help design and create custom Software solutions.
Deuglo follows seven steps methods for delivering their services to their customers. They called it the Software development life cycle process (SDLC).
Requirement — Collecting the Requirements is the first Phase in the SSLC process.
Feasibility Study — after completing the requirement process they move to the design phase.
Design — in this phase, they start designing the software.
Coding — when designing is completed, the developers start coding for the software.
Testing — in this phase when the coding of the software is done the testing team will start testing.
Installation — after completion of testing, the application opens to the live server and launches!
Maintenance — after completing the software development, customers start using the software.
Atelier - Innover avec l’IA Générative et les graphes de connaissancesNeo4j
Atelier - Innover avec l’IA Générative et les graphes de connaissances
Allez au-delà du battage médiatique autour de l’IA et découvrez des techniques pratiques pour utiliser l’IA de manière responsable à travers les données de votre organisation. Explorez comment utiliser les graphes de connaissances pour augmenter la précision, la transparence et la capacité d’explication dans les systèmes d’IA générative. Vous partirez avec une expérience pratique combinant les relations entre les données et les LLM pour apporter du contexte spécifique à votre domaine et améliorer votre raisonnement.
Amenez votre ordinateur portable et nous vous guiderons sur la mise en place de votre propre pile d’IA générative, en vous fournissant des exemples pratiques et codés pour démarrer en quelques minutes.
Graspan: A Big Data System for Big Code AnalysisAftab Hussain
We built a disk-based parallel graph system, Graspan, that uses a novel edge-pair centric computation model to compute dynamic transitive closures on very large program graphs.
We implement context-sensitive pointer/alias and dataflow analyses on Graspan. An evaluation of these analyses on large codebases such as Linux shows that their Graspan implementations scale to millions of lines of code and are much simpler than their original implementations.
These analyses were used to augment the existing checkers; these augmented checkers found 132 new NULL pointer bugs and 1308 unnecessary NULL tests in Linux 4.4.0-rc5, PostgreSQL 8.3.9, and Apache httpd 2.2.18.
- Accepted in ASPLOS ‘17, Xi’an, China.
- Featured in the tutorial, Systemized Program Analyses: A Big Data Perspective on Static Analysis Scalability, ASPLOS ‘17.
- Invited for presentation at SoCal PLS ‘16.
- Invited for poster presentation at PLDI SRC ‘16.
E-commerce Development Services- Hornet DynamicsHornet Dynamics
For any business hoping to succeed in the digital age, having a strong online presence is crucial. We offer Ecommerce Development Services that are customized according to your business requirements and client preferences, enabling you to create a dynamic, safe, and user-friendly online store.
Unveiling the Advantages of Agile Software Development.pdfbrainerhub1
Learn about Agile Software Development's advantages. Simplify your workflow to spur quicker innovation. Jump right in! We have also discussed the advantages.
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...Crescat
Crescat is industry-trusted event management software, built by event professionals for event professionals. Founded in 2017, we have three key products tailored for the live event industry.
Crescat Event for concert promoters and event agencies. Crescat Venue for music venues, conference centers, wedding venues, concert halls and more. And Crescat Festival for festivals, conferences and complex events.
With a wide range of popular features such as event scheduling, shift management, volunteer and crew coordination, artist booking and much more, Crescat is designed for customisation and ease-of-use.
Over 125,000 events have been planned in Crescat and with hundreds of customers of all shapes and sizes, from boutique event agencies through to international concert promoters, Crescat is rigged for success. What's more, we highly value feedback from our users and we are constantly improving our software with updates, new features and improvements.
If you plan events, run a venue or produce festivals and you're looking for ways to make your life easier, then we have a solution for you. Try our software for free or schedule a no-obligation demo with one of our product specialists today at crescat.io
2. Open Source Stack Installer
cki is a very fast and ultra reliable Linux server provisioning tool … at sca
h zero prerequisites for taking systems from bare metal to a ping and prom
4. Hadoop @ PayPal
12 x 2TB SATA
data drives
48 nodes
each rack
1GBE-10GBE
NICs
24 x 900GB 6G SA
10K data drives
24 nodes
each rack
10GBE NIC
8 x 4TB NR-SAS
data drives
10 GBE NIC
Bay Area
Salt Lake City
Las Vegas
DATACENTERS
000 nodes and growing
+ initial server racks
terogeneous HW
oss multiple DCs
Science
structure Footprint
48 nodes
each rack
5. Stacki @ PayPal
Ambari HDP
Health Detectio
Integration
IPMI/iLOOS Disk Network
DHCP / DNS /
TFTP
Ansible
- Disk Array Controller Configuration
- Disk Partitioning Configuration
“Stacki + Ansible = Happiness. :D” – Stacki mailing list 8/11/15
6. Quick, Early Success
14 Minutes*To Fully Provision 6 Racks of Bare Metal (288 Servers)
s wiping all
en fully
ning & formatting
rives
And Now…
Upgrades all firmware
automatically
Executes Ansible
scripts on all hosts
Hadoop packages
installed
us hours with other hyperscale management tools, or days to weeks with traditional tools and proc
10. Download and Boot the ISO
www.stacki.com and download the ISO
s 1.2 GB
“stacki” pallet plus stripped down CentOS 6.7
the ISO on the host that will be your frontend
11.
12.
13.
14.
15.
16.
17. Frontend Services
ces to build backend nodes
HCP
TP
amed (optional)
ces to access backend nodes
SH key management
rallel execution shell
19. Frontend
Network
Backend Backend Backend B
em1 em1
em1
em1
em1
Backend Installation
your Host Configuration spreadsheet as a CSV
t CSV on frontend
tack load hostfile file=hosts.csv”
ackend nodes to install on their next PXE boot
tack set host boot backend action=install”
boot all backend nodes
22. Advanced Networking
ost Configuration spreadsheet, you can configure:
onded interfaces
ANs
idging
y combo of the above
ge hosts in multiple subnets
uild a single cluster from hosts in multiple subnets
anage hosts in multiple datacenters
26. Multiple Distributions
ntend houses a default distribution
ased on stripped down CentOS 6.7 or 7.2
sed to build backend nodes
dd any number of new distributions to a frontend
g., RHEL 6.x based distro, CentOS 6.5, etc.
n any backend node to any distro
28. The “Step 0” Problem
Check namenodes are
empty
Format/start HDFS
Create all directories
Create all metastores
Start services (Hbase, Hive,
Oozie, Sqoop, Impala, etc)
Deploy client configuration Configure database
Setup/assign monitors
(activity, services, and host)
Test database connections
Validate/resolve hostnamesConsistent host timezones
No bad kernel versions
running
(CDH) version consistency
Java version consistency
Daemons versions
consistency
Mgmt Agents versions
consistency
Host specification/SSH
ports
MUCH MORE …
DHCP Server/Client setup TFTP/PXE configuration
Server OS installation
Node OS Install
RAID configuration
Boot configuration
System/data disk
partitioning
Monitoring system setup
and config
Lights Out/IPMI setup
User accounts added and
synced
SSH keys on all hosts
Network node configuration
Config Mgmt install and
configuration
Route configurationOS upgrades/updates
Site specific software and
configuration
Host specification/SSH
ports
Security
Firewall setupCluster Mgmt utility Database install and config
Multiple network configPackage installation MUCH MORE …
29. Clusters are Different
ng new servers does require coordination
y added servers must:
Have same software stack as original
servers
Have same configuration as original
servers
Know about original servers
original servers must:
Know about new servers
lt: The management complexity added to the
ations staff is “exponential”
31. The Pain Curve
Number of Servers
ManagementComplexity
General Data Center
Clusters
PAIN
32. The Pain Threshold
pain threshold differs for every
nization
ion of:
cluster(s) size
number of people in Operations
Operations staff cluster expertise
33. Moore’s Law
50 1 2 3 4
8
1
2
3
4
5
6
7
Time (Years)
Density
18 month
doubling
35. What it Means for You
50 1 2 3 4
100
0
10
20
30
40
50
60
70
80
90
Time (Years)
Value(%)
3 months
90% value
18 months
50% value
36. Time is Money
clock starts ticking when hosts land on your
ng dock
out your applications online, you have an
r weight that consumes power, cooling, and
agement’s attention
39. History
n Diego Supercomputer Center
1986 - National Science Foundation
Along with NCSA only two non-classified centers
Mission: serve computational scientists
cks
2000 - First cluster group inside SDSC
Version 1.0 released that November as open source
10k+ clusters world-wide
ckIQ
2006 - Commercial support for Rocks
2011 - Venture Backed
Focus on next generation clustered systems (Data, Cloud)
cki - 2015
June – released as open source
July – first hyper-scale user
40. Must Haves
it – Automatic
nk about it, test it. Deploy it.
ople don’t scale, software does. Free your people – allow ops guys to be ops/analysis guys, move them from single machine vie
bal machine view.
it – Repeatable
te of the environment is guaranteed. Does not require homogeneity of hardware or functionality. Make compute environments
mogenous on heterogeneous hardware and software.
ally, nothing is homogenous. Environment maybe, behavior of that environment on different machines while predictable will not be
me across all hardware. Stacki gets you flexibility and predictability.
it – Reliable
u always get what you want when you want it. You can make reasonable estimates of need because you’ve made the environmen
dictable and repeatable. Just like science!
it – Comprehensive
nage application layer(s) down to kernels and device configuration with one tool. Never hit the network unconfigured.
vide turn-key deployment with reasonable default settings and ability to customize / re-wire as desired.
41. PayPal’s Options
Bring what we used at former parent company eBay with us.
Build our own soups-to-nuts bespoke bare metal provisioning tool.
Find the perfect open source tool that we can use and grow with.
Not Possible
Not Optimal
Not Likely
42. Quick, Early Success
2 Weeks Instead of 2 Years
To Build a Scale-out Management Solution
stalled Stacki Frontend (base management server)
an test installations of backend servers
Single Server test
Full Rack test (48 nodes)
dated distribution (CentOS 6.6) to install additional
kages
grated IPMI information into Stacki
Can now ssh into all IPMI consoles from the Stacki
frontend host using <hostname>.ipmi
ran with PayPal kickstart changes/additions and was
e to image 6 racks in 14 minutes, including:
Nuking disks/partitions and running a full format of all
data drives
5. Updated the Stacki post-boot piece to do the follo
1. Upgrade firmware if host needs it
2. Runs PayPal Ansible playbook, which:
1. Installs additional packages
2. Creates user accounts
3. Disables unused services
4. Sets up resolver/ntp/syslog-ng/sudoers/lim
d/sysctl/etc.
5. Installs/configures Ambari agents
6. Checks data drive mounts, fstab
7. Prepares the rack to be added to a Hadoo
cluster
development with Stacki includes:
43. DevOps Agnostic
DevOps / Configuration Tool
DHCP /
DNS / TFTP
NetworkDiskOS
In-house
developed
deployment
tools
- Disk Array Controller Configuration
- Disk Partitioning Configuration
44. The “Step 0” Problem
k namenodes are
empty
Format/start HDFS
ate all directories
Create all metastores
Start services (Hbase, Hive,
Oozie, Sqoop, Impala, etc)
client configuration Configure database
Setup/assign monitors
(activity, services, and host)
Test database connections
Validate/resolve hostnamestent host timezones
ad kernel versions
running
version consistency
Java version consistency
Daemons versions
consistency
Mgmt Agents versions
consistency
Host specification/SSH
ports
MUCH MORE …
Server/Client setup TFTP/PXE configuration
er OS installation
Node OS Install
RAID configuration
ot configuration
System/data disk
partitioning
Monitoring system setup
and config
Lights Out/IPMI setup
User accounts added and
synced
keys on all hosts
k node configuration
g Mgmt install and
configuration
Route configurationOS upgrades/updates
Site specific software and
configuration
Host specification/SSH
ports
Security
Firewall setupster Mgmt utility Database install and config
Multiple network configkage installation MUCH MORE …
App Confi
Site Confi
HW Insta
System Performance
Validation
Bare Metal Installers
Hadoop Mgmt Tool
Upgrades/Patching
Disk Configuration
Monitoring Tool
Configuration Tool
Network/Site Config ToolsSystems Mgmt Tool
Others …
MANUAL
SEMI-AUTOMATED
TOOLCHAIN
(w/o StackIQ)
w/Stack
FULLY AUTO
46. Configuration Database
Server appliance types (e.g. data, namenode, tomcat, …)
Number of CPUs
Disk partitioning
Hardware RAID config
PCI bus information
…
And other System Attributes
47. Attributes
Global
◦ stack set attr
Appliance
◦ stack set appliance attr
OS
◦ stack set os attr
Host
◦ stack set host attr