SlideShare a Scribd company logo
1 of 52
Mixing Your 
Open Source Cloud Cocktail 
Mark Hinkle 
Senior Director, Open Source Solutions 
Citrix Inc. 
mark.hinkle@citrix.com 
mrhinkle@gmail.com 
@mrhinkle 
Last updated: 10/13/14
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
ABOUT ME 
I Help Build Open Source Ecosystems 
Open Source Experience 
• Manage Citrix Open Source Business Office 
• Apache CloudStack Committer and PMC Member 
• Advisory boards Gluster and Xen Project 
• Joined Citrix via Cloud.com acquisition July 2011 
• Zenoss Core open source project to 100,000 users, 
1.5 million downloads 
• Former LinuxWorld Magazine Editor-in-Chief 
• Open Management Consortium organizer 
• Author - “Windows to Linux Business Desktop 
Migration” – Thomson 
• NetDirector Project - Open Source Configuration 
Management 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
http://www.slideshare.net/socializedsoftware 
Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes 
were made. You may do so in any reasonable manner, but not in any way that suggests the licensor 
endorses you or your use. 
ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions 
under the same license as the original. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Slides Available on Slideshare: 
Creative Commons Attributions-ShareAlike 4.0 International 
Share — copy and redistribute the material in any medium or format 
Adapt — remix, transform, and build upon the material 
for any purpose, even commercially. 
The licensor cannot revoke these freedoms as long as you follow the license terms. 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
AGENDA 
• Vetting Open Source Cloud Projects 
• “What is Cloud” in 60 Seconds 
• Virtualization 
• Infrastructure-as-a-Service 
• Platform-as-a-Service 
• SDN 
• Open Source for the Amazon Web Services 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
• Code Velocity 
• Committers 
• Committer Reputation 
• User-driven or Vendor-Driven 
Innovation 
• User Activity 
• Corporate Support* 
• Reputation of Foundation* 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
VETTING OPEN SOURCE 
HPowR caOn yJouE telCl if TtheSy’re Legit 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
http://www.ohloh.net http://activity.openstack.org 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
OPEN SOURCE ANALYSIS 
Visualizing Community Activity 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
60 SECOND CLOUD DEFINITION 
Just because Software Marketing Guys Think it’s the Internet 
5 CHARACTERISTICS OF CLOUD 
1. On-Demand Self-Service 
2. Broad Network Access 
3. Resource Pooling 
4. Rapid Elasticity 
5. Measured Service 
User Cloud a.k.a. 
SOFTWARE-AS-A-SERVICE 
Developer Cloud a.k.a. 
PLATFORM-AS-A-SERVICE 
Systems Cloud a.k.a. 
INFRASTRUCTURE-AS-A-SERVICE 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
SCALE-UP SCALE OUT 
Elasticity and the cloud 
Vertical Scaling (Scale-Up) 
Allocate additional resources to 
VMs, requires a reboot, no need for 
distributed app logic, single-point of 
OS failure 
Horizontal Scaling (Scale-Out) 
Application needs logic to work in 
distributed fashion (e.g. HA-Proxy 
and Apache Hadoop) 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
HYPERVISORS AND CONTAINERS 
Differences in virtualization 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Type 1 Hypervisors 
VMware, Xen Project, Hyper-V 
Type 2 Hypervisors 
KVM, VirtualBox 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Containers 
LXC
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
VIRTUALIZATION 
Carving up compute resources 
OPEN SOURCE 
• Xen Project 
• Citrix XenServer 
• KVM 
• VirtualBox 
• OpenVZ 
• LXC 
PROPRIETARY 
• VMware 
• Microsoft Hyper-V 
• OracleVM (Based on Xen Project) 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
OPEN VIRTUALIZATION FORMATS 
Virtualization Payloads 
Formats for hypervisors/cloud 
technologies: 
• Amazon - AMI 
• KVM – QCOW2 
• VMware – VMDK 
• Xen Project– IMG 
• Hyper-V - VHD – Virtual Hard Disk 
• LXC – local file system/mount point - 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Open Virtualization 
Format (OVF) is an 
open standard for 
packaging and 
distributing virtual 
appliances or more 
generally software to 
be run in virtual 
machines. 
Docker* 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
• Lets your run a Linux system within 
• A container is a group of processes on a 
Linux box, put together the provide an 
isolated environment 
• From the inside, it looks like a VM 
• Externally it looks like normal processes 
• “chroot on steroids” 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
LINUX CONTAINERS (LXC) 
“Lightweight” Linux Virtualization 
another Linux system 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
• Different file formats for virtual machines 
• VMware uses vmdk file format, Xen and Hyper- 
V use VHD, KVM uses Raw or QCOW2 
• Guest images may be “processor architecture” 
• VMware and Xen can manage SCSI devices, but 
• KVM and Xen can use virtio drivers but not 
• VMware uses a proprietary agent inside the 
guest OS (VMware tools) which does not work 
with Xen or KVM 
• Xen uses VirtIo and ParaVirtualized drivers, Xen 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
THE PORTABILITY PROBLEM 
Containers compared to Hardware Virtualization 
bound 
KVM cannot 
VMware 
uses 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
• Code – Application is stored 
• Build – Code is built (Jenkins) 
• Test – Unit tests are 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
CONTINUOUS INTEGRATION 
Rebuild Applications on any Cloud and/or Virtualized Infrastructure 
in a repository 
(Subversion,Git) 
automated (Jenkins) 
• Deploy – Deploy code to 
server various ways 
Code 
Build 
Test 
Deploy 
Thoughtworks Go – Open Source 
Continuous Deliver System 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
PACKER MULTIPLATFORM VM 
CREATION 
Packer is easy to use and automates the 
creation of any type of machine image. It 
embraces modern configuration 
management by encouraging you to use 
automated scripts to install and configure 
the software within your Packer-made 
images. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
To learn more please visit: 
www.packer.io 
Open source Automation for VMs 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
DOCKER CONTAINER PACKAGING 
Open source LXC Packaging Engine 
Docker is an open-source project to easily 
create lightweight, portable, self-sufficient 
containers from any application. The same 
container that a developer builds and tests 
on a laptop can run at scale, in production, 
on VMs, bare metal, public clouds and 
more. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
To learn more please visit: 
www.docker.io 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
• Compliment to LXC not a replacement 
• Managed daemonized processes on Linux 
• Create ability to re-use and manage similar 
• Content agnostic 
• Hardware agnostic 
• Easy to automate 
• Integrated with other tools: Chef, OpenShift, 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
WHAT IS DOCKER 
System for Managing and Deploying LXC Containers 
using LXC 
applications 
Puppet, VMware, etc. 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
DOCKER’S GROWING 
ECOSYSTEM 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Kubernetes builds on top of Docker to 
construct a clustered container scheduling 
service. Kubernetes enables users to ask 
a cluster to run a set of containers. The 
system will automatically pick worker 
nodes to run those containers on, which 
we think of more as "scheduling" than 
"orchestration” 
To learn more please visit: 
https://github.com/GoogleCloudPlatform/kubernetes Greek for Shipmaster 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
KUBERNETES 
Container Cluster Management – Scheduler 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
DOCKER RELATED 
PROJECTS 
• Fig -Fast, isolated development environments 
• Flynn - Next-generation application platform 
• Panamax – Drag-and-Drop Docker Containerization 
• Project Atomic – JEOS designed to run Docker 
containers 
• Weave – The Docker Network 
• 13,000+ Docker-related repos on Github 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Apache Mesos is a cluster manager that simplifies the 
complexity of running applications on a shared pool of 
servers. Largely supported by Twitter, used by LinkedIn, 
AirBNB too. 
Features 
• Fault-tolerant replicated master using ZooKeeper 
• Scalability to 10,000s of nodes 
• Isolation between tasks with Linux Containers 
• Multi-resource scheduling (memory and CPU aware) 
• Java, Python and C++ APIs for developing new 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APACHE MESOS 
One to many tools for managing large numbers of devices 
parallel applications 
• Web UI for viewing cluster state 
To learn more please visit: 
http://mesos.apache.org/ 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
ZooKeeper is a centralized service for 
maintaining configuration information, 
naming, providing distributed 
synchronization, and providing group 
services. All of these kinds of services 
are used in some form or another by 
distributed applications 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APACHE ZOOKEEPER 
Centralized Server to Service Distributed Apps 
To learn more please visit: 
http://zookeeper.apache.org/ 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
22
INFRASTRUCTURE-AS-A-SERVICE 
Compute Orchestration 
Project Year Started License Virtualization 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Technologies 
Apache 
CloudStack 
2008 Apache (Bare Metal), Xenserver, 
KVM, LXC VMware Hyper- 
V 
Eucalyptus 2006 GPL Xen, KVM, VMware 
(commercial version) 
OpenNebula 2005 Apache Xen, KVM, VMware 
OpenStack 2010 (Developed by 
NASA by Anso Labs 
previously) 
Apache VMware ESX and ESXi, , 
Xen, XenServer, KVM, 
LXC, QEMU and Virtual 
Box
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
OPENSTACK 
The Boy Band of the Open Source Cloud  
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
OPENSTACK SHARED 
SSpaEn CRomVputIe,C StoEragSe and Networking 
IDENTITY 
SERVICE 
IMAGE 
SERVICE 
TELEMETRY 
SERVICE 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
ORCHESTRATION 
SERVICE
EVEN MORE OPENSTACK PROJECTS 
Span Compute, Storage and Networking 
• Trove 
Database Service 
• Ironic 
Bare Metal (Ironic) 
• Marconi 
Queue Service 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
• Cinder 
Block Storage Service 
• Ceilometer 
Metering/Monitoring 
• Heat 
Orchestration 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
OPENSTACK SOLUTION PROVIDERS 
If you can’t do it yourself 
“OpenStack is not a product. If you are building a large infrastructure, it’s 
more like a tool kit. It gives you a lot of technologies that do take a lot of 
effort to integrate.” 
Chris Kemp, OpenStack Board Member and Co-Founder 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
CEO of Piston Computing
• Deltacloud(ruby) 
• Daisein(java) 
• Jclouds(java) 
• Libcloud(python) 
• Fog(ruby) 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
CLOUD APIS 
Everything (should) have an API in the Cloud 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Project Description 
Ceph Distributed file storage system developed by DreamHost -> 
GlusterFS Scale Out NAS system aggregating storage over Ethernet or 
Riak CS Riak CS is open source software designed to provide simple, 
available, distributed cloud storage at any scale. Riak CS is S3- 
API compatible and supports per-tenant reporting for billing and 
metering use cases. (object) 
Sheepdog Distributed storage for KVM hypervisors, distributed iSCSI 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
CLOUD STORAGE 
Virtualized, Distributed usually on Commodity Hardware 
InkTank -> Red Hat (block, object, file) 
Infiniband (file) 
OpenStack 
Storage 
Long-term object storage system (object) 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Project Sponsors Languages/Frameworks 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
PLATFORM-AS-A-SERVICE 
Abstracted Cloud-Scale Run-Time Environments 
CloudFoundry VMware -> Pivotal -> CloudFoundry 
Foundation 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Spring for Java, Ruby for Rails and 
Sinatra, node.js, Grails, Scala on 
Lift and more via partners (e.g. 
Python, PHP) 
Cloudify Gigaspaces [Groovy for deployment recipes] 
OpenShift Origin Red Hat Java, Ruby, PHP, Perl and Python 
Apache Stratos WSO2 - >Apache Stratus PHP, Tomcat, MySQL “cartridges”
Decoupling of the control and data planes of the network to 
improve efficiency. Communication from a SDN controller via a 
protocol to network devices both physical and virtual. 
Abstractions allow for programmable networks. 
Network can be changed quickly via a controller 
Network offerings can match virtualization offerings for finer 
grained security in a highly volatile compute landscape. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
SOFTWARE DEFINED 
VNirtuEalizTatiWon mOeetRs thKe neItwNorkG(SDN) 
Automation 
Dynamic Networks 
Security 
Heterogeneous Management 
Single control point for various devices. 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Business Applications 
SDN OVERVIEW 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Network Services 
SDN 
Control 
Software 
API API 
Network Devices Network Devices Network Devices 
Network Devices Network Devices Network Devices 
Application 
Layer 
Control 
Layer 
Infrastructure 
Layer 
Control Data Plane Interface (e.g. OpenFlow)
BENEFITS OF SDN 
Network Virtualization is the final frontier of Software Defined Datacenter 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
• Dynamically update networks 
• Automate network 
functionality 
• “Program” security into the 
network 
• Centrally apply policies to 
network and services 
• Optimize networks 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
OpenFlow enables networks to 
evolve, by giving a remote 
controller the power to modify 
the behavior of network 
devices, through a well-defined 
"forwarding instruction set". 
The growing OpenFlow 
ecosystem now includes 
routers, switches, virtual 
switches, and access points 
from a range of vendors. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
OPENFLOW 
Virtualization meets the network 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
OPEN SOURCE SDN 
Software Defined Network Controllers and more 
Floodlight The Floodlight Open SDN Controller is an enterprise-class, Apache-licensed, Java-based OpenFlow 
Controller. It is supported by a community of developers including a number of engineers from Big Switch 
Networks. - See more at: http://www.projectfloodlight.org/floodlight/#sthash.9IhA1Ih5.dpuf 
Indigo Indigo is an open source project aimed at enabling support for OpenFlow on physical and hypervisor 
switches. Big Switch has helped numerous companies OpenFlow enable their equipment, and we 
provide firmware for a number of popular switches. Indigo is the basis of Switch Light by Big Switch 
Networks. - See more at: http://www.projectfloodlight.org/indigo/#sthash.K7LiHcqc.dpuf 
Lincx LINCX is a pure OpenFlow software switch written in Erlang. It runs within a separate domain under Xen 
Nox NOX is the original OpenFlow controller, and facilitates development of fast C++ controllers on Linux. 
Open Daylight Linux Foundation Collaborative Project based on Cisco One Controller and plugins from numerous 
Open vSwitch Open vSwitch is a open source (ASL 2.0), multilayer virtual switch designed to enable massive network 
automation through programmatic extension, while still supporting standard management interfaces and 
protocols (e.g. NetFlow, sFlow, SPAN, RSPAN, CLI, LACP, 802.1ag). 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Project Description 
hypervisor using LING (erlangonxen.org). 
vendors in development. E.g IBM DOVE 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Open vSwitch is a production quality, 
multilayer virtual switch licensed under the 
open source Apache 2.0 license. It is 
designed to enable massive network 
automation through programmatic extension, 
while still supporting standard management 
interfaces and protocols (e.g. NetFlow, sFlow, 
SPAN, RSPAN, CLI, LACP, 802.1ag). 
To learn more please visit our website: 
http://openvswitch.org/ 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
OPEN VSWITCH 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
CONFIGURATION MANAGEMENT 
TOOLS Tools with features for configuring cloud infrastructure 
Project Year Started Language License Client/Server 
Chef 2009 Ruby Apache Chef Solo – No 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
CFengine 1993 C Apache Yes 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Chef Server - Yes 
Puppet 2004 Ruby GPL Yes & standalone 
Salt 2011 Python Apache yes 
Hitchhiker’s Guide to the 
Open Cloud by @mrhinkle 
37
CLOUD AUTOMATION TOOLS 
One to many tools for managing large numbers of devices 
Ansible Ansible's SSH-key based access allows contributors to the Fedora Project to assist in 
automating infrastructure while having access limited appropriately. (Originally authored Func) 
Capistrano Utility and framework for executing commands in parallel on multiple remote machines, via SSH. 
It uses a simple DSL that allows you to define tasks, which may be applied to machines in 
certain roles 
RunDeck Rundeck is an open-source process automation and command orchestration tool with a web 
Func Func provides a two-way authenticated system for generically executing tasks, integrations with 
MCollective The Marionette Collective AKA MCollective is a framework to build server orchestration or 
Salt Execute arbitrary shell commands or choose from dozens of pre-built modules of common (or 
Scalr Provide scaling across multiple cloud computing platforms, integrates with Chef. 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Project Description 
console. 
puppet and cobbler. 
parallel job execution systems. 
complex) commands. 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
EUREKA PRIAM SIMIAN ARMY 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
ASGARD ASTYANAX EDDA 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
39 
http://netflix.github.com 
NETFLIX AWS TOOLBAG 
Tools developed by a super Amazon Web Services Power User
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
CONTACT ME 
Happy to Chat about Open Source, Cloud or Pittsburgh Sports 
Professional: mark.hinkle@citrix.com 
Personal: mrhinkle@gmail.com 
Phone: 919.228.8049 
Professional: http://open.citrix.com 
Personal: http://www.socializedsoftware.com 
Twitter: @mrhinkle 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APPENDIX A 
Additional Links to related stuff 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
ADDITIONAL LINKS 
• Devops Toolchains Group 
• Software Defined Networking: The New Norm for Networks 
(Whitepaper) 
• DevOps Wikipedia Page 
• NoSQL-Database.org – Ultimate Guide to the Non-Relational Universe 
• Open Cloud Initiative 
• NIST Cloud Computing Platform 
• Open Virtualization Format Specs 
• Clouderati Twitter Account 
• Planet DevOps 
• Nicira Whitepaper – It’s Time to Virtualize the Network 
• Why Open vSwitch FAQ 
• Stanford Seminar - Software-Defined Networking at the Crossroads 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
ADDITIONAL LINKS (CONT’D) 
• SDN, NFV, and open source: The Operator’s View 
• Puppet Labs: Build a Toolbox for Continuous Delivery 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APPENDIX B 
Stuff I’d liked to have talked 
about but didn’t have time 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Bitnami BitNami provides free, ready to run environments for your favorite open source 
web applications and frameworks, including Drupal, Joomla!, Wordpress, PHP, 
Rails, Django and many more. 
Boxgrinder BoxGrinder is a set of projects that help you grind out appliances for multiple 
Oz Command-line tool that has the ability to create images for common Linux 
SUSE Studio SUSE Studio supports building and deploying directly to cloud services such as 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
SOURCING CLOUD APPLIANCES 
Packaging Engines for VMs 
Tool/Project What you can do with them 
virtualization and Cloud providers 
distributions to run on KVM 
Amazon EC2. 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
CLOUD MONITORING TOOLS 
Tools with features for monitoring cloud infrastructure 
Project Type of Monitoring Collection Methods 
Cacti / RRDTool Performance SNMP, syslog 
Nagios Availability SNMP,TCP, ICMP, IPMI, 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Graphite Performance Agent 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
syslog 
Sensu Availability Agent 
Zabbix Availability/ Performance and more SNMP, TCP/ICMP, IPMI, 
Synthetic Transactions 
Zenoss Availability, Performance, Event 
Management 
SNMP, ICMP, SSH, syslog, 
WMI 
Hitchhiker’s Guide to the 
Open Cloud by @mrhinkle 
46
CLOUD PROVISIONING TOOLS 
Packaging Engines for VMs 
Can provision 10s to 1000s of machines on various clouds. 
Cobbler Distributed virtual infrastructure using koan (kickstart of a network to PXE 
boot VMs) for Red Hat, OpenSUSE Fedora, Debian, Ubuntu VMs 
Salt Cloud Tool to provision “salted” VMs that can then be updated by a central server 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
Project Installation Targets 
Apache Provisionr 
(incubating) 
Crowbar (Bare metal provisioning) 
JuJu Public Clouds - Amazon Web Services HP Cloud, 
Private OpenStack clouds, Bare Metal via MAAS. 
via ZeroMQ 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Hitchhiker’s Guide to the 
Open Cloud by @mrhinkle 
47
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
BIG DATA 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
API: many » Query Method: MapReduce, Replicaton: , Written in: Java, Concurrency: eventually 
consistent , Misc: like "Big-Table on Amazon Dynamo alike", initiated by Facebook 
CouchDB Document Store API: Memcached API+protocol (binary and ASCII) , most languages, Protocol: Memcached REST interface 
for cluster conf + management, Written in: C/C++ + Erlang (clustering), Replication: Peer to Peer, fully 
consistent, Misc: Transparent topology changes during operation, provides memcached-compatible 
caching buckets 
API: Java / any writer, Protocol: any write call, Query Method: MapReduce Java / any exec, Replication: 
HDFS Replication, Written in: Java 
PI: Thrift (Java, PHP, Perl, Python, Ruby, etc.), Protocol: Thrift, Query Method: HQL, native Thrift API, 
Replication: HDFS Replication, Concurrency: MVCC, Consistency Model: Fully consistent Misc: High 
performance C++ implementation of Google's Bigtable. 
MongoDB Document Store API: BSON, Protocol: C, Query Method: dynamic object-based language & MapReduce, Replication: 
Redis Key Value/ Tuple Store API: Tons of languages, Written in: C, Concurrency: in memory and saves asynchronous disk after a 
defined time. Append only mode available. Different kinds of fsync policies. Replication: Master / Slave, 
Misc: also lists, sets, sorted sets, hashes, queues. 
Riak Key Value / Tuple Store API: JSON, Protocol: REST, Query Method: MapReduce term matching , Scaling: Multiple Masters; Written 
in: Erlang, Concurrency: eventually consistent (stronger then MVCC via Vector Clocks) 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
NOSQL DATABASES 
Horizontally scalable unstructured data retrieval 
Name Type Description 
Apache 
Wide Column 
Cassandra 
Store/Families 
HBase Wide Column 
Store/Families 
Hypertable Wide Column 
Store/Families 
Master Slave & Auto-Sharding, Written in: C++,Concurrency 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
MAP REDUCE 
Algorithm for Parallelized Data Set Processing 
Problem 
Data 
Master 
Node 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
Worker 
Node 1 
Worker 
Node 2 
Worker 
Node 3 
Solution 
Data 
Map 
Reduce
• Handles large amounts of 
• Stores data in native format 
• Delivers linear scalability at 
• Resilient in case of 
infrastructure failures 
• Transparent application 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APACHE HADOOP 
Apache Project for Parallelized Data Set Processing 
Overview 
• Handles large amounts of 
data 
• Stores data in native format 
• Delivers linear scalability at 
low cost 
• Resilient in case of 
infrastructure failures 
• Transparent application 
scalability 
Features 
data 
low cost 
scalability 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
Machine Learning 
By Mark R. Hinkle 
@mrhinkle 
mrhinkle@gmail.com 
APACHE HADOOP ECOSYSTEM 
Non-Relational DB 
Hadoop Hadoop Common 
HDFS 
Distributes & replicates data 
across machines 
Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 
MapReduce 
Distributes & monitors tasks 
Hive 
Data warehouse that 
provides SQL interface. 
Ad hoc projection of 
data structure to 
unstructured 
MapReduce 
• Parallel programming 
• Handles large data blocks 
HBase 
Column-oriented 
schema-less distributed 
DB modeled after 
Google’s BigTable 
Random real time 
read/write. 
Scripting 
Pig 
Platform for 
manipulating and 
analyzing large data sets. 
Scripting language for 
analysts. 
Mahout 
Machine learning 
libraries for 
recommendations , 
clustering, classifications 
and item sets. 
Chuckwa Zookeeper

More Related Content

What's hot

Cloud Computing and Open Source
Cloud Computing and Open SourceCloud Computing and Open Source
Cloud Computing and Open Source
John Willis
 

What's hot (20)

Future of Open Source in a Cloudy World
Future of Open Source in a Cloudy WorldFuture of Open Source in a Cloudy World
Future of Open Source in a Cloudy World
 
Fossetcon: Crash Course on Open Source Cloud Computing
Fossetcon: Crash Course on Open Source Cloud ComputingFossetcon: Crash Course on Open Source Cloud Computing
Fossetcon: Crash Course on Open Source Cloud Computing
 
Keynote Devops Days Amsterdam - Hacking IT, Culture over Code Bringing Devops...
Keynote Devops Days Amsterdam - Hacking IT, Culture over Code Bringing Devops...Keynote Devops Days Amsterdam - Hacking IT, Culture over Code Bringing Devops...
Keynote Devops Days Amsterdam - Hacking IT, Culture over Code Bringing Devops...
 
Open Source Tool Chains for Cloud Computing
Open Source Tool Chains for Cloud ComputingOpen Source Tool Chains for Cloud Computing
Open Source Tool Chains for Cloud Computing
 
Cloud 2.0: Containers, Microservices and Cloud Hybridization
Cloud 2.0: Containers, Microservices and Cloud HybridizationCloud 2.0: Containers, Microservices and Cloud Hybridization
Cloud 2.0: Containers, Microservices and Cloud Hybridization
 
RICON 2014 - Build a Cloud Day - Crash Course Open Source Cloud Computing
RICON 2014 - Build a Cloud Day - Crash Course Open Source Cloud ComputingRICON 2014 - Build a Cloud Day - Crash Course Open Source Cloud Computing
RICON 2014 - Build a Cloud Day - Crash Course Open Source Cloud Computing
 
Crash Course in Open Source Cloud Computing
Crash Course in Open Source Cloud ComputingCrash Course in Open Source Cloud Computing
Crash Course in Open Source Cloud Computing
 
Linux Foundation Collaboration Summit: Hitchhiker's Guide to the Cloud
Linux Foundation Collaboration Summit: Hitchhiker's Guide to the CloudLinux Foundation Collaboration Summit: Hitchhiker's Guide to the Cloud
Linux Foundation Collaboration Summit: Hitchhiker's Guide to the Cloud
 
Linuxcon Europe 2011: Overview - Building Cloud Computing Environments
Linuxcon Europe 2011:  Overview - Building Cloud Computing EnvironmentsLinuxcon Europe 2011:  Overview - Building Cloud Computing Environments
Linuxcon Europe 2011: Overview - Building Cloud Computing Environments
 
Cloud Computing and Open Source
Cloud Computing and Open SourceCloud Computing and Open Source
Cloud Computing and Open Source
 
Build a Cloud Day SF - Crash Course on Open Source Cloud Computing
Build a Cloud Day SF - Crash Course on Open Source Cloud ComputingBuild a Cloud Day SF - Crash Course on Open Source Cloud Computing
Build a Cloud Day SF - Crash Course on Open Source Cloud Computing
 
LinuxFest Northwest: Crash Course in Open Source Cloud Computing
LinuxFest Northwest: Crash Course in Open Source Cloud Computing LinuxFest Northwest: Crash Course in Open Source Cloud Computing
LinuxFest Northwest: Crash Course in Open Source Cloud Computing
 
OpenCloudConf: It takes an (Open Source) Village to Build a Cloud
OpenCloudConf: It takes an (Open Source) Village to Build a CloudOpenCloudConf: It takes an (Open Source) Village to Build a Cloud
OpenCloudConf: It takes an (Open Source) Village to Build a Cloud
 
Cloud Computing Expo West - Crash Course in Open Source Cloud Computing
Cloud Computing Expo West - Crash Course in Open Source Cloud ComputingCloud Computing Expo West - Crash Course in Open Source Cloud Computing
Cloud Computing Expo West - Crash Course in Open Source Cloud Computing
 
DockerCon 16 - Moby's Cool Hack Session
DockerCon 16 - Moby's Cool Hack SessionDockerCon 16 - Moby's Cool Hack Session
DockerCon 16 - Moby's Cool Hack Session
 
DockerCon EU 2015: Sparebank; a journey towards Docker
DockerCon EU 2015: Sparebank; a journey towards DockerDockerCon EU 2015: Sparebank; a journey towards Docker
DockerCon EU 2015: Sparebank; a journey towards Docker
 
DevSecOps in a cloudnative world
DevSecOps in a cloudnative worldDevSecOps in a cloudnative world
DevSecOps in a cloudnative world
 
The Tao of Docker - ITES 2018
The Tao of Docker - ITES 2018The Tao of Docker - ITES 2018
The Tao of Docker - ITES 2018
 
server to cloud: converting a legacy platform to an open source paas
server to cloud:  converting a legacy platform to an open source paasserver to cloud:  converting a legacy platform to an open source paas
server to cloud: converting a legacy platform to an open source paas
 
Taking the Next Hot Mobile Game Live with Docker and IBM SoftLayer
Taking the Next Hot Mobile Game Live with Docker and IBM SoftLayerTaking the Next Hot Mobile Game Live with Docker and IBM SoftLayer
Taking the Next Hot Mobile Game Live with Docker and IBM SoftLayer
 

Viewers also liked

Infographic - The True Cost of Downtime
Infographic - The True Cost of DowntimeInfographic - The True Cost of Downtime
Infographic - The True Cost of Downtime
AppDynamics
 

Viewers also liked (19)

FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0 FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0
 
Science logic cloudstack london meetup 2015 02-11
Science logic cloudstack london meetup 2015 02-11Science logic cloudstack london meetup 2015 02-11
Science logic cloudstack london meetup 2015 02-11
 
Army PEO EIS Cloud Architecture
Army PEO EIS Cloud Architecture   Army PEO EIS Cloud Architecture
Army PEO EIS Cloud Architecture
 
Agile Cloud Conference 2 Introduction - John Brennan
Agile Cloud Conference 2 Introduction - John BrennanAgile Cloud Conference 2 Introduction - John Brennan
Agile Cloud Conference 2 Introduction - John Brennan
 
ICH Agile Cloud Session 1-Highlights /Prospective Svc Offerings Kevin Jackson
ICH Agile Cloud Session 1-Highlights /Prospective Svc Offerings   Kevin JacksonICH Agile Cloud Session 1-Highlights /Prospective Svc Offerings   Kevin Jackson
ICH Agile Cloud Session 1-Highlights /Prospective Svc Offerings Kevin Jackson
 
Improving Cybersecurity and Resilience Through Acquisition Emile Monette GSA
Improving Cybersecurity and Resilience Through Acquisition   Emile Monette GSAImproving Cybersecurity and Resilience Through Acquisition   Emile Monette GSA
Improving Cybersecurity and Resilience Through Acquisition Emile Monette GSA
 
@AgileCLoud_ICH Presentation - 20140521 US Navy OPNAV - Capt Christopher Page
@AgileCLoud_ICH Presentation - 20140521 US Navy OPNAV - Capt Christopher Page@AgileCLoud_ICH Presentation - 20140521 US Navy OPNAV - Capt Christopher Page
@AgileCLoud_ICH Presentation - 20140521 US Navy OPNAV - Capt Christopher Page
 
The Five Essential Truths of the Application Economy
The Five Essential Truths of the Application EconomyThe Five Essential Truths of the Application Economy
The Five Essential Truths of the Application Economy
 
IT Certifications that Pay Well - And Why to Consider CloudMASTER
IT Certifications that Pay Well - And Why to Consider CloudMASTERIT Certifications that Pay Well - And Why to Consider CloudMASTER
IT Certifications that Pay Well - And Why to Consider CloudMASTER
 
How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...
How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...
How Data-Driven Continuous Intelligence Benefits Aid the Development and Mana...
 
DevOps Transformations
DevOps TransformationsDevOps Transformations
DevOps Transformations
 
Infographic - The True Cost of Downtime
Infographic - The True Cost of DowntimeInfographic - The True Cost of Downtime
Infographic - The True Cost of Downtime
 
Lean Security
Lean SecurityLean Security
Lean Security
 
Shirt Ops: How to make awesome t-shirts for your conference
Shirt Ops: How to make awesome t-shirts for your conferenceShirt Ops: How to make awesome t-shirts for your conference
Shirt Ops: How to make awesome t-shirts for your conference
 
Pragmatic Security and Rugged DevOps - SXSW 2015
Pragmatic Security and Rugged DevOps - SXSW 2015Pragmatic Security and Rugged DevOps - SXSW 2015
Pragmatic Security and Rugged DevOps - SXSW 2015
 
Deep Organisational Transformation a model for a higher Agility
Deep Organisational Transformation a model for a higher AgilityDeep Organisational Transformation a model for a higher Agility
Deep Organisational Transformation a model for a higher Agility
 
Application Security Epistemology in a Continuous Delivery World
Application Security Epistemology in a Continuous Delivery WorldApplication Security Epistemology in a Continuous Delivery World
Application Security Epistemology in a Continuous Delivery World
 
Converged IoT Systems: Bringing the Data Center to the Edge of Everything
Converged IoT Systems: Bringing the Data Center to the Edge of EverythingConverged IoT Systems: Bringing the Data Center to the Edge of Everything
Converged IoT Systems: Bringing the Data Center to the Edge of Everything
 
Containers - (Austin Cloud Meetup April 2016)
Containers - (Austin Cloud Meetup April 2016)Containers - (Austin Cloud Meetup April 2016)
Containers - (Austin Cloud Meetup April 2016)
 

Similar to CloudOpen 2014 - Mixing Your Open Source Cloud Cocktail

LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud ComputingLinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
Mark Hinkle
 
Docker and OpenStack Boston Meetup
Docker and OpenStack Boston MeetupDocker and OpenStack Boston Meetup
Docker and OpenStack Boston Meetup
Kamesh Pemmaraju
 
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingOSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
Mark Hinkle
 

Similar to CloudOpen 2014 - Mixing Your Open Source Cloud Cocktail (20)

Crash Course in Cloud Computing
Crash Course in Cloud ComputingCrash Course in Cloud Computing
Crash Course in Cloud Computing
 
Docker Container As A Service - JAX 2016
Docker Container As A Service - JAX 2016Docker Container As A Service - JAX 2016
Docker Container As A Service - JAX 2016
 
0507 057 01 98 * Adana Klima Tamir Servisi
0507 057 01 98 * Adana Klima Tamir Servisi0507 057 01 98 * Adana Klima Tamir Servisi
0507 057 01 98 * Adana Klima Tamir Servisi
 
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud ComputingLinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
 
Built in NM - Meetup Talk - CTL Labs
Built in NM - Meetup Talk - CTL LabsBuilt in NM - Meetup Talk - CTL Labs
Built in NM - Meetup Talk - CTL Labs
 
Are VM Passé?
Are VM Passé? Are VM Passé?
Are VM Passé?
 
Are VMs Passé?
Are VMs Passé?Are VMs Passé?
Are VMs Passé?
 
Docker and OpenStack Boston Meetup
Docker and OpenStack Boston MeetupDocker and OpenStack Boston Meetup
Docker and OpenStack Boston Meetup
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
 
Docker EE 2.0 choice security agility by Erik Tan,Tech Insights Singapore - 2...
Docker EE 2.0 choice security agility by Erik Tan,Tech Insights Singapore - 2...Docker EE 2.0 choice security agility by Erik Tan,Tech Insights Singapore - 2...
Docker EE 2.0 choice security agility by Erik Tan,Tech Insights Singapore - 2...
 
Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015Docker Platform and Ecosystem Nov 2015
Docker Platform and Ecosystem Nov 2015
 
(APP303) Lightning Fast Deploys with Docker Containers and AWS | AWS re:Inven...
(APP303) Lightning Fast Deploys with Docker Containers and AWS | AWS re:Inven...(APP303) Lightning Fast Deploys with Docker Containers and AWS | AWS re:Inven...
(APP303) Lightning Fast Deploys with Docker Containers and AWS | AWS re:Inven...
 
DockerCon SF 2015: Docker Community in China
DockerCon SF 2015: Docker Community in ChinaDockerCon SF 2015: Docker Community in China
DockerCon SF 2015: Docker Community in China
 
Docker open stack boston
Docker open stack bostonDocker open stack boston
Docker open stack boston
 
OpenStack Boston
OpenStack BostonOpenStack Boston
OpenStack Boston
 
Docker Overview - Rise of the Containers
Docker Overview - Rise of the ContainersDocker Overview - Rise of the Containers
Docker Overview - Rise of the Containers
 
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud ComputingOSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
OSCON 2013 - The Hitchiker’s Guide to Open Source Cloud Computing
 
Introduction to Docker - VIT Campus
Introduction to Docker - VIT CampusIntroduction to Docker - VIT Campus
Introduction to Docker - VIT Campus
 
Open stack + Containers + Hyper-V
Open stack + Containers + Hyper-VOpen stack + Containers + Hyper-V
Open stack + Containers + Hyper-V
 
Getting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick StinematesGetting Started with Docker - Nick Stinemates
Getting Started with Docker - Nick Stinemates
 

More from Mark Hinkle

Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
Mark Hinkle
 
Cloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
Cloud Expo East 2013: Essential Open Source Software for Building the Open CloudCloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
Cloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
Mark Hinkle
 
Hitchhiker's Guide to Open Source Cloud Computing
Hitchhiker's Guide to Open Source Cloud ComputingHitchhiker's Guide to Open Source Cloud Computing
Hitchhiker's Guide to Open Source Cloud Computing
Mark Hinkle
 

More from Mark Hinkle (17)

Serverless 2019 and Beyond
Serverless 2019 and Beyond Serverless 2019 and Beyond
Serverless 2019 and Beyond
 
Triangle Kubernetes Meet-Up - Serverless is FaaS-tastic
Triangle Kubernetes Meet-Up - Serverless is FaaS-tasticTriangle Kubernetes Meet-Up - Serverless is FaaS-tastic
Triangle Kubernetes Meet-Up - Serverless is FaaS-tastic
 
Serverless is FaaS-tastic - Columbia Open Source Meet-Up
Serverless is FaaS-tastic - Columbia Open Source Meet-Up Serverless is FaaS-tastic - Columbia Open Source Meet-Up
Serverless is FaaS-tastic - Columbia Open Source Meet-Up
 
Serverless is FaaS-tastic - All Things Open Meet-up
Serverless is FaaS-tastic - All Things Open Meet-upServerless is FaaS-tastic - All Things Open Meet-up
Serverless is FaaS-tastic - All Things Open Meet-up
 
Keynote - Open Source 101 - How JavaScript Became a Legitimate Open Source En...
Keynote - Open Source 101 - How JavaScript Became a Legitimate Open Source En...Keynote - Open Source 101 - How JavaScript Became a Legitimate Open Source En...
Keynote - Open Source 101 - How JavaScript Became a Legitimate Open Source En...
 
Keynote All Things Open - Open Source: The Punk Rock of the 21st Century
Keynote All Things Open - Open Source: The Punk Rock of the 21st CenturyKeynote All Things Open - Open Source: The Punk Rock of the 21st Century
Keynote All Things Open - Open Source: The Punk Rock of the 21st Century
 
All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight
 
Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...
Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...
Cloud 2.0 - How Containers, Microservices and Open Source Software are Redefi...
 
Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
Keynote: Community, Code and Companies - Mark Hinkle, Director of Open Source...
 
ApacheCon 2014; Let Me Help You. Don’t Fear the Man with the Free T-Shirts
ApacheCon 2014; Let Me Help You. Don’t Fear the Man with the Free T-ShirtsApacheCon 2014; Let Me Help You. Don’t Fear the Man with the Free T-Shirts
ApacheCon 2014; Let Me Help You. Don’t Fear the Man with the Free T-Shirts
 
Linuxcon Europe 2013 | Keynote: We Won What's Next
Linuxcon Europe 2013 | Keynote: We Won What's NextLinuxcon Europe 2013 | Keynote: We Won What's Next
Linuxcon Europe 2013 | Keynote: We Won What's Next
 
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your CloudCloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
 
LinuxCon North America 2013: Why Lease When You Can Buy Your Cloud
LinuxCon North America 2013: Why Lease When You Can Buy Your CloudLinuxCon North America 2013: Why Lease When You Can Buy Your Cloud
LinuxCon North America 2013: Why Lease When You Can Buy Your Cloud
 
OSCON 2013 - Keynote - Creating Communities of Inclusion
OSCON 2013 - Keynote - Creating Communities of InclusionOSCON 2013 - Keynote - Creating Communities of Inclusion
OSCON 2013 - Keynote - Creating Communities of Inclusion
 
Cloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
Cloud Expo East 2013: Essential Open Source Software for Building the Open CloudCloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
Cloud Expo East 2013: Essential Open Source Software for Building the Open Cloud
 
Cloudstack Users
Cloudstack UsersCloudstack Users
Cloudstack Users
 
Hitchhiker's Guide to Open Source Cloud Computing
Hitchhiker's Guide to Open Source Cloud ComputingHitchhiker's Guide to Open Source Cloud Computing
Hitchhiker's Guide to Open Source Cloud Computing
 

CloudOpen 2014 - Mixing Your Open Source Cloud Cocktail

  • 1. Mixing Your Open Source Cloud Cocktail Mark Hinkle Senior Director, Open Source Solutions Citrix Inc. mark.hinkle@citrix.com mrhinkle@gmail.com @mrhinkle Last updated: 10/13/14
  • 2. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com ABOUT ME I Help Build Open Source Ecosystems Open Source Experience • Manage Citrix Open Source Business Office • Apache CloudStack Committer and PMC Member • Advisory boards Gluster and Xen Project • Joined Citrix via Cloud.com acquisition July 2011 • Zenoss Core open source project to 100,000 users, 1.5 million downloads • Former LinuxWorld Magazine Editor-in-Chief • Open Management Consortium organizer • Author - “Windows to Linux Business Desktop Migration” – Thomson • NetDirector Project - Open Source Configuration Management Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 3. http://www.slideshare.net/socializedsoftware Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Slides Available on Slideshare: Creative Commons Attributions-ShareAlike 4.0 International Share — copy and redistribute the material in any medium or format Adapt — remix, transform, and build upon the material for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms. Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 4. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com AGENDA • Vetting Open Source Cloud Projects • “What is Cloud” in 60 Seconds • Virtualization • Infrastructure-as-a-Service • Platform-as-a-Service • SDN • Open Source for the Amazon Web Services Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 5. • Code Velocity • Committers • Committer Reputation • User-driven or Vendor-Driven Innovation • User Activity • Corporate Support* • Reputation of Foundation* By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com VETTING OPEN SOURCE HPowR caOn yJouE telCl if TtheSy’re Legit Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 6. http://www.ohloh.net http://activity.openstack.org By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com OPEN SOURCE ANALYSIS Visualizing Community Activity Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 7. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com 60 SECOND CLOUD DEFINITION Just because Software Marketing Guys Think it’s the Internet 5 CHARACTERISTICS OF CLOUD 1. On-Demand Self-Service 2. Broad Network Access 3. Resource Pooling 4. Rapid Elasticity 5. Measured Service User Cloud a.k.a. SOFTWARE-AS-A-SERVICE Developer Cloud a.k.a. PLATFORM-AS-A-SERVICE Systems Cloud a.k.a. INFRASTRUCTURE-AS-A-SERVICE Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 8. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com SCALE-UP SCALE OUT Elasticity and the cloud Vertical Scaling (Scale-Up) Allocate additional resources to VMs, requires a reboot, no need for distributed app logic, single-point of OS failure Horizontal Scaling (Scale-Out) Application needs logic to work in distributed fashion (e.g. HA-Proxy and Apache Hadoop) Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 9. HYPERVISORS AND CONTAINERS Differences in virtualization By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Type 1 Hypervisors VMware, Xen Project, Hyper-V Type 2 Hypervisors KVM, VirtualBox Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Containers LXC
  • 10. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com VIRTUALIZATION Carving up compute resources OPEN SOURCE • Xen Project • Citrix XenServer • KVM • VirtualBox • OpenVZ • LXC PROPRIETARY • VMware • Microsoft Hyper-V • OracleVM (Based on Xen Project) Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 11. OPEN VIRTUALIZATION FORMATS Virtualization Payloads Formats for hypervisors/cloud technologies: • Amazon - AMI • KVM – QCOW2 • VMware – VMDK • Xen Project– IMG • Hyper-V - VHD – Virtual Hard Disk • LXC – local file system/mount point - By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Open Virtualization Format (OVF) is an open standard for packaging and distributing virtual appliances or more generally software to be run in virtual machines. Docker* Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 12. • Lets your run a Linux system within • A container is a group of processes on a Linux box, put together the provide an isolated environment • From the inside, it looks like a VM • Externally it looks like normal processes • “chroot on steroids” By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com LINUX CONTAINERS (LXC) “Lightweight” Linux Virtualization another Linux system Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 13. • Different file formats for virtual machines • VMware uses vmdk file format, Xen and Hyper- V use VHD, KVM uses Raw or QCOW2 • Guest images may be “processor architecture” • VMware and Xen can manage SCSI devices, but • KVM and Xen can use virtio drivers but not • VMware uses a proprietary agent inside the guest OS (VMware tools) which does not work with Xen or KVM • Xen uses VirtIo and ParaVirtualized drivers, Xen By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com THE PORTABILITY PROBLEM Containers compared to Hardware Virtualization bound KVM cannot VMware uses Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 14. • Code – Application is stored • Build – Code is built (Jenkins) • Test – Unit tests are By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com CONTINUOUS INTEGRATION Rebuild Applications on any Cloud and/or Virtualized Infrastructure in a repository (Subversion,Git) automated (Jenkins) • Deploy – Deploy code to server various ways Code Build Test Deploy Thoughtworks Go – Open Source Continuous Deliver System Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 15. PACKER MULTIPLATFORM VM CREATION Packer is easy to use and automates the creation of any type of machine image. It embraces modern configuration management by encouraging you to use automated scripts to install and configure the software within your Packer-made images. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com To learn more please visit: www.packer.io Open source Automation for VMs Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 16. DOCKER CONTAINER PACKAGING Open source LXC Packaging Engine Docker is an open-source project to easily create lightweight, portable, self-sufficient containers from any application. The same container that a developer builds and tests on a laptop can run at scale, in production, on VMs, bare metal, public clouds and more. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com To learn more please visit: www.docker.io Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 17. • Compliment to LXC not a replacement • Managed daemonized processes on Linux • Create ability to re-use and manage similar • Content agnostic • Hardware agnostic • Easy to automate • Integrated with other tools: Chef, OpenShift, By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com WHAT IS DOCKER System for Managing and Deploying LXC Containers using LXC applications Puppet, VMware, etc. Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 18. DOCKER’S GROWING ECOSYSTEM By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 19. Kubernetes builds on top of Docker to construct a clustered container scheduling service. Kubernetes enables users to ask a cluster to run a set of containers. The system will automatically pick worker nodes to run those containers on, which we think of more as "scheduling" than "orchestration” To learn more please visit: https://github.com/GoogleCloudPlatform/kubernetes Greek for Shipmaster By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com KUBERNETES Container Cluster Management – Scheduler Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 20. DOCKER RELATED PROJECTS • Fig -Fast, isolated development environments • Flynn - Next-generation application platform • Panamax – Drag-and-Drop Docker Containerization • Project Atomic – JEOS designed to run Docker containers • Weave – The Docker Network • 13,000+ Docker-related repos on Github By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 21. Apache Mesos is a cluster manager that simplifies the complexity of running applications on a shared pool of servers. Largely supported by Twitter, used by LinkedIn, AirBNB too. Features • Fault-tolerant replicated master using ZooKeeper • Scalability to 10,000s of nodes • Isolation between tasks with Linux Containers • Multi-resource scheduling (memory and CPU aware) • Java, Python and C++ APIs for developing new By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APACHE MESOS One to many tools for managing large numbers of devices parallel applications • Web UI for viewing cluster state To learn more please visit: http://mesos.apache.org/ Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 22. ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APACHE ZOOKEEPER Centralized Server to Service Distributed Apps To learn more please visit: http://zookeeper.apache.org/ Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 22
  • 23. INFRASTRUCTURE-AS-A-SERVICE Compute Orchestration Project Year Started License Virtualization By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Technologies Apache CloudStack 2008 Apache (Bare Metal), Xenserver, KVM, LXC VMware Hyper- V Eucalyptus 2006 GPL Xen, KVM, VMware (commercial version) OpenNebula 2005 Apache Xen, KVM, VMware OpenStack 2010 (Developed by NASA by Anso Labs previously) Apache VMware ESX and ESXi, , Xen, XenServer, KVM, LXC, QEMU and Virtual Box
  • 24. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com OPENSTACK The Boy Band of the Open Source Cloud  Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 25. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com OPENSTACK SHARED SSpaEn CRomVputIe,C StoEragSe and Networking IDENTITY SERVICE IMAGE SERVICE TELEMETRY SERVICE Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail ORCHESTRATION SERVICE
  • 26. EVEN MORE OPENSTACK PROJECTS Span Compute, Storage and Networking • Trove Database Service • Ironic Bare Metal (Ironic) • Marconi Queue Service By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com • Cinder Block Storage Service • Ceilometer Metering/Monitoring • Heat Orchestration Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 27. OPENSTACK SOLUTION PROVIDERS If you can’t do it yourself “OpenStack is not a product. If you are building a large infrastructure, it’s more like a tool kit. It gives you a lot of technologies that do take a lot of effort to integrate.” Chris Kemp, OpenStack Board Member and Co-Founder By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail CEO of Piston Computing
  • 28. • Deltacloud(ruby) • Daisein(java) • Jclouds(java) • Libcloud(python) • Fog(ruby) By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com CLOUD APIS Everything (should) have an API in the Cloud Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 29. Project Description Ceph Distributed file storage system developed by DreamHost -> GlusterFS Scale Out NAS system aggregating storage over Ethernet or Riak CS Riak CS is open source software designed to provide simple, available, distributed cloud storage at any scale. Riak CS is S3- API compatible and supports per-tenant reporting for billing and metering use cases. (object) Sheepdog Distributed storage for KVM hypervisors, distributed iSCSI By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com CLOUD STORAGE Virtualized, Distributed usually on Commodity Hardware InkTank -> Red Hat (block, object, file) Infiniband (file) OpenStack Storage Long-term object storage system (object) Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 30. Project Sponsors Languages/Frameworks By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com PLATFORM-AS-A-SERVICE Abstracted Cloud-Scale Run-Time Environments CloudFoundry VMware -> Pivotal -> CloudFoundry Foundation Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Spring for Java, Ruby for Rails and Sinatra, node.js, Grails, Scala on Lift and more via partners (e.g. Python, PHP) Cloudify Gigaspaces [Groovy for deployment recipes] OpenShift Origin Red Hat Java, Ruby, PHP, Perl and Python Apache Stratos WSO2 - >Apache Stratus PHP, Tomcat, MySQL “cartridges”
  • 31. Decoupling of the control and data planes of the network to improve efficiency. Communication from a SDN controller via a protocol to network devices both physical and virtual. Abstractions allow for programmable networks. Network can be changed quickly via a controller Network offerings can match virtualization offerings for finer grained security in a highly volatile compute landscape. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com SOFTWARE DEFINED VNirtuEalizTatiWon mOeetRs thKe neItwNorkG(SDN) Automation Dynamic Networks Security Heterogeneous Management Single control point for various devices. Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 32. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Business Applications SDN OVERVIEW Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Network Services SDN Control Software API API Network Devices Network Devices Network Devices Network Devices Network Devices Network Devices Application Layer Control Layer Infrastructure Layer Control Data Plane Interface (e.g. OpenFlow)
  • 33. BENEFITS OF SDN Network Virtualization is the final frontier of Software Defined Datacenter By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com • Dynamically update networks • Automate network functionality • “Program” security into the network • Centrally apply policies to network and services • Optimize networks Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 34. OpenFlow enables networks to evolve, by giving a remote controller the power to modify the behavior of network devices, through a well-defined "forwarding instruction set". The growing OpenFlow ecosystem now includes routers, switches, virtual switches, and access points from a range of vendors. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com OPENFLOW Virtualization meets the network Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 35. OPEN SOURCE SDN Software Defined Network Controllers and more Floodlight The Floodlight Open SDN Controller is an enterprise-class, Apache-licensed, Java-based OpenFlow Controller. It is supported by a community of developers including a number of engineers from Big Switch Networks. - See more at: http://www.projectfloodlight.org/floodlight/#sthash.9IhA1Ih5.dpuf Indigo Indigo is an open source project aimed at enabling support for OpenFlow on physical and hypervisor switches. Big Switch has helped numerous companies OpenFlow enable their equipment, and we provide firmware for a number of popular switches. Indigo is the basis of Switch Light by Big Switch Networks. - See more at: http://www.projectfloodlight.org/indigo/#sthash.K7LiHcqc.dpuf Lincx LINCX is a pure OpenFlow software switch written in Erlang. It runs within a separate domain under Xen Nox NOX is the original OpenFlow controller, and facilitates development of fast C++ controllers on Linux. Open Daylight Linux Foundation Collaborative Project based on Cisco One Controller and plugins from numerous Open vSwitch Open vSwitch is a open source (ASL 2.0), multilayer virtual switch designed to enable massive network automation through programmatic extension, while still supporting standard management interfaces and protocols (e.g. NetFlow, sFlow, SPAN, RSPAN, CLI, LACP, 802.1ag). By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Project Description hypervisor using LING (erlangonxen.org). vendors in development. E.g IBM DOVE Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 36. Open vSwitch is a production quality, multilayer virtual switch licensed under the open source Apache 2.0 license. It is designed to enable massive network automation through programmatic extension, while still supporting standard management interfaces and protocols (e.g. NetFlow, sFlow, SPAN, RSPAN, CLI, LACP, 802.1ag). To learn more please visit our website: http://openvswitch.org/ By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com OPEN VSWITCH Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 37. CONFIGURATION MANAGEMENT TOOLS Tools with features for configuring cloud infrastructure Project Year Started Language License Client/Server Chef 2009 Ruby Apache Chef Solo – No By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com CFengine 1993 C Apache Yes Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Chef Server - Yes Puppet 2004 Ruby GPL Yes & standalone Salt 2011 Python Apache yes Hitchhiker’s Guide to the Open Cloud by @mrhinkle 37
  • 38. CLOUD AUTOMATION TOOLS One to many tools for managing large numbers of devices Ansible Ansible's SSH-key based access allows contributors to the Fedora Project to assist in automating infrastructure while having access limited appropriately. (Originally authored Func) Capistrano Utility and framework for executing commands in parallel on multiple remote machines, via SSH. It uses a simple DSL that allows you to define tasks, which may be applied to machines in certain roles RunDeck Rundeck is an open-source process automation and command orchestration tool with a web Func Func provides a two-way authenticated system for generically executing tasks, integrations with MCollective The Marionette Collective AKA MCollective is a framework to build server orchestration or Salt Execute arbitrary shell commands or choose from dozens of pre-built modules of common (or Scalr Provide scaling across multiple cloud computing platforms, integrates with Chef. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Project Description console. puppet and cobbler. parallel job execution systems. complex) commands. Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 39. EUREKA PRIAM SIMIAN ARMY By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com ASGARD ASTYANAX EDDA Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail 39 http://netflix.github.com NETFLIX AWS TOOLBAG Tools developed by a super Amazon Web Services Power User
  • 40. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com CONTACT ME Happy to Chat about Open Source, Cloud or Pittsburgh Sports Professional: mark.hinkle@citrix.com Personal: mrhinkle@gmail.com Phone: 919.228.8049 Professional: http://open.citrix.com Personal: http://www.socializedsoftware.com Twitter: @mrhinkle Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 41. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APPENDIX A Additional Links to related stuff Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 42. ADDITIONAL LINKS • Devops Toolchains Group • Software Defined Networking: The New Norm for Networks (Whitepaper) • DevOps Wikipedia Page • NoSQL-Database.org – Ultimate Guide to the Non-Relational Universe • Open Cloud Initiative • NIST Cloud Computing Platform • Open Virtualization Format Specs • Clouderati Twitter Account • Planet DevOps • Nicira Whitepaper – It’s Time to Virtualize the Network • Why Open vSwitch FAQ • Stanford Seminar - Software-Defined Networking at the Crossroads By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 43. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com ADDITIONAL LINKS (CONT’D) • SDN, NFV, and open source: The Operator’s View • Puppet Labs: Build a Toolbox for Continuous Delivery Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 44. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APPENDIX B Stuff I’d liked to have talked about but didn’t have time Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 45. Bitnami BitNami provides free, ready to run environments for your favorite open source web applications and frameworks, including Drupal, Joomla!, Wordpress, PHP, Rails, Django and many more. Boxgrinder BoxGrinder is a set of projects that help you grind out appliances for multiple Oz Command-line tool that has the ability to create images for common Linux SUSE Studio SUSE Studio supports building and deploying directly to cloud services such as By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com SOURCING CLOUD APPLIANCES Packaging Engines for VMs Tool/Project What you can do with them virtualization and Cloud providers distributions to run on KVM Amazon EC2. Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 46. CLOUD MONITORING TOOLS Tools with features for monitoring cloud infrastructure Project Type of Monitoring Collection Methods Cacti / RRDTool Performance SNMP, syslog Nagios Availability SNMP,TCP, ICMP, IPMI, By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Graphite Performance Agent Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail syslog Sensu Availability Agent Zabbix Availability/ Performance and more SNMP, TCP/ICMP, IPMI, Synthetic Transactions Zenoss Availability, Performance, Event Management SNMP, ICMP, SSH, syslog, WMI Hitchhiker’s Guide to the Open Cloud by @mrhinkle 46
  • 47. CLOUD PROVISIONING TOOLS Packaging Engines for VMs Can provision 10s to 1000s of machines on various clouds. Cobbler Distributed virtual infrastructure using koan (kickstart of a network to PXE boot VMs) for Red Hat, OpenSUSE Fedora, Debian, Ubuntu VMs Salt Cloud Tool to provision “salted” VMs that can then be updated by a central server By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com Project Installation Targets Apache Provisionr (incubating) Crowbar (Bare metal provisioning) JuJu Public Clouds - Amazon Web Services HP Cloud, Private OpenStack clouds, Bare Metal via MAAS. via ZeroMQ Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Hitchhiker’s Guide to the Open Cloud by @mrhinkle 47
  • 48. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com BIG DATA Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 49. API: many » Query Method: MapReduce, Replicaton: , Written in: Java, Concurrency: eventually consistent , Misc: like "Big-Table on Amazon Dynamo alike", initiated by Facebook CouchDB Document Store API: Memcached API+protocol (binary and ASCII) , most languages, Protocol: Memcached REST interface for cluster conf + management, Written in: C/C++ + Erlang (clustering), Replication: Peer to Peer, fully consistent, Misc: Transparent topology changes during operation, provides memcached-compatible caching buckets API: Java / any writer, Protocol: any write call, Query Method: MapReduce Java / any exec, Replication: HDFS Replication, Written in: Java PI: Thrift (Java, PHP, Perl, Python, Ruby, etc.), Protocol: Thrift, Query Method: HQL, native Thrift API, Replication: HDFS Replication, Concurrency: MVCC, Consistency Model: Fully consistent Misc: High performance C++ implementation of Google's Bigtable. MongoDB Document Store API: BSON, Protocol: C, Query Method: dynamic object-based language & MapReduce, Replication: Redis Key Value/ Tuple Store API: Tons of languages, Written in: C, Concurrency: in memory and saves asynchronous disk after a defined time. Append only mode available. Different kinds of fsync policies. Replication: Master / Slave, Misc: also lists, sets, sorted sets, hashes, queues. Riak Key Value / Tuple Store API: JSON, Protocol: REST, Query Method: MapReduce term matching , Scaling: Multiple Masters; Written in: Erlang, Concurrency: eventually consistent (stronger then MVCC via Vector Clocks) By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com NOSQL DATABASES Horizontally scalable unstructured data retrieval Name Type Description Apache Wide Column Cassandra Store/Families HBase Wide Column Store/Families Hypertable Wide Column Store/Families Master Slave & Auto-Sharding, Written in: C++,Concurrency Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 50. By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com MAP REDUCE Algorithm for Parallelized Data Set Processing Problem Data Master Node Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail Worker Node 1 Worker Node 2 Worker Node 3 Solution Data Map Reduce
  • 51. • Handles large amounts of • Stores data in native format • Delivers linear scalability at • Resilient in case of infrastructure failures • Transparent application By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APACHE HADOOP Apache Project for Parallelized Data Set Processing Overview • Handles large amounts of data • Stores data in native format • Delivers linear scalability at low cost • Resilient in case of infrastructure failures • Transparent application scalability Features data low cost scalability Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail
  • 52. Machine Learning By Mark R. Hinkle @mrhinkle mrhinkle@gmail.com APACHE HADOOP ECOSYSTEM Non-Relational DB Hadoop Hadoop Common HDFS Distributes & replicates data across machines Cloud Open EU 2014 - Mixing Your Open Source Cloud Cocktail MapReduce Distributes & monitors tasks Hive Data warehouse that provides SQL interface. Ad hoc projection of data structure to unstructured MapReduce • Parallel programming • Handles large data blocks HBase Column-oriented schema-less distributed DB modeled after Google’s BigTable Random real time read/write. Scripting Pig Platform for manipulating and analyzing large data sets. Scripting language for analysts. Mahout Machine learning libraries for recommendations , clustering, classifications and item sets. Chuckwa Zookeeper

Editor's Notes

  1. ZooKeeper https://cwiki.apache.org/confluence/display/ZOOKEEPER/Index ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them ,which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services lead to management complexity when the applications are deployed. ZooKeeper aims at distilling the essence of these different services into a very simple interface to a centralized coordination service. The service itself is distributed and highly reliable. Consensus, group management, and presence protocols will be implemented by the service so that the applications do not need to implement them on their own. Application specific uses of these will consist of a mixture of specific components of Zoo Keeper and application specific conventions. ZooKeeper Recipes shows how this simple service can be used to build much more powerful abstractions. We have Java and C interfaces to Zoo Keeper for the applications themselves. A variety of client bindings is available for a number of languages including Python, Ruby and Go.
  2. OpenStack Shared Services - https://www.openstack.org/software/openstack-shared-services/ Identity Service OpenStack Identity provides a central directory of users mapped to the OpenStack services they can access. It acts as a common authentication system across the cloud operating system and can integrate with existing backend directory services like LDAP. It supports multiple forms of authentication including standard username and password credentials, token-based systems and AWS-style logins. Image Service The OpenStack Image Service provides discovery, registration and delivery services for disk and server images. The ability to copy or snapshot a server image and immediately store it away is a powerful capability of the OpenStack cloud operating system. Stored images can be used as a template to get new servers up and running quickly and more consistently if you are provisioning multiple servers than installing a server operating system and individually configuring additional services. It can also be used to store and catalog an unlimited number of backups. Telemetry Service The OpenStack Telemetry service aggregates usage and performance data across the services deployed in an OpenStack cloud. This powerful capability provides visibility and insight into the usage of the cloud across dozens of data points and allows cloud operators to view metrics globally or by individual deployed resources. Orchestration Service OpenStack Orchestration is a template-driven engine that allows application developers to describe and automate the deployment of infrastructure. The flexible template language can specify compute, storage and networking configurations as well as detailed post-deployment activity to automate the full provisioning of infrastructure as well as services and applications. Through integration with the Telemetry service, the Orchestration engine can also perform auto-scaling of certain infrastructure elements.
  3. OpenStack Shared Services https://www.openstack.org/software/openstack-shared-services/ Identity Service OpenStack Identity provides a central directory of users mapped to the OpenStack services they can access. It acts as a common authentication system across the cloud operating system and can integrate with existing backend directory services like LDAP. It supports multiple forms of authentication including standard username and password credentials, token-based systems and AWS-style logins. Image Service The OpenStack Image Service provides discovery, registration and delivery services for disk and server images. The ability to copy or snapshot a server image and immediately store it away is a powerful capability of the OpenStack cloud operating system. Stored images can be used as a template to get new servers up and running quickly and more consistently if you are provisioning multiple servers than installing a server operating system and individually configuring additional services. It can also be used to store and catalog an unlimited number of backups. Telemetry Service The OpenStack Telemetry service aggregates usage and performance data across the services deployed in an OpenStack cloud. This powerful capability provides visibility and insight into the usage of the cloud across dozens of data points and allows cloud operators to view metrics globally or by individual deployed resources. Orchestration Service OpenStack Orchestration is a template-driven engine that allows application developers to describe and automate the deployment of infrastructure. The flexible template language can specify compute, storage and networking configurations as well as detailed post-deployment activity to automate the full provisioning of infrastructure as well as services and applications. Through integration with the Telemetry service, the Orchestration engine can also perform auto-scaling of certain infrastructure elements.
  4. Debate: How Many Open Source Platforms Are Enough? http://www.enterprisetech.com/2014/06/23/debate-many-open-source-platforms-enough/ OpenStack came in for the most criticism for issues such cost of deployment and maintenance. “OpenStack is not a product,” Kemp responded. “If you are building a large infrastructure, it’s more like a tool kit. It gives you a lot of technologies that do take a lot of effort to integrate.” The tool kit is used to create a product, Kemp stressed. OpenStack Vendors Canonical Ubuntu OpenStack - http://www.ubuntu.com/cloud/tools/openstack CloudScaling – Elastic Cloud Infrastructure - http://www.cloudscaling.com/ Elastic Cloud Infrastructure – built on OpenStack – enables any IT group to deploy cloud services comparable to the capabilities of the world’s largest and most successful public clouds. Cloudscaling solutions allow your organization to rapidly scale resources, achieve new levels of agility and improve market responsiveness. All with full control and governance in the privacy of your on-premise data center. HP Cloud OS - http://www8.hp.com/us/en/business-solutions/solution.html?compURI=1421776#.UzoD3K1dVDo Based on OpenStack technology, HP Cloud OS provides the foundation for the HP Cloud common architecture across private, public, and hybrid cloud delivery. Piston Cloud Computing - http://www.pistoncloud.com/openstack-cloud-software/ Piston OpenStack is a software product that uses advanced systems intelligence to orchestrate an entire private cloud environment using commodity hardware. Starting with an extremely lightweight custom Linux OS called Iocane Micro-OS™, and using an advanced high-availability system called Moxie Runtime Environment™, Piston keeps your cloud running no matter what – through hardware failure, operator error, upgrades, and power outages. Red Hat Distribution of OpenStack - http://openstack.redhat.com/Main_Page RDO is a community of people using and deploying OpenStack on Red Hat Enterprise Linux, Fedora and distributions derived from these (such as CentOS, Scientific Linux and others). We have documentation to help get started, forums where you can connect with other users, and community-supported packages of the most up-to-date OpenStack releases available for download. Rackspace Private Cloud powered by OpenStack - http://www.rackspace.com/cloud/private/
  5. Types of Tasks Accomplished by an API Provisioning (creating, re-creating, moving, or deleting components e.g. virtual machines, vlans) Configuration (assigning or changing attributes of the architecture such as security and network settings) Cloud Providers Daisein - Jclouds – java API Abstraction Libcloud – started by CloudKick (now Rackspace) to abstract clouds, Apache incubator project Deltacloud – started by Red Hat to abstract clouds, Apache incubator project Fog - provider and abstraction level API across compute and storage, written in Ruby
  6. MapReduce is a programming model for processing large data sets with a parallel, distributed algorithm on a cluster. A MapReduce program is composed of a Map() procedure that performs filtering and sorting (such as sorting students by first name into queues, one queue for each name) and a Reduce() procedure that performs a summary operation (such as counting the number of students in each queue, yielding name frequencies). The "MapReduce System" (also called "infrastructure" or "framework") orchestrates by marshalling the distributed servers, running the various tasks in parallel, managing all communications and data transfers between the various parts of the system, and providing for redundancy and fault tolerance.