Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Basics AWS Presentation
1.
2. Cloud Computing!!!
Computing in which large groups of remote servers.
Goal - Use technologies without much deeper knowledge
Cloud computing is Virtualization
Offer Service Models:
a. Infrastructure as a service (IaaS)
b. Platform as a service (PaaS)
c. Software as a service (SaaS)
AWS Simple Icons: Compute & Networking
3. Cloud Computing!!!
Offer Deployment Models:
a. Public Cloud
b. Private Cloud
c. Hybrid Cloud
http://aws.amazon.com/new/reinvent/?sc_ichann
el=ha&sc_ipage=homepage&sc_icountry=en&sc
_isegment=nc&sc_iplace=hero1&sc_icampaignt
ype=event&sc_icampaign=ha_en_reInvent_201
4_News&sc_icategory=none&sc_idetail=ha_en_
292_1&sc_icontent=ha_292&/
AWS Simple Icons: Compute & Networking
4. Compute & Networking
Amazon Elastic Compute Cloud
Amazon EC2 instance instances AMI DB on
Amazon Elastic MapReduce
instance
instance with
CloudWatch
Elastic IP
Amazon EMR cluster
AWS Simple Icons: Compute & Networking
optimized
instance
new!
MapR M3
engine
MapR M5
engine
MapR M7
engine
engine
new! new!
new!
new!
5. Compute & Networking
Amazon Elastic Compute Cloud
Amazon EC2 instance instances AMI DB on
Amazon Elastic MapReduce
instance
instance with
CloudWatch
Elastic IP
Amazon EMR cluster
AWS Simple Icons: Compute & Networking
optimized
instance
new!
MapR M3
engine
MapR M5
engine
MapR M7
engine
engine
new! new!
new!
new!
11. Database
Amazon DynamoDB
DynamoDB table item items attribute attributes
Amazon Relational Database Service
Amazon RDS RDS DB
instance
RDS DB
instance standby
(Multi-AZ)
RDS DB
instance read
replica
MySQL DB
instance
Oracle DB
instance
MS SQL
instance
AWS Simple Icons: Database
global
secondary
index
new!
new! new! new!
18. Non-Service Specific
AWS cloud
AWS Management
Console
virtual private cloud forums
client mobile client multimedia Internet user users
corporate
data center
traditional server disk generic database tape storage
AWS Simple Icons: Non-Service Specific
19. Groups
Auto Scaling group Availability Zone region security group
Elastic Beanstalk container EC2 instance contents VPC subnet server contents
AWS Simple Icons: Groups
21. Example 1: 2-Tier Auto-scalable Web Application Architecture in 1 AZ
web app
server
security group
Auto Scaling group
Availability Zone #1
www.example.com
security group
root volume
data volume
media.example.com
Elastic Load
Balancing
Amazon S3
bucket
logs
Amazon EBS
snapshot
CloudFront
distribution
EC2 instance
AWS Simple Icons: Example 1
Editor's Notes
Computing in which large groups of remote servers are networked to allow centralized data storage and online access to computer services or resources.
goal of cloud computing is to allow users to take benefit from all of these technologies, without the need for deep knowledge about or expertise with each one of them.
The main enabling technology for cloud computing is virtualization. Virtualization software separates a physical computing device into one or more "virtual" devices, each of which can be easily used and managed to perform computing tasks.
http://en.wikipedia.org/wiki/Cloud_computing
Infrastructure as a service (IaaS) : physical or (more often) virtual machines – and other resources. (A hypervisor, such as Xen, Oracle VirtualBox, KVM, VMware ESX/ESXi, or Hyper-V runs the virtual machines as guests. IaaS clouds often offer additional resources such as a virtual-machine disk image library, raw block storage, and file or object storage, firewalls, load balancers, IP addresses, virtual local area networks (VLANs), and software bundles.[49] IaaS-cloud providers supply these resources on-demand from their large pools installed in data centers.
Platform as a service (PaaS) : In the PaaS models, cloud providers deliver a computing platform, typically including operating system, programming language execution environment, database, and web server. With some PaaS offers like Microsoft Azure and Google App Engine, the underlying computer and storage resources scale automatically to match application demand so that the cloud user does not have to allocate resources manually.
Software as a service (SaaS) : SaaS is sometimes referred to as "on-demand software" and is usually priced on a pay-per-use basis.
A public cloud is one in which the services and infrastructure are provided off-site over the Internet. These clouds offer the greatest level of efficiency in shared resources; however, they are also more vulnerable than private clouds. A public cloud is the obvious choice when:-
Your standardized workload for applications is used by lots of people, such as e-mail.
You need to test and develop application code.
You have SaaS (Software as a Service) applications from a vendor who has a well-implemented security strategy.
Public clouds such as Amazon EC2 and Rackspace Cloud
A private cloud is one in which the services and infrastructure are maintained on a private network. These clouds offer the greatest level of security and control, but they require the company to still purchase and maintain all the software and infrastructure, which reduces the cost savings. A private cloud is the obvious choice when:-
Your business is your data and your applications. Therefore, control and security are paramount.
Your business is part of an industry that must conform to strict security and data privacy issues
Your company is large enough to run a next generation cloud data center efficiently and effectively on its own.
VMWare provides cloud services. Rackspace Private Cloud (Powered by OpenStack), CloudBees, Amazon Virtual Private Cloud
====================================================================
Private cloud is cloud infrastructure operated solely for a single organization, whether managed internally or by a third-party, and hosted either internally or externally.[1] Undertaking a private cloud project requires a significant level and degree of engagement to virtualize the business environment, and requires the organization to reevaluate decisions about existing resources. When done right, it can improve business, but every step in the project raises security issues that must be addressed to prevent serious vulnerabilities.[58] Self-run data centers[59] are generally capital intensive. They have a significant physical footprint, requiring allocations of space, hardware, and environmental controls. These assets have to be refreshed periodically, resulting in additional capital expenditures. They have attracted criticism because users "still have to buy, build, and manage them" and thus do not benefit from less hands-on management,[60] essentially "[lacking] the economic model that makes cloud computing such an intriguing concept“
A cloud is called a "public cloud" when the services are rendered over a network that is open for public use. Public cloud services may be free or offered on a pay-per-usage model.[63] Technically there may be little or no difference between public and private cloud architecture, however, security consideration may be substantially different for services (applications, storage, and other resources) that are made available by a service provider for a public audience and when communication is effected over a non-trusted network. Generally, public cloud service providers like Amazon AWS, Microsoft and Google own and operate the infrastructure at their data center and access is generally via the Internet. AWS and Microsoft also offer direct connect services called "AWS Direct Connect" and "Azure ExpressRoute" respectively, such connections require customers to purchase or lease a private connection to a peering point offered by the cloud provider
Hybrid cloud is a composition of two or more clouds (private, community or public) that remain distinct entities but are bound together, offering the benefits of multiple deployment models. Hybrid cloud can also mean the ability to connect collocation, managed and/or dedicated services with cloud resources
Amazon Elastic Compute Cloud (EC2) is a central part of Amazon.com's cloud computing platform, Amazon Web Services (AWS).
Allows users to rent virtual computers on which to run their own computer applications.
Allows scalable deployment of applications by providing a Web service through which a user can boot an Amazon Machine Image to create a virtual machine/Instances, containing any software desired. You can create, launch, and terminate server instances as needed, paying by the hour for active servers, hence the term "elastic".
Provides users with control over the geographical location of instances that allows for latency optimization and high levels of redundancy.
AMI is a supported and maintained Linux image provided by Amazon Web Services for use on Amazon Elastic Compute Cloud (Amazon EC2).
stable, secure, and high performance execution environment for applications running
Instance with Cloudwatch: monitor your Amazon EC2 instances using Amazon CloudWatch, which collects and processes raw data from Amazon EC2 into readable, near real-time metrics.
recorded for a period of two weeks
Elastic IP addresses are static IP addresses designed for dynamic cloud computing. Unlike traditional static IP addresses, however, Elastic IP addresses allow you to mask instance or availability zone failures by programmatically remapping your public IP addresses to any instance associated with your account. Rather than waiting on a data technician to reconfigure or replace your host, or waiting for DNS to propagate to all of your customers, Amazon EC2 enables you to engineer around problems with your instance or software by programmatically remapping your Elastic IP address to a replacement instance.
Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data.
EMR uses Hadoop, an open source framework, to distribute your data and processing across a resizable cluster of Amazon EC2 instances
A computer cluster(distributed system) consists of a set of loosely or tightly connected computers that work together so that, in many respects, they can be viewed as a single system.
Unlike grid, cluster different nodes can perform same tasks/application.
Amazon Elastic MapReduce (Amazon EMR) makes it easy to provision and manage Hadoop in the AWS Cloud.
streaming applications in one unified Big Data platform.
M3 Edition is the free version of our complete distribution for Hadoop. delivers a fully random read-write capable platform that supports industry-standard interfaces (e.g., NFS, ODBC), and provides management, compression and performance advantages.
M5 Edition is also a complete distribution for Apache Hadoop that delivers enterprise-grade features for all file operations on Hadoop. Features include mirroring, snapshots, NFS HA, data placement control, and many more,
M7 Edition is a complete distribution for Apache Hadoop that delivers ease of use, dependability and performance advantages for NoSQL and Hadoop applications.
M7 provides scale, strong consistency, reliability and continuous low latency with an architecture that does not require compactions or background consistency checks.
Amazon Elastic Compute Cloud (EC2) is a central part of Amazon.com's cloud computing platform, Amazon Web Services (AWS).
Allows users to rent virtual computers on which to run their own computer applications.
Allows scalable deployment of applications by providing a Web service through which a user can boot an Amazon Machine Image to create a virtual machine/Instances, containing any software desired. You can create, launch, and terminate server instances as needed, paying by the hour for active servers, hence the term "elastic".
Provides users with control over the geographical location of instances that allows for latency optimization and high levels of redundancy.
AMI is a supported and maintained Linux image provided by Amazon Web Services for use on Amazon Elastic Compute Cloud (Amazon EC2).
stable, secure, and high performance execution environment for applications running
Instance with Cloudwatch: monitor your Amazon EC2 instances using Amazon CloudWatch, which collects and processes raw data from Amazon EC2 into readable, near real-time metrics.
recorded for a period of two weeks
Elastic IP addresses are static IP addresses designed for dynamic cloud computing. Unlike traditional static IP addresses, however, Elastic IP addresses allow you to mask instance or availability zone failures by programmatically remapping your public IP addresses to any instance associated with your account. Rather than waiting on a data technician to reconfigure or replace your host, or waiting for DNS to propagate to all of your customers, Amazon EC2 enables you to engineer around problems with your instance or software by programmatically remapping your Elastic IP address to a replacement instance.
Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data.
EMR uses Hadoop, an open source framework, to distribute your data and processing across a resizable cluster of Amazon EC2 instances
A computer cluster(distributed system) consists of a set of loosely or tightly connected computers that work together so that, in many respects, they can be viewed as a single system.
Unlike grid, cluster different nodes can perform same tasks/application.
Amazon Elastic MapReduce (Amazon EMR) makes it easy to provision and manage Hadoop in the AWS Cloud.
streaming applications in one unified Big Data platform.
M3 Edition is the free version of our complete distribution for Hadoop. delivers a fully random read-write capable platform that supports industry-standard interfaces (e.g., NFS, ODBC), and provides management, compression and performance advantages.
M5 Edition is also a complete distribution for Apache Hadoop that delivers enterprise-grade features for all file operations on Hadoop. Features include mirroring, snapshots, NFS HA, data placement control, and many more,
M7 Edition is a complete distribution for Apache Hadoop that delivers ease of use, dependability and performance advantages for NoSQL and Hadoop applications.
M7 provides scale, strong consistency, reliability and continuous low latency with an architecture that does not require compactions or background consistency checks.
Kinesis is a fully managed service for real-time processing of streaming data at massive scale. Amazon Kinesis can continuously capture and store terabytes of data per hour from hundreds of thousands of sources such as website clickstreams, financial transactions, social media feeds, IT logs, and location-tracking events.
Amazon WorkSpaces offers you an easy way to provide a cloud-based desktop experience to your end-users. You simply select from a choice of WorkSpace bundles that offer a range of different amounts of CPU, memory, storage, and a choice of applications. Then, enter user information and launch the number of WorkSpaces that you require. As soon as the WorkSpaces are ready, users can download the Amazon WorkSpaces client and connect to their WorkSpace. Users can connect from a PC or Mac desktop computer, or an iPad, Kindle, or Android tablet.
Amazon Route 53 is a highly available and scalable cloud Domain Name System (DNS) web service. It is designed to give developers and businesses an extremely reliable and cost effective way to route end users to Internet applications by translating names like www.example.com into the numeric IP addresses like 192.0.2.1 that computers use to connect to each other.
Elastic Load Balancing to automatically distribute incoming web traffic across multiple Amazon Elastic Compute Cloud (Amazon EC2) instances. With Elastic Load Balancing, you can add and remove EC2 instances as your needs change without disrupting the overall flow of information. If one EC2 instance fails, Elastic Load Balancing automatically reroutes the traffic to the remaining running EC2 instances. If the failed EC2 instance is restored, Elastic Load Balancing restores the traffic to that instance.
Direct Connect links your internal network to an AWS Direct Connect location over a standard 1 gigabit or 10 gigabit Ethernet fiber-optic cable. One end of the cable is connected to your router, the other to an AWS Direct Connect router. With this connection in place, you can create virtual interfaces directly to the AWS cloud (for example, to Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service (Amazon S3)) and to Amazon Virtual Private Cloud (Amazon VPC), bypassing Internet service providers in your network path.
Virtual Private Cloud (Amazon VPC) enables you to launch Amazon Web Services (AWS) resources into a virtual network that you've defined. This virtual network closely resembles a traditional network that you'd operate in your own data center, with the benefits of using the scalable infrastructure of AWS.
A VPC peering connection allows you to route traffic between the peer VPCs using private IP addresses; as if they are part of the same network.
Auto Scaling is a web service designed to launch or terminate Amazon EC2 instances automatically based on user-defined policies, schedules, and health checks.
Auto Scaling is an AWS service that allows you to increase or decrease the number of EC2 instances within your application's architecture. With Auto Scaling, you create collections of EC2 instances, called Auto Scaling groups. You can create these groups from scratch, or from existing EC2 instances that are already in production.
An elastic network interface (ENI) is a virtual network interface that you can attach to an instance in a VPC. An ENI can include the following attributes:
a primary private IP address
1. one or more secondary private IP addresses
2. one Elastic IP address per private IP address
3. one public IP address, which can be auto-assigned to the network interface for eth0 when you launch an instance, but only when you create a network interface for eth0 instead of using an existing network interface
4. one or more security groups
a MAC address
a source/destination check flag
a description
When you move a network interface from one instance to another, network traffic is redirected to the new instance.
Simple Storage Service (Amazon S3), provides developers and IT teams with secure, durable, highly-scalable object storage. Amazon S3 is easy to use, with a simple web services interface to store and retrieve any amount of data from anywhere on the web. With Amazon S3, you pay only for the storage you actually use. There is no minimum fee and no setup cost.
AWS Import/Export accelerates moving large amounts of data into and out of the AWS cloud using portable storage devices for transport. AWS Import/Export transfers your data directly onto and off of storage devices using Amazon’s high-speed internal network and bypassing the Internet.
Amazon EBS provides persistent block level storage volumes for use with Amazon EC2 instances in the AWS Cloud. Each Amazon EBS volume is automatically replicated within its Availability Zone to protect you from component failure, offering high availability and durability. Amazon EBS volumes offer the consistent and low-latency performance needed to run your workloads.
An Amazon EBS snapshot is a point-in-time backup copy of an Amazon EBS volume that is stored in Amazon S3. Snapshots are incremental backups, which means that only the blocks on the device that have changed after your most recent snapshot are saved. When you delete a snapshot, only the data exclusive to that snapshot is removed. Active snapshots contain all of the information needed to restore your data (from the time the snapshot was taken) to a new Amazon EBS volume.
An Amazon EBS volume is a durable, block-level storage device that you can attach to a single EC2 instance. You can use Amazon EBS volumes as primary storage for data that requires frequent updates, such as the system drive for an instance or storage for a database application.
The AWS Storage Gateway is a service connecting an on-premises software appliance with cloud-based storage to provide seamless and secure integration between an organization’s on-premises IT environment and AWS’s storage infrastructure.
allows you to securely store data in the AWS cloud for scalable and cost-effective storage
Gateway-Cached Volumes: You can store your primary data in Amazon S3, and retain your frequently accessed data locally. Gateway-Cached volumes provide substantial cost savings on primary storage, minimize the need to scale your storage on-premises, and retain low-latency access to your frequently accessed data.
Gateway-Virtual Tape Library (Gateway-VTL): With Gateway-VTL you can have a limitless collection of virtual tapes. Each virtual tape can be stored in a Virtual Tape Library backed by Amazon S3 or a Virtual Tape Shelf backed by Amazon Glacier.
Amazon Glacier is an extremely low-cost cloud archive storage service that provides secure and durable storage for data archiving and online backup. In order to keep costs low, Amazon Glacier is optimized for data that is infrequently accessed and for which retrieval times of several hours are suitable.
You use vaults to organize the data you store in Amazon Glacier. Each archive is stored in a vault of your choice. You may control access to your data by setting vault-level access policies using the AWS Identity and Access Management (IAM) service.
Amazon CloudFront is a content delivery web service. It integrates with other Amazon Web Services products to give developers and businesses an easy way to distribute content to end users with low latency, high data transfer speeds, and no minimum usage commitments.
Edge location:
United States(Miami, New York, Jew Jersay,Palo Alto, Seattle WA, San jose)
Europe: paris, london, madrid, milan, amsterdam, frankfurt, dublin
Asia: chennai, Hong Kong, Singapore, Tokyo, Seoul
To stream media files using CloudFront, you provide two types of files to your end users:
Your media files – using HTTP
A media player, for example, JW Player, Flowplayer, or Adobe Flash – using RTMP(real time messaging protocol)
Amazon DynamoDB is a fast and flexible NoSQL database service for all applications that need consistent, single-digit millisecond latency at any scale. It is a fully managed database and supports both document and key-value data models. Its flexible data model and reliable performance make it a great fit for mobile, web, gaming, ad-tech, IoT, and many other applications.
for example, you are storing data that does not lend itself well to a relational schema (tree structures, schema-less JSON representations, etc.) that can be looked up against a single key or a key/range combination then DynamoDB (or some other NoSQL store) would likely be your best bet.
If you have a well-defined schema for your data that can fit well in a relational structure and you need the flexibility to query the data in a number of different ways (adding indexes as necessary of course), then RDS might be a better solution.
ElastiCache is a web service that makes it easy to deploy, operate, and scale an in-memory cache in the cloud. The service improves the performance of web applications by allowing you to retrieve information from fast, managed, in-memory caches, instead of relying entirely on slower disk-based databases. ElastiCache supports two open-source in-memory caching engines:
Amazon SimpleDB is a highly available and flexible non-relational data store that offloads the work of database administration. Developers simply store and query data items via web services requests and Amazon SimpleDB does the rest. Amazon SimpleDB creates and manages multiple geographically distributed replicas of your data automatically to enable high availability and data durability.
Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse solution that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. to handle large scale datasets Amazon is making use of massive parallel processing.
AWS Elastic Beanstalk is an easy-to-use service for deploying and scaling web applications and services developed with Java, .NET, PHP, Node.js, Python, Ruby, and Docker on familiar servers such as Apache, Nginx, Passenger, and IIS. You can simply upload your code and Elastic Beanstalk automatically handles the deployment, from capacity provisioning, load balancing, auto-scaling to application health monitoring. At the same time, you retain full control over the AWS resources powering your application and can access the underlying resources at any time.
There is no additional charge for Elastic Beanstalk - you pay only for the AWS resources needed to store and run your applications.
Amazon CloudWatch is a monitoring service for AWS cloud resources and the applications you run on AWS. You can use Amazon CloudWatch to collect and track metrics, collect and monitor log files, and set alarms. Amazon CloudWatch can monitor AWS resources such as Amazon EC2 instances, Amazon DynamoDB tables, and Amazon RDS DB instances, as well as custom metrics generated by your applications and services, and any log files your applications generate.
AWS Data Pipeline is a web service that helps you reliably process and move data between different AWS compute and storage services as well as on-premise data sources at specified intervals.
AWS CloudFormation gives developers and systems administrators an easy way to create and manage a collection of related AWS resources, provisioning and updating them in an orderly and predictable fashion.
You can use AWS CloudFormation’s sample templates or create your own templates to describe the AWS resources, and any associated dependencies or runtime parameters, required to run your application. You don’t need to figure out the order for provisioning AWS services or the subtleties of making those dependencies work. CloudFormation takes care of this for you. After the AWS resources are deployed, you can modify and update them in a controlled and predictable way, in effect applying version control to your AWS infrastructure the same way you do with your software.
AWS CloudTrail is a web service that records AWS API calls for your account and delivers log files to you. The recorded information includes the identity of the API caller, the time of the API call, the source IP address of the API caller, the request parameters, and the response elements returned by the AWS service.
With CloudTrail, you can get a history of AWS API calls for your account, including API calls made via the AWS Management Console, AWS SDKs, command line tools, and higher-level AWS services (such as AWS CloudFormation). The AWS API call history produced by CloudTrail enables security analysis, resource change tracking, and compliance auditing.
AWS Identity and Access Management (IAM) enables you to securely control access to AWS services and resources for your users. Using IAM, you can create and manage AWS users and groups and use permissions to allow and deny their access to AWS resources.
AWS OpsWorks is an application management service that makes it easy to deploy and operate applications of all shapes and sizes. You can define the application’s architecture and the specification of each component including package installation, software configuration and resources such as storage. Start from templates for common technologies like application servers and databases or build your own to perform any task that can be scripted.
Amazon CloudSearch is a managed service in the AWS Cloud that makes it simple and cost-effective to set up, manage, and scale a custom search solution for your website or application. Amazon CloudSearch supports 34 languages and popular search features such as highlighting, autocomplete, and geospatial search.
Amazon Simple Email Service (Amazon SES) is a cost-effective outbound-only email-sending service built on the reliable and scalable infrastructure that Amazon.com has developed to serve its own customer base. With Amazon SES, you can send transactional email, marketing messages, or any other type of high-quality content and you only pay for what you use.
Along with high deliverability, Amazon SES provides easy, real-time access to your sending statistics and built-in notifications for bounces, complaints, and deliveries to help you fine-tune your cloud-based email-sending strategy.
Amazon Simple Notification Service (Amazon SNS) is a fast, flexible, fully managed push messaging service. Amazon SNS makes it simple and cost-effective to push notifications to Apple, Google, Fire OS, and Windows devices, as well as Android devices in China with Baidu Cloud Push. You can also use SNS to push notifications to internet connected smart devices, as well as other distributed services.
Besides pushing cloud notifications directly to mobile devices, Amazon SNS can also deliver notifications by SMS text message or email, to Amazon Simple Queue Service (SQS) queues, or to any HTTP endpoint.
To prevent messages from being lost, all messages published to Amazon SNS are stored redundantly across multiple availability zones.
Amazon Simple Queue Service (SQS) is a fast, reliable, scalable, fully managed message queuing service. SQS makes it simple and cost-effective to decouple the components of a cloud application. You can use SQS to transmit any volume of data, at any level of throughput, without losing messages or requiring other services to be always available.
With SQS, you can offload the administrative burden of operating and scaling a highly available messaging cluster, while paying a low price for only what you use.
Amazon SWF helps developers build, run, and scale background jobs that have parallel or sequential steps. You can think of SWF as a fully-managed state tracker and task coordinator in the Cloud.
If your app's steps take more than 500 milliseconds to complete, you need to track the state of processing, and you need to recover or retry if a task fails, Amazon SWF can help you.
Amazon Elastic Transcoder is media transcoding in the cloud. It is designed to be a highly scalable, easy to use and a cost effective way for developers and businesses to convert (or “transcode”) media files from their source format into versions that will playback on devices like smartphones, tablets and PCs.