• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Cloud Computing - Availability Issues and Controls

Cloud Computing - Availability Issues and Controls







Total Views
Views on SlideShare
Embed Views



1 Embed 1

http://www.slideshare.net 1


Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
Post Comment
Edit your comment
  • Hi everyone. My name is Lisa and welcome to my presentation. The topic that I will be talking to you about today is Cloud computing, in specific, the availability issues and controls.
  • I will start off by discussing with you the availability issue associated with cloud outages, with detailed reference to Amazon’s cloud services, as well as the general reactions from the consumers. After that, I will be talking to you about other availability issues including data lock-in and vendor shut-down. Finally, I will be talking to you about the potential role CA’s can play to mitigate the risks associated with cloud unavailability.
  • To start us off, let us first take a look at the famous Amazon cloud services provider. Amazon offers two main types of cloud services, namely Elastic Cloud 2 and Amazon Web Services. By consensus, Amazon had 5 instances of cloud outages throughout 2008-2009. However, from 2010 up until now, Amazon only had one outage which took place on April 21, 2011. Per Amazon, this was an issue of “stuck data volume”. In essence, the consumers cannot access data stored in Amazon’s cloud server. This outage had lasted for 2 days.
  • To examine the root cause of Amazon’s cloud outages, let us take a close look at Amazon’s cloud infrastructure. Amazon’s cloud infrastructure was built upon the concept of redundancy. Basically, Amazon’s data warehouses are located in 5 Regions including East Coast & West Coast of United States, Ireland, Tokyo, and Singapore. In each one of the Regions, Amazon also have various Availability zones built in different locations. Amazon explained that by launching instances in separate Availability Zones, the consumers’ applications would be protected from the failure of a single location as backups can be retrieved from different locations. This is known as redundancy. However, Amazon’s availability zones are structured in a way that redundancies are built within the same region. This is because inter-regional redundancies could cause latency and is often more expensive.
  • Theoretically, there is a flaw to Amazon’s architecture of redundancy. In the event of a natural disaster, like the Japanese Earthquake that took place not long ago, it would no longer make sense to build redundancies within the same Region since the backup locations would be destroyed as well. But realistically speaking, it did not take an act of god to cause the cloud outage that has taken place in April 2011.
  • In an effort to minimize the negative effects of Amazon’s cloud outage, Quora has decided to solve the matter in their own hands. For the April 21st 2011 outage, Quora took the initiative and brought up a new database from the most recent back-up that Quora had performed at the company level on April 19th 2011. However, not every Amazon clients did the same. Reddit, for instance, simply posted a note on top of its website to inform the users of the inaccessible data caused by Amazon’s cloud outage.
  • In reality, cloud outages occur quite often at other big-name cloud vendors as well. According to the statistics shown on this slide, you can see that Google and Microsoft’s cloud is not performing any better in terms of availability compared to the Amazon cloud.
  • It’s interesting, however, that the many instances of cloud outages did not scare companies away. According to an IDC report, the expenditure for cloud-related technologies will grow into 45 billion by 2013. In addition, Harris interactive conducted a survey for IT executives which 43% of them expressed that they expect to increase the usage of cloud. Why, you may ask, do people continue to resort to cloud in light of all the detrimental cloud outages?
  • One explanation could be the data lock-in effect of employing cloud services. This term is used to describe the inability to switch cloud vendors or move data back to one’s own data warehouse due to the high conversion costs. For instance, SalesForce.com has a proprietary programming language called Apex that only runs on SalesForce’s platform. Although consumers can retrieve the data that they legitimately own, they will lose all the formatting and data will become unmanageable. Most often than not, consumers would choose to remain with the same cloud vendor since retrieving data back from the cloud will put them at a risk of incompliance with industry standards.
  • When EMC shut down its Atmos Online cloud storage services only a year after its launch, consumers started to worry about the going concern of the cloud vendors’ businesses. Although consumers usually hire cloud vendors with big corporate structures and good reputation, the case of EMC proves that even big-name cloud vendors can easily shut down its own cloud services, putting the consumer’s data, infrastructure, or platform at risk.
  • Over the recent years, there has been heated debate over whether or not CA’s should have a share in cloud computing by providing assurance services over cloud. While I would personally be against auditing the cloud due to the questionable auditability of cloud, I believe that CA’s can still have a share of the cloud by providing consultancy services over the preventative measures vendors and consumers can take to mitigate the risks associated with cloud unavailability.
  • This list contains some of the controls vendors should implement in an effort to sustain the availability of their cloud services. Some of the important ones include, ensuring that redundancies are built in geographically diversified locations to fight against natural disasters. Also, vendors should use reliable equipment that has already been tested for stress-level. Furthermore, vendors should establish and regularly monitor their own Service Health Dashboard in order to promptly receive and address cloud issues that consumers may have reported. Lastly, vendors should regularly monitor the health of their own virtual servers. There is an alternative other than doing this internally, there are also 3rd party service providers such as Amazon’s CloudWatch and Nimsoft’s Cloud Monitor that can be employed by the cloud vendor to monitor the health of their virtual servers.
  • This slide shows a list of things consumers can do to manage issues of cloud availability. It is to note however, the degree of control depends on the mission criticalness of cloud to the consumer’s core business processes. For instance, Tweeter as a consumer may not feel the urgent need to resolve unavailability issues as people may not have a big problem with not having access to their tweets for a couple of days. On the contrary, Netflix may view unavailability as a detrimental issue since its success depends on the continuous supply of video streaming.At minimum, the consumers should monitor various medias to keep themselves informed of the reported cloud outages. It is also important for consumers to fully comprehend the vendor’s Services Level Agreement since some vendors like Google Apps would provide consumers with service credits for failing to comply with their promised up time. If cloud services are critical to the success of a consumer’s business, the consumer should set up business continuity plans such as performing off-cloud backup or employing a second cloud services provider. Lastly, the consumers can self-monitor the vendor’s virtual servers by hiring 3rd party consultants such as uptime for this type of services.
  • It appears that the significant cost reductions associated with cloud is highly lucrative to the Chief Information Officers of many organizations since many of them are either already using cloud, or contemplating to employ cloud, despite of all the availability concerns surrounding the cloud. While cloud computing is still a fairly new topic, keep in mind that there are controls to mitigate the risks of cloud unavailability. Since many CIO’s do not understand how that can be done, this in turn provides us CA’s with the opportunity to step into the picture and save the day!
  • That concludes my presentation, I hope you enjoyed it!

Cloud Computing - Availability Issues and Controls Cloud Computing - Availability Issues and Controls Presentation Transcript

  • Cloud Computing – Availability Issues and Controls
    By: Lisa Cheng
  • Agenda
    Cloud Outages
    Impact of unavailability on consumers
    Data Lock-in
    Vendor Shut-down
    Linkage to the CA profession
    Preventative measures
  • Amazon Cloud Outages
    Two main types of services
    EC2 and Amazon Web Services
    From 2008-2009
    Had 5 instances of cloud outages
    From 2010-2011
    Had 1 instance of cloud outage
    April 21, 2011 outage
    “Stuck data volume”
    Lasted 2 days
    Data were inaccessible, although websites could still function
  • Amazon cloud structure
    Availability Zones
  • Amazon Cloud Structure Critique
    “Act of God”
    Japanese Earthquake
    Cannot make back-ups within the same Region
    Recent April 21, 2011 outage
    All availability zones within the same region failed simultaneously
    Amazon’s competency in building redundancies is questionable
  • Amazon Clients
    Brought up new database from the latest back-up at the company’s level
    Synchronization issue
    Did nothing
  • Cloud outages elsewhere
    12 outages from 2008-2009
    6 outages from 2010 – 2011 (now)
    4 outages from 2008-2009
    6 outages from 2010-2011 (now)
    Playstation Network
  • Impact of unavailability on consumers
    Companies are still resorting to cloud services to cut down costs
    IDC report shown:
    17 billion spent on cloud-related technologies in 2009
    By 2013, it’ll grow to 45 billion
    Harris Interactive:
    43% of IT executives are expected to increase the usage of cloud
  • Data Lock-in
    Vendors with proprietary technologies
    High conversion costs
    Proprietary programming language named Apex
    Microsoft Azure & Amazon Web Services
    Data are the only portable items
    To consumers:
    Risk of paying high prices for poor services
    No compatible technology to retrieve data from cloud
    Risk of incompliance with standards
  • Vendor Shut-down
    Atmos Online Cloud Storage
    Shut down its business after one month of operation
    Offered multiple migration options
    Potential impact on consumers
    Worried about reliance on 3rd party service providers
  • Linkage to the CA profession
    Heated debate over auditing cloud
    Advantage: opportunities
    Disadvantage: auditability of cloud
    Provide consultancy services over preventative measures vendors/consumers can take to mitigate risks of unavailability
  • Preventative measures: Vendors
    Geographically diversified architecture
    Reliable internet connection
    Reliable and redundant hardware/software
    Effective business continuity plans
    Make web console and API available to consumers
    Establish Service Health Dashboard
    Regularly monitor CCID – cloud outages database
    Regularly monitor the health of virtual servers
  • Preventative measures: consumers
    Monitor vendors’ service health dashboard
    Monitor CCID
    Monitor customer mailing list of recent changes
    Monitor RSS feed hosted by the vendor
    Understand/negotiate Services Level Agreement
    Set-up business continuityplans
    Hire a second cloud services providers
    Off-cloud backup
    Periodical updates to reflect expansion
    Self-monitor vendors’ virtual servers
  • Conclusion
  • The end