2

•Download as DOCX, PDF•

0 likes•175 views

Cloud computing is a new concept that emerged in the late 1990s involving hosting applications and data over the internet. There are two main types - software as a service where vendors host specific applications, and infrastructure as a service where clients run their own software on vendor machines through virtual machines. Large cloud vendors include Amazon and Google. Cloud databases store and retrieve data for large numbers of users and prioritize availability and scalability over consistency. BigTable is Google's cloud database that stores attribute values as strings using a hierarchical key of record identifier and attribute name to retrieve data.

Technology Business

Cloud computing
 A new concept is computing that emerged in the late 1990s and the 2000s.
 First, software as a service
o Vendors of software services provided specific customizable applications that
they hosted on their own machines
 Then, generic computers as a service
o Clients runs its own software, but runs it on vendor’s computers.
o These machines are called virtual machines, which are simulated by software that
allows a single real computer to simulate several independent computers
o Clients can add machines as needed to meet demand and release them at times of
light load.
 Other services
o Data storage services, map services, and other services can be accessed using a
Web-service API.
 Venders of cloud service
o Traditional computing vendors, Amazon, Google
 Cloud-based database
o Web applications need to store and retrieve data for very large numbers of users
o Value availability and scalability over consistency
 Systems for data storage on the cloud
o Bigtable from Google
o Simple Storage Service (S3) from Amazon
o Cassandra from Facebook
o Sherpa/PNUTs from Yahoo!
Data Representation
 It needs to provide flexibility in the set of attributes that a record contains, and the types
of these attributes

 The partitioning is done on the search key, so that a request for a specific key value is
directed to a single tablet.
 The site to which a tablet is assigned acts as the master site.
o All updates are routed through this site, and then propagated to replicas
 The partitioning of data is not fixed, but happens dynamically.
 A tablet controller site tracks the partitioning function, to map a get() request to tablets,
and map from tablets to sites
Architecture of a cloud data storage system
Challenges with Cloud-based Database
 advantages
o Do not need to build a computing infrastructures from scratch
o Essential for certain applications
 Disadvantage
o Additional communication cost like traditional distributed database system

o The physical location of data is under the control of the vendor, which is unaware
 Hard to perform query optimization
o Replication is under the control of the vendor
 Hard to ensure the latest version of data are read
o Data held by another organization are risked in terms of security and legal
liability

What's hot

MongoDB NoSQL database a deep dive -MyWhitePaperRajesh Kumar

SQL vs MongoDBcalltutors

An Intro to NoSQL DatabasesRajith Pemabandu

Knockout Advanced Concepts By Surekha GadkariSurekha Gadkari

Oslo bekk2014Max Neunhöffer

Multi-model databases and node.jsMax Neunhöffer

Cassandra data accesstechblog

NoSQL databasesFilip Ilievski

FIWARE Global Summit - NGSI-LD: Modelling, Linking and Utilizing Context Info...FIWARE

Extensible Database APIs and their role in Software ArchitectureMax Neunhöffer

Mongo db a deep dive of mongodb indexesRajesh Kumar

JSON ApplicationLin Tzu Cheng

Backbone using Extensible Database APIs over HTTPMax Neunhöffer

PostgreSQL - Object Relational DatabaseMubashar Iqbal

DiveDhan V Sagar

Schemaless DatabasesDan Gunter

Jan Steemann: Modelling data in a schema free world (Talk held at Froscon, 2...ArangoDB Database

NOSQL Databases types and UsesSuvradeep Rudra

Android L05 - StorageMohammad Shaker

What's hot (19)

MongoDB NoSQL database a deep dive -MyWhitePaper

SQL vs MongoDB

An Intro to NoSQL Databases

Knockout Advanced Concepts By Surekha Gadkari

Oslo bekk2014

Multi-model databases and node.js

Cassandra data access

NoSQL databases

FIWARE Global Summit - NGSI-LD: Modelling, Linking and Utilizing Context Info...

Extensible Database APIs and their role in Software Architecture

Mongo db a deep dive of mongodb indexes

JSON Application

Backbone using Extensible Database APIs over HTTP

PostgreSQL - Object Relational Database

Dive

Schemaless Databases

Jan Steemann: Modelling data in a schema free world (Talk held at Froscon, 2...

NOSQL Databases types and Uses

Android L05 - Storage

Similar to 2

Technology OverviewLiran Zelkha

Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, ConfluentHostedbyConfluent

Towards secure and dependable storageKhaja Moiz Uddin

ArcReady - Architecting For The CloudMicrosoft ArcReady

Cloud ComputingKashyap Parmar

Schema-based multi-tenant architecture using Quarkus & Hibernate-ORM.pdfseo18

Privacy Issues of Cloud Computing in the Federal SectorLew Oleinick

Essay On Active DirectoryTammy Moncrief

Apache Kafka and the Data Mesh | Michael Noll, ConfluentHostedbyConfluent

The Proliferation And Advances Of Computer NetworksJessica Deakin

Cloud economics design, capacity and operational concernsMarcos García

Cloud ComputingArwa

Off-Label Data Mesh: A Prescription for Healthier DataHostedbyConfluent

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...Niraj Tolia

Windows Azure: Lessons From The FieldRob Gillen

EXPLORING WOMEN SECURITY BY DEDUPLICATION OF DATAIRJET Journal

Maximizing Data Lake ROI with Data Virtualization: A Technical DemonstrationDenodo

Presentazione pagano1Francesco Pagano

The Last Frontier- Virtualization, Hybrid Management and the CloudKellyn Pot'Vin-Gorman

databasesystemsconollyslide1-151102101031-lva1-app6892.pptxsalutiontechnology

Similar to 2 (20)

Technology Overview

Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent

Towards secure and dependable storage

ArcReady - Architecting For The Cloud

Cloud Computing

Schema-based multi-tenant architecture using Quarkus & Hibernate-ORM.pdf

Privacy Issues of Cloud Computing in the Federal Sector

Essay On Active Directory

Apache Kafka and the Data Mesh | Michael Noll, Confluent

The Proliferation And Advances Of Computer Networks

Cloud economics design, capacity and operational concerns

Cloud Computing

Off-Label Data Mesh: A Prescription for Healthier Data

(Speaker Notes Version) Architecting An Enterprise Storage Platform Using Obj...

Windows Azure: Lessons From The Field

EXPLORING WOMEN SECURITY BY DEDUPLICATION OF DATA

Maximizing Data Lake ROI with Data Virtualization: A Technical Demonstration

Presentazione pagano1

The Last Frontier- Virtualization, Hybrid Management and the Cloud

databasesystemsconollyslide1-151102101031-lva1-app6892.pptx

Recently uploaded

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

ICT role in 21st century education and its challengesrafiqahmad00786416

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Exploring Multimodal Embeddings with MilvusZilliz

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Architecting Cloud Native ApplicationsWSO2

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

[BuildWithAI] Introduction to Gemini.pdfSandro Moreira

DBX First Quarter 2024 Investor PresentationDropbox

Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Boost Fertility New Invention Ups Success Rates.pdf

ICT role in 21st century education and its challenges

FWD Group - Insurer Innovation Award 2024

Exploring Multimodal Embeddings with Milvus

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

Exploring the Future Potential of AI-Enabled Smartphone Processors

Strategies for Landing an Oracle DBA Job as a Fresher

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Apidays New York 2024 - The value of a flexible API Management solution for O...

Architecting Cloud Native Applications

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

[BuildWithAI] Introduction to Gemini.pdf

DBX First Quarter 2024 Investor Presentation

Spring Boot vs Quarkus the ultimate battle - DevoxxUK

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

AWS Community Day CPH - Three problems of Terraform

Ransomware_Q4_2023. The report. [EN].pdf

2

1. Cloud computing  A new concept is computing that emerged in the late 1990s and the 2000s.  First, software as a service o Vendors of software services provided specific customizable applications that they hosted on their own machines  Then, generic computers as a service o Clients runs its own software, but runs it on vendor’s computers. o These machines are called virtual machines, which are simulated by software that allows a single real computer to simulate several independent computers o Clients can add machines as needed to meet demand and release them at times of light load.  Other services o Data storage services, map services, and other services can be accessed using a Web-service API.  Venders of cloud service o Traditional computing vendors, Amazon, Google  Cloud-based database o Web applications need to store and retrieve data for very large numbers of users o Value availability and scalability over consistency  Systems for data storage on the cloud o Bigtable from Google o Simple Storage Service (S3) from Amazon o Cassandra from Facebook o Sherpa/PNUTs from Yahoo! Data Representation  It needs to provide flexibility in the set of attributes that a record contains, and the types of these attributes

2. o XML, JSON o BigTable has its own data model (the next page)  It does not need extensive query language support. Two primitive functions: o put(key, value): store values with an associated key o get(key): retrieve the stored value associated with the specified key  An example application o The profile of a user needs to be accessible to many different application that are run by an organization. o The profile contains my attributes, and there are frequent additions to the attributes stored in the profile o Some attributes may contain complex data. BigTable  A record is split into component attributes that are stored separately.  The key for an attribute value consists of (record-identifier, attribute-name).  Each attribute value is just a string.  Example: A record with identifier “22222”, can have multiple attribute names such as “name.firstname”, “deptname”, “children[1].firstname”, “children[2].lastname”. (cf the JSON example in chapter 23).  To fetch all attributes of a record, a prefix-match query consisting of just the record identifier, is used.  The record identifier can itself be structured hierarchically  A single instance of Bigtable can store data for multiple application, with multiple tables per application, by simply prefixing the application name and table name to the record identifier Partitioning and Retrieving Data  Unlike regular parallel database, it is usually not possible to decide on a partitioning function ahead of time.  Therefore, it partition data into small units, called tablets.

3.  The partitioning is done on the search key, so that a request for a specific key value is directed to a single tablet.  The site to which a tablet is assigned acts as the master site. o All updates are routed through this site, and then propagated to replicas  The partitioning of data is not fixed, but happens dynamically.  A tablet controller site tracks the partitioning function, to map a get() request to tablets, and map from tablets to sites Architecture of a cloud data storage system Challenges with Cloud-based Database  advantages o Do not need to build a computing infrastructures from scratch o Essential for certain applications  Disadvantage o Additional communication cost like traditional distributed database system

4. o The physical location of data is under the control of the vendor, which is unaware  Hard to perform query optimization o Replication is under the control of the vendor  Hard to ensure the latest version of data are read o Data held by another organization are risked in terms of security and legal liability

2

Recommended