SlideShare a Scribd company logo
© 2009 EMC Corporation. All rights reserved.
EMC Proven Professional
The #1 Certification Program in the information storage
and management industry
Content Addressed Storage
Chapter 9
Section 2 : Storage Networking Technologies and Virtualization
© 2009 EMC Corporation. All rights reserved.
Chapter Objectives
Upon completion of this chapter, you will be able to:
o Describe CAS, fixed content and archives, traditional storage
solutions for archive
o Describe the features and benefits of a CAS based storage
strategy
o List the physical and logical elements of CAS
o Describe the storage and retrieval process for CAS data objects
o Describe the best suited operational environments for CAS
solutions
© 2009 EMC Corporation. All rights reserved.
Lesson: CAS Overview
Upon completion of this lesson, you be able to:
o Define fixed content
o Describe traditional archival solutions and its shortcoming
o Define Content Addressed Storage (CAS)
o List benefits of CAS
© 2009 EMC Corporation. All rights reserved.
What are Fixed Content and Archives
Electronic Documents
•Contracts, claims, etc.
•E-mail and attachments
•Financial spread sheets
•CAD/CAM designs
•Presentations
Digital Records
•Documents
– Checks, securities trades
– Historical preservation
•Photographs
– Personal / professional
•Surveys
– Seismic, astronomic,
geographic
Digital Assets Retained For Active Reference And Value
Leverage
Historical Value
Improve
Service Levels
Generate
New Revenues
Rich Media
•Medical
– X-rays, MRIs, CTI
•Video
– News / media, movies
– Security surveillance
•Audio
– Voicemail
– Radio
© 2009 EMC Corporation. All rights reserved.
Challenges of Storing Fixed Content
o Fixed content is growing at more than 90% annually
o Significant amount of newly created information falls into this category
o New regulations require retention and data protection
o Often, long-term preservation is required (years-decades)
o Simultaneous multi-user online access is preferable to offline
storage
o Need faster access to fixed content
o Need for location independent data, enabling technology
refresh and migration
o Traditional storage methods are inadequate
© 2009 EMC Corporation. All rights reserved.
Traditional storage solutions for Archive
o Three categories of archival solution are:
o Online, nearline, and offline based on the means of access
o Traditional archival solution were offline
o Traditional archival process used optical disks and tapes as media for
archival
o An archive is often stored on a Write Once Read Many (WORM) device,
such as a CD-ROM
© 2009 EMC Corporation. All rights reserved.
Shortcomings of Traditional Archiving Solutions
o Tape is slow, and standards are always changing
o Optical is expensive, and requires vast amounts of media
o Recovering files from tape and optical is often time consuming
o Data on tape and optical is subject to media degradation
o Both solution require sophisticated media management
CAS has emerged as an alternative to traditional
archiving solutions
© 2009 EMC Corporation. All rights reserved.
What is Content Addressed Storage (CAS)
o Object-oriented, location-independent approach to data
storage
o Repository for the “Objects”
o Access mechanism to interface with repository
o Globally unique identifiers provide access to objects
© 2009 EMC Corporation. All rights reserved.
Benefits of CAS
o Content authenticity
o Content integrity
o Location independence
o Single-instance storage (SiS)
o Retention enforcement
o Record-level protection and disposition
o Technology independence
o Fast record retrieval
Additional Task
Research on role of CAS in ILM
Strategy
© 2009 EMC Corporation. All rights reserved.
Lesson Summary
Key points covered in this lesson:
o CAS Definition
o Challenges of Storing Fixed Content
o Shortcomings of Traditional Archiving Solutions
o Benefits of CAS
© 2009 EMC Corporation. All rights reserved.
Lesson: CAS Architecture
Upon completion of this lesson, you will be able to:
o Describe CAS architecture
o Describe Physical and logical elements of CAS
o Describe data storage and retrieval process in CAS
environment
o CAS examples
© 2009 EMC Corporation. All rights reserved.
Physical Elements of CAS
o Storage devices (CAS Based)
o Storage node
o Access node
o Servers (to which storage devices
get connected)
o Client
Server
Private
LAN
Storage
Nodes
Access
Nodes
CAS System
IP
API
© 2009 EMC Corporation. All rights reserved.
CAS Terminology
o Application Programming Interface (API)
o A set of function calls that enables communication
between applications or between an application and
an operating system
o BLOB (Binary Large Object)
o The actual data without the descriptive information
(metadata)
o The Distinct Bit Sequence (DBS) of user data
represents the actual content of a file and is
independent of the filename and physical location
API
© 2009 EMC Corporation. All rights reserved.
CAS Terminology (Cont)
o C-Clip
o A package containing the user's data and associated
metadata
o C-Clip ID (C-Clip handle or C-Clip reference) is the CA that
the system returns to the client application
o Content Address (CA)
o An identifier that uniquely addresses the content of a file
and not its location. Unlike location-based addresses,
content addresses are inherently stable and, once
calculated, they never change and always refer to the same
content
o C-Clip Descriptor File (CDF)
o The additional XML file that the system creates when
making a C-Clip. This file includes the content addresses for
all referenced BLOBs and associated metadata
© 2009 EMC Corporation. All rights reserved.
API
How CAS Stores a Data Object
Unique Content
Address is calculated
Client presents data
to API to be archived CAS System
Client
Application Server
CDF
C-Clip
(Object)
Object is sent
to CAS System via
CAS API over IP
© 2009 EMC Corporation. All rights reserved.
API
How CAS Stores a Data Object
Unique Content
Address is calculated
Client presents data
to API to be archived CAS System
Client
Application Server Object is sent
to CAS System via
CAS API over IP Object
CAS System validates
the Content Address
and stores the object
Acknowledgement
returned to
application
Clip ID is retained and
stored for future use
© 2009 EMC Corporation. All rights reserved.
Application Server
Client
CAS System
Object is needed by
an application
1
CAS authenticates
the request and
delivers the object
4
Application finds
Content Address of
object to be retrieved
2 Retrieval request is
sent to the CAS System via
CAS API over IP
3
How CAS Retrieves a Data Object
API
C-Clip ID
© 2009 EMC Corporation. All rights reserved.
CAS Features
o Features available with most CAS systems are:
o Integrity checking
o Data protection
o Local replication
o Remote replication
o Load balancing
o Scalability
o Self-diagnosis and repair
o Report generation and event notification
o Fault tolerance
o Through the use of redundant components and data protection schemes
o Audit trails
o Documentation of management activities, access and disposition of data
© 2009 EMC Corporation. All rights reserved.
Example 1: CAS Healthcare Solution
o Each X-ray image ranges from about 15MB to over 1GB
o Patient record is stored online for a period of 60-90 days
o Beyond 90 days patient records are archived
Data Stored
on CAS
Patient Studies
Stored locally for
Short-Term Use
(60 Days)
Hospital
CAS SystemApplication
Server
API
© 2009 EMC Corporation. All rights reserved.
Example 2: CAS Financial Solution
o Check image size is about 25KB
o Check imaging service provider may process 50–90 million
check images per month
o Checks are stored online for a period of 60 days
o Beyond 60 days data is archived
Bank
CAS SystemApplication Server
API
© 2009 EMC Corporation. All rights reserved.
Lesson Summary
Key points covered in this lesson:
o CAS architecture
o Physical and logical elements of CAS
o CAS storage and retrieval process
o CAS solution examples
© 2009 EMC Corporation. All rights reserved.
Chapter Summary
Key points covered in this chapter:
o Benefits of CAS based storage strategy
o Overview of physical and logical elements of CAS
o Storing and retrieving data from CAS
o CAS application examples
© 2009 EMC Corporation. All rights reserved.
#1 IT
company
For more information visit http://education.EMC.com

More Related Content

What's hot

Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)
Rabin BK
 
Unit 2
Unit 2Unit 2
Unit 2
APARNA P
 
Chapter 4
Chapter 4Chapter 4
Chapter 4
Ali Broumandnia
 
30326851 -operating-system-unit-1-ppt
30326851 -operating-system-unit-1-ppt30326851 -operating-system-unit-1-ppt
30326851 -operating-system-unit-1-ppt
raj732723
 
Congestion control
Congestion controlCongestion control
Congestion control
Aman Jaiswal
 
Recovery Techniques and Need of Recovery
Recovery Techniques and   Need of RecoveryRecovery Techniques and   Need of Recovery
Recovery Techniques and Need of Recovery
Pooja Dixit
 
input/ output in java
input/ output  in javainput/ output  in java
input/ output in java
sharma230399
 
Presentation on tablespaceses segments extends and blocks
Presentation on tablespaceses segments extends and blocksPresentation on tablespaceses segments extends and blocks
Presentation on tablespaceses segments extends and blocks
Vinay Ugave
 
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data ControlDistributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data Control
Gyanmanjari Institute Of Technology
 
9. Input Output in java
9. Input Output in java9. Input Output in java
9. Input Output in java
Nilesh Dalvi
 
TCP - Transmission Control Protocol
TCP - Transmission Control ProtocolTCP - Transmission Control Protocol
TCP - Transmission Control Protocol
Peter R. Egli
 
8th lec flow and error control
8th lec   flow and error control8th lec   flow and error control
8th lec flow and error control
Inocentshuja Ahmad
 
Distributed Query Processing
Distributed Query ProcessingDistributed Query Processing
Distributed Query Processing
Mythili Kannan
 
Chap 12 tcp
Chap 12 tcpChap 12 tcp
Chap 12 tcp
Sparsh Samir
 
Database Systems Security
Database Systems SecurityDatabase Systems Security
Database Systems Security
amiable_indian
 
Congestion on computer network
Congestion on computer networkCongestion on computer network
Congestion on computer networkDisi Dc
 
Threads concept in java
Threads concept in javaThreads concept in java
Threads concept in java
Muthukumaran Subramanian
 
[OOP - Lec 08] Encapsulation (Information Hiding)
[OOP - Lec 08] Encapsulation (Information Hiding)[OOP - Lec 08] Encapsulation (Information Hiding)
[OOP - Lec 08] Encapsulation (Information Hiding)
Muhammad Hammad Waseem
 
Computer network switching
Computer network switchingComputer network switching
Computer network switching
Shivani Godha
 

What's hot (20)

Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)Object Relational Database Management System(ORDBMS)
Object Relational Database Management System(ORDBMS)
 
Unit 2
Unit 2Unit 2
Unit 2
 
Chapter 4
Chapter 4Chapter 4
Chapter 4
 
30326851 -operating-system-unit-1-ppt
30326851 -operating-system-unit-1-ppt30326851 -operating-system-unit-1-ppt
30326851 -operating-system-unit-1-ppt
 
Congestion control
Congestion controlCongestion control
Congestion control
 
Guided media
Guided mediaGuided media
Guided media
 
Recovery Techniques and Need of Recovery
Recovery Techniques and   Need of RecoveryRecovery Techniques and   Need of Recovery
Recovery Techniques and Need of Recovery
 
input/ output in java
input/ output  in javainput/ output  in java
input/ output in java
 
Presentation on tablespaceses segments extends and blocks
Presentation on tablespaceses segments extends and blocksPresentation on tablespaceses segments extends and blocks
Presentation on tablespaceses segments extends and blocks
 
Distributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data ControlDistributed DBMS - Unit 5 - Semantic Data Control
Distributed DBMS - Unit 5 - Semantic Data Control
 
9. Input Output in java
9. Input Output in java9. Input Output in java
9. Input Output in java
 
TCP - Transmission Control Protocol
TCP - Transmission Control ProtocolTCP - Transmission Control Protocol
TCP - Transmission Control Protocol
 
8th lec flow and error control
8th lec   flow and error control8th lec   flow and error control
8th lec flow and error control
 
Distributed Query Processing
Distributed Query ProcessingDistributed Query Processing
Distributed Query Processing
 
Chap 12 tcp
Chap 12 tcpChap 12 tcp
Chap 12 tcp
 
Database Systems Security
Database Systems SecurityDatabase Systems Security
Database Systems Security
 
Congestion on computer network
Congestion on computer networkCongestion on computer network
Congestion on computer network
 
Threads concept in java
Threads concept in javaThreads concept in java
Threads concept in java
 
[OOP - Lec 08] Encapsulation (Information Hiding)
[OOP - Lec 08] Encapsulation (Information Hiding)[OOP - Lec 08] Encapsulation (Information Hiding)
[OOP - Lec 08] Encapsulation (Information Hiding)
 
Computer network switching
Computer network switchingComputer network switching
Computer network switching
 

Similar to Chapter 9

Presentation atmos architecture overview
Presentation   atmos architecture overviewPresentation   atmos architecture overview
Presentation atmos architecture overview
xKinAnx
 
Chapter 9
Chapter 9Chapter 9
Chapter 9
Ali Broumandnia
 
Rapid_Recovery-T75-v2204j.pdf
Rapid_Recovery-T75-v2204j.pdfRapid_Recovery-T75-v2204j.pdf
Rapid_Recovery-T75-v2204j.pdf
Tony Pearson
 
Object Broker Infrastructure for Wide Area Networks
Object Broker Infrastructure for Wide Area NetworksObject Broker Infrastructure for Wide Area Networks
Object Broker Infrastructure for Wide Area Networks
Vaidas Brundza
 
IDT Replaces On-Premises Appliances with Primary Backup on AWS
 IDT Replaces On-Premises Appliances with Primary Backup on AWS IDT Replaces On-Premises Appliances with Primary Backup on AWS
IDT Replaces On-Premises Appliances with Primary Backup on AWS
Amazon Web Services
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overview
solarisyougood
 
Oracle E-Business Suite On Oracle Cloud
Oracle E-Business Suite On Oracle CloudOracle E-Business Suite On Oracle Cloud
Oracle E-Business Suite On Oracle Cloud
pasalapudi
 
Oracle Cloud Infraestructure Update
Oracle Cloud Infraestructure UpdateOracle Cloud Infraestructure Update
Oracle Cloud Infraestructure Update
RaphaelCampelo
 
Oci meetup v1
Oci meetup v1Oci meetup v1
Oci meetup v1
RaphaelCampelo
 
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
Snowy Chen
 
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
Simeon Warner
 
S016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710dS016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710d
Tony Pearson
 
Storage networking fcf_co_eiscsivsn_technology
Storage networking fcf_co_eiscsivsn_technologyStorage networking fcf_co_eiscsivsn_technology
Storage networking fcf_co_eiscsivsn_technologyEMC
 
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdfPass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
GaganaGowda31
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
glorykim
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
광영 김
 
NetApp Se training storage grid webscale technical overview
NetApp Se training   storage grid webscale technical overviewNetApp Se training   storage grid webscale technical overview
NetApp Se training storage grid webscale technical overview
solarisyougood
 
S104876 ibm-cos-jburg-v1809b
S104876 ibm-cos-jburg-v1809bS104876 ibm-cos-jburg-v1809b
S104876 ibm-cos-jburg-v1809b
Tony Pearson
 
S100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804cS100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804c
Tony Pearson
 

Similar to Chapter 9 (20)

Presentation atmos architecture overview
Presentation   atmos architecture overviewPresentation   atmos architecture overview
Presentation atmos architecture overview
 
Chapter 9
Chapter 9Chapter 9
Chapter 9
 
Rapid_Recovery-T75-v2204j.pdf
Rapid_Recovery-T75-v2204j.pdfRapid_Recovery-T75-v2204j.pdf
Rapid_Recovery-T75-v2204j.pdf
 
Object Broker Infrastructure for Wide Area Networks
Object Broker Infrastructure for Wide Area NetworksObject Broker Infrastructure for Wide Area Networks
Object Broker Infrastructure for Wide Area Networks
 
Chapter 6
Chapter 6Chapter 6
Chapter 6
 
IDT Replaces On-Premises Appliances with Primary Backup on AWS
 IDT Replaces On-Premises Appliances with Primary Backup on AWS IDT Replaces On-Premises Appliances with Primary Backup on AWS
IDT Replaces On-Premises Appliances with Primary Backup on AWS
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overview
 
Oracle E-Business Suite On Oracle Cloud
Oracle E-Business Suite On Oracle CloudOracle E-Business Suite On Oracle Cloud
Oracle E-Business Suite On Oracle Cloud
 
Oracle Cloud Infraestructure Update
Oracle Cloud Infraestructure UpdateOracle Cloud Infraestructure Update
Oracle Cloud Infraestructure Update
 
Oci meetup v1
Oci meetup v1Oci meetup v1
Oci meetup v1
 
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
Frank kramer ibm-data_management-for-adas-scale-usergroup-sin-032018
 
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
 
S016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710dS016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710d
 
Storage networking fcf_co_eiscsivsn_technology
Storage networking fcf_co_eiscsivsn_technologyStorage networking fcf_co_eiscsivsn_technology
Storage networking fcf_co_eiscsivsn_technology
 
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdfPass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
Pass DSDSDSDSDSDSDSDSSDSSDSDSSDDSDSDSDS .pdf
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
20100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_033020100401 정영임 da 전략 tft_0330
20100401 정영임 da 전략 tft_0330
 
NetApp Se training storage grid webscale technical overview
NetApp Se training   storage grid webscale technical overviewNetApp Se training   storage grid webscale technical overview
NetApp Se training storage grid webscale technical overview
 
S104876 ibm-cos-jburg-v1809b
S104876 ibm-cos-jburg-v1809bS104876 ibm-cos-jburg-v1809b
S104876 ibm-cos-jburg-v1809b
 
S100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804cS100299 ibm-cos-orlando-v1804c
S100299 ibm-cos-orlando-v1804c
 

Recently uploaded

GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 

Recently uploaded (20)

GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 

Chapter 9

  • 1. © 2009 EMC Corporation. All rights reserved. EMC Proven Professional The #1 Certification Program in the information storage and management industry Content Addressed Storage Chapter 9 Section 2 : Storage Networking Technologies and Virtualization
  • 2. © 2009 EMC Corporation. All rights reserved. Chapter Objectives Upon completion of this chapter, you will be able to: o Describe CAS, fixed content and archives, traditional storage solutions for archive o Describe the features and benefits of a CAS based storage strategy o List the physical and logical elements of CAS o Describe the storage and retrieval process for CAS data objects o Describe the best suited operational environments for CAS solutions
  • 3. © 2009 EMC Corporation. All rights reserved. Lesson: CAS Overview Upon completion of this lesson, you be able to: o Define fixed content o Describe traditional archival solutions and its shortcoming o Define Content Addressed Storage (CAS) o List benefits of CAS
  • 4. © 2009 EMC Corporation. All rights reserved. What are Fixed Content and Archives Electronic Documents •Contracts, claims, etc. •E-mail and attachments •Financial spread sheets •CAD/CAM designs •Presentations Digital Records •Documents – Checks, securities trades – Historical preservation •Photographs – Personal / professional •Surveys – Seismic, astronomic, geographic Digital Assets Retained For Active Reference And Value Leverage Historical Value Improve Service Levels Generate New Revenues Rich Media •Medical – X-rays, MRIs, CTI •Video – News / media, movies – Security surveillance •Audio – Voicemail – Radio
  • 5. © 2009 EMC Corporation. All rights reserved. Challenges of Storing Fixed Content o Fixed content is growing at more than 90% annually o Significant amount of newly created information falls into this category o New regulations require retention and data protection o Often, long-term preservation is required (years-decades) o Simultaneous multi-user online access is preferable to offline storage o Need faster access to fixed content o Need for location independent data, enabling technology refresh and migration o Traditional storage methods are inadequate
  • 6. © 2009 EMC Corporation. All rights reserved. Traditional storage solutions for Archive o Three categories of archival solution are: o Online, nearline, and offline based on the means of access o Traditional archival solution were offline o Traditional archival process used optical disks and tapes as media for archival o An archive is often stored on a Write Once Read Many (WORM) device, such as a CD-ROM
  • 7. © 2009 EMC Corporation. All rights reserved. Shortcomings of Traditional Archiving Solutions o Tape is slow, and standards are always changing o Optical is expensive, and requires vast amounts of media o Recovering files from tape and optical is often time consuming o Data on tape and optical is subject to media degradation o Both solution require sophisticated media management CAS has emerged as an alternative to traditional archiving solutions
  • 8. © 2009 EMC Corporation. All rights reserved. What is Content Addressed Storage (CAS) o Object-oriented, location-independent approach to data storage o Repository for the “Objects” o Access mechanism to interface with repository o Globally unique identifiers provide access to objects
  • 9. © 2009 EMC Corporation. All rights reserved. Benefits of CAS o Content authenticity o Content integrity o Location independence o Single-instance storage (SiS) o Retention enforcement o Record-level protection and disposition o Technology independence o Fast record retrieval Additional Task Research on role of CAS in ILM Strategy
  • 10. © 2009 EMC Corporation. All rights reserved. Lesson Summary Key points covered in this lesson: o CAS Definition o Challenges of Storing Fixed Content o Shortcomings of Traditional Archiving Solutions o Benefits of CAS
  • 11. © 2009 EMC Corporation. All rights reserved. Lesson: CAS Architecture Upon completion of this lesson, you will be able to: o Describe CAS architecture o Describe Physical and logical elements of CAS o Describe data storage and retrieval process in CAS environment o CAS examples
  • 12. © 2009 EMC Corporation. All rights reserved. Physical Elements of CAS o Storage devices (CAS Based) o Storage node o Access node o Servers (to which storage devices get connected) o Client Server Private LAN Storage Nodes Access Nodes CAS System IP API
  • 13. © 2009 EMC Corporation. All rights reserved. CAS Terminology o Application Programming Interface (API) o A set of function calls that enables communication between applications or between an application and an operating system o BLOB (Binary Large Object) o The actual data without the descriptive information (metadata) o The Distinct Bit Sequence (DBS) of user data represents the actual content of a file and is independent of the filename and physical location API
  • 14. © 2009 EMC Corporation. All rights reserved. CAS Terminology (Cont) o C-Clip o A package containing the user's data and associated metadata o C-Clip ID (C-Clip handle or C-Clip reference) is the CA that the system returns to the client application o Content Address (CA) o An identifier that uniquely addresses the content of a file and not its location. Unlike location-based addresses, content addresses are inherently stable and, once calculated, they never change and always refer to the same content o C-Clip Descriptor File (CDF) o The additional XML file that the system creates when making a C-Clip. This file includes the content addresses for all referenced BLOBs and associated metadata
  • 15. © 2009 EMC Corporation. All rights reserved. API How CAS Stores a Data Object Unique Content Address is calculated Client presents data to API to be archived CAS System Client Application Server CDF C-Clip (Object) Object is sent to CAS System via CAS API over IP
  • 16. © 2009 EMC Corporation. All rights reserved. API How CAS Stores a Data Object Unique Content Address is calculated Client presents data to API to be archived CAS System Client Application Server Object is sent to CAS System via CAS API over IP Object CAS System validates the Content Address and stores the object Acknowledgement returned to application Clip ID is retained and stored for future use
  • 17. © 2009 EMC Corporation. All rights reserved. Application Server Client CAS System Object is needed by an application 1 CAS authenticates the request and delivers the object 4 Application finds Content Address of object to be retrieved 2 Retrieval request is sent to the CAS System via CAS API over IP 3 How CAS Retrieves a Data Object API C-Clip ID
  • 18. © 2009 EMC Corporation. All rights reserved. CAS Features o Features available with most CAS systems are: o Integrity checking o Data protection o Local replication o Remote replication o Load balancing o Scalability o Self-diagnosis and repair o Report generation and event notification o Fault tolerance o Through the use of redundant components and data protection schemes o Audit trails o Documentation of management activities, access and disposition of data
  • 19. © 2009 EMC Corporation. All rights reserved. Example 1: CAS Healthcare Solution o Each X-ray image ranges from about 15MB to over 1GB o Patient record is stored online for a period of 60-90 days o Beyond 90 days patient records are archived Data Stored on CAS Patient Studies Stored locally for Short-Term Use (60 Days) Hospital CAS SystemApplication Server API
  • 20. © 2009 EMC Corporation. All rights reserved. Example 2: CAS Financial Solution o Check image size is about 25KB o Check imaging service provider may process 50–90 million check images per month o Checks are stored online for a period of 60 days o Beyond 60 days data is archived Bank CAS SystemApplication Server API
  • 21. © 2009 EMC Corporation. All rights reserved. Lesson Summary Key points covered in this lesson: o CAS architecture o Physical and logical elements of CAS o CAS storage and retrieval process o CAS solution examples
  • 22. © 2009 EMC Corporation. All rights reserved. Chapter Summary Key points covered in this chapter: o Benefits of CAS based storage strategy o Overview of physical and logical elements of CAS o Storing and retrieving data from CAS o CAS application examples
  • 23. © 2009 EMC Corporation. All rights reserved. #1 IT company For more information visit http://education.EMC.com

Editor's Notes

  1. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  2. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage - The objectives for this chapter are shown here. Please take a moment to read them.
  3. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage - The objectives for this lesson are shown here. Please take a moment to read them.
  4. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  5. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  6. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  7. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  8. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  9. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  10. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  11. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  12. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  13. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  14. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  15. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  16. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  17. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  18. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  19. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  20. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  21. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  22. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -
  23. Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage -