SlideShare a Scribd company logo
1 of 27
PAGE1
www.DocuLynx.com
Intelligent Cloud Storage Enablement
Presented by Mike Wipperfeld
July 22, 2014
PAGE2
www.DocuLynx.com
Challenges
• Data continues to grow
exponentially – especially
with the advent of social
content
• Approximately 70% of data is
unstructured
• Impact on
- Storage costs and
management
- Data Protection SLAs
• New deployment options
such as cloud provide
alternatives
Corporate knowledge has
grown from 75 Exabyte in
2007 to 580 Exabyte in 2011
or a CAGR of 67%
- Forrester
PAGE3
www.DocuLynx.com
• What files ?
• How ?
• Approx. 65% of the total volume
is rarely used
• Challenge:
– Identify the rarely accessed data
– Transport to target
– Keep available to users /applications
• Results:
– Optimized storage utilization
– Reduced operational and
administrative costs
ON PREMISE STORAGE
Typical Scenario
Window
ServerNAS Filer
NetApp
PAGE4
www.DocuLynx.com
you could classify information based on external
attributes: analyze, monitor, report?
you could select and move files based on centrally defined
policies to second tier storage in the cloud?
you could make it completely transparent?
you could address compliance and access controls
through classification based on content analysis
Challenges
PAGE5
www.DocuLynx.com
• You can migrate files – transparent for the user - into
multiple storage locations
• Users and applications will not notice any difference
• With cloud storage, you can integrate the public
cloud with on-premises storage to:
• reduce datacenter infrastructure complexity
• maximize data protection
• reduce overall storage total cost of ownership (TCO) by
60-80%
• provision storage more rapidly to reclaim IT time cycles
• With the optional DocuClassify feature, the selection
of files can be done based on its content and
business values
Enabling the fast introduction of integrated cloud storage
without disturbing users and IT operations
Intelligent Content Management
More Intelligent Content Management
Window
ServerNAS Filer
NetApp
Cloud Storage
PAGE6
www.DocuLynx.com
• Analysis & Classification
Analysis and classification of the file inventory
with customer-defined rules.
• Migration
Creation of a copy on secondary storage in
accordance with rules based on criteria such
as meta-data or content-based classification.
• Release
Exchange of the original file with a small
reference file which looks and acts just like the
original.
• Recall
Restoration of the original file from secondary
storage upon request.
After the initial set-up, these processes run independent and transparently in the background.
Analysis &
Classification
ReleaseRecall
Migration
Identifying
the right files
Moving
files transparently to
secondary storage
Storage Optimization –
Functional principle in four processes
PAGE7
www.DocuLynx.com
Purpose of the assessment:
•Analyze the current file structure
•Illustrate results in multiple ways for examination
•Make recommendations regarding future HSM or archiving options
•Make recommendations about optimal usage of file classification
Quantitative Analysis
Number and volume of:
•Server
•Disks
•Shares
•Directories
•Capacity Growth
Qualitative Analysis
•Distribution by Age
•Distribution by Size
•HSM/Archiving Simulation
•File types
•Users
•Groups
•Duplicates
File Assessment–
Purpose and scope
PAGE8
www.DocuLynx.com
Analysis –
Date Analysis & Assessment
Quantitative
• Server
• Disks
• Shares
• Directories
• Capacity Growth
• Forecast
Qualitative Analysis
• Distribution by Age
• Distribution by Size
• HSM Simulation
• Analysis of Data Types
• User Distribution
• Group Distribution
• Duplicates
PAGE9
www.DocuLynx.com
Filter for “last modified” > 360 days
> 700 GB can be released
Simulate Results of a File Migration
Quantify Storage Space Savings
PAGE10
www.DocuLynx.com
Continuous Tracking of Storage Growth
Trigger action when threshold is reached
PAGE11
www.DocuLynx.com
Primary Storage
Secondary Storage
Filter Options
PAGE12
www.DocuLynx.com
• Reference files are transparent to end-users and applications
• No change in file name, extension, size or date
• Reference files act like files, meaning they can be copied, moved, renamed…
• Applications can work with the archived files without any customisation requirements
The only visible changes:
Attribute “O” for offline files and a clock symbol on the icon -> NTFS native feature
Reference Files
A Small But Important Detail
PAGE13
www.DocuLynx.com
Migration Adapter for NetApp
Providing HSM functionality for NetApp filers
CC Node
Responsible for all DocuFile Migration Adapter
operations and communication with the
DocuSuite Control Center
FPolicy Node
Responsible for file recalls via the FPolicy
Server
FPolicy Server
Detects access to files residing on the ONTAP
file system of the NetApp file server. In case a
reference file is accessed, the DocuFile
Migration Adapter initiates a recall from
secondary storage
Typical Deployment and Main Components
PAGE14
www.DocuLynx.com
• Meta data based classification enables high
performance on high volumes of data
• Pattern Matching is performant and exact when
using classification criteria that can be expressed
in regular expressions
• Machine Learning with linguistic-statistical
analysis has universal applicability and delivers
strong results with fuzzy classification criteria
• Partnerships with
– KPMG
– Deloitte
– Fontis International
Content Based Classification
Enhances migration precision and effectiveness
metadata-
based
metadata-
based
......
Machine
Learning
Machine
Learning
Pattern
Matching
Pattern
Matching
PAGE15
www.DocuLynx.com
• Critical information that should
not be moved to the cloud.
• Sensitive information that needs to
be encrypted before moving it to
the cloud.
• Important information that needs to
be archived.
• Passive information that is little
used and could be moved to IRM
immediately.
DocuClassifyDocuClassify
Identifying the Right Files
Only Classification Enables Automation
PAGE16
www.DocuLynx.com
Keep it simple
The Classification Cube® is a concept for an
universal classification scheme for companies of
all kinds which…
– is easy enough to be implemented
quickly,
– is flexible enough to be adjusted and
expanded in a simple way and
– meets the most significant and most
urgent requirements.
Our cube is the key guideline which helps to
finish the project successfully together with the
document classes.
A cube as the solution
Easy Implementation with the Classification Cube®
A Universal Technical Approach
PAGE17
www.DocuLynx.com
(1) Any information object brings
Metadata (e.g. location, name, creator
etc.) with it. The actual set can vary
with the type of information.
(2) Classification rule assigns a document
type to the information object. Each DT
comes with a set of properties which
are used by the classification rule. The
property values enrich the metadata of
the information object.
(3) Any application can – based on the
classification expressed in the
properties values of a specific
information object – trigger an action
(e.g. archiving) for this object. This
could be other DocuSuite modules or 3rd
party applications.
Document
Class
Document
Class Classification
Properties
Classification
Properties
MetadataMetadata
(1)
Classification
Rules
Information
Object
(File, Mail, SP, ...)
Information
Object
(File, Mail, SP, ...)
(2)
Property
Values
Property
Values
(3rd
Party)
Actions
(3rd
Party)
Actions
(2)
(3)
Document Class Metadata Properties Actions
Invoice Location Retention If (retention>0) then archive
Engineering Plan (CAD) Location
User
Project ID
Restricted Access
If (Restricted Access) then block access on mobile devices
Document Classes and the Classification Cube
The Document Class Model
PAGE18
www.DocuLynx.com
BC
AC
Security SettingSecurity Setting: Confidential: Confidential
ComplianceCompliance : 5 year retention: 5 year retention
Summary Project Document
Process of Tagging to the Information Object thru the ADS or Security Descriptor
PAGE19
www.DocuLynx.com
SharePoint
Office
SharePoint
…
SharePoint
Office
SharePoint
…
Target Archives
Variable Retention period
1 Year
5 Years
10 Years
25 years
…
Target Archives
Variable Retention period
1 Year
5 Years
10 Years
25 years
…
1. Selection
2. Transport
3. Report
Generation
Classification and
File Movement
File/SharePoint Object Migration
PAGE20
www.DocuLynx.com
Introduce Cloud Storage using
intelligent content
management as an enabler
• Identify files to offload
based on metadata such as
age, location, type, ...
• Immediately start to offload
your primary storage
• Harvest your benefits e.g.
reduced storage TCO, better
backup and DR
Add automatic classification for
even more benefits
• Introduce file classification
classes based on business
demand
• Use self-learning, trainable
classifier to automatically
classify all files
• Automatically attach business
value to every file and use it
as a trigger to offload files
Leverage classification for
security and governance task
• Introduce Dynamic Access
Control to ease access rights
management
• Increase security by
automatic encryption of
confidential files
• Boost effectiveness of any
DLP solution
Starting small you can expand the capabilities of this system
from storage optimization to classification based information
governance.
Starting small you can expand the capabilities of this system
from storage optimization to classification based information
governance.
Progression:
Introduction and Expansion
PAGE21
www.DocuLynx.com
Intelligent Content Management
•Significant cost reduction in storage
infrastructure
•Easy implementation with minimal admin
effort
•High degree of automation
•Permanent availability of all relevant data
•Highly scalable
•Fulfillment of legal requirements
•Higher transparency of unstructured data
•Optimize use of resources and investments
•Implementation of rule-based Information
Lifecycle Management (ILM)
Classification
•Automated classification fulfills regulatory
compliance and information security
requirements relative to encryption and access
control
•Mitigate risk due to knowledge of the value of
your information
•Boosts in productivity due to higher
transparency levels
•Movement toward value-based information
management
Company Confidential
Summary Advantages
PAGE22
www.DocuLynx.com
• Large medium-sized company based in the
automation technology sector
• Head of Information Management Server
• 80 TB on Windows Server 2008 R2 and 2012
• Few relevant datasets should be moved to an
external service provider
• Motivation: optimization of costs and change
in awareness within the business
departments
• No transparency in terms of data relevance
Challenge
• Initial on-site visit: More precise analysis of
the demand, especially in terms of granularity
• Decision: classic HSM (without classification)
vs. intelligent HSM (with classification)
• Presentation file assessment as starting point
for analysis
• Solution:
– migration for transparent moving of data
– classification (external attribute analysis,
reporting) and optional content classification
ApproachCustomer
Concrete Customer Request With Regard to Classifiction
Storage Optimization through moving of “irrelevant” data
PAGE23
www.DocuLynx.com
– Railway Engines
– Trams
– Train Engines
– High Speed Trains
– Transport Solutions
Rail Systems
– Trucks
– Buses
– Engine Brakes
– Special Products
Commercial Brake Systems
The Knorr-Bremse Group is the
world’s leading manufacturer of
braking systems for rail and
commercial vehicles, vehicle air
conditioning systems and
torsional vibration dampers
Customer Reference
PAGE24
www.DocuLynx.com
– 120 TB total storage
– 16.000 users, 120 locations
– 8000 SAP users, 1000 CAD users
Key metrics:
– 6 NetApp V-Filer, 1 Windows File Server
– Growth rate 30% per year
– 24 TB ( 26,4 Million Files) managed by
DocuFile
– Tier 1 Storage growth -> BU / Recovery
cost increase
– Only a fraction of the files is in active use
2012
24 TB
2013
32 TB
2014
43 TB
2015
55 TB
Active Active Active Active
Typical Challenge
PAGE25
www.DocuLynx.com
Secondary Storage Tier
Primary Storage
– 39% of the managed storage
volume has been migrated
– Individual quota between
Server/Filer varies between 21%
and 62%
– Successfully increased migration
quota by fine tuning the migration
policies
– Back-Up and Snapshot performance
vastly improved
21 – 62% migrated
Results
PAGE26
www.DocuLynx.com
DocuLynx Portfolio
Addressing Informance Governance and Lifecycle Challenges
DocuLynx Portfolio
 Enterprise Information Infrastructure
Solutions
• DocuLynx 360- a Content Services Platform for
Smart Applications
• Active Information Archiving Platform
o DocuSuite
o DocHarbor/Haven
• DocuClassify
• DocuSearch
 Enterprise Information Application
Solutions
• AP Automation
• E-Signature
 Information Conversion Services
(eg.Scanning)
PAGE27
www.DocuLynx.com
Thank You!

More Related Content

What's hot

Adopting the open group cloud eco system reference model
Adopting the open group cloud eco system reference modelAdopting the open group cloud eco system reference model
Adopting the open group cloud eco system reference modelKrishna-Kumar
 
Cloud Resource Management
Cloud Resource ManagementCloud Resource Management
Cloud Resource ManagementNASIRSAYYED4
 
Cloud computing 8 cloud service models
Cloud computing 8 cloud service modelsCloud computing 8 cloud service models
Cloud computing 8 cloud service modelsVaibhav Khanna
 
Understanding the cloud computing stack
Understanding the cloud computing stackUnderstanding the cloud computing stack
Understanding the cloud computing stackSatish Chavan
 
Cloud computing Fundamentals - behind the hood of cloud platforms
Cloud computing Fundamentals - behind the hood of cloud platformsCloud computing Fundamentals - behind the hood of cloud platforms
Cloud computing Fundamentals - behind the hood of cloud platformshnandy
 
An introduction to the cloud 11 v1
An introduction to the cloud 11 v1An introduction to the cloud 11 v1
An introduction to the cloud 11 v1charan7575
 
Chap 4 platform as a service (paa s)
Chap 4 platform as a service (paa s)Chap 4 platform as a service (paa s)
Chap 4 platform as a service (paa s)Raj Sarode
 
Gary Homeland Security Presentation 102114
Gary Homeland Security Presentation 102114Gary Homeland Security Presentation 102114
Gary Homeland Security Presentation 102114Gary Dischner
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computingRevathi Ram
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloudsree raj
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloudVan Pham
 

What's hot (20)

Adopting the open group cloud eco system reference model
Adopting the open group cloud eco system reference modelAdopting the open group cloud eco system reference model
Adopting the open group cloud eco system reference model
 
Cloud Architecture
Cloud ArchitectureCloud Architecture
Cloud Architecture
 
Cloud Resource Management
Cloud Resource ManagementCloud Resource Management
Cloud Resource Management
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Sla in cloud
Sla in cloudSla in cloud
Sla in cloud
 
Cloud computing 8 cloud service models
Cloud computing 8 cloud service modelsCloud computing 8 cloud service models
Cloud computing 8 cloud service models
 
Understanding the cloud computing stack
Understanding the cloud computing stackUnderstanding the cloud computing stack
Understanding the cloud computing stack
 
Cloud computing Fundamentals - behind the hood of cloud platforms
Cloud computing Fundamentals - behind the hood of cloud platformsCloud computing Fundamentals - behind the hood of cloud platforms
Cloud computing Fundamentals - behind the hood of cloud platforms
 
Multi Cloud Architecture Approach
Multi Cloud Architecture ApproachMulti Cloud Architecture Approach
Multi Cloud Architecture Approach
 
An introduction to the cloud 11 v1
An introduction to the cloud 11 v1An introduction to the cloud 11 v1
An introduction to the cloud 11 v1
 
Chap 4 platform as a service (paa s)
Chap 4 platform as a service (paa s)Chap 4 platform as a service (paa s)
Chap 4 platform as a service (paa s)
 
Gary Homeland Security Presentation 102114
Gary Homeland Security Presentation 102114Gary Homeland Security Presentation 102114
Gary Homeland Security Presentation 102114
 
ERP IN CLOUD
ERP IN CLOUDERP IN CLOUD
ERP IN CLOUD
 
Cloud
CloudCloud
Cloud
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloud
 
Introduction to cloud computing
Introduction to cloud computingIntroduction to cloud computing
Introduction to cloud computing
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloud
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloud
 
cloud computng
cloud computng cloud computng
cloud computng
 
Lect15 cloud
Lect15 cloudLect15 cloud
Lect15 cloud
 

Similar to Intelligent Cloud Enablement

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Denodo
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Denodo
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesDATAVERSITY
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
DocuClassify - AutoClassification at its best
DocuClassify - AutoClassification at its bestDocuClassify - AutoClassification at its best
DocuClassify - AutoClassification at its bestDocuLynx
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Data accessibilityandchallenges
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallengesjyotikhadake
 
InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17Everteam
 
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...Blackboard APAC
 
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureShaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureDenodo
 
EMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by SigmaEMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by SigmaJonathan Simpson
 
Agile enterprise analytics on aws
Agile enterprise analytics on awsAgile enterprise analytics on aws
Agile enterprise analytics on awsDon Gillis
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseHostway|HOSTING
 
Research Data Management at the University of Salford
Research Data Management at the University of SalfordResearch Data Management at the University of Salford
Research Data Management at the University of SalfordDavid Clay
 

Similar to Intelligent Cloud Enablement (20)

ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Data Domain-Driven Design
Data Domain-Driven DesignData Domain-Driven Design
Data Domain-Driven Design
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data LakesADV Slides: Building and Growing Organizational Analytics with Data Lakes
ADV Slides: Building and Growing Organizational Analytics with Data Lakes
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
DocuClassify - AutoClassification at its best
DocuClassify - AutoClassification at its bestDocuClassify - AutoClassification at its best
DocuClassify - AutoClassification at its best
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Data accessibilityandchallenges
Data accessibilityandchallengesData accessibilityandchallenges
Data accessibilityandchallenges
 
InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17InfoDNA Everteam houston breakfast 06.29.17
InfoDNA Everteam houston breakfast 06.29.17
 
The Untethered Enterprise - Synchronizing Content Across Multiple Storage Pla...
The Untethered Enterprise - Synchronizing Content Across Multiple Storage Pla...The Untethered Enterprise - Synchronizing Content Across Multiple Storage Pla...
The Untethered Enterprise - Synchronizing Content Across Multiple Storage Pla...
 
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
Blackboard Learn Deployment: A Detailed Update of Managed Hosting and SaaS De...
 
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric ArchitectureShaping the Role of a Data Lake in a Modern Data Fabric Architecture
Shaping the Role of a Data Lake in a Modern Data Fabric Architecture
 
EMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by SigmaEMC InfoArchive Overview: Offered by Sigma
EMC InfoArchive Overview: Offered by Sigma
 
Agile enterprise analytics on aws
Agile enterprise analytics on awsAgile enterprise analytics on aws
Agile enterprise analytics on aws
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
OpenStack Swift In the Enterprise
OpenStack Swift In the EnterpriseOpenStack Swift In the Enterprise
OpenStack Swift In the Enterprise
 
Research Data Management at the University of Salford
Research Data Management at the University of SalfordResearch Data Management at the University of Salford
Research Data Management at the University of Salford
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

Intelligent Cloud Enablement

  • 1. PAGE1 www.DocuLynx.com Intelligent Cloud Storage Enablement Presented by Mike Wipperfeld July 22, 2014
  • 2. PAGE2 www.DocuLynx.com Challenges • Data continues to grow exponentially – especially with the advent of social content • Approximately 70% of data is unstructured • Impact on - Storage costs and management - Data Protection SLAs • New deployment options such as cloud provide alternatives Corporate knowledge has grown from 75 Exabyte in 2007 to 580 Exabyte in 2011 or a CAGR of 67% - Forrester
  • 3. PAGE3 www.DocuLynx.com • What files ? • How ? • Approx. 65% of the total volume is rarely used • Challenge: – Identify the rarely accessed data – Transport to target – Keep available to users /applications • Results: – Optimized storage utilization – Reduced operational and administrative costs ON PREMISE STORAGE Typical Scenario Window ServerNAS Filer NetApp
  • 4. PAGE4 www.DocuLynx.com you could classify information based on external attributes: analyze, monitor, report? you could select and move files based on centrally defined policies to second tier storage in the cloud? you could make it completely transparent? you could address compliance and access controls through classification based on content analysis Challenges
  • 5. PAGE5 www.DocuLynx.com • You can migrate files – transparent for the user - into multiple storage locations • Users and applications will not notice any difference • With cloud storage, you can integrate the public cloud with on-premises storage to: • reduce datacenter infrastructure complexity • maximize data protection • reduce overall storage total cost of ownership (TCO) by 60-80% • provision storage more rapidly to reclaim IT time cycles • With the optional DocuClassify feature, the selection of files can be done based on its content and business values Enabling the fast introduction of integrated cloud storage without disturbing users and IT operations Intelligent Content Management More Intelligent Content Management Window ServerNAS Filer NetApp Cloud Storage
  • 6. PAGE6 www.DocuLynx.com • Analysis & Classification Analysis and classification of the file inventory with customer-defined rules. • Migration Creation of a copy on secondary storage in accordance with rules based on criteria such as meta-data or content-based classification. • Release Exchange of the original file with a small reference file which looks and acts just like the original. • Recall Restoration of the original file from secondary storage upon request. After the initial set-up, these processes run independent and transparently in the background. Analysis & Classification ReleaseRecall Migration Identifying the right files Moving files transparently to secondary storage Storage Optimization – Functional principle in four processes
  • 7. PAGE7 www.DocuLynx.com Purpose of the assessment: •Analyze the current file structure •Illustrate results in multiple ways for examination •Make recommendations regarding future HSM or archiving options •Make recommendations about optimal usage of file classification Quantitative Analysis Number and volume of: •Server •Disks •Shares •Directories •Capacity Growth Qualitative Analysis •Distribution by Age •Distribution by Size •HSM/Archiving Simulation •File types •Users •Groups •Duplicates File Assessment– Purpose and scope
  • 8. PAGE8 www.DocuLynx.com Analysis – Date Analysis & Assessment Quantitative • Server • Disks • Shares • Directories • Capacity Growth • Forecast Qualitative Analysis • Distribution by Age • Distribution by Size • HSM Simulation • Analysis of Data Types • User Distribution • Group Distribution • Duplicates
  • 9. PAGE9 www.DocuLynx.com Filter for “last modified” > 360 days > 700 GB can be released Simulate Results of a File Migration Quantify Storage Space Savings
  • 10. PAGE10 www.DocuLynx.com Continuous Tracking of Storage Growth Trigger action when threshold is reached
  • 12. PAGE12 www.DocuLynx.com • Reference files are transparent to end-users and applications • No change in file name, extension, size or date • Reference files act like files, meaning they can be copied, moved, renamed… • Applications can work with the archived files without any customisation requirements The only visible changes: Attribute “O” for offline files and a clock symbol on the icon -> NTFS native feature Reference Files A Small But Important Detail
  • 13. PAGE13 www.DocuLynx.com Migration Adapter for NetApp Providing HSM functionality for NetApp filers CC Node Responsible for all DocuFile Migration Adapter operations and communication with the DocuSuite Control Center FPolicy Node Responsible for file recalls via the FPolicy Server FPolicy Server Detects access to files residing on the ONTAP file system of the NetApp file server. In case a reference file is accessed, the DocuFile Migration Adapter initiates a recall from secondary storage Typical Deployment and Main Components
  • 14. PAGE14 www.DocuLynx.com • Meta data based classification enables high performance on high volumes of data • Pattern Matching is performant and exact when using classification criteria that can be expressed in regular expressions • Machine Learning with linguistic-statistical analysis has universal applicability and delivers strong results with fuzzy classification criteria • Partnerships with – KPMG – Deloitte – Fontis International Content Based Classification Enhances migration precision and effectiveness metadata- based metadata- based ...... Machine Learning Machine Learning Pattern Matching Pattern Matching
  • 15. PAGE15 www.DocuLynx.com • Critical information that should not be moved to the cloud. • Sensitive information that needs to be encrypted before moving it to the cloud. • Important information that needs to be archived. • Passive information that is little used and could be moved to IRM immediately. DocuClassifyDocuClassify Identifying the Right Files Only Classification Enables Automation
  • 16. PAGE16 www.DocuLynx.com Keep it simple The Classification Cube® is a concept for an universal classification scheme for companies of all kinds which… – is easy enough to be implemented quickly, – is flexible enough to be adjusted and expanded in a simple way and – meets the most significant and most urgent requirements. Our cube is the key guideline which helps to finish the project successfully together with the document classes. A cube as the solution Easy Implementation with the Classification Cube® A Universal Technical Approach
  • 17. PAGE17 www.DocuLynx.com (1) Any information object brings Metadata (e.g. location, name, creator etc.) with it. The actual set can vary with the type of information. (2) Classification rule assigns a document type to the information object. Each DT comes with a set of properties which are used by the classification rule. The property values enrich the metadata of the information object. (3) Any application can – based on the classification expressed in the properties values of a specific information object – trigger an action (e.g. archiving) for this object. This could be other DocuSuite modules or 3rd party applications. Document Class Document Class Classification Properties Classification Properties MetadataMetadata (1) Classification Rules Information Object (File, Mail, SP, ...) Information Object (File, Mail, SP, ...) (2) Property Values Property Values (3rd Party) Actions (3rd Party) Actions (2) (3) Document Class Metadata Properties Actions Invoice Location Retention If (retention>0) then archive Engineering Plan (CAD) Location User Project ID Restricted Access If (Restricted Access) then block access on mobile devices Document Classes and the Classification Cube The Document Class Model
  • 18. PAGE18 www.DocuLynx.com BC AC Security SettingSecurity Setting: Confidential: Confidential ComplianceCompliance : 5 year retention: 5 year retention Summary Project Document Process of Tagging to the Information Object thru the ADS or Security Descriptor
  • 19. PAGE19 www.DocuLynx.com SharePoint Office SharePoint … SharePoint Office SharePoint … Target Archives Variable Retention period 1 Year 5 Years 10 Years 25 years … Target Archives Variable Retention period 1 Year 5 Years 10 Years 25 years … 1. Selection 2. Transport 3. Report Generation Classification and File Movement File/SharePoint Object Migration
  • 20. PAGE20 www.DocuLynx.com Introduce Cloud Storage using intelligent content management as an enabler • Identify files to offload based on metadata such as age, location, type, ... • Immediately start to offload your primary storage • Harvest your benefits e.g. reduced storage TCO, better backup and DR Add automatic classification for even more benefits • Introduce file classification classes based on business demand • Use self-learning, trainable classifier to automatically classify all files • Automatically attach business value to every file and use it as a trigger to offload files Leverage classification for security and governance task • Introduce Dynamic Access Control to ease access rights management • Increase security by automatic encryption of confidential files • Boost effectiveness of any DLP solution Starting small you can expand the capabilities of this system from storage optimization to classification based information governance. Starting small you can expand the capabilities of this system from storage optimization to classification based information governance. Progression: Introduction and Expansion
  • 21. PAGE21 www.DocuLynx.com Intelligent Content Management •Significant cost reduction in storage infrastructure •Easy implementation with minimal admin effort •High degree of automation •Permanent availability of all relevant data •Highly scalable •Fulfillment of legal requirements •Higher transparency of unstructured data •Optimize use of resources and investments •Implementation of rule-based Information Lifecycle Management (ILM) Classification •Automated classification fulfills regulatory compliance and information security requirements relative to encryption and access control •Mitigate risk due to knowledge of the value of your information •Boosts in productivity due to higher transparency levels •Movement toward value-based information management Company Confidential Summary Advantages
  • 22. PAGE22 www.DocuLynx.com • Large medium-sized company based in the automation technology sector • Head of Information Management Server • 80 TB on Windows Server 2008 R2 and 2012 • Few relevant datasets should be moved to an external service provider • Motivation: optimization of costs and change in awareness within the business departments • No transparency in terms of data relevance Challenge • Initial on-site visit: More precise analysis of the demand, especially in terms of granularity • Decision: classic HSM (without classification) vs. intelligent HSM (with classification) • Presentation file assessment as starting point for analysis • Solution: – migration for transparent moving of data – classification (external attribute analysis, reporting) and optional content classification ApproachCustomer Concrete Customer Request With Regard to Classifiction Storage Optimization through moving of “irrelevant” data
  • 23. PAGE23 www.DocuLynx.com – Railway Engines – Trams – Train Engines – High Speed Trains – Transport Solutions Rail Systems – Trucks – Buses – Engine Brakes – Special Products Commercial Brake Systems The Knorr-Bremse Group is the world’s leading manufacturer of braking systems for rail and commercial vehicles, vehicle air conditioning systems and torsional vibration dampers Customer Reference
  • 24. PAGE24 www.DocuLynx.com – 120 TB total storage – 16.000 users, 120 locations – 8000 SAP users, 1000 CAD users Key metrics: – 6 NetApp V-Filer, 1 Windows File Server – Growth rate 30% per year – 24 TB ( 26,4 Million Files) managed by DocuFile – Tier 1 Storage growth -> BU / Recovery cost increase – Only a fraction of the files is in active use 2012 24 TB 2013 32 TB 2014 43 TB 2015 55 TB Active Active Active Active Typical Challenge
  • 25. PAGE25 www.DocuLynx.com Secondary Storage Tier Primary Storage – 39% of the managed storage volume has been migrated – Individual quota between Server/Filer varies between 21% and 62% – Successfully increased migration quota by fine tuning the migration policies – Back-Up and Snapshot performance vastly improved 21 – 62% migrated Results
  • 26. PAGE26 www.DocuLynx.com DocuLynx Portfolio Addressing Informance Governance and Lifecycle Challenges DocuLynx Portfolio  Enterprise Information Infrastructure Solutions • DocuLynx 360- a Content Services Platform for Smart Applications • Active Information Archiving Platform o DocuSuite o DocHarbor/Haven • DocuClassify • DocuSearch  Enterprise Information Application Solutions • AP Automation • E-Signature  Information Conversion Services (eg.Scanning)

Editor's Notes

  1. Cover today Oftne seen at our customers What files How transport
  2. Using content based classification – later – much higher precision – compliance Metadata based classificatio – fast Content based classification – higher precision Handle files based on the business value – not based on storage location and file naming – may be wrong anyway
  3. Dg file - external meta data Optional dg classification – content ( regex / key words /document cklasses)
  4. Alle Prozesse laufen im Prinzip zeitgesteuert. Bei der Migration gibt es zwei Methoden: Migration nach Archivierung Ereignisgesteuerte Migration (Bei Erreichen von Füllständen)
  5. Part of a policyCreating efficient filters for migration Levels Fiel sets ( type of files ) customized Typical all HSM meta data; Group membership, User name Most effective + precise -> classification Property and values
  6. Dg classification is an add on to dg file. Dg file can operate stand alone Filter can be based not only on external meta data Also on classification Properties / Stanrads FCI is a Standard by Microsoft back in 2008 Further enhanced with server 2012 There are different methods to classify files Even manual classification – has some significant downsides dg classification allows flexible configuration of different methods In our experience: meta data based for high performance – providing fst results Content based using training sets for high precision, automatic and in teh background
  7. After classification comes the follow on process Example for our demo is archiving based on classification properties /values
  8. We typically implement in phases Phase 1 – assessment and use external metadata – fast ersults Phase 2 – use automatic classification for higher precision _ handel files according to busienss value / not only based onname and location