0
CONSOLIDATE MORE: HIGH-
PERFORMANCE PRIMARY
DEDUPLICATION IN THE AGE
OF ABUNDANT CAPACITY
YONG KIM, TECHNICAL DIRECTOR,
AM...
Increase productivity, efficiency and environmental savings by eliminating silos,
preventing sprawl and reducing complexit...
UPCOMING WEBTECHS
 Cloud and Object Store Series
‒ Environmental Pressures Driving an Evolution in File Storage, April 3,...
CUSTOMER CHALLENGES
Reduce the
cost of
storing data?
Reduce the
cost of
protecting
data?
Manage
distributed IT
more
effect...
TIE IT ALL TOGETHER
EDGE
STORAGE
ALTERNATE DATA
CENTER
REMOTE
OFFICE
APPLICATION APPLICATION APPLICATION
HDDS
SEARCH ACROS...
TIE IT ALL TOGETHER
EDGE
STORAGE
ALTERNATE DATA
CENTER
REMOTE
OFFICE
APPLICATION APPLICATION APPLICATION
HDDS
SEARCH ACROS...
TIE IT ALL TOGETHER
EDGE
STORAGE
ALTERNATE DATA
CENTER
REMOTE
OFFICE
APPLICATION APPLICATION APPLICATION
HDDS
SEARCH ACROS...
DATA GROWTH CONTINUES UNABATED …
 Data growth: Doubling every 18 months
‒ Unstructured data (files) growing even faster
...
… FAR LESS OF THE DATA IS UNIQUE
 75% duplicate data (IDC)
 80% (McKinsey Global Institute)
DATA DEDUPLICATION
 A storage-optimization technology
‒ Compression, SIS
 Reduces data by eliminating multiple copies of...
DATA DEDUPLICATION
 Challenge for companies today continues to be the cost and
associated cost of storage
Data storage
 ...
WHAT MARKET RESEARCH FIRMS ARE SAYING
WHAT MARKET RESEARCH FIRMS ARE SAYING
“… the percentage of deployments with must-
have primary dedupe ‘requirements’ will ...
WHAT HAPPENS DURING DEDUPLICATION
After Dedupe
DEDUPE IN ACTION
Owner: Steve
File Name: homesteveworkReport.doc
Size: 15 4K blocks
Owner: Paul
File Name: homepaulproform...
HITACHI NAS PLATFORM (HNAS)
DEDUPLICATION OVERVIEW
 Leverages HNAS unique hybrid core technology
‒ Indexing engine via CP...
IT’S ALL ABOUT POINTERS


 Maximum (in theory)
dedupe ratio is 239:1
‒ A HNAS block can be
shared up to 239 times
‒ For...
DEDUPE EFFICIENCY
 Typical enterprises 2:1 to 5:1 data reduction
 Higher in virtual environments
2:1 50% Savings
5:1 80%...
COMPETITIVE DIFFERENTIATION
 Extreme performance
‒ Up to 450MB/sec post-ingest throughput rate
 Dynamic quality of servi...
QOS / THROTTLING MECHANISMS
 Automatic background
process
‒ No complex scheduling
process
‒ 24/7 operation
‒ No disruptio...
RESULTS FROM A LEADING MANUFACTURER
REAL-WORLD RESULTS
“The current HNAS algorithm appears to be far better than
others (c...
TIE IT ALL TOGETHER
EDGE
STORAGE
ALTERNATE DATA
CENTER
REMOTE
OFFICE
APPLICATION APPLICATION APPLICATION
CLOUD
STORAGE
HDD...
QUESTIONS AND
DISCUSSION
UPCOMING WEBTECHS
 Cloud and Object Store Series
‒ Environmental Pressures Driving an Evolution in File Storage, April 3,...
THANK YOU
Upcoming SlideShare
Loading in...5
×

Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

537

Published on

Increase productivity, efficiency and environmental savings by eliminating silos, preventing sprawl and reducing complexity by 50%. Using powerful consolidation systems, Hitachi Unified Storage or Hitachi NAS Platform, lets you consolidate existing file servers and NAS devices on to fewer nodes. You can perform the same or even more work with fewer devices and lower overhead, while reducing floor space and associated power and cooling costs. View this webcast to learn how to: Shrink your primary file data without disrupting performance. Increase productivity and utilization of available capacity. Defer additional storage purchases. Save on power, cooling and space costs. For more information please visit: http://www.hds.com/products/file-and-content/network-attached-storage/?WT.ac=us_inside_rm_htchunfds

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
537
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity"

  1. 1. CONSOLIDATE MORE: HIGH- PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY YONG KIM, TECHNICAL DIRECTOR, AMERICAS FILE AND CONTENT SOLUTIONS
  2. 2. Increase productivity, efficiency and environmental savings by eliminating silos, preventing sprawl and reducing complexity by 50%. Using powerful consolidation systems − Hitachi Unified Storage or Hitachi NAS Platform − lets you consolidate existing file servers and NAS devices onto fewer nodes. You can perform the same or even more work with fewer devices and lower overhead, while reducing floor space and associated power and cooling costs. Attend this webcast to learn how to  Shrink your primary file data without disrupting performance.  Increase productivity and utilization of available capacity.  Defer additional storage purchases.  Save on power, cooling and space costs. CONSOLIDATE MORE: HIGH-PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY WEBTECH EDUCATIONAL SERIES
  3. 3. UPCOMING WEBTECHS  Cloud and Object Store Series ‒ Environmental Pressures Driving an Evolution in File Storage, April 3, 9 a.m. PT, noon ET  Big Data Webcast Series Continues ‒ Big Data: Shining the Light on Enterprise Dark Data, April 17, 9 a.m. PT, noon ET ‒ HDS Big Data Roadmap, May 1, 9 a.m. PT, noon ET Check www.hds.com/webtech for  Links to the recording, the presentation, and Q&A (available next week)  Schedule and registration for upcoming WebTech sessions
  4. 4. CUSTOMER CHALLENGES Reduce the cost of storing data? Reduce the cost of protecting data? Manage distributed IT more effectively? Mitigate data risk? Gain IT agility? How do I Do more with less? Archive first Back up less Consolidate more
  5. 5. TIE IT ALL TOGETHER EDGE STORAGE ALTERNATE DATA CENTER REMOTE OFFICE APPLICATION APPLICATION APPLICATION HDDS SEARCH ACROSS THE POWER OF THE PORTFOLIO Reduce the cost of storing data Reduce the cost of protecting data Do more with less Back up less Consolidate more Reduce overall storage costs by reducing the load on primary storage by at least 40% Reduce licensing and management cost, complexity and backup by up to 75% CLOUD STORAGE MOBILE WORKFORCE HUS FILE MODULE
  6. 6. TIE IT ALL TOGETHER EDGE STORAGE ALTERNATE DATA CENTER REMOTE OFFICE APPLICATION APPLICATION APPLICATION HDDS SEARCH ACROSS THE POWER OF THE PORTFOLIO Reduce the cost of storing data Reduce the cost of protecting data Do more with less Consolidate more Streamline backup and restore operations by 50-60% Improve reliability with >24x improvement in RPO, >30x improvement in RTO Simplify management, improve data protection and reduce risk CLOUD STORAGE MOBILE WORKFORCE HUS FILE MODULE
  7. 7. TIE IT ALL TOGETHER EDGE STORAGE ALTERNATE DATA CENTER REMOTE OFFICE APPLICATION APPLICATION APPLICATION HDDS SEARCH ACROSS THE POWER OF THE PORTFOLIO Reduce the cost of storing data Reduce the cost of protecting data Do more with less Simplify management, improve data protection and reduce risk Reduce or eliminate CAPEX, simplify management, offload data from primary storage CLOUD STORAGE MOBILE WORKFORCE HUS FILE MODULE
  8. 8. DATA GROWTH CONTINUES UNABATED …  Data growth: Doubling every 18 months ‒ Unstructured data (files) growing even faster  Worldwide data creation exceeded 1 zettabyte in 2010 for the first time  40% projected growth in global data generated per year vs. 5% growth in global IT spend *
  9. 9. … FAR LESS OF THE DATA IS UNIQUE  75% duplicate data (IDC)  80% (McKinsey Global Institute)
  10. 10. DATA DEDUPLICATION  A storage-optimization technology ‒ Compression, SIS  Reduces data by eliminating multiple copies of redundant data and only keeping unique data  First made popular for backup (secondary) devices, the technology has been extended to support primary storage WHAT IS IT?
  11. 11. DATA DEDUPLICATION  Challenge for companies today continues to be the cost and associated cost of storage Data storage  Optimizing data in place, and reducing the on-disk footprint of data as it is stored provides immediate savings in capacity and new disk expenditures  Drive down the cost per gigabyte − store more information in the same gigabyte Data management  Reduces the volume of data that needs to be managed  Reduces the frequency of backups BUSINESS AND CUSTOMER BENEFITS
  12. 12. WHAT MARKET RESEARCH FIRMS ARE SAYING
  13. 13. WHAT MARKET RESEARCH FIRMS ARE SAYING “… the percentage of deployments with must- have primary dedupe ‘requirements’ will reach anywhere from 5% (conservative) to 22% (aggressive) by 2015”
  14. 14. WHAT HAPPENS DURING DEDUPLICATION After Dedupe
  15. 15. DEDUPE IN ACTION Owner: Steve File Name: homesteveworkReport.doc Size: 15 4K blocks Owner: Paul File Name: homepaulproforma.xls Size: 5 4K blocks Owner: John File Name: homejohntmpMyReport.doc Size: 17 4K blocks Owner: Mary File Name: homemarySteveReport.doc Size: 17 4K blocks Without Dedupe 54 4K Blocks Consumed With Dedupe Only 24 4K Blocks Consumed 30 4K Blocks (> 50%) Reclaimed
  16. 16. HITACHI NAS PLATFORM (HNAS) DEDUPLICATION OVERVIEW  Leverages HNAS unique hybrid core technology ‒ Indexing engine via CPU ‒ VLSI/FPGA technology  SHA256 hash of a data block detect duplicate  Up to 4 SHA calculators running in parallel  Post-processing ‒ File system data is analyzed and processed for dedupe after it is written to disk ‒ Not in data path – no risk of losing or being unable to access data  Dedupe index is used to store and identify duplicate blocks in a file system
  17. 17. IT’S ALL ABOUT POINTERS    Maximum (in theory) dedupe ratio is 239:1 ‒ A HNAS block can be shared up to 239 times ‒ For example, 478 duplicate blocks dedupe down to 2 blocks …
  18. 18. DEDUPE EFFICIENCY  Typical enterprises 2:1 to 5:1 data reduction  Higher in virtual environments 2:1 50% Savings 5:1 80% Savings 4:1 75% Savings 10:1 90% Savings 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
  19. 19. COMPETITIVE DIFFERENTIATION  Extreme performance ‒ Up to 450MB/sec post-ingest throughput rate  Dynamic quality of service (QoS) ‒ Avoid impacting application I/O ‒ Less degradation on I/O Performance  Simple to use ‒ Not a hardware appliance ‒ Little to no administration required
  20. 20. QOS / THROTTLING MECHANISMS  Automatic background process ‒ No complex scheduling process ‒ 24/7 operation ‒ No disruption to workflow  QoS/auto throttling ‒ When the file serving load passes beyond 50% (of available IOPS or throughput capacity), the engine throttles back
  21. 21. RESULTS FROM A LEADING MANUFACTURER REAL-WORLD RESULTS “The current HNAS algorithm appears to be far better than others (competitors) tested”
  22. 22. TIE IT ALL TOGETHER EDGE STORAGE ALTERNATE DATA CENTER REMOTE OFFICE APPLICATION APPLICATION APPLICATION CLOUD STORAGE HDDS SEARCH ACROSS THE POWER OF THE PORTFOLIO Reduce the cost of storing data Reduce the cost of protecting data Do more with less Archive first Back up less Consolidate more The intelligent archive is the strong foundation of the 21st Century data center
  23. 23. QUESTIONS AND DISCUSSION
  24. 24. UPCOMING WEBTECHS  Cloud and Object Store Series ‒ Environmental Pressures Driving an Evolution in File Storage, April 3, 9 a.m. PT, noon ET  Big Data Webcast Series Continues ‒ Big Data: Shining the Light on Enterprise Dark Data, April 17, 9 a.m. PT, noon ET ‒ HDS Big Data Roadmap, May 1, 9 a.m. PT, noon ET Check www.hds.com/webtech for  Links to the recording, the presentation, and Q&A (available next week)  Schedule and registration for upcoming WebTech sessions
  25. 25. THANK YOU
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×