SlideShare a Scribd company logo
1 of 36
Download to read offline
Tes$ng	
  Storage	
  Systems:	
  	
  
Methodology	
  &	
  Common	
  Pi8alls	
  
John	
  Constable	
  	
  
Wellcome	
  Trust	
  Sanger	
  Ins$tute	
  
@kript	
  /	
  jc18@sanger.ac.uk	
  
#lisa14storage	
  
	
  
What	
  I	
  wish	
  I	
  knew	
  when	
  I	
  started	
  	
  
tes$ng	
  storage	
  systems	
  three	
  years	
  ago	
  
Intro	
  
Covering;	
  
intro,	
  challenges,	
  experiences,	
  lessons	
  learned,	
  $ps	
  and	
  tricks,	
  denouement	
  	
  
The	
  Sanger	
  Ins,tute	
  
	
  
	
  
Funded	
  by	
  Wellcome	
  Trust	
  
	
  
	
  
2nd	
  largest	
  research	
  charity	
  in	
  the	
  world.	
  
~700	
  employees	
  (~50	
  IT)	
  
	
  
	
  
Large	
  scale	
  genomic	
  research.	
  
Sequenced	
  1/3	
  of	
  the	
  human	
  genome	
  (largest	
  
single	
  contributor)	
  
	
  
Intro	
  
Who	
  are	
  all	
  you	
  people?	
  
Why	
  did	
  you	
  come	
  to	
  this	
  
talk?	
  
I’m	
  part	
  of	
  the	
  Informa$cs	
  Support	
  Group	
  
(HPC),	
  also	
  Storage	
  Team	
  
	
  
3	
  years	
  at	
  Sanger,	
  22	
  years	
  in	
  IT	
  
	
  
What	
  I’ve	
  learned	
  from	
  evalua$ng	
  seven	
  
different	
  storage	
  systems	
  in	
  3	
  years	
  
	
  
Credit:Wikimedia	

 Intro	
  This	
  is	
  not	
  the	
  John	
  Constable	
  you	
  are	
  looking	
  for..	
  
Charity	
  (90%	
  funded	
  by	
  Wellcome	
  Trust)	
  
Our	
  users	
  are	
  scien$sts	
  first,	
  bioinforma$cians	
  second,	
  unix	
  geeks	
  third	
  
	
  
Intro	
  
N.B.	
  a	
  kilobase	
  (kb)	
  is	
  a	
  unit	
  of	
  
measurement	
  in	
  molecular	
  	
  
biology	
  equal	
  to	
  1000	
  base	
  
	
  pairs	
  of	
  DNA	
  or	
  RNA.	
  
	
  
One	
  megabase	
  roughly	
  
equiv	
  1	
  MB	
  data.	
  
	
  
Why	
  yes,	
  the	
  data	
  is	
  
doubling	
  every	
  12	
  
months!	
  
	
  
Yes,	
  that’s	
  faster	
  
than	
  18	
  month	
  disk/
transistor	
  doubling	
  
rate	
  
Challenges	
  
	
  
Currently	
  12	
  staff	
  managing	
  21PB	
  
	
  
Our	
  scien$sts	
  predict	
  up	
  to	
  70PB	
  
within	
  the	
  next	
  5	
  years.	
  	
  
	
  
Its	
  all	
  spinning	
  disk	
  
	
  
NFS/CIFS	
  (7PB),	
  Lustre	
  (10PB),	
  	
  
iRODS	
  (4PB)	
  
‘Only’	
  350TB	
  tape	
  
	
  
(gnuplot	
  FTW)	
  iostat	
  info	
  on	
  average	
  queue	
  size	
  
Challenges	
  
Experiences	
  
Credit:	
  Touring	
  Club	
  Suisse	
  
hlps://www.flickr.com/photos/touring_club/	
  
Can’t	
  make	
  it	
  go	
  fast	
  enough!	
  
	
  
Our	
  test	
  or	
  vendors	
  equipment?	
  
	
  
Our	
  equipment	
  or	
  vendors’	
  informa$on?	
  
	
  
What	
  don’t	
  we	
  know?	
  
	
  
Credit:	
  Bogdan	
  
hlps://www.flickr.com/photos/bogdix	
  
Experiences	
  
Scriptable	
  tests	
  are	
  good,	
  but	
  be	
  
careful..	
  
	
  
25k	
  snapshots	
  in	
  one	
  dir	
  
	
  
It	
  didn’t	
  break	
  (mostly)!	
  
	
  
Credit:"Droste" by Jan (Johannes) Musset? - [4] [5]. Licensed under Public domain viaWikimedia Commons - http://commons.wikimedia.org/wiki/File:Droste.jpg#mediaviewer/File:Droste.jpg	

Experiences	
  
No	
  fit	
  between	
  use	
  case	
  	
  
and	
  hardware	
  provided	
  
	
  
Credit:	
  W	
  Smith:	
  	
  
hlps://www.flickr.com/photos/wsmith	
  
Experiences	
  
Avoid	
  beta	
  hardware	
  if	
  possible	
  
(Try	
  not	
  to	
  be	
  their	
  test	
  lab)	
  
	
  
"Beaker	
  (Muppet)"	
  by	
  Disney.com.	
  	
  
Licensed	
  under	
  Fair	
  use	
  of	
  copyrighted	
  material	
  in	
  the	
  context	
  of	
  Beaker	
  (Muppet)	
  via	
  Wikipedia	
  
hlp://en.wikipedia.org/wiki/File:Beaker_(Muppet).jpg#mediaviewer/File:Beaker_(Muppet).jpg	
   Experiences	
  
‘We	
  never	
  thought	
  of	
  that’	
  	
  
or	
  
not	
  able	
  to	
  monitor	
  disk	
  enclosures	
  
	
  
Experiences	
  
Changes	
  in	
  Test	
  Methodology	
  Over	
  Time	
  
"Greek	
  uc	
  delta".	
  Licensed	
  under	
  Crea$ve	
  Commons	
  Alribu$on-­‐Share	
  Alike	
  3.0	
  via	
  Wikimedia	
  Commons	
  
	
  hlp://commons.wikimedia.org/wiki/File:Greek_uc_delta.png#mediaviewer/File:Greek_uc_delta.png	
  
Test	
  frameworks	
  
Credit:	
  Markus	
  Grossalber	
  
hlps://www.flickr.com/photos/tschiae/8417927326	
  
Experiences:	
  ∆	
  
Streaming	
  IO	
  
Random	
  IO	
  
	
  
Big	
  Files!	
  
Small	
  Files!	
  
Zero	
  Length	
  files!	
  
	
  
Mul$threaded	
  
or	
  mul$	
  streams.	
  
	
  
Sosware	
  and	
  procedural	
  
Credit:	
  Gary	
  
hlps://www.flickr.com/photos/38207720@N00/4159421486	
  
Experiences:	
  ∆	
  
Sharing	
  test	
  plan	
  with	
  vendors	
  
Credit:	
  Richard	
  Masoner	
  
hlps://www.flickr.com/photos/bike/5465305209	
  
Experiences:	
  ∆	
  
Nega$ve	
  tes$ng	
  based	
  on	
  	
  
what	
  went	
  wrong	
  last	
  $me	
  	
  
	
  &	
  in	
  produc$on	
  to	
  date	
  
Credit:	
  Kenny	
  Alexander	
  
hlps://www.flickr.com/photos/boostdependent	
  
Experiences:	
  ∆	
  
Monitoring	
  of	
  test	
  systems	
  and	
  clients	
  
Credit:	
  JP	
  Bousquet	
  
hlps://www.flickr.com/photos/33676704@N04	
  
Experiences:	
  ∆	
  
Project	
  management	
  
Credit:	
  donut2D	
  
hlps://www.flickr.com/photos/donut2d/772256834	
   Experiences:	
  ∆	
  
Lessons	
  Learned	
  
Credit:	
  Aun$e	
  P:	
  
	
  hlps://www.flickr.com/photos/aun$ep	
  
Good	
  rela$onship	
  	
  
with	
  vendors	
  crucial	
  
	
  
Meet	
  the	
  engineers/developers	
  	
  
if	
  possible	
  
Credit:	
  hlps://www.flickr.com/photos/annstheclaf	
  
Lessons	
  Learned	
  
Learn new tech
Credit:	
  James	
  Vaughn	
  (photo)	
  
hlps://www.flickr.com/photos/x-­‐ray_delta_one/	
  
Lessons	
  Learned	
  
You may fix your existing infrastructure along the way
Credit:	
  Stéfan	
  
hlps://www.flickr.com/photos/st3f4n/	
  
Lessons	
  Learned	
  
Management is not a dirty word
Credit:	
  Gary	
  Burke	
  
hlps://www.flickr.com/photos/klingon65/5516037577	
  
Lessons	
  Learned	
  
Project	
  management	
  
(This	
  can	
  be	
  just	
  a	
  regular	
  conference	
  call,	
  	
  
and	
  an	
  end	
  date!)	
  
Credit:	
  Carsten	
  Knoch	
  
hlps://www.flickr.com/photos/netsrac/173500383	
  	
   Lessons	
  Learned	
  
Clear	
  internal	
  communica$on	
  
Credit:	
  Darren	
  Hester	
  
hlps://www.flickr.com/photos/grungetextures/4215941524	
  
Lessons	
  Learned	
  
Tips	
  and	
  Tricks	
  
Credit:	
  	
  Rinogmb	
  off	
  for	
  a	
  few	
  day	
  
hlps://www.flickr.com/photos/rinogmb	
  
‘Begin	
  with	
  the	
  end	
  in	
  mind’	
  
	
  
Management	
  
	
  
Monitoring	
  
	
  
Resilience	
  
	
  
Handover	
  
Tips	
  and	
  Tricks	
  
Nega,ve	
  Tes,ng	
  
Credit:	
  HomeSpot	
  HQ	
  
hlps://www.flickr.com/photos/86639298@N02/	
  
Tips	
  and	
  Tricks	
  
Cable	
  Pulling!	
  
	
  
For	
  e.g.	
  
	
  
•  Network	
  (inc.	
  one	
  pair)	
  
•  Storage	
  interconnects	
  	
  
•  (FC,	
  IB,	
  Ethernet)	
  
•  Chassis	
  interconnects	
  
	
  
Credit:	
  Eric	
  
hlps://www.flickr.com/photos/emagic/	
  
Tips	
  and	
  Tricks:	
  Nega:ve	
  Tes:ng	
  
Credit:	
  Lars	
  Plougmann	
  
hlps://www.flickr.com/photos/criminalintent/	
  
Failover	
  
(Pref.	
  while	
  under	
  load)	
  
Tips	
  and	
  Tricks:	
  Nega:ve	
  Tes:ng	
  
Disk	
  	
  
failures	
  
Credit:	
  John	
  Kaye	
  
hlps://www.flickr.com/photos/johnkay/5200871042	
  
Tips	
  and	
  Tricks:	
  Nega:ve	
  Tes:ng	
  
Useful
Software
Performance deep diving: https://github.com/
brendangregg/perf-tools/
VDBench for IO generation on Block and
filesystems http://www.oracle.com/
technetwork/server-storage/vdbench-
downloads-1901681.html
SNMP browsing GUI:
http://ireasoning.com/mibbrowser.shtml
Useful Links:
monitoring driven infrastructure:
https://devblog.timgroup.com/2014/07/30/mdi-
monitoring-driven-infrastructure/
Jason Dixon’s talk on The state of open source
monitoring;
https://speakerdeck.com/obfuscurity/the-state-of-open-
source-monitoring
JISC Project Management (inc templates):
http://www.jisc.ac.uk/fundingopportunities/
projectmanagement/planning.aspx
LiquidPlanner.com
https://blogs.oracle.com/brendan/entry/
performance_testing_the_7000_series2
Books
The Practice of System and Network
Administration by Limoncelli, Hogan, Chalup
Systems Performance by Brendan Gregg
1.  What	
  you	
  will	
  need	
  to	
  report	
  on	
  when	
  the	
  $ckets	
  come	
  in	
  
2.  Plan	
  your	
  nega$ve	
  tes$ng	
  
3.  Share	
  with	
  your	
  vendor	
  -­‐	
  involve	
  them	
  
4.  Test	
  script	
  your	
  frequent	
  management	
  tasks	
  &	
  ques$ons	
  
5.  Graph	
  all	
  the	
  things	
  	
  
Credit:	
  Dani	
  Figueiredo	
  
hlps://www.flickr.com/photos/maxsummers/	
  
Thank	
  You!	
  
Acknowledgement:	
  Mr	
  Woobey,	
  Dr	
  Coates,	
  James	
  Beal,	
  Dr	
  Clapham	
  	
  
	
  
John	
  Constable	
  	
  
Wellcome	
  Trust	
  Sanger	
  Ins$tute	
  
@kript	
  /	
  jc18@sanger.ac.uk	
  
#keepcalmitsonlystorage	
  

More Related Content

Viewers also liked

Mistair Meadows Ooty Floor Plan
Mistair Meadows Ooty Floor PlanMistair Meadows Ooty Floor Plan
Mistair Meadows Ooty Floor Planmistairrealty
 
nazism and the rise of hitler
nazism and the rise of hitlernazism and the rise of hitler
nazism and the rise of hitlerShujath Baig
 
Happy Recipes for Everyone
Happy Recipes for EveryoneHappy Recipes for Everyone
Happy Recipes for Everyonen0m4n
 
artweb
artwebartweb
artwebDetecd
 
Electronics class10
Electronics class10Electronics class10
Electronics class10Anubhav Jain
 
iRODS/DDN User Group 20140908 Sanger
iRODS/DDN User Group 20140908 SangeriRODS/DDN User Group 20140908 Sanger
iRODS/DDN User Group 20140908 SangerJohn Constable
 
Slide share power point
Slide share power pointSlide share power point
Slide share power pointdmantis
 
Process communication
Process communicationProcess communication
Process communicationcjakhila
 
Top event workshop transparant scrummen
Top event   workshop transparant scrummenTop event   workshop transparant scrummen
Top event workshop transparant scrummenEnschede
 
Програма Смаль В.
Програма Смаль В.Програма Смаль В.
Програма Смаль В.mynizhyn
 

Viewers also liked (13)

Mistair Meadows Ooty Floor Plan
Mistair Meadows Ooty Floor PlanMistair Meadows Ooty Floor Plan
Mistair Meadows Ooty Floor Plan
 
nazism and the rise of hitler
nazism and the rise of hitlernazism and the rise of hitler
nazism and the rise of hitler
 
Representation
RepresentationRepresentation
Representation
 
Happy Recipes for Everyone
Happy Recipes for EveryoneHappy Recipes for Everyone
Happy Recipes for Everyone
 
artweb
artwebartweb
artweb
 
Electronics class10
Electronics class10Electronics class10
Electronics class10
 
iRODS/DDN User Group 20140908 Sanger
iRODS/DDN User Group 20140908 SangeriRODS/DDN User Group 20140908 Sanger
iRODS/DDN User Group 20140908 Sanger
 
Slide share power point
Slide share power pointSlide share power point
Slide share power point
 
Process communication
Process communicationProcess communication
Process communication
 
Top event workshop transparant scrummen
Top event   workshop transparant scrummenTop event   workshop transparant scrummen
Top event workshop transparant scrummen
 
Deficit financing
Deficit financingDeficit financing
Deficit financing
 
Sesión comparación numeros matematica
Sesión comparación numeros matematicaSesión comparación numeros matematica
Sesión comparación numeros matematica
 
Програма Смаль В.
Програма Смаль В.Програма Смаль В.
Програма Смаль В.
 

Similar to Testing Storage Systems: Methodology and Common Pitfalls

Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesGuy Coates
 
Predicting & influencing with kanban metrics
Predicting & influencing with kanban metricsPredicting & influencing with kanban metrics
Predicting & influencing with kanban metricsAdam Scott
 
Predicting & Influencing with Kanban Metrics
Predicting & Influencing with Kanban MetricsPredicting & Influencing with Kanban Metrics
Predicting & Influencing with Kanban MetricsSteve Purkis
 
Disaster Planning Backup, Backup, Backup
Disaster Planning Backup, Backup, BackupDisaster Planning Backup, Backup, Backup
Disaster Planning Backup, Backup, BackupTechSoup
 
Faster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesFaster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesOSCON Byrum
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Tracy Kent
 
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and – DANE/DNSSE...
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and –DANE/DNSSE...PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and –DANE/DNSSE...
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and – DANE/DNSSE...PROIDEA
 
Can’t Find Your 404s?
Can’t Find Your 404s?Can’t Find Your 404s?
Can’t Find Your 404s?Michael Nelson
 
CT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloudCT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloudJan Aerts
 
Talk at Bioinformatics Open Source Conference, 2012
Talk at Bioinformatics Open Source Conference, 2012Talk at Bioinformatics Open Source Conference, 2012
Talk at Bioinformatics Open Source Conference, 2012c.titus.brown
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
Managing large and complex data sets
Managing large and complex data setsManaging large and complex data sets
Managing large and complex data setsdata_management
 
Digital Video File Organization
Digital Video File OrganizationDigital Video File Organization
Digital Video File OrganizationKay Gregg
 
Filtering From the Firehose: Real Time Social Media Streaming
Filtering From the Firehose: Real Time Social Media StreamingFiltering From the Firehose: Real Time Social Media Streaming
Filtering From the Firehose: Real Time Social Media StreamingCloud Elements
 
Digital Asset Management
Digital Asset ManagementDigital Asset Management
Digital Asset ManagementBill Swersey
 
How to Use Agile to Move the Earth
How to Use Agile to Move the EarthHow to Use Agile to Move the Earth
How to Use Agile to Move the EarthRyan Martens
 
Five Ways To Do Data Analytics "The Wrong Way"
Five Ways To Do Data Analytics "The Wrong Way"Five Ways To Do Data Analytics "The Wrong Way"
Five Ways To Do Data Analytics "The Wrong Way"Discover Pinterest
 

Similar to Testing Storage Systems: Methodology and Common Pitfalls (20)

Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
 
Predicting & influencing with kanban metrics
Predicting & influencing with kanban metricsPredicting & influencing with kanban metrics
Predicting & influencing with kanban metrics
 
Predicting & Influencing with Kanban Metrics
Predicting & Influencing with Kanban MetricsPredicting & Influencing with Kanban Metrics
Predicting & Influencing with Kanban Metrics
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Disaster Planning Backup, Backup, Backup
Disaster Planning Backup, Backup, BackupDisaster Planning Backup, Backup, Backup
Disaster Planning Backup, Backup, Backup
 
Faster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypesFaster! Faster! Accelerate your business with blazing prototypes
Faster! Faster! Accelerate your business with blazing prototypes
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source
 
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and – DANE/DNSSE...
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and –DANE/DNSSE...PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and –DANE/DNSSE...
PLNOG16: Mix 2-in-1: IPv6 troubleshooting for helpdesks - and – DANE/DNSSE...
 
Can’t Find Your 404s?
Can’t Find Your 404s?Can’t Find Your 404s?
Can’t Find Your 404s?
 
CT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloudCT Brown - Doing next-gen sequencing analysis in the cloud
CT Brown - Doing next-gen sequencing analysis in the cloud
 
Talk at Bioinformatics Open Source Conference, 2012
Talk at Bioinformatics Open Source Conference, 2012Talk at Bioinformatics Open Source Conference, 2012
Talk at Bioinformatics Open Source Conference, 2012
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
Managing large and complex data sets
Managing large and complex data setsManaging large and complex data sets
Managing large and complex data sets
 
Digital Video File Organization
Digital Video File OrganizationDigital Video File Organization
Digital Video File Organization
 
Filtering From the Firehose: Real Time Social Media Streaming
Filtering From the Firehose: Real Time Social Media StreamingFiltering From the Firehose: Real Time Social Media Streaming
Filtering From the Firehose: Real Time Social Media Streaming
 
Digital Asset Management
Digital Asset ManagementDigital Asset Management
Digital Asset Management
 
How to Use Agile to Move the Earth
How to Use Agile to Move the EarthHow to Use Agile to Move the Earth
How to Use Agile to Move the Earth
 
Five Ways To Do Data Analytics "The Wrong Way"
Five Ways To Do Data Analytics "The Wrong Way"Five Ways To Do Data Analytics "The Wrong Way"
Five Ways To Do Data Analytics "The Wrong Way"
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 

Recently uploaded

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Testing Storage Systems: Methodology and Common Pitfalls

  • 1. Tes$ng  Storage  Systems:     Methodology  &  Common  Pi8alls   John  Constable     Wellcome  Trust  Sanger  Ins$tute   @kript  /  jc18@sanger.ac.uk   #lisa14storage    
  • 2. What  I  wish  I  knew  when  I  started     tes$ng  storage  systems  three  years  ago   Intro   Covering;   intro,  challenges,  experiences,  lessons  learned,  $ps  and  tricks,  denouement    
  • 3. The  Sanger  Ins,tute       Funded  by  Wellcome  Trust       2nd  largest  research  charity  in  the  world.   ~700  employees  (~50  IT)       Large  scale  genomic  research.   Sequenced  1/3  of  the  human  genome  (largest   single  contributor)     Intro  
  • 4. Who  are  all  you  people?   Why  did  you  come  to  this   talk?   I’m  part  of  the  Informa$cs  Support  Group   (HPC),  also  Storage  Team     3  years  at  Sanger,  22  years  in  IT     What  I’ve  learned  from  evalua$ng  seven   different  storage  systems  in  3  years     Credit:Wikimedia Intro  This  is  not  the  John  Constable  you  are  looking  for..  
  • 5. Charity  (90%  funded  by  Wellcome  Trust)   Our  users  are  scien$sts  first,  bioinforma$cians  second,  unix  geeks  third     Intro   N.B.  a  kilobase  (kb)  is  a  unit  of   measurement  in  molecular     biology  equal  to  1000  base    pairs  of  DNA  or  RNA.     One  megabase  roughly   equiv  1  MB  data.     Why  yes,  the  data  is   doubling  every  12   months!     Yes,  that’s  faster   than  18  month  disk/ transistor  doubling   rate  
  • 6. Challenges     Currently  12  staff  managing  21PB     Our  scien$sts  predict  up  to  70PB   within  the  next  5  years.       Its  all  spinning  disk     NFS/CIFS  (7PB),  Lustre  (10PB),     iRODS  (4PB)   ‘Only’  350TB  tape    
  • 7. (gnuplot  FTW)  iostat  info  on  average  queue  size   Challenges  
  • 8. Experiences   Credit:  Touring  Club  Suisse   hlps://www.flickr.com/photos/touring_club/  
  • 9. Can’t  make  it  go  fast  enough!     Our  test  or  vendors  equipment?     Our  equipment  or  vendors’  informa$on?     What  don’t  we  know?     Credit:  Bogdan   hlps://www.flickr.com/photos/bogdix   Experiences  
  • 10. Scriptable  tests  are  good,  but  be   careful..     25k  snapshots  in  one  dir     It  didn’t  break  (mostly)!     Credit:"Droste" by Jan (Johannes) Musset? - [4] [5]. Licensed under Public domain viaWikimedia Commons - http://commons.wikimedia.org/wiki/File:Droste.jpg#mediaviewer/File:Droste.jpg Experiences  
  • 11. No  fit  between  use  case     and  hardware  provided     Credit:  W  Smith:     hlps://www.flickr.com/photos/wsmith   Experiences  
  • 12. Avoid  beta  hardware  if  possible   (Try  not  to  be  their  test  lab)     "Beaker  (Muppet)"  by  Disney.com.     Licensed  under  Fair  use  of  copyrighted  material  in  the  context  of  Beaker  (Muppet)  via  Wikipedia   hlp://en.wikipedia.org/wiki/File:Beaker_(Muppet).jpg#mediaviewer/File:Beaker_(Muppet).jpg   Experiences  
  • 13. ‘We  never  thought  of  that’     or   not  able  to  monitor  disk  enclosures     Experiences  
  • 14. Changes  in  Test  Methodology  Over  Time   "Greek  uc  delta".  Licensed  under  Crea$ve  Commons  Alribu$on-­‐Share  Alike  3.0  via  Wikimedia  Commons    hlp://commons.wikimedia.org/wiki/File:Greek_uc_delta.png#mediaviewer/File:Greek_uc_delta.png  
  • 15. Test  frameworks   Credit:  Markus  Grossalber   hlps://www.flickr.com/photos/tschiae/8417927326   Experiences:  ∆   Streaming  IO   Random  IO     Big  Files!   Small  Files!   Zero  Length  files!     Mul$threaded   or  mul$  streams.    
  • 16. Sosware  and  procedural   Credit:  Gary   hlps://www.flickr.com/photos/38207720@N00/4159421486   Experiences:  ∆  
  • 17. Sharing  test  plan  with  vendors   Credit:  Richard  Masoner   hlps://www.flickr.com/photos/bike/5465305209   Experiences:  ∆  
  • 18. Nega$ve  tes$ng  based  on     what  went  wrong  last  $me      &  in  produc$on  to  date   Credit:  Kenny  Alexander   hlps://www.flickr.com/photos/boostdependent   Experiences:  ∆  
  • 19. Monitoring  of  test  systems  and  clients   Credit:  JP  Bousquet   hlps://www.flickr.com/photos/33676704@N04   Experiences:  ∆  
  • 20. Project  management   Credit:  donut2D   hlps://www.flickr.com/photos/donut2d/772256834   Experiences:  ∆  
  • 21. Lessons  Learned   Credit:  Aun$e  P:    hlps://www.flickr.com/photos/aun$ep  
  • 22. Good  rela$onship     with  vendors  crucial     Meet  the  engineers/developers     if  possible   Credit:  hlps://www.flickr.com/photos/annstheclaf   Lessons  Learned  
  • 23. Learn new tech Credit:  James  Vaughn  (photo)   hlps://www.flickr.com/photos/x-­‐ray_delta_one/   Lessons  Learned  
  • 24. You may fix your existing infrastructure along the way Credit:  Stéfan   hlps://www.flickr.com/photos/st3f4n/   Lessons  Learned  
  • 25. Management is not a dirty word Credit:  Gary  Burke   hlps://www.flickr.com/photos/klingon65/5516037577   Lessons  Learned  
  • 26. Project  management   (This  can  be  just  a  regular  conference  call,     and  an  end  date!)   Credit:  Carsten  Knoch   hlps://www.flickr.com/photos/netsrac/173500383     Lessons  Learned  
  • 27. Clear  internal  communica$on   Credit:  Darren  Hester   hlps://www.flickr.com/photos/grungetextures/4215941524   Lessons  Learned  
  • 28. Tips  and  Tricks   Credit:    Rinogmb  off  for  a  few  day   hlps://www.flickr.com/photos/rinogmb  
  • 29. ‘Begin  with  the  end  in  mind’     Management     Monitoring     Resilience     Handover   Tips  and  Tricks  
  • 30. Nega,ve  Tes,ng   Credit:  HomeSpot  HQ   hlps://www.flickr.com/photos/86639298@N02/   Tips  and  Tricks  
  • 31. Cable  Pulling!     For  e.g.     •  Network  (inc.  one  pair)   •  Storage  interconnects     •  (FC,  IB,  Ethernet)   •  Chassis  interconnects     Credit:  Eric   hlps://www.flickr.com/photos/emagic/   Tips  and  Tricks:  Nega:ve  Tes:ng  
  • 32. Credit:  Lars  Plougmann   hlps://www.flickr.com/photos/criminalintent/   Failover   (Pref.  while  under  load)   Tips  and  Tricks:  Nega:ve  Tes:ng  
  • 33. Disk     failures   Credit:  John  Kaye   hlps://www.flickr.com/photos/johnkay/5200871042   Tips  and  Tricks:  Nega:ve  Tes:ng  
  • 34. Useful Software Performance deep diving: https://github.com/ brendangregg/perf-tools/ VDBench for IO generation on Block and filesystems http://www.oracle.com/ technetwork/server-storage/vdbench- downloads-1901681.html SNMP browsing GUI: http://ireasoning.com/mibbrowser.shtml Useful Links: monitoring driven infrastructure: https://devblog.timgroup.com/2014/07/30/mdi- monitoring-driven-infrastructure/ Jason Dixon’s talk on The state of open source monitoring; https://speakerdeck.com/obfuscurity/the-state-of-open- source-monitoring JISC Project Management (inc templates): http://www.jisc.ac.uk/fundingopportunities/ projectmanagement/planning.aspx LiquidPlanner.com https://blogs.oracle.com/brendan/entry/ performance_testing_the_7000_series2 Books The Practice of System and Network Administration by Limoncelli, Hogan, Chalup Systems Performance by Brendan Gregg
  • 35. 1.  What  you  will  need  to  report  on  when  the  $ckets  come  in   2.  Plan  your  nega$ve  tes$ng   3.  Share  with  your  vendor  -­‐  involve  them   4.  Test  script  your  frequent  management  tasks  &  ques$ons   5.  Graph  all  the  things     Credit:  Dani  Figueiredo   hlps://www.flickr.com/photos/maxsummers/  
  • 36. Thank  You!   Acknowledgement:  Mr  Woobey,  Dr  Coates,  James  Beal,  Dr  Clapham       John  Constable     Wellcome  Trust  Sanger  Ins$tute   @kript  /  jc18@sanger.ac.uk   #keepcalmitsonlystorage