Customer Success Story                                             National Institutes of Health              L INST      ...
Customer Success Story: National Institutes of HealthThe ResultFlexibility and Efficiency Advances Discovery.Technology ad...
Upcoming SlideShare
Loading in …5
×

National Institutes of Health Maximize Computing Resources with Panasas

237 views

Published on

The National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM) at the National Institutes of Health (NIH), serves as a national resource for molecular biology information serving research groups from around the world. Here’s how Panasas works with them to deliver 5X performance and affordable scalability for fast growing archives.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
237
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

National Institutes of Health Maximize Computing Resources with Panasas

  1. 1. Customer Success Story National Institutes of Health L INST National Institutes of Health NA The National Center for Biotechnology Information (NCBI), a division of the IT NATIO UTES National Library of Medicine (NLM) at the National Institutes of Health (NIH), F serves as a national resource for molecular biology information serving O H E A LT H research groups from around the world. Established in 1988, NCBI develops new information technologies to aid in the understanding of fundamental molecular and genetic processes that control health and disease. NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genomic data, and disseminates biomedical information. Some 450 people—ranging from NCBI researchers and staff scientists to programmers, curators, and indexers—generate, store, and access NCBI databases.SUMMARY The Challenge Designed specifically to accelerate theIndustry: Researchers at NCBI depend on high- performance of applications deployedLife Sciences/Government performance compute clusters to run on Linux compute clusters, the Panasas complex analyses of genotyping and storage cluster effectively eliminated theTHE CHALLENGE sequencing data. The existing storage research-impacting I/O bottlenecks.Meet demands of researchers from architecture did not effectively scalearound the globe accessing the NCBI to support such efforts as the 1000 PAS storage now provides scalablepublic database to conduct genome Genomes Project, an ambitious endeavor performance and capacity to multipleresearch. Eliminate I/O bottlenecks andmaximize computing resources for public to sequence the genomes of at least internal production systems (both Linux-databases, including an estimated 1.5 1,000 people from around the world. and Windows-based platforms), includingPB of genetic information for the 1000 NCBI’s 1800-core Dell PowerEdge cluster The project, creating the most detailedGenomes Project. and medically useful picture to date of that provides computing resources to some human genetic variation, is expected to 80 applications used by ten NCBI researchTHE SOLUTION groups. Panasas Storage supports much generate more than 1.5 PB of geneticPAS Storage system with the PanFSTM information. NCBI will be required to of the daily computation that generatesparallel file system, 1800-core DellPowerEdge Cluster, Cisco 6509 archive and provide timely investigator the data for such high-visibility servicesNetwork Switch access to as much as 3 TB of new as NCBI’s PubMed resource that brings genome data arriving weekly from each of together more than 18 million citations fromTHE RESULT the six institutes participating in the 1000 MEDLINE and other life science journals Genomes Project. To accommodate the for biomedical articles. • 5X application performance improvement expected high demand for data access • Timelier database updates with NCBI requires a storage solution that is Most recently, NCBI implemented a PAS faster time-to-results reliable, manageable, and affordable. Storage system that provides economical • High performance irrespective of second-tier storage for the high-density access patterns/dataset size The Solution data requirements of the 1000 Genomes • Affordable scalability for fast- NCBI selected Panasas Storage for the Project. The PAS solution also provides growing archives Center’s Dell PowerEdge compute farm. storage resources to projects such as • Administrative efficiencies across The decision was based in part on testing the NCBI Short Read Archive (SRA), a primary and secondary storage results that indicated the Panasas Storage central repository for short read sequencing solution delivers a significant performance data, and the dbGaP public repository of improvement over existing installed storage. genotypes and phenotypes. 1-888-panasas www.panasas.com
  2. 2. Customer Success Story: National Institutes of HealthThe ResultFlexibility and Efficiency Advances Discovery.Technology advances that have brought down the cost “Technology advances that haveof sequencing—from billions to millions per project and brought down the cost of sequencingfreefalling rapidly to the industry’s goal of $10K or even have contributed to an explosion ofas low as $1K for a single run—have also contributed to an data...Panasas helps NCBI keep paceexplosion of data. Taking advantage of the PAS solution for with the volume and complexity ofreceipt and storage of genome and other project data helps incoming information in aNCBI keep pace with the volume and complexity of incominginformation in a cost-effective manner. cost-effective manner.”Performance, Scalability for Fast-Growing ArchivesPanasas solutions help address the research community’sstorage needs in spite of a very high unpredictability factor.Whether it’s unexpected demand for particular researchfindings, storage requirements that mushroom from 150 TBto 1.5 PB almost overnight, or datasets that vary from 3 TB to30 TB in size, the needs of the scientific community dictatestorage flexibility and maximum uptime. In addition to theinherent administrative efficiencies of a common architecture,the Panasas unified storage platform for Tier 1 and secondarystorage applications gives flexibility to support a scientificuser community striving for discoveries that directly impactunderstanding of genetics and its role in health and diseaseanalysis. NCBI’s mission is to help researchers better leverageand build on the work of the larger biotechnology community,avoiding both the cost and the time penalities of reworkingdata.About PanasasPanasas, Inc., the leader in high-performance scale-out NAS storage solutions, enables enterprise customers to rapidly solvecomplex computing problems, speed innovation and bring new products to market faster. All Panasas solutions leverage thepatented PanFS™ storage operating system to deliver exceptional performance, scalability and manageability. PW-10-21000 | Phone: 1-888-PANASAS | www.panasas.com © 2010 Panasas Incorporated. All rights reserved. Panasas is a trademark of Panasas, Inc. in the United States and other countries.

×