SlideShare a Scribd company logo
“Determining the Human Gut Microbiome
using Genome Sequencing and Dell’s Cloud Computing”
Dell Webinar
April 29, 2014
Dr. Larry Smarr
Director, California Institute for Telecommunications and Information Technology
Harry E. Gruber Professor,
Dept. of Computer Science and Engineering
Jacobs School of Engineering, UCSD
http://lsmarr.calit2.net
1
The Human Microbiome Ecology is Critical
to Health and Disease
Inclusion of the Microbiome
Will Radically Change Medicine
99% of Your
DNA Genes
Are in Microbe Cells
Not Human Cells
Your Body Has 10 Times
As Many Microbe Cells As Human Cells
To Map Out the Dynamics of My Microbiome Ecology
I Partnered with the J. Craig Venter Institute
• JCVI Did Metagenomic
Sequencing on Seven of
My Stool Samples
Over 1.5 Years
• Sequencing on
Illumina HiSeq 2000
– Generates 100bp Reads
• JCVI Lab Manager,
Genomic Medicine
– Manolito Torralba
• IRB PI Karen Nelson
– President JCVI
Illumina HiSeq 2000 at JCVI
Manolito Torralba, JCVI Karen Nelson, JCVI
We Downloaded Additional Phenotypes from NIH’s
Human Microbiome Program For Comparative Analysis
5 Ileal Crohn’s Patients,
3 Points in Time
2 Ulcerative Colitis Patients,
6 Points in Time
“Healthy” Individuals
Download Raw Reads
~100M Per Person
Source: Jerry Sheehan, Calit2
Weizhong Li, Sitao Wu, CRBS, UCSD
Total of ~28 Billion Reads
Or 2.8 Trillion DNA Bases
“Disease” Patients
250 Subjects
1 Point in Time Larry Smarr
7 Points in Time
Over 1.5 Years
Inflammatory Bowel Disease
We Created a Reference Database
Of Known Gut Genomes
• NCBI April 2013
– 2471 Complete + 5543 Draft Bacteria & Archaea Genomes
– 2399 Complete Virus Genomes
– 26 Complete Fungi Genomes
– 309 HMP Eukaryote Reference Genomes
• Total 10,741 genomes, ~30 GB of sequences
Now to Align Our 28 Billion Reads
Against the Reference Database
Source: Weizhong Li, Sitao Wu, CRBS, UCSD
Computational NextGen Sequencing Pipeline:
From Sequence to Taxonomy and Function
PI: (Weizhong Li, CRBS, UCSD):
NIH R01HG005978 (2010-2013, $1.1M)
We Used Dell’s Cloud (Sanger) to Analyze
All of Our Human Gut Microbiomes
• Dell’s Sanger Cluster
– 32 Nodes, 512 Cores,
– 48GB RAM per Node
– 50GB SSD Local Drive, 390TB Lustre File System
• We Processed the Taxonomic Relative Abundance
– Used ~35,000 Core-Hours on Dell’s Sanger
– With 30 TB data
• Full Processing to Function (COGs, KEGGs)
– Would Require ~1-2 Million Core-Hours
Source: Weizhong Li, UCSD
Dell Cloud Results Are Leading
Toward Microbiome Disease Diagnosis
UC 100x Healthy
CD 100x Healthy
We Produced Similar Results for ~2500 Microbial Species

More Related Content

What's hot

What's hot (20)

Using Supercomputers and Data Science to Reveal Your Inner Microbiome
Using Supercomputers and Data Science to Reveal Your Inner MicrobiomeUsing Supercomputers and Data Science to Reveal Your Inner Microbiome
Using Supercomputers and Data Science to Reveal Your Inner Microbiome
 
Stability in Health vs. Abrupt Changes in Disease in the Human Gut Microbiome...
Stability in Health vs. Abrupt Changes in Disease in the Human Gut Microbiome...Stability in Health vs. Abrupt Changes in Disease in the Human Gut Microbiome...
Stability in Health vs. Abrupt Changes in Disease in the Human Gut Microbiome...
 
How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...
How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...
How Studying Astrophysics and Coral Reefs Enabled Me to Become an Empowered,...
 
Assay Lab Within Your Body: Biometrics and Biomes
Assay Lab Within Your Body: Biometrics and BiomesAssay Lab Within Your Body: Biometrics and Biomes
Assay Lab Within Your Body: Biometrics and Biomes
 
Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...
Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...
Using Supercomputers to Discover the 100 Trillion Bacteria Living Within Each...
 
Finding the Patterns in the Big Data From Human Microbiome Ecology
Finding the Patterns in the Big Data From Human Microbiome EcologyFinding the Patterns in the Big Data From Human Microbiome Ecology
Finding the Patterns in the Big Data From Human Microbiome Ecology
 
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
Analyzing the Human Gut Microbiome Dynamics in Health and Disease Using Super...
 
Machine Learning Opportunities in the Explosion of Personalized Precision Med...
Machine Learning Opportunities in the Explosion of Personalized Precision Med...Machine Learning Opportunities in the Explosion of Personalized Precision Med...
Machine Learning Opportunities in the Explosion of Personalized Precision Med...
 
Quantifying Your Superorganism Body Using Big Data Supercomputing
Quantifying Your Superorganism Body Using Big Data SupercomputingQuantifying Your Superorganism Body Using Big Data Supercomputing
Quantifying Your Superorganism Body Using Big Data Supercomputing
 
Quantifying the Time Progression of the Interaction of the Human Immune Syste...
Quantifying the Time Progression of the Interaction of the Human Immune Syste...Quantifying the Time Progression of the Interaction of the Human Immune Syste...
Quantifying the Time Progression of the Interaction of the Human Immune Syste...
 
Using Supercomputers and Data Analytics to Discover the Differences in Health...
Using Supercomputers and Data Analytics to Discover the Differences in Health...Using Supercomputers and Data Analytics to Discover the Differences in Health...
Using Supercomputers and Data Analytics to Discover the Differences in Health...
 
Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...
Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...
Quantifying The Dynamics of Your Superorganism Body Using Big Data Supercompu...
 
Linking Phenotype Changes to Internal/External Longitudinal Time Series in a ...
Linking Phenotype Changes to Internal/External Longitudinal Time Series in a ...Linking Phenotype Changes to Internal/External Longitudinal Time Series in a ...
Linking Phenotype Changes to Internal/External Longitudinal Time Series in a ...
 
Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us ...
Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us ...Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us ...
Quantifying Your Dynamic Human Body (Including Its Microbiome), Will Move Us ...
 
Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...
Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...
Using Dell’s HPC Cloud & Advanced Analytic Software to Discover Radical Chang...
 
Discovering the 100 Trillion Bacteria Living Within Each of Us
Discovering the 100 Trillion Bacteria Living Within Each of UsDiscovering the 100 Trillion Bacteria Living Within Each of Us
Discovering the 100 Trillion Bacteria Living Within Each of Us
 
Fifty Years of Supercomputing: From Colliding Black Holes to Dynamic Microbio...
Fifty Years of Supercomputing: From Colliding Black Holes to Dynamic Microbio...Fifty Years of Supercomputing: From Colliding Black Holes to Dynamic Microbio...
Fifty Years of Supercomputing: From Colliding Black Holes to Dynamic Microbio...
 
Quantified Self On Being A Personal Genomic Observatory
Quantified Self On Being A Personal Genomic ObservatoryQuantified Self On Being A Personal Genomic Observatory
Quantified Self On Being A Personal Genomic Observatory
 
Decoding the Software Inside of You
Decoding the Software Inside of YouDecoding the Software Inside of You
Decoding the Software Inside of You
 
From N=1 to N=100: What I Have Learned from Quantifying My Superorganism Body
From N=1 to N=100: What I Have Learned from Quantifying My Superorganism BodyFrom N=1 to N=100: What I Have Learned from Quantifying My Superorganism Body
From N=1 to N=100: What I Have Learned from Quantifying My Superorganism Body
 

Similar to Determining the Human Gut Microbiome Using Genome Sequencing and Dell's Cloud Computing

Similar to Determining the Human Gut Microbiome Using Genome Sequencing and Dell's Cloud Computing (19)

Discovering the Other 90% of our Human Superorganism
Discovering the Other 90% of our Human SuperorganismDiscovering the Other 90% of our Human Superorganism
Discovering the Other 90% of our Human Superorganism
 
Exploring Our Inner Universe Using Supercomputers and Gene Sequencers
Exploring Our Inner Universe Using Supercomputers and Gene SequencersExploring Our Inner Universe Using Supercomputers and Gene Sequencers
Exploring Our Inner Universe Using Supercomputers and Gene Sequencers
 
Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...
Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...
Mapping the Human Gut Microbiome in Health and Disease Using Sequencing, Supe...
 
Supercomputing Your Inner Microbiome
Supercomputing Your Inner MicrobiomeSupercomputing Your Inner Microbiome
Supercomputing Your Inner Microbiome
 
Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...
Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...
Tracking Large Variations in My Immune Biomarkers and My Gut Microbiome: Infl...
 
Assay Lab Within Your Body: Biometrics and Biomes
Assay Lab Within Your Body: Biometrics and BiomesAssay Lab Within Your Body: Biometrics and Biomes
Assay Lab Within Your Body: Biometrics and Biomes
 
Large Memory High Performance Computing Enables Comparison Across Human Gut M...
Large Memory High Performance ComputingEnables Comparison Across Human Gut M...Large Memory High Performance ComputingEnables Comparison Across Human Gut M...
Large Memory High Performance Computing Enables Comparison Across Human Gut M...
 
Big Data and Superorganism Genomics: Microbial Metagenomics Meets Human Genomics
Big Data and Superorganism Genomics: Microbial Metagenomics Meets Human GenomicsBig Data and Superorganism Genomics: Microbial Metagenomics Meets Human Genomics
Big Data and Superorganism Genomics: Microbial Metagenomics Meets Human Genomics
 
Quantifying Your Superorganism Body Using Big Data Supercomputing
Quantifying Your Superorganism Body Using Big Data SupercomputingQuantifying Your Superorganism Body Using Big Data Supercomputing
Quantifying Your Superorganism Body Using Big Data Supercomputing
 
The Human Microbiome and the Revolution in Digital Health
The Human Microbiome and the Revolution in Digital HealthThe Human Microbiome and the Revolution in Digital Health
The Human Microbiome and the Revolution in Digital Health
 
Exploring the Dynamics of The Microbiome in Health and Disease
Exploring the Dynamics of The Microbiome in Health and DiseaseExploring the Dynamics of The Microbiome in Health and Disease
Exploring the Dynamics of The Microbiome in Health and Disease
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
Inspired by Carl: Exploring the Microbial Dynamics Within
Inspired by Carl: Exploring the Microbial Dynamics WithinInspired by Carl: Exploring the Microbial Dynamics Within
Inspired by Carl: Exploring the Microbial Dynamics Within
 
Capturing the Interactive Dynamics of the Human Host/Microbiome System
Capturing the Interactive Dynamics of the Human Host/Microbiome SystemCapturing the Interactive Dynamics of the Human Host/Microbiome System
Capturing the Interactive Dynamics of the Human Host/Microbiome System
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
Microbial Metagenomics and Human Health
Microbial Metagenomics and Human HealthMicrobial Metagenomics and Human Health
Microbial Metagenomics and Human Health
 
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
 
Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...
Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...
Observing the Dynamics of the Human Immune System Coupled to the Microbiome i...
 
Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...
Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...
Interactions of the Immune System with the Gut Microbiome in Inflammatory Bo...
 

More from Larry Smarr

More from Larry Smarr (20)

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 Years
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
 

Recently uploaded

Circulation through Special Regions -characteristics and regulation
Circulation through Special Regions -characteristics and regulationCirculation through Special Regions -characteristics and regulation
Circulation through Special Regions -characteristics and regulation
MedicoseAcademics
 

Recently uploaded (20)

Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
Vaccines: A Powerful and Cost-Effective Tool Protecting Americans Against Dis...
 
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...The hemodynamic and autonomic determinants of elevated blood pressure in obes...
The hemodynamic and autonomic determinants of elevated blood pressure in obes...
 
Cardiovascular Physiology - Regulation of Cardiac Pumping
Cardiovascular Physiology - Regulation of Cardiac PumpingCardiovascular Physiology - Regulation of Cardiac Pumping
Cardiovascular Physiology - Regulation of Cardiac Pumping
 
Relationship between vascular system disfunction, neurofluid flow and Alzheim...
Relationship between vascular system disfunction, neurofluid flow and Alzheim...Relationship between vascular system disfunction, neurofluid flow and Alzheim...
Relationship between vascular system disfunction, neurofluid flow and Alzheim...
 
Why invest into infodemic management in health emergencies
Why invest into infodemic management in health emergenciesWhy invest into infodemic management in health emergencies
Why invest into infodemic management in health emergencies
 
A thorough review of supernormal conduction.pptx
A thorough review of supernormal conduction.pptxA thorough review of supernormal conduction.pptx
A thorough review of supernormal conduction.pptx
 
In-service education (Nursing Mangement)
In-service education (Nursing Mangement)In-service education (Nursing Mangement)
In-service education (Nursing Mangement)
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
 
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.GawadHemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
Hemodialysis: Chapter 2, Extracorporeal Blood Circuit - Dr.Gawad
 
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
 
DECIPHERING COMMON ECG FINDINGS IN ED.pptx
DECIPHERING COMMON ECG FINDINGS IN ED.pptxDECIPHERING COMMON ECG FINDINGS IN ED.pptx
DECIPHERING COMMON ECG FINDINGS IN ED.pptx
 
Anuman- An inference for helpful in diagnosis and treatment
Anuman- An inference for helpful in diagnosis and treatmentAnuman- An inference for helpful in diagnosis and treatment
Anuman- An inference for helpful in diagnosis and treatment
 
Circulation through Special Regions -characteristics and regulation
Circulation through Special Regions -characteristics and regulationCirculation through Special Regions -characteristics and regulation
Circulation through Special Regions -characteristics and regulation
 
linearity concept of significance, standard deviation, chi square test, stude...
linearity concept of significance, standard deviation, chi square test, stude...linearity concept of significance, standard deviation, chi square test, stude...
linearity concept of significance, standard deviation, chi square test, stude...
 
End Feel -joint end feel - Normal and Abnormal end feel
End Feel -joint end feel - Normal and Abnormal end feelEnd Feel -joint end feel - Normal and Abnormal end feel
End Feel -joint end feel - Normal and Abnormal end feel
 
Prix Galien International 2024 Forum Program
Prix Galien International 2024 Forum ProgramPrix Galien International 2024 Forum Program
Prix Galien International 2024 Forum Program
 
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
 
Compare home pulse pressure components collected directly from home
Compare home pulse pressure components collected directly from homeCompare home pulse pressure components collected directly from home
Compare home pulse pressure components collected directly from home
 
ANATOMY OF THE LOWER URINARY TRACT AND MALE [Autosaved] [Autosaved].pptx
ANATOMY OF THE LOWER URINARY TRACT AND MALE [Autosaved] [Autosaved].pptxANATOMY OF THE LOWER URINARY TRACT AND MALE [Autosaved] [Autosaved].pptx
ANATOMY OF THE LOWER URINARY TRACT AND MALE [Autosaved] [Autosaved].pptx
 
Young at heart: Cardiovascular health stations to empower healthy lifestyle b...
Young at heart: Cardiovascular health stations to empower healthy lifestyle b...Young at heart: Cardiovascular health stations to empower healthy lifestyle b...
Young at heart: Cardiovascular health stations to empower healthy lifestyle b...
 

Determining the Human Gut Microbiome Using Genome Sequencing and Dell's Cloud Computing

  • 1. “Determining the Human Gut Microbiome using Genome Sequencing and Dell’s Cloud Computing” Dell Webinar April 29, 2014 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net 1
  • 2. The Human Microbiome Ecology is Critical to Health and Disease Inclusion of the Microbiome Will Radically Change Medicine 99% of Your DNA Genes Are in Microbe Cells Not Human Cells Your Body Has 10 Times As Many Microbe Cells As Human Cells
  • 3. To Map Out the Dynamics of My Microbiome Ecology I Partnered with the J. Craig Venter Institute • JCVI Did Metagenomic Sequencing on Seven of My Stool Samples Over 1.5 Years • Sequencing on Illumina HiSeq 2000 – Generates 100bp Reads • JCVI Lab Manager, Genomic Medicine – Manolito Torralba • IRB PI Karen Nelson – President JCVI Illumina HiSeq 2000 at JCVI Manolito Torralba, JCVI Karen Nelson, JCVI
  • 4. We Downloaded Additional Phenotypes from NIH’s Human Microbiome Program For Comparative Analysis 5 Ileal Crohn’s Patients, 3 Points in Time 2 Ulcerative Colitis Patients, 6 Points in Time “Healthy” Individuals Download Raw Reads ~100M Per Person Source: Jerry Sheehan, Calit2 Weizhong Li, Sitao Wu, CRBS, UCSD Total of ~28 Billion Reads Or 2.8 Trillion DNA Bases “Disease” Patients 250 Subjects 1 Point in Time Larry Smarr 7 Points in Time Over 1.5 Years Inflammatory Bowel Disease
  • 5. We Created a Reference Database Of Known Gut Genomes • NCBI April 2013 – 2471 Complete + 5543 Draft Bacteria & Archaea Genomes – 2399 Complete Virus Genomes – 26 Complete Fungi Genomes – 309 HMP Eukaryote Reference Genomes • Total 10,741 genomes, ~30 GB of sequences Now to Align Our 28 Billion Reads Against the Reference Database Source: Weizhong Li, Sitao Wu, CRBS, UCSD
  • 6. Computational NextGen Sequencing Pipeline: From Sequence to Taxonomy and Function PI: (Weizhong Li, CRBS, UCSD): NIH R01HG005978 (2010-2013, $1.1M)
  • 7. We Used Dell’s Cloud (Sanger) to Analyze All of Our Human Gut Microbiomes • Dell’s Sanger Cluster – 32 Nodes, 512 Cores, – 48GB RAM per Node – 50GB SSD Local Drive, 390TB Lustre File System • We Processed the Taxonomic Relative Abundance – Used ~35,000 Core-Hours on Dell’s Sanger – With 30 TB data • Full Processing to Function (COGs, KEGGs) – Would Require ~1-2 Million Core-Hours Source: Weizhong Li, UCSD
  • 8. Dell Cloud Results Are Leading Toward Microbiome Disease Diagnosis UC 100x Healthy CD 100x Healthy We Produced Similar Results for ~2500 Microbial Species