SlideShare a Scribd company logo
1 of 15
S
Creating a Drosophila
Metabolomics
Database
Andrew Schwitzgebel and William Freeman
Dr. Laura Reed
Vishal Oza
S Metabolite : Metabolome :: Gene : Genome
S Metabolites are the intermediate products of metabolic
reactions catalyzed by various enzymes that naturally
occur within cells1
2
What is Metabolomics?
2/23/2016 Schwitzgebel & Freeman, CBH
Metabolic Syndrome
S MetS
S Suite of symptoms
S Obesity
S Elevated blood lipids
S Diabetes
S Complex interaction between
genetics and lifestyle
2/23/2016 3
Age-Adjusted Rate per 100 of Civilian, Noninstitutionalized Population with
Diagnosed Diabetes, by Race and Sex, United States, 1980–2011
Schwitzgebel & Freeman, CBH
Drosophila as a Human
Analog
S Complex genome
S Cryptic genetic variation8
S ~15,000 genes vs. ~23,000 in humans
S High throughput
S Fast results
S Observe population trends over time
S Accessibility to metabolic data
2/23/2016 4 Schwitzgebel & Freeman, CBH
Database Creation
S Goals for FlyMetLib
S Contain basic molecular data
S Reference other chemical identifiers
S HMDBID, DrugBank, ChemSpider, PubChem, etc.
S ~1,500 PubChem SIDs
S ~100 English synonyms
S Link to other bioinformatic databases
2/23/2016 5 Schwitzgebel & Freeman, CBH
2/23/2016 6
Chemical IDs Example
Schwitzgebel & Freeman, CBH
PubChem SID 1
PubChem SID 2
PubChem SID 3
PubChem SID 4
SLING ID 1
SLING ID 2
Chemical ID Conversion
Procedure
InChiKey read from a .csv file
InChiKey translated into a URL of a chemical translation
service
Other identifiers retrieved using find and string methods
Identifiers stored in a list variable to be added to the
database
Sample (Ethanol)
LFQSCWFLJHTTHZ-UHFFFAOYSA-N
http://cts.fiehnlab.ucdavis.edu/service/compound/LFQSCW
FLJHTTHZ-UHFFFAOYSA-N
“C2H6O”
MolFormList = [“C2H6O”]
2/23/2016 Schwitzgebel & Freeman, CBH7
Structuring the Database
S Creating a unique ID
S FlyMetID
S 5 hexadecimal digits (Ex: FLY00001)
S Allows greater number of unique IDs with fewer digits
S InChiKey for ethanol:
S LFQSCWFLJHTTHZ-UHFFFAOYSA-N
S More human readable
2/23/2016 8 Schwitzgebel & Freeman, CBH
Structuring the Database
One to One
S Chemical data table (Master)
S Molecular weight
S Molecular formula
S Melting point
S Certain IDs
S Hotlinks
S Link to other databases
FlyMetID
Molecular
Weight
Molecular
Formula
FLY00001 16.042 CH4
FLY00002 30.068 C2H6
FLY00003 44.094 C3H8
FLY00004 58.120 C4H10
2/23/2016 9 Schwitzgebel & Freeman, CBH
Structuring the Database
One to Many
S Synonyms
S IUPAC names
S Certain IDs
S PubChem, CAS,
ChemSpider
S Citations
PubChem SIDs
FlyMetID SID 1 SID 2 SID 3
FLY00001 48416772
16385457
6
24112948
1
FLY00002
12728063
0
FLY00003
12885427
1
53776982
FLY00004
12729869
9
12729867
4
99205937
2/23/2016 Schwitzgebel & Freeman, CBH10
Significance
S Researchers can access and contribute to the database
S Open source code
S One central place for data
S Analogous to the Human Metabolome Database (HMDB)
S Cuts down time cost for research
2/23/2016 11 Schwitzgebel & Freeman, CBH
Future Plans
S Making it publically accessible
S Creating a user interface
S Linking to FlyBase6
S Database for Drosophila genes
S and genomes
2/23/2016 12 Schwitzgebel & Freeman, CBH
Questions?
Andrew Schwitzgebel
Department of Biological Sciences
Computer Based Honors Program
ajschwitzgebel@crimson.ua.edu
William Freeman
Department of Chemical Engineering
Computer Based Honors Program
wjfreeman1@crimson.ua.edu
Dr. Laura Reed
Department of Biological Sciences
lreed1@ua.edu
2/23/2016 13 Schwitzgebel & Freeman, CBH
Proposed Metabolite
Identification Metrics
S Ex: Metabolite A
S Identified using: 1H 2D NMR,
Tandem Mass Spectrum, IR
Absorbance Spectrum
S Matched to an authentic reference
compound
S (3.0 + 1.5 + 0.5) (2) = 10
S Sumner suggests a minimum score of
59
2/23/2016 Schwitzgebel & Freeman, CBH14
Works Cited
1. Meštrović, Tomislav. "What Are Metabolites?" News-Medical.net. N.p., 01 June 2010. Web. 06 Nov. 2015.
2. Glycolysis. Digital image. Northwest Metabolomics Research Center. University of Washington, n.d. Web. 6 Nov. 2015.
3. Zhang, Rong, Tong Zhang, Dominika Korzekwa, Shadi Al-Johani, Julian A.T. Dow, and David G. Watson. "A Comparison of the
Metabolome of Male and Female Drosophila Melanogaster.” ResearchGate. Strathclyde Institute of Pharmaceutical and Biomedical
Sciences,, 2014. Web. 06 Nov. 2015.
4. Pedrosa, Diego, Gert Wohlgemuth, Sajjan Mehta, and Oliver Fiehn. "Chemical Translation Service." Chemical Translation Service. U.C.
Davis, 2011. Web. 06 Nov. 2015.
5. "Age-Adjusted Rate per 100 of Civilian, Noninstitutionalized Population with Diagnosed Diabetes, by Race and Sex, United States, 1980–
2011." Centers for Disease Control and Prevention. Centers for Disease Control and Prevention, 05 Sept. 2014. Web. 08 Nov. 2015.
6. "A Database of Drosophila Genes and Genomes." FlyBase. N.p., n.d. Web. 08 Nov. 2015.
7. "What Is Metabolic Syndrome?" National Heart, Lung and Blood Institute. National Institutes of Health, 06 Nov. 2015. Web. 08 Nov. 2015.
8. Gibson, Greg, and Ian Dworkin. "Uncovering Cryptic Genetic Variation." Genetics. Nature.com, Sept. 2004. Web. 8 Nov. 2015.
9. Sumner, LW. "Proposed Quantitative and Alphanumeric Metabolite Identification Metrics." Springer (2014): n. pag. Web. 1 Dec. 2015.
2/23/2016 15 Schwitzgebel & Freeman, CBH

More Related Content

Viewers also liked

Viewers also liked (16)

Final_Greenfield Business Model Canvas Presentation
Final_Greenfield Business Model Canvas PresentationFinal_Greenfield Business Model Canvas Presentation
Final_Greenfield Business Model Canvas Presentation
 
Stay_Authentic
Stay_AuthenticStay_Authentic
Stay_Authentic
 
Taiyab CV
Taiyab CVTaiyab CV
Taiyab CV
 
Huyton 27 ibrochure v2
Huyton 27 ibrochure v2Huyton 27 ibrochure v2
Huyton 27 ibrochure v2
 
SRA report 1
SRA report 1SRA report 1
SRA report 1
 
Dike 14-Doan Brook IBA Avian Survey Project
Dike 14-Doan Brook IBA Avian Survey ProjectDike 14-Doan Brook IBA Avian Survey Project
Dike 14-Doan Brook IBA Avian Survey Project
 
RESUME_NAVEEN
RESUME_NAVEENRESUME_NAVEEN
RESUME_NAVEEN
 
LM6 Runcorn Whitehouse85 3pp
LM6 Runcorn Whitehouse85 3ppLM6 Runcorn Whitehouse85 3pp
LM6 Runcorn Whitehouse85 3pp
 
Irlam Spectra57 ibrochure
Irlam Spectra57 ibrochure Irlam Spectra57 ibrochure
Irlam Spectra57 ibrochure
 
Dogs for çuuem
Dogs for çuuemDogs for çuuem
Dogs for çuuem
 
Portfolio_WeaverMichael
Portfolio_WeaverMichaelPortfolio_WeaverMichael
Portfolio_WeaverMichael
 
Swimming for çuuem
Swimming for çuuemSwimming for çuuem
Swimming for çuuem
 
Antigone prod
Antigone prodAntigone prod
Antigone prod
 
LM6 December 2016 Update
LM6 December 2016 UpdateLM6 December 2016 Update
LM6 December 2016 Update
 
Bla bla for çuuem
Bla bla for çuuemBla bla for çuuem
Bla bla for çuuem
 
LM6 December Update
LM6 December UpdateLM6 December Update
LM6 December Update
 

Similar to CBHLive2015

The Translational Medicine Ontology: Driving personalized medicine by br...
The Translational Medicine Ontology: Driving personalized      medicine by br...The Translational Medicine Ontology: Driving personalized      medicine by br...
The Translational Medicine Ontology: Driving personalized medicine by br...Michel Dumontier
 
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docx
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docxPERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docx
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docxmattjtoni51554
 
CSU_Press_release-Diabetes
CSU_Press_release-DiabetesCSU_Press_release-Diabetes
CSU_Press_release-DiabetesAndrey Ptitsyn
 
Update on genetics and molecular biology.pdf
Update on genetics and molecular biology.pdfUpdate on genetics and molecular biology.pdf
Update on genetics and molecular biology.pdfssuser5b0f5e
 
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020dkNET
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Chirag Patel
 
RunningHead PICOT Question1RunningHead PICOT Question7.docx
RunningHead PICOT Question1RunningHead PICOT Question7.docxRunningHead PICOT Question1RunningHead PICOT Question7.docx
RunningHead PICOT Question1RunningHead PICOT Question7.docxrtodd599
 
Anil Malhotra.pdf.docx
Anil Malhotra.pdf.docxAnil Malhotra.pdf.docx
Anil Malhotra.pdf.docxAnil Malhotra
 
Pathways to wellbeing
Pathways to wellbeingPathways to wellbeing
Pathways to wellbeingAndrew Kemp
 
Life course trajectories of systolic blood pressure using longitudinal data f...
Life course trajectories of systolic blood pressure using longitudinal data f...Life course trajectories of systolic blood pressure using longitudinal data f...
Life course trajectories of systolic blood pressure using longitudinal data f...Diyan Yunanto Setyaji
 
The AIDS Linked to the IntraVenous Experience (ALIVE) Study
The AIDS Linked to the IntraVenous Experience (ALIVE) StudyThe AIDS Linked to the IntraVenous Experience (ALIVE) Study
The AIDS Linked to the IntraVenous Experience (ALIVE) StudyHopkinsCFAR
 
Biomedical Resources Science Teachers
Biomedical Resources Science TeachersBiomedical Resources Science Teachers
Biomedical Resources Science TeachersMary Markland
 
Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresChirag Patel
 
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro..."How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...Hyper Wellbeing
 

Similar to CBHLive2015 (20)

Hyperlipidemia - etiology epidemiology clinical features
Hyperlipidemia - etiology epidemiology clinical featuresHyperlipidemia - etiology epidemiology clinical features
Hyperlipidemia - etiology epidemiology clinical features
 
The Translational Medicine Ontology: Driving personalized medicine by br...
The Translational Medicine Ontology: Driving personalized      medicine by br...The Translational Medicine Ontology: Driving personalized      medicine by br...
The Translational Medicine Ontology: Driving personalized medicine by br...
 
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docx
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docxPERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docx
PERSPECTIVEUnderstanding Nutritional Epidemiology and Its.docx
 
CSU_Press_release-Diabetes
CSU_Press_release-DiabetesCSU_Press_release-Diabetes
CSU_Press_release-Diabetes
 
DIET-HEARTfinal
DIET-HEARTfinalDIET-HEARTfinal
DIET-HEARTfinal
 
Update on genetics and molecular biology.pdf
Update on genetics and molecular biology.pdfUpdate on genetics and molecular biology.pdf
Update on genetics and molecular biology.pdf
 
Biomarker.docx
Biomarker.docxBiomarker.docx
Biomarker.docx
 
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
dkNET Webinar: Illuminating The Druggable Genome With Pharos 10/23/2020
 
Biomarker.pdf
Biomarker.pdfBiomarker.pdf
Biomarker.pdf
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817
 
RunningHead PICOT Question1RunningHead PICOT Question7.docx
RunningHead PICOT Question1RunningHead PICOT Question7.docxRunningHead PICOT Question1RunningHead PICOT Question7.docx
RunningHead PICOT Question1RunningHead PICOT Question7.docx
 
Anil Malhotra.pdf.docx
Anil Malhotra.pdf.docxAnil Malhotra.pdf.docx
Anil Malhotra.pdf.docx
 
Pathways to wellbeing
Pathways to wellbeingPathways to wellbeing
Pathways to wellbeing
 
Life course trajectories of systolic blood pressure using longitudinal data f...
Life course trajectories of systolic blood pressure using longitudinal data f...Life course trajectories of systolic blood pressure using longitudinal data f...
Life course trajectories of systolic blood pressure using longitudinal data f...
 
The AIDS Linked to the IntraVenous Experience (ALIVE) Study
The AIDS Linked to the IntraVenous Experience (ALIVE) StudyThe AIDS Linked to the IntraVenous Experience (ALIVE) Study
The AIDS Linked to the IntraVenous Experience (ALIVE) Study
 
Biomedical Resources Science Teachers
Biomedical Resources Science TeachersBiomedical Resources Science Teachers
Biomedical Resources Science Teachers
 
Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposures
 
The Case For a True Health Coalition
The Case For a True Health CoalitionThe Case For a True Health Coalition
The Case For a True Health Coalition
 
Schader_Honors_Thesis
Schader_Honors_ThesisSchader_Honors_Thesis
Schader_Honors_Thesis
 
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro..."How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...
"How Scientific Wellness will Drive The Future of Health" - Nathan Price (Pro...
 

CBHLive2015

  • 1. S Creating a Drosophila Metabolomics Database Andrew Schwitzgebel and William Freeman Dr. Laura Reed Vishal Oza
  • 2. S Metabolite : Metabolome :: Gene : Genome S Metabolites are the intermediate products of metabolic reactions catalyzed by various enzymes that naturally occur within cells1 2 What is Metabolomics? 2/23/2016 Schwitzgebel & Freeman, CBH
  • 3. Metabolic Syndrome S MetS S Suite of symptoms S Obesity S Elevated blood lipids S Diabetes S Complex interaction between genetics and lifestyle 2/23/2016 3 Age-Adjusted Rate per 100 of Civilian, Noninstitutionalized Population with Diagnosed Diabetes, by Race and Sex, United States, 1980–2011 Schwitzgebel & Freeman, CBH
  • 4. Drosophila as a Human Analog S Complex genome S Cryptic genetic variation8 S ~15,000 genes vs. ~23,000 in humans S High throughput S Fast results S Observe population trends over time S Accessibility to metabolic data 2/23/2016 4 Schwitzgebel & Freeman, CBH
  • 5. Database Creation S Goals for FlyMetLib S Contain basic molecular data S Reference other chemical identifiers S HMDBID, DrugBank, ChemSpider, PubChem, etc. S ~1,500 PubChem SIDs S ~100 English synonyms S Link to other bioinformatic databases 2/23/2016 5 Schwitzgebel & Freeman, CBH
  • 6. 2/23/2016 6 Chemical IDs Example Schwitzgebel & Freeman, CBH PubChem SID 1 PubChem SID 2 PubChem SID 3 PubChem SID 4 SLING ID 1 SLING ID 2
  • 7. Chemical ID Conversion Procedure InChiKey read from a .csv file InChiKey translated into a URL of a chemical translation service Other identifiers retrieved using find and string methods Identifiers stored in a list variable to be added to the database Sample (Ethanol) LFQSCWFLJHTTHZ-UHFFFAOYSA-N http://cts.fiehnlab.ucdavis.edu/service/compound/LFQSCW FLJHTTHZ-UHFFFAOYSA-N “C2H6O” MolFormList = [“C2H6O”] 2/23/2016 Schwitzgebel & Freeman, CBH7
  • 8. Structuring the Database S Creating a unique ID S FlyMetID S 5 hexadecimal digits (Ex: FLY00001) S Allows greater number of unique IDs with fewer digits S InChiKey for ethanol: S LFQSCWFLJHTTHZ-UHFFFAOYSA-N S More human readable 2/23/2016 8 Schwitzgebel & Freeman, CBH
  • 9. Structuring the Database One to One S Chemical data table (Master) S Molecular weight S Molecular formula S Melting point S Certain IDs S Hotlinks S Link to other databases FlyMetID Molecular Weight Molecular Formula FLY00001 16.042 CH4 FLY00002 30.068 C2H6 FLY00003 44.094 C3H8 FLY00004 58.120 C4H10 2/23/2016 9 Schwitzgebel & Freeman, CBH
  • 10. Structuring the Database One to Many S Synonyms S IUPAC names S Certain IDs S PubChem, CAS, ChemSpider S Citations PubChem SIDs FlyMetID SID 1 SID 2 SID 3 FLY00001 48416772 16385457 6 24112948 1 FLY00002 12728063 0 FLY00003 12885427 1 53776982 FLY00004 12729869 9 12729867 4 99205937 2/23/2016 Schwitzgebel & Freeman, CBH10
  • 11. Significance S Researchers can access and contribute to the database S Open source code S One central place for data S Analogous to the Human Metabolome Database (HMDB) S Cuts down time cost for research 2/23/2016 11 Schwitzgebel & Freeman, CBH
  • 12. Future Plans S Making it publically accessible S Creating a user interface S Linking to FlyBase6 S Database for Drosophila genes S and genomes 2/23/2016 12 Schwitzgebel & Freeman, CBH
  • 13. Questions? Andrew Schwitzgebel Department of Biological Sciences Computer Based Honors Program ajschwitzgebel@crimson.ua.edu William Freeman Department of Chemical Engineering Computer Based Honors Program wjfreeman1@crimson.ua.edu Dr. Laura Reed Department of Biological Sciences lreed1@ua.edu 2/23/2016 13 Schwitzgebel & Freeman, CBH
  • 14. Proposed Metabolite Identification Metrics S Ex: Metabolite A S Identified using: 1H 2D NMR, Tandem Mass Spectrum, IR Absorbance Spectrum S Matched to an authentic reference compound S (3.0 + 1.5 + 0.5) (2) = 10 S Sumner suggests a minimum score of 59 2/23/2016 Schwitzgebel & Freeman, CBH14
  • 15. Works Cited 1. Meštrović, Tomislav. "What Are Metabolites?" News-Medical.net. N.p., 01 June 2010. Web. 06 Nov. 2015. 2. Glycolysis. Digital image. Northwest Metabolomics Research Center. University of Washington, n.d. Web. 6 Nov. 2015. 3. Zhang, Rong, Tong Zhang, Dominika Korzekwa, Shadi Al-Johani, Julian A.T. Dow, and David G. Watson. "A Comparison of the Metabolome of Male and Female Drosophila Melanogaster.” ResearchGate. Strathclyde Institute of Pharmaceutical and Biomedical Sciences,, 2014. Web. 06 Nov. 2015. 4. Pedrosa, Diego, Gert Wohlgemuth, Sajjan Mehta, and Oliver Fiehn. "Chemical Translation Service." Chemical Translation Service. U.C. Davis, 2011. Web. 06 Nov. 2015. 5. "Age-Adjusted Rate per 100 of Civilian, Noninstitutionalized Population with Diagnosed Diabetes, by Race and Sex, United States, 1980– 2011." Centers for Disease Control and Prevention. Centers for Disease Control and Prevention, 05 Sept. 2014. Web. 08 Nov. 2015. 6. "A Database of Drosophila Genes and Genomes." FlyBase. N.p., n.d. Web. 08 Nov. 2015. 7. "What Is Metabolic Syndrome?" National Heart, Lung and Blood Institute. National Institutes of Health, 06 Nov. 2015. Web. 08 Nov. 2015. 8. Gibson, Greg, and Ian Dworkin. "Uncovering Cryptic Genetic Variation." Genetics. Nature.com, Sept. 2004. Web. 8 Nov. 2015. 9. Sumner, LW. "Proposed Quantitative and Alphanumeric Metabolite Identification Metrics." Springer (2014): n. pag. Web. 1 Dec. 2015. 2/23/2016 15 Schwitzgebel & Freeman, CBH

Editor's Notes

  1. Graph taken from the CDC
  2. Cryptic genetic variation – genotype that does not contribute to phenotype in the absence of external stimuli Accessibility – how you access/analyze the metabolites in an organism
  3. DrugBank used in pharmaceuticals, HMDBID used for humans (medicine), anything with Chem deals with organic chemistry mostly
  4. Andrew starts here, demonstrates # of different ways to talk about one chemical
  5. Will starts here
  6. Many to Many with junction table
  7. One place too see all the pieces of the puzzle. Less scattered