SlideShare a Scribd company logo
X-team #2
High Dimensional
Biological Butterflies
Data Science Workshop 2015
What do we have in common?
High-dimensional biological data
● High-throughput genotyping and phenotyping
● Finding biological meaning in big data with
high N and/or P
The ability to harvest the wealth of information contained in
biomedical Big Data will advance our understanding of
human health and disease; however, lack of appropriate
tools, poor data accessibility, and insufficient training, are
major impediments to rapid translational impact. -NIH BD2K
Data integration
● Data fragmentation
o individual vs population
o multiple -omics
o multiple sources
● Discovery and prediction
o genome and functional
annotation
Statistical learning
methods
● Data quality
○ hidden sources of variability
○ limitations of short read
sequencing
Data annotation
Genome assembly/error
correction
Problem Solution
Success Stories
Domain Science Data Science Methods
Metabolic pathway - Ingenuity Pathway Analysis (http://www.ingenuity.com/products/ipa)
Genomic data - Quality Control
- FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/)
- EasyQC for genome-wide association meta-analyses
(http://www.nature.com/nprot/journal/v9/n5/full/nprot.2014.071.html)
- Batch effect
- PEER (http://www.ncbi.nlm.nih.gov/pubmed/22343431)
- SVA (http://www.ncbi.nlm.nih.gov/pubmed/22257669)
- scLVM (Buettner et al., 2015)
- Data storage and sharing
- NCBI (http://www.ncbi.nlm.nih.gov)
- GitHub (https://github.com)
- UCSC genome browser (http://genome.ucsc.edu/)
- Gene annotation
- Gene Ontology (http://geneontology.org/page/documentation)
Proteomics - Protein Data Bank (PDB) (http://www.rcsb.org/pdb/home/home.do)
Disease Survivability - WEKA (Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten
(2009); The WEKA Data Mining Software: An Update; SIGKDD Explorations, Volume 11, Issue 1.)
Same data, different interpretation
Gilad & Mizrahi-Man 2015
F1000Research, 4:121
Interdisciplinary
Research
Interdisciplinary data science essentials
Going Forward
● Create and maintain a HowTo website for
Data Science computational tools and
methods.
http://data-science-for-biologists.wikia.com/wiki/Data_Science_for_Biologists_Wikia
● Collaborate via Github
Thanks!

More Related Content

What's hot

Precision Medicine enabling tools are not just NGS
Precision Medicine enabling tools are not just NGSPrecision Medicine enabling tools are not just NGS
Precision Medicine enabling tools are not just NGS
Carlo Lucchesi
 
Beyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AIBeyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AI
Paul Agapow
 
Quality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State UniversityQuality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State University
rds-wayne-edu
 
Brazil-UK Frontiers of Engineering - Big data in healthcare session
Brazil-UK Frontiers of Engineering - Big data in healthcare sessionBrazil-UK Frontiers of Engineering - Big data in healthcare session
Brazil-UK Frontiers of Engineering - Big data in healthcare session
Alejandra Gonzalez-Beltran
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
Pranavathiyani G
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
Pistoia Alliance
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET
 
David Tyrpak CV
David Tyrpak CVDavid Tyrpak CV
David Tyrpak CV
DavidTyrpak1
 
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Elia Brodsky
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
Pistoia Alliance
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
Pistoia Alliance
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
John Koch
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
Syed Muhammad Ali Hasnain
 
Omics Logic Genomics Program
Omics Logic Genomics ProgramOmics Logic Genomics Program
Omics Logic Genomics Program
Elia Brodsky
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
Warren Kibbe
 
Pine Biotech
Pine BiotechPine Biotech
Pine Biotech
Elia Brodsky
 
Lecture 9C
Lecture 9CLecture 9C
Lecture 9C
CMDLMS
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
Pistoia Alliance
 
NCI Support for Cancer Data Sharing
NCI Support for Cancer Data SharingNCI Support for Cancer Data Sharing
NCI Support for Cancer Data Sharing
Warren Kibbe
 
Data Science Coursera 8N8VM4AGNDL7
Data Science Coursera 8N8VM4AGNDL7Data Science Coursera 8N8VM4AGNDL7
Data Science Coursera 8N8VM4AGNDL7
Mei Chiao Lin
 

What's hot (20)

Precision Medicine enabling tools are not just NGS
Precision Medicine enabling tools are not just NGSPrecision Medicine enabling tools are not just NGS
Precision Medicine enabling tools are not just NGS
 
Beyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AIBeyond Proofs of Concept for Biomedical AI
Beyond Proofs of Concept for Biomedical AI
 
Quality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State UniversityQuality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State University
 
Brazil-UK Frontiers of Engineering - Big data in healthcare session
Brazil-UK Frontiers of Engineering - Big data in healthcare sessionBrazil-UK Frontiers of Engineering - Big data in healthcare session
Brazil-UK Frontiers of Engineering - Big data in healthcare session
 
Application of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicineApplication of blockchain technology in healthcare and biomedicine
Application of blockchain technology in healthcare and biomedicine
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
David Tyrpak CV
David Tyrpak CVDavid Tyrpak CV
David Tyrpak CV
 
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
 
cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)cBioPortal Webinar Slides (2/3)
cBioPortal Webinar Slides (2/3)
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
 
Omics Logic Genomics Program
Omics Logic Genomics ProgramOmics Logic Genomics Program
Omics Logic Genomics Program
 
Data Commons & Data Science Workshop
Data Commons & Data Science WorkshopData Commons & Data Science Workshop
Data Commons & Data Science Workshop
 
Pine Biotech
Pine BiotechPine Biotech
Pine Biotech
 
Lecture 9C
Lecture 9CLecture 9C
Lecture 9C
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
NCI Support for Cancer Data Sharing
NCI Support for Cancer Data SharingNCI Support for Cancer Data Sharing
NCI Support for Cancer Data Sharing
 
Data Science Coursera 8N8VM4AGNDL7
Data Science Coursera 8N8VM4AGNDL7Data Science Coursera 8N8VM4AGNDL7
Data Science Coursera 8N8VM4AGNDL7
 

Similar to X team 2 - presentation

Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
Amit Sheth
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
ARDC
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Philip Bourne
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Promila Sharan
 
The Clinical Genome Conference 2014
The Clinical Genome Conference 2014The Clinical Genome Conference 2014
The Clinical Genome Conference 2014
Nicole Proulx
 
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug TargetsDiscovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Jaime Hodges
 
Data supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbeData supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbe
Warren Kibbe
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
fionabrinkman
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
JTADrexel
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
Elena Sügis
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
Elia Brodsky
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
Warren Kibbe
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
Pistoia Alliance
 
Utilization of virtual microscopy in a cooperative group setting
Utilization of virtual microscopy in a cooperative group settingUtilization of virtual microscopy in a cooperative group setting
Utilization of virtual microscopy in a cooperative group setting
BIT002
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
Christopher Wilson
 
Bioinformatics principles and applications
Bioinformatics principles and applicationsBioinformatics principles and applications
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
Chien-Wei Lin
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
EITESANGO
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
Paolo Missier
 
Health Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptxHealth Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptx
Arti Parab Academics
 

Similar to X team 2 - presentation (20)

Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical InformaticsSemantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
The Clinical Genome Conference 2014
The Clinical Genome Conference 2014The Clinical Genome Conference 2014
The Clinical Genome Conference 2014
 
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug TargetsDiscovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
Discovery on Target 2014 - The Industry's Preeminent Event on Novel Drug Targets
 
Data supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbeData supporting precision oncology fda wakibbe
Data supporting precision oncology fda wakibbe
 
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.caGenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.ca
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1Free webinar-introduction to bioinformatics - biologist-1
Free webinar-introduction to bioinformatics - biologist-1
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Utilization of virtual microscopy in a cooperative group setting
Utilization of virtual microscopy in a cooperative group settingUtilization of virtual microscopy in a cooperative group setting
Utilization of virtual microscopy in a cooperative group setting
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Bioinformatics principles and applications
Bioinformatics principles and applicationsBioinformatics principles and applications
Bioinformatics principles and applications
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
 
Health Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptxHealth Informatics- Module 5-Chapter 3.pptx
Health Informatics- Module 5-Chapter 3.pptx
 

More from Rayna Harris

Hippocampal transcriptomic responses to technical and biological perturbations
Hippocampal transcriptomic responses to technical and biological perturbationsHippocampal transcriptomic responses to technical and biological perturbations
Hippocampal transcriptomic responses to technical and biological perturbations
Rayna Harris
 
Version Control with GitHub for Bioinformatics
Version Control with GitHub for BioinformaticsVersion Control with GitHub for Bioinformatics
Version Control with GitHub for Bioinformatics
Rayna Harris
 
Time and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression ProfilingTime and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression Profiling
Rayna Harris
 
Toward Single Neuron Gene Expression Analysis for Studying Behavior
Toward Single Neuron Gene Expression Analysis for Studying Behavior Toward Single Neuron Gene Expression Analysis for Studying Behavior
Toward Single Neuron Gene Expression Analysis for Studying Behavior
Rayna Harris
 
Evolution of Social Brains
Evolution of Social BrainsEvolution of Social Brains
Evolution of Social Brains
Rayna Harris
 
Neurobiology of Social Sensory Integration and Behavior
Neurobiology of Social Sensory Integration and BehaviorNeurobiology of Social Sensory Integration and Behavior
Neurobiology of Social Sensory Integration and Behavior
Rayna Harris
 

More from Rayna Harris (6)

Hippocampal transcriptomic responses to technical and biological perturbations
Hippocampal transcriptomic responses to technical and biological perturbationsHippocampal transcriptomic responses to technical and biological perturbations
Hippocampal transcriptomic responses to technical and biological perturbations
 
Version Control with GitHub for Bioinformatics
Version Control with GitHub for BioinformaticsVersion Control with GitHub for Bioinformatics
Version Control with GitHub for Bioinformatics
 
Time and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression ProfilingTime and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression Profiling
 
Toward Single Neuron Gene Expression Analysis for Studying Behavior
Toward Single Neuron Gene Expression Analysis for Studying Behavior Toward Single Neuron Gene Expression Analysis for Studying Behavior
Toward Single Neuron Gene Expression Analysis for Studying Behavior
 
Evolution of Social Brains
Evolution of Social BrainsEvolution of Social Brains
Evolution of Social Brains
 
Neurobiology of Social Sensory Integration and Behavior
Neurobiology of Social Sensory Integration and BehaviorNeurobiology of Social Sensory Integration and Behavior
Neurobiology of Social Sensory Integration and Behavior
 

Recently uploaded

一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 

Recently uploaded (20)

一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 

X team 2 - presentation

  • 1. X-team #2 High Dimensional Biological Butterflies Data Science Workshop 2015
  • 2. What do we have in common?
  • 3. High-dimensional biological data ● High-throughput genotyping and phenotyping ● Finding biological meaning in big data with high N and/or P
  • 4. The ability to harvest the wealth of information contained in biomedical Big Data will advance our understanding of human health and disease; however, lack of appropriate tools, poor data accessibility, and insufficient training, are major impediments to rapid translational impact. -NIH BD2K
  • 5. Data integration ● Data fragmentation o individual vs population o multiple -omics o multiple sources ● Discovery and prediction o genome and functional annotation Statistical learning methods ● Data quality ○ hidden sources of variability ○ limitations of short read sequencing Data annotation Genome assembly/error correction Problem Solution
  • 6. Success Stories Domain Science Data Science Methods Metabolic pathway - Ingenuity Pathway Analysis (http://www.ingenuity.com/products/ipa) Genomic data - Quality Control - FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) - EasyQC for genome-wide association meta-analyses (http://www.nature.com/nprot/journal/v9/n5/full/nprot.2014.071.html) - Batch effect - PEER (http://www.ncbi.nlm.nih.gov/pubmed/22343431) - SVA (http://www.ncbi.nlm.nih.gov/pubmed/22257669) - scLVM (Buettner et al., 2015) - Data storage and sharing - NCBI (http://www.ncbi.nlm.nih.gov) - GitHub (https://github.com) - UCSC genome browser (http://genome.ucsc.edu/) - Gene annotation - Gene Ontology (http://geneontology.org/page/documentation) Proteomics - Protein Data Bank (PDB) (http://www.rcsb.org/pdb/home/home.do) Disease Survivability - WEKA (Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, Ian H. Witten (2009); The WEKA Data Mining Software: An Update; SIGKDD Explorations, Volume 11, Issue 1.)
  • 7. Same data, different interpretation Gilad & Mizrahi-Man 2015 F1000Research, 4:121
  • 9. Going Forward ● Create and maintain a HowTo website for Data Science computational tools and methods. http://data-science-for-biologists.wikia.com/wiki/Data_Science_for_Biologists_Wikia ● Collaborate via Github

Editor's Notes

  1. Half are domain scientists and half are more computationally inclined. Made this word cloud from out notes. Data. Comp bio. Disease. Genetics. Integrative anlyses.. Disease spread. Social environment and epigenetics. Data privacy, data sharing, and computational genetics. Genetic and Proteomics and statistical tool to understand disease and cancer or individual phenotypic variation Tool development. RNAseq technology and applications tools for data reduction and variable selection.
  2. S
  3. predicting disease survivability for breast cancer patients Famous example: Potential flaws in genomics paper scrutinized on Twitter:http://www.nature.com/news/potential-flaws-in-genomics-paper-scrutinized-on-twitter-1.17591