SlideShare a Scribd company logo
Statistical and Visualization Methods for Metagenomic Analysis
Héctor Corrada Bravo
Center for Bioinformatics and Computational Biology
• metagenomeSeq
– 16S differential abundance
– R/Bioconductor infrastructure for
metagenomic assays
– Longitudinal data
• metagenomicFeatures
– Incipient attempt regularizing 16S feature
annotations in R/Bioconductor
– E.g., greengenes13.5MgDb
• msd16s
– Example data, as infrastructure object
R/Bioconductor Strengths
• Infrastructure objects
– Interoperability, speed up startup time for method development
• Strict development practices
– Documentation, use cases, vignettes
• Annotation infrastructure
– Again, interoperability across experiments and data types
• Exploratory analysis
• Reproducibility
– Vignettes, Rmarkdown, etc.
• Recently, exploratory and interactive visualization
– Shiny, epiviz
Integrative, visual and computational
exploratory analysis of genomic data
• Browser-based
• Interactive
• Integration of data
• Reproducible dissemination
• Communication with R/Bioconductor: epivizr package
software systems to support creative exploratory analysis of large genome-wide datasets...
• Computed Measurements: create new measurements from
integrated measurements and visualize
• Summarization: summarize integrated measurements
(computed on data subsets)
Dynamically extensible: Easily integrate new data sources, data
types and add new visualizations.
Data providers define coordinate
space
One interpretation of Big Data is many sources of relevant
contextual data
• Easily access/integrate contextual data
• Driven by exploratory analysis of immediate
data
• Iterative process
• Visual and computational exploration go
hand in hand
Visualization design goals
Context
• Integrate and align multiple data sources;
navigate; search
• Connect: brushing
• Encode: map visualization properties to
data on the fly
• Reconfigure: multiple views of the same
data
Visualization design goals
Data
• Select and filter: tight-knit integration with
R/Bioconductor
• (current work) filters on visualization
propagate to data environment
Model
• New 'measurements' the result of
modeling; suggested by data context
Metagenomic Visualization
• How to effectively navigate large datasets
where features are organized hierarchically?
• Metaviz: browser-based, interactive
exploratory analysis of metagenomic
data
• Connection to R/Bioconductor with
metavizr package
• Built on metagenomeSeq and
metagenomeFeatures infrastructure
Metaviz
• Exploration of hierarchically organized
features
• Geared towards 16S for now
– Hierarchical organization relevant to WGS
• Integration is a big part of design
– Framework designed for data integration
Acknowledgements
Brianna Lindsey, O. Colin Stine, Owen White, Anup Mahurkar: University of Maryland Baltimore
Jim Nataro: University of Virginia
NIGMS, Genentech
Florin Chelaru
(now @ MIT)
Joseph Paulson
(now @ Harvard)
Mihai Pop
(@ UMD)
Hmp 201512

More Related Content

What's hot

Semantic mediawiki
Semantic mediawikiSemantic mediawiki
Semantic mediawiki
Karsten Krumrück
 
Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.
Markus Luczak-Rösch
 
Linked Data media experiment
Linked Data media experimentLinked Data media experiment
Linked Data media experiment
MediArena
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
AKSHAY BHAGAT
 
20170621_System requirements of data journal platform
20170621_System requirements of data journal platform20170621_System requirements of data journal platform
20170621_System requirements of data journal platform
Yasuyuki Minamiyama
 
2013 04 g8opendata-ag_infra
2013 04 g8opendata-ag_infra2013 04 g8opendata-ag_infra
2013 04 g8opendata-ag_infra
Johannes Keizer
 
Are you talking to me? Researching a scenario for linking objects and publica...
Are you talking to me? Researching a scenario for linking objects and publica...Are you talking to me? Researching a scenario for linking objects and publica...
Are you talking to me? Researching a scenario for linking objects and publica...
Ellen Van Keer
 
RDAP 15: “This is just for me”: Researchers on their data documentation pract...
RDAP 15: “This is just for me”: Researchers on their data documentation pract...RDAP 15: “This is just for me”: Researchers on their data documentation pract...
RDAP 15: “This is just for me”: Researchers on their data documentation pract...
ASIS&T
 
Linked data representation
Linked data representationLinked data representation
Linked data representation
haroonrashidlone
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
ASIS&T
 
Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0
Vladimir Tomberg
 
36. data mining techniques
36. data mining techniques36. data mining techniques
36. data mining techniques
奈良先端大 情報科学研究科
 
Integrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science frameworkIntegrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science framework
rmacneil88
 
American Archive of Public Broadcasting: Preservation and Content Continuity
American Archive of Public Broadcasting: Preservation and Content ContinuityAmerican Archive of Public Broadcasting: Preservation and Content Continuity
American Archive of Public Broadcasting: Preservation and Content Continuity
WGBH Media Library and Archives
 

What's hot (15)

Semantic mediawiki
Semantic mediawikiSemantic mediawiki
Semantic mediawiki
 
Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.Context-free data analysis with Transcendental Information Cascades.
Context-free data analysis with Transcendental Information Cascades.
 
Linked Data media experiment
Linked Data media experimentLinked Data media experiment
Linked Data media experiment
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
20170621_System requirements of data journal platform
20170621_System requirements of data journal platform20170621_System requirements of data journal platform
20170621_System requirements of data journal platform
 
Dacena
DacenaDacena
Dacena
 
2013 04 g8opendata-ag_infra
2013 04 g8opendata-ag_infra2013 04 g8opendata-ag_infra
2013 04 g8opendata-ag_infra
 
Are you talking to me? Researching a scenario for linking objects and publica...
Are you talking to me? Researching a scenario for linking objects and publica...Are you talking to me? Researching a scenario for linking objects and publica...
Are you talking to me? Researching a scenario for linking objects and publica...
 
RDAP 15: “This is just for me”: Researchers on their data documentation pract...
RDAP 15: “This is just for me”: Researchers on their data documentation pract...RDAP 15: “This is just for me”: Researchers on their data documentation pract...
RDAP 15: “This is just for me”: Researchers on their data documentation pract...
 
Linked data representation
Linked data representationLinked data representation
Linked data representation
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
 
Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0
 
36. data mining techniques
36. data mining techniques36. data mining techniques
36. data mining techniques
 
Integrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science frameworkIntegrating repositories and eLab notebooks through an open science framework
Integrating repositories and eLab notebooks through an open science framework
 
American Archive of Public Broadcasting: Preservation and Content Continuity
American Archive of Public Broadcasting: Preservation and Content ContinuityAmerican Archive of Public Broadcasting: Preservation and Content Continuity
American Archive of Public Broadcasting: Preservation and Content Continuity
 

Viewers also liked

Indicadores de DESC, su producción y uso
Indicadores de DESC, su producción y usoIndicadores de DESC, su producción y uso
Indicadores de DESC, su producción y uso
Walter Mauricio Barreto
 
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
Ahmed Abd el-Ghany
 
Jung y platon. influencias de platon en jung
Jung  y platon. influencias de platon en jungJung  y platon. influencias de platon en jung
Jung y platon. influencias de platon en jung
Hemil Mora
 
Kernel
KernelKernel
Informe ejecutivi fase1_wilson_pinto
Informe ejecutivi fase1_wilson_pintoInforme ejecutivi fase1_wilson_pinto
Informe ejecutivi fase1_wilson_pinto
wilspinto
 
Datos curiosos
Datos curiososDatos curiosos
Datos curiosos
Linda Castaño
 
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
Sunaina Rawat
 
Datos y probabilidades
Datos y probabilidadesDatos y probabilidades
Datos y probabilidades
Katherina Mosquera
 
New microsoft power point presentation
New microsoft power point presentationNew microsoft power point presentation
New microsoft power point presentationmwincott
 
Pasos para formatear una usb
Pasos para formatear una usbPasos para formatear una usb
Pasos para formatear una usb
arelyUGMEX
 
Arte Contemporáneo
Arte ContemporáneoArte Contemporáneo
Arte Contemporáneo
Candy Mendoza
 
Revision paper en Emerald
Revision paper en EmeraldRevision paper en Emerald
Revision paper en Emeraldatrivinho
 
Kernel
KernelKernel
Kernel
esdeguau27
 
Seguridad informática: virus y otros daños para nuestro PC
Seguridad informática: virus y otros daños para nuestro PCSeguridad informática: virus y otros daños para nuestro PC
Seguridad informática: virus y otros daños para nuestro PC
yireni
 

Viewers also liked (18)

Indicadores de DESC, su producción y uso
Indicadores de DESC, su producción y usoIndicadores de DESC, su producción y uso
Indicadores de DESC, su producción y uso
 
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
EL-NAKHEIL OIL SHALE: A PROMISING RESOURCE OF UNCONVENTIONAL RAW MATERIAL FOR...
 
Jung y platon. influencias de platon en jung
Jung  y platon. influencias de platon en jungJung  y platon. influencias de platon en jung
Jung y platon. influencias de platon en jung
 
Kernel
KernelKernel
Kernel
 
141112pdfrazoneshuelga2 octavilla
141112pdfrazoneshuelga2 octavilla141112pdfrazoneshuelga2 octavilla
141112pdfrazoneshuelga2 octavilla
 
Archivos
ArchivosArchivos
Archivos
 
Informe ejecutivi fase1_wilson_pinto
Informe ejecutivi fase1_wilson_pintoInforme ejecutivi fase1_wilson_pinto
Informe ejecutivi fase1_wilson_pinto
 
Datos curiosos
Datos curiososDatos curiosos
Datos curiosos
 
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
Class 8 Cbse Chemistry Sample Paper Term 2 Model 2
 
Datos y probabilidades
Datos y probabilidadesDatos y probabilidades
Datos y probabilidades
 
G616926US201S_sp
G616926US201S_spG616926US201S_sp
G616926US201S_sp
 
New microsoft power point presentation
New microsoft power point presentationNew microsoft power point presentation
New microsoft power point presentation
 
Mohamed El Nady C.V
Mohamed El Nady C.VMohamed El Nady C.V
Mohamed El Nady C.V
 
Pasos para formatear una usb
Pasos para formatear una usbPasos para formatear una usb
Pasos para formatear una usb
 
Arte Contemporáneo
Arte ContemporáneoArte Contemporáneo
Arte Contemporáneo
 
Revision paper en Emerald
Revision paper en EmeraldRevision paper en Emerald
Revision paper en Emerald
 
Kernel
KernelKernel
Kernel
 
Seguridad informática: virus y otros daños para nuestro PC
Seguridad informática: virus y otros daños para nuestro PCSeguridad informática: virus y otros daños para nuestro PC
Seguridad informática: virus y otros daños para nuestro PC
 

Similar to Hmp 201512

Linked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and LuzzuLinked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and Luzzu
jerdeb
 
BlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical IntroductionBlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical Introduction
Bogdan Roman
 
A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...
Craig Knoblock
 
UNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningUNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data Mining
Nandakumar P
 
Cytoscape Network Visualization and Analysis
Cytoscape Network Visualization and AnalysisCytoscape Network Visualization and Analysis
Cytoscape Network Visualization and Analysis
bdemchak
 
Making project data avalialble eNanomapper through Database
Making project data avalialble eNanomapper through  DatabaseMaking project data avalialble eNanomapper through  Database
Making project data avalialble eNanomapper through Database
Nina Jeliazkova
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEM
Sai Kumar Ale
 
Maximizing AI Performance with Vector Databases: A Comprehensive Guide
Maximizing AI Performance with Vector Databases: A Comprehensive GuideMaximizing AI Performance with Vector Databases: A Comprehensive Guide
Maximizing AI Performance with Vector Databases: A Comprehensive Guide
Bhusan Chettri
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Ed Dodds
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
Dhilsath Fathima
 
Adelaide Rhodes Resume March 2023
Adelaide Rhodes Resume March 2023Adelaide Rhodes Resume March 2023
Adelaide Rhodes Resume March 2023
Stacy Taylor
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)
Anna Fensel
 
Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...
mestato
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
Ken Karapetyan
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
Artificial Intelligence Institute at UofSC
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016Manjula Ambur
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
DataDryad
 
Lec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrustLec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrust
Menchita Falcutila Dumlao
 
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
EDINA, University of Edinburgh
 

Similar to Hmp 201512 (20)

Linked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and LuzzuLinked Data Quality Assessment – daQ and Luzzu
Linked Data Quality Assessment – daQ and Luzzu
 
BlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical IntroductionBlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical Introduction
 
A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...A scalable architecture for extracting, aligning, linking, and visualizing mu...
A scalable architecture for extracting, aligning, linking, and visualizing mu...
 
UNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data MiningUNIT - 5: Data Warehousing and Data Mining
UNIT - 5: Data Warehousing and Data Mining
 
Cytoscape Network Visualization and Analysis
Cytoscape Network Visualization and AnalysisCytoscape Network Visualization and Analysis
Cytoscape Network Visualization and Analysis
 
Making project data avalialble eNanomapper through Database
Making project data avalialble eNanomapper through  DatabaseMaking project data avalialble eNanomapper through  Database
Making project data avalialble eNanomapper through Database
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEM
 
Maximizing AI Performance with Vector Databases: A Comprehensive Guide
Maximizing AI Performance with Vector Databases: A Comprehensive GuideMaximizing AI Performance with Vector Databases: A Comprehensive Guide
Maximizing AI Performance with Vector Databases: A Comprehensive Guide
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
Unit 3 part i Data mining
Unit 3 part i Data miningUnit 3 part i Data mining
Unit 3 part i Data mining
 
Adelaide Rhodes Resume March 2023
Adelaide Rhodes Resume March 2023Adelaide Rhodes Resume March 2023
Adelaide Rhodes Resume March 2023
 
Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)Towards Semantic APIs for Research Data Services (Invited Talk)
Towards Semantic APIs for Research Data Services (Invited Talk)
 
Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...Building genomic data cyberinfrastructure with the online database software T...
Building genomic data cyberinfrastructure with the online database software T...
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Semantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including AstrophysicsSemantic Technologies for Big Sciences including Astrophysics
Semantic Technologies for Big Sciences including Astrophysics
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016AIAA Conference - Big Data Session_ Final - Jan 2016
AIAA Conference - Big Data Session_ Final - Jan 2016
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Lec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrustLec 1 integrating data science and data analytics in various research thrust
Lec 1 integrating data science and data analytics in various research thrust
 
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
 

Recently uploaded

(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 

Recently uploaded (20)

(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 

Hmp 201512

  • 1. Statistical and Visualization Methods for Metagenomic Analysis Héctor Corrada Bravo Center for Bioinformatics and Computational Biology
  • 2. • metagenomeSeq – 16S differential abundance – R/Bioconductor infrastructure for metagenomic assays – Longitudinal data • metagenomicFeatures – Incipient attempt regularizing 16S feature annotations in R/Bioconductor – E.g., greengenes13.5MgDb • msd16s – Example data, as infrastructure object
  • 3. R/Bioconductor Strengths • Infrastructure objects – Interoperability, speed up startup time for method development • Strict development practices – Documentation, use cases, vignettes • Annotation infrastructure – Again, interoperability across experiments and data types • Exploratory analysis • Reproducibility – Vignettes, Rmarkdown, etc. • Recently, exploratory and interactive visualization – Shiny, epiviz
  • 4. Integrative, visual and computational exploratory analysis of genomic data • Browser-based • Interactive • Integration of data • Reproducible dissemination • Communication with R/Bioconductor: epivizr package software systems to support creative exploratory analysis of large genome-wide datasets...
  • 5. • Computed Measurements: create new measurements from integrated measurements and visualize
  • 6. • Summarization: summarize integrated measurements (computed on data subsets)
  • 7. Dynamically extensible: Easily integrate new data sources, data types and add new visualizations. Data providers define coordinate space
  • 8. One interpretation of Big Data is many sources of relevant contextual data • Easily access/integrate contextual data • Driven by exploratory analysis of immediate data • Iterative process • Visual and computational exploration go hand in hand
  • 9. Visualization design goals Context • Integrate and align multiple data sources; navigate; search • Connect: brushing • Encode: map visualization properties to data on the fly • Reconfigure: multiple views of the same data
  • 10. Visualization design goals Data • Select and filter: tight-knit integration with R/Bioconductor • (current work) filters on visualization propagate to data environment Model • New 'measurements' the result of modeling; suggested by data context
  • 11. Metagenomic Visualization • How to effectively navigate large datasets where features are organized hierarchically? • Metaviz: browser-based, interactive exploratory analysis of metagenomic data • Connection to R/Bioconductor with metavizr package • Built on metagenomeSeq and metagenomeFeatures infrastructure
  • 12.
  • 13.
  • 14.
  • 15. Metaviz • Exploration of hierarchically organized features • Geared towards 16S for now – Hierarchical organization relevant to WGS • Integration is a big part of design – Framework designed for data integration
  • 16. Acknowledgements Brianna Lindsey, O. Colin Stine, Owen White, Anup Mahurkar: University of Maryland Baltimore Jim Nataro: University of Virginia NIGMS, Genentech Florin Chelaru (now @ MIT) Joseph Paulson (now @ Harvard) Mihai Pop (@ UMD)