SlideShare a Scribd company logo
Rphenoscape:

Connecting the semantics of
evolutionary morphology to
comparative phylogenetics
Hilmar Lapp (Duke University)
Hong Xu (Duke University)
Jim Balhoff (RTI, Inc.)
Evolution Meetings 2016, Austin, TX
RPhenoscape
• A package for accessing the
Phenoscape Knowledgebase from
within R programs
• Programmatic access to:
• Evolutionary character data with
computable semantics
• Machine-reasoning with
computable phenotype data
R features a rich ecosystem
for comparative phylogenetics
CRAN Task View on
Phylogenetics and
Comparative Methods at last
count lists 76 packages.
Comparative analysis needs comparative data
Magee et al (2014), PLOS ONE
See also Drew et al (2013), PLOS Biology; Stoltzfus et al (2012), BMC Research Notes
The lack of reusable digital data
is amplified for morphology
Which matrix are we criticizing?
RC07 published their character list (their appendix 2) and their
matrix (appendix 3). We also have a hitherto unpublished
NEXUS file (presented here as part of Data S3), most likely
sent by [M.R.] to [D.G.] in late 2007 or early 2008, which
purports to contain the same matrix. Surprisingly, the
character list in the paper and that in the file do not agree
on the identities of characters 132–134.
Marjanović and Laurin (2015) Reevaluation of the largest published morphological data matrix for
phylogenetic analysis of Paleozoic limbed vertebrates. PeerJ Preprints 3:e1596v1
Morphology is more
complex than discrete,
disjoint, independent
We know more
about morphology
than authors state
Implied knowledge can be
substantial
Asserted
Inferred
Missing
Digit presence/absence; Sarcopterygii
kb.phenoscape.org
Make	morphology	
computable,	discoverable,	&	
linked	to	genetic	data
9
Phenoscape	Knowledgebase
❖ 4,399 taxa (vertebrates)
❖ 139 publications
(matrices)
❖ 19,024 character states
❖ 651,660 phenotype
annotations
Morphological
matrices
Annotation
Ontologies
anatomy
quality
taxonomy
Phenex software
(Balhoff et al., 2010)
Phenoscape
Knowledgebase
Machine reasoner
(OWL)
KB Interface for humans
KB Interface for machines
KB Interface for machines
RPhenoscape
An R package for API access to the Phenoscape
Knowledgebase
• Evolutionary character data with computable
semantics
• Machine-reasoning with computable
phenotype data:
• Synthetic supermatrix synthesis
• Semantics-based character and state filtering
• Semantic similarity-driven querying and
synthesis
Use-case: Querying studies by
morphology and taxonomy
> slist <- pk_get_study_list(taxon = "Ictaluridae",
entity = "pectoral fin")
> slist[,"label"]
Source: local data frame [10 x 1]
label
<chr>
1 Bockmann, F. A. (1998)
2 Chen, X. (1994)
3 De Pinna, M. C., Ferraris, C. J. J., & Vari, R. P. (2007)
4 Fink, S. V, & Fink, W. L. (1981); Fink, S. V, & Fink, W. L. (1996)
5 Kailola, P. J. (2004)
6 Lundberg, J. G. (1992)
7 Mo, T. (1991)
8 Royero, R. (1999)
9 Vigliotta, T. R. (2008)
10 Wiley, E.O., and Johnson, G.D. (2010)
>
Use-case: Querying studies by
morphology and taxonomy
> nex_list <- pk_get_study_xml(as.matrix
(slist[2:3,c("id")]))
....This might take a while....
https://scholar.google.com/scholar?q=hylogenetic+studies+of+the
+amblycipitid+catfishes+%28Teleostei%2C+Siluriformes%29+with
+species+accounts&btnG=&hl=en&as_sdt=0%2C42
Parse NeXML....
http://dx.doi.org/10.1111/j.1096-3642.2007.00306.x
Parse NeXML....
> nex_list[[1]]
A nexml object representing:
0 phylogenetic tree blocks, where:
block 1 contains NULL phylogenetic trees
block 0 contains phylogenetic trees
155 meta elements
1 character matrices
53 taxonomic units
Taxa: Pseudobagarius leucorhynchus, Liobagrus obesus,
Hypsidoris farsonensis, Erethistes sp. (Chen 1994), Bunocephalus
amaurus, Xyliphius sp. (Chen 1994) ...
Heavy-lifting of
NeXML parsing
is done by
RNeXML
Use-case: Synthesize presence-
absence matrix
> nex <- pk_get_ontotrace_xml(taxon = c("Ictalurus",
"Ameiurus"), entity = "fin spine")
> m <- pk_get_ontotrace(nex)
> m[1:10,]
## Source: local data frame [15 x 5]
##
## taxa otu
## (chr) (chr)
## 1 Ameiurus brunneus VTO_0036273
## 2 Ameiurus catus VTO_0036275
## 3 Ameiurus melas VTO_0036272
## 4 Ameiurus natalis VTO_0036274
## 5 Ameiurus nebulosus VTO_0036278
## 6 Ameiurus platycephalus VTO_0036276
## 7 Ameiurus serracanthus VTO_0036277
## 8 Ictalurus australis VTO_0061495
## 9 Ictalurus balsanus VTO_0036221
## 10 Ictalurus dugesii VTO_0061497
## Variables not shown: otus (chr), anterior dentation of pectoral
fin spine
## (int), anterior distal serration of pectoral fin spine (dbl)
Use-case: Filter matrix using
semantics
> is_desc <- pk_is_descendant('Ictalurus', m$taxa)
## [1] FALSE FALSE FALSE FALSE FALSE FALSE FALSE TRUE TRUE TRUE TRUE
## [12] TRUE TRUE TRUE TRUE
# pk_is_ancestor() also available (and pk_is_extinct())
#
# This is in development for characters, too:
# pk_is_descendant(‘jaw skeleton', m$chars,
# relationships = ‘part of’)
Current limitations
Package is not on CRAN (yet), need to install from
Github:





Data in Phenoscape concentrated on vertebrates, and
skeletal fin-limb characters.
Semantics-driven matrix synthesis currently limited to
presence-absence characters.
library(“devtools”)

install_github(“xu-hong/rphenoscape”)
Summary
Phenoscape KB has an API for machine access to
computable morphology data and computational
semantics services.
RPhenoscape is a bridge between this API and the
ecosystem of comparative phylogenetics packages in R.
Translates between R user (who uses labels, data
matrices) and Phenoscape KB API (which uses identifiers,
ontology terms, NeXML, etc).
Code on Github: https://github.com/xu-hong/rphenoscape
Reproducible data integration:
The rOpenSci ecosystem
Acknowledgements
U.S. National Science Foundation
DBI-1062404, DBI-1062542
National Evolutionary Synthesis Center
(NESCent), NSF #EF-0905606
Phenoscape contributors, Advisory Board, Data
sources (see: http://phenoscape.org/wiki/
Acknowledgments)
RNeXML developers (C. Boettiger, S. Chamberlain)

http://github.com/rOpenSci/RNeXML
Get in touch
Repo: github.com/xu-hong/rphenoscape
Github: github.com/hlapp
Twitter: @hlapp

More Related Content

What's hot

P7 2018 biopython3
P7 2018 biopython3P7 2018 biopython3
P7 2018 biopython3
Prof. Wim Van Criekinge
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
Jeremy Yang
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
Jeremy Yang
 
Tony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentation
Tony Rees
 
Java Introductie
Java IntroductieJava Introductie
Java Introductiembruggen
 
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...ICZN
 
10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?
Tony Rees
 
Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...ICZN
 
Getting Started with the Hymenoptera Anatomical Ontology
Getting Started with the Hymenoptera Anatomical OntologyGetting Started with the Hymenoptera Anatomical Ontology
Getting Started with the Hymenoptera Anatomical Ontology
Katja C. Seltmann
 
Two graphs, three responses
Two graphs, three responsesTwo graphs, three responses
Two graphs, three responses
Roderic Page
 
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...Edward Baker
 
2016 bioinformatics i_io_wim_vancriekinge
2016 bioinformatics i_io_wim_vancriekinge2016 bioinformatics i_io_wim_vancriekinge
2016 bioinformatics i_io_wim_vancriekinge
Prof. Wim Van Criekinge
 
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
taxonbytes
 
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
Bene Rodriguez
 
2008 11 13 Hcls Call
2008 11 13 Hcls Call2008 11 13 Hcls Call
2008 11 13 Hcls Call
Jun Zhao
 
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other CasesFranz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
taxonbytes
 
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Ellinor Michel
 
Biodiversity Informatics Course Presentation
Biodiversity Informatics Course PresentationBiodiversity Informatics Course Presentation
Biodiversity Informatics Course Presentation
Roderic Page
 
Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...
Chris Mungall
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
Rothamsted Research, UK
 

What's hot (20)

P7 2018 biopython3
P7 2018 biopython3P7 2018 biopython3
P7 2018 biopython3
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
 
Tony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentationTony Rees IRMNG 2015 presentation
Tony Rees IRMNG 2015 presentation
 
Java Introductie
Java IntroductieJava Introductie
Java Introductie
 
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
Andrew Polaszek - ZooBank: ICZN’s open-access web-based register of all new a...
 
10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?10 years of global biodiversity databases: are we there yet?
10 years of global biodiversity databases: are we there yet?
 
Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...Nomenclature for the Future: The power and challenges for stable and sensible...
Nomenclature for the Future: The power and challenges for stable and sensible...
 
Getting Started with the Hymenoptera Anatomical Ontology
Getting Started with the Hymenoptera Anatomical OntologyGetting Started with the Hymenoptera Anatomical Ontology
Getting Started with the Hymenoptera Anatomical Ontology
 
Two graphs, three responses
Two graphs, three responsesTwo graphs, three responses
Two graphs, three responses
 
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...
Biodiversity Informatics of the Cyperaceae: Where we stand and where we’re he...
 
2016 bioinformatics i_io_wim_vancriekinge
2016 bioinformatics i_io_wim_vancriekinge2016 bioinformatics i_io_wim_vancriekinge
2016 bioinformatics i_io_wim_vancriekinge
 
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
Franz 2015 SPNHC Taxonomic concept resolution for voucher-based biodiversity ...
 
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
Alignment of Ontology Design Patterns: Class As Property Value, Value Partiti...
 
2008 11 13 Hcls Call
2008 11 13 Hcls Call2008 11 13 Hcls Call
2008 11 13 Hcls Call
 
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other CasesFranz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
Franz 2014 ESA Aligning Insect Phylogenies Perelleschus and Other Cases
 
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
Michel digital nomenclature-gna-zoobank-2014-co-namesconfv2
 
Biodiversity Informatics Course Presentation
Biodiversity Informatics Course PresentationBiodiversity Informatics Course Presentation
Biodiversity Informatics Course Presentation
 
Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...Experiences in the biosciences with the open biological ontologies foundry an...
Experiences in the biosciences with the open biological ontologies foundry an...
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
 

Viewers also liked

PhyloCommons: Sharing, annotating, and reusing Phylogenies
PhyloCommons: Sharing, annotating, and reusing PhylogeniesPhyloCommons: Sharing, annotating, and reusing Phylogenies
PhyloCommons: Sharing, annotating, and reusing Phylogenies
Hilmar Lapp
 
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future FeaturesBioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future FeaturesHilmar Lapp
 
OBF Address at BOSC 2013
OBF Address at BOSC 2013OBF Address at BOSC 2013
OBF Address at BOSC 2013
Hilmar Lapp
 
OOW2008 in China
OOW2008 in ChinaOOW2008 in China
OOW2008 in China
James Hu
 
Integrating data with phylogenies, at scale
Integrating data with phylogenies, at scaleIntegrating data with phylogenies, at scale
Integrating data with phylogenies, at scale
Hilmar Lapp
 
Global Sourcing Updates 2010
Global Sourcing Updates 2010Global Sourcing Updates 2010
Global Sourcing Updates 2010
pagnew
 

Viewers also liked (6)

PhyloCommons: Sharing, annotating, and reusing Phylogenies
PhyloCommons: Sharing, annotating, and reusing PhylogeniesPhyloCommons: Sharing, annotating, and reusing Phylogenies
PhyloCommons: Sharing, annotating, and reusing Phylogenies
 
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future FeaturesBioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
BioSQL Reloaded: v1.0 Release, PhyloDB Module, and Future Features
 
OBF Address at BOSC 2013
OBF Address at BOSC 2013OBF Address at BOSC 2013
OBF Address at BOSC 2013
 
OOW2008 in China
OOW2008 in ChinaOOW2008 in China
OOW2008 in China
 
Integrating data with phylogenies, at scale
Integrating data with phylogenies, at scaleIntegrating data with phylogenies, at scale
Integrating data with phylogenies, at scale
 
Global Sourcing Updates 2010
Global Sourcing Updates 2010Global Sourcing Updates 2010
Global Sourcing Updates 2010
 

Similar to Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparative phylogenetics

Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009bosc
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
KAUSHAL SAHU
 
Semantic tools for aggregation of morphological characters across studies
Semantic tools for aggregation of morphological characters across studiesSemantic tools for aggregation of morphological characters across studies
Semantic tools for aggregation of morphological characters across studies
balhoff
 
CRISPR PROJECT.pptx
CRISPR PROJECT.pptxCRISPR PROJECT.pptx
CRISPR PROJECT.pptx
AcSni
 
R Analytics in the Cloud
R Analytics in the CloudR Analytics in the Cloud
R Analytics in the CloudDataMine Lab
 
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Hilmar Lapp
 
2012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les12012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les1
Prof. Wim Van Criekinge
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Dominic Suciu
 
Thesis def
Thesis defThesis def
Thesis def
Jay Vyas
 
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Spark Summit
 
Crispr handbook 2015
Crispr handbook 2015Crispr handbook 2015
Crispr handbook 2015
rochonf
 
Multi-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/BioconductorMulti-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/Bioconductor
Levi Waldron
 
Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?
Klaas Vandepoele
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
Deborah McGuinness
 
Phylogenetics Analysis in R
Phylogenetics Analysis in RPhylogenetics Analysis in R
Phylogenetics Analysis in RKlaus Schliep
 
2 md2016 annotation
2 md2016 annotation2 md2016 annotation
2 md2016 annotation
Scott Dawson
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016
Michiel Stock
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...
jembrown
 
Araport Data Integration - 2015 UMD Minisymposium
Araport Data Integration - 2015 UMD MinisymposiumAraport Data Integration - 2015 UMD Minisymposium
Araport Data Integration - 2015 UMD Minisymposium
Vivek Krishnakumar
 

Similar to Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparative phylogenetics (20)

Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
Semantic tools for aggregation of morphological characters across studies
Semantic tools for aggregation of morphological characters across studiesSemantic tools for aggregation of morphological characters across studies
Semantic tools for aggregation of morphological characters across studies
 
CRISPR PROJECT.pptx
CRISPR PROJECT.pptxCRISPR PROJECT.pptx
CRISPR PROJECT.pptx
 
R Analytics in the Cloud
R Analytics in the CloudR Analytics in the Cloud
R Analytics in the Cloud
 
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
Semantics of and for the diversity of life:
 Opportunities and perils of tryi...
 
2012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les12012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les1
 
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
 
Thesis def
Thesis defThesis def
Thesis def
 
Satya Sahoo Thesis Defense
Satya Sahoo Thesis DefenseSatya Sahoo Thesis Defense
Satya Sahoo Thesis Defense
 
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
 
Crispr handbook 2015
Crispr handbook 2015Crispr handbook 2015
Crispr handbook 2015
 
Multi-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/BioconductorMulti-omics infrastructure and data for R/Bioconductor
Multi-omics infrastructure and data for R/Bioconductor
 
Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?Comparative genomics to the rescue: How complete is your plant genome sequence?
Comparative genomics to the rescue: How complete is your plant genome sequence?
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
 
Phylogenetics Analysis in R
Phylogenetics Analysis in RPhylogenetics Analysis in R
Phylogenetics Analysis in R
 
2 md2016 annotation
2 md2016 annotation2 md2016 annotation
2 md2016 annotation
 
Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016Poster genome engineering & Synthetic Biology 2016
Poster genome engineering & Synthetic Biology 2016
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...
 
Araport Data Integration - 2015 UMD Minisymposium
Araport Data Integration - 2015 UMD MinisymposiumAraport Data Integration - 2015 UMD Minisymposium
Araport Data Integration - 2015 UMD Minisymposium
 

More from Hilmar Lapp

Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Hilmar Lapp
 
Open Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionOpen Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some Introspection
Hilmar Lapp
 
Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014 Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014
Hilmar Lapp
 
The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...
Hilmar Lapp
 
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...Hilmar Lapp
 
The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...
Hilmar Lapp
 
Bringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descentBringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descent
Hilmar Lapp
 
Phyloinformatics VoCamp
Phyloinformatics VoCampPhyloinformatics VoCamp
Phyloinformatics VoCamp
Hilmar Lapp
 
Reasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descentReasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descent
Hilmar Lapp
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
Hilmar Lapp
 
Liberating Our Beautiful Trees: A Call to Arms.
Liberating Our Beautiful Trees: A Call to Arms.Liberating Our Beautiful Trees: A Call to Arms.
Liberating Our Beautiful Trees: A Call to Arms.
Hilmar Lapp
 
OBF Address at BOSC 2012
OBF Address at BOSC 2012OBF Address at BOSC 2012
OBF Address at BOSC 2012
Hilmar Lapp
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseHilmar Lapp
 
Lapp, ISCB Software Sharing Symposium
Lapp, ISCB Software Sharing SymposiumLapp, ISCB Software Sharing Symposium
Lapp, ISCB Software Sharing SymposiumHilmar Lapp
 

More from Hilmar Lapp (14)

Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
Of Trees and Owl: 
The challenges of reasoning over the semantics of shared d...
 
Open Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some IntrospectionOpen Bioinformatics Foundation: 2014 Update & Some Introspection
Open Bioinformatics Foundation: 2014 Update & Some Introspection
 
Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014 Reproducible Science - Panel at iEvoBio 2014
Reproducible Science - Panel at iEvoBio 2014
 
The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...The Dryad Digital Repository: Published data as part of the greater data ecos...
The Dryad Digital Repository: Published data as part of the greater data ecos...
 
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...The MIAPA ontology: An annotation ontology for validating minimum metadata re...
The MIAPA ontology: An annotation ontology for validating minimum metadata re...
 
The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...
 
Bringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descentBringing reason to phenotype diversity, character change, and common descent
Bringing reason to phenotype diversity, character change, and common descent
 
Phyloinformatics VoCamp
Phyloinformatics VoCampPhyloinformatics VoCamp
Phyloinformatics VoCamp
 
Reasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descentReasoning over phenotype diversity, character change, and evolutionary descent
Reasoning over phenotype diversity, character change, and evolutionary descent
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
Liberating Our Beautiful Trees: A Call to Arms.
Liberating Our Beautiful Trees: A Call to Arms.Liberating Our Beautiful Trees: A Call to Arms.
Liberating Our Beautiful Trees: A Call to Arms.
 
OBF Address at BOSC 2012
OBF Address at BOSC 2012OBF Address at BOSC 2012
OBF Address at BOSC 2012
 
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
Lapp, ISCB Software Sharing Symposium
Lapp, ISCB Software Sharing SymposiumLapp, ISCB Software Sharing Symposium
Lapp, ISCB Software Sharing Symposium
 

Recently uploaded

ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
AlguinaldoKong
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 

Recently uploaded (20)

ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 

Rphenoscape: 
Connecting the semantics of evolutionary morphology to comparative phylogenetics

  • 1. Rphenoscape:
 Connecting the semantics of evolutionary morphology to comparative phylogenetics Hilmar Lapp (Duke University) Hong Xu (Duke University) Jim Balhoff (RTI, Inc.) Evolution Meetings 2016, Austin, TX
  • 2. RPhenoscape • A package for accessing the Phenoscape Knowledgebase from within R programs • Programmatic access to: • Evolutionary character data with computable semantics • Machine-reasoning with computable phenotype data
  • 3. R features a rich ecosystem for comparative phylogenetics CRAN Task View on Phylogenetics and Comparative Methods at last count lists 76 packages.
  • 4. Comparative analysis needs comparative data Magee et al (2014), PLOS ONE See also Drew et al (2013), PLOS Biology; Stoltzfus et al (2012), BMC Research Notes
  • 5. The lack of reusable digital data is amplified for morphology Which matrix are we criticizing? RC07 published their character list (their appendix 2) and their matrix (appendix 3). We also have a hitherto unpublished NEXUS file (presented here as part of Data S3), most likely sent by [M.R.] to [D.G.] in late 2007 or early 2008, which purports to contain the same matrix. Surprisingly, the character list in the paper and that in the file do not agree on the identities of characters 132–134. Marjanović and Laurin (2015) Reevaluation of the largest published morphological data matrix for phylogenetic analysis of Paleozoic limbed vertebrates. PeerJ Preprints 3:e1596v1
  • 6. Morphology is more complex than discrete, disjoint, independent
  • 7. We know more about morphology than authors state
  • 8. Implied knowledge can be substantial Asserted Inferred Missing Digit presence/absence; Sarcopterygii
  • 10. Phenoscape Knowledgebase ❖ 4,399 taxa (vertebrates) ❖ 139 publications (matrices) ❖ 19,024 character states ❖ 651,660 phenotype annotations Morphological matrices Annotation Ontologies anatomy quality taxonomy Phenex software (Balhoff et al., 2010) Phenoscape Knowledgebase Machine reasoner (OWL)
  • 12. KB Interface for machines
  • 13. KB Interface for machines
  • 14. RPhenoscape An R package for API access to the Phenoscape Knowledgebase • Evolutionary character data with computable semantics • Machine-reasoning with computable phenotype data: • Synthetic supermatrix synthesis • Semantics-based character and state filtering • Semantic similarity-driven querying and synthesis
  • 15. Use-case: Querying studies by morphology and taxonomy > slist <- pk_get_study_list(taxon = "Ictaluridae", entity = "pectoral fin") > slist[,"label"] Source: local data frame [10 x 1] label <chr> 1 Bockmann, F. A. (1998) 2 Chen, X. (1994) 3 De Pinna, M. C., Ferraris, C. J. J., & Vari, R. P. (2007) 4 Fink, S. V, & Fink, W. L. (1981); Fink, S. V, & Fink, W. L. (1996) 5 Kailola, P. J. (2004) 6 Lundberg, J. G. (1992) 7 Mo, T. (1991) 8 Royero, R. (1999) 9 Vigliotta, T. R. (2008) 10 Wiley, E.O., and Johnson, G.D. (2010) >
  • 16. Use-case: Querying studies by morphology and taxonomy > nex_list <- pk_get_study_xml(as.matrix (slist[2:3,c("id")])) ....This might take a while.... https://scholar.google.com/scholar?q=hylogenetic+studies+of+the +amblycipitid+catfishes+%28Teleostei%2C+Siluriformes%29+with +species+accounts&btnG=&hl=en&as_sdt=0%2C42 Parse NeXML.... http://dx.doi.org/10.1111/j.1096-3642.2007.00306.x Parse NeXML.... > nex_list[[1]] A nexml object representing: 0 phylogenetic tree blocks, where: block 1 contains NULL phylogenetic trees block 0 contains phylogenetic trees 155 meta elements 1 character matrices 53 taxonomic units Taxa: Pseudobagarius leucorhynchus, Liobagrus obesus, Hypsidoris farsonensis, Erethistes sp. (Chen 1994), Bunocephalus amaurus, Xyliphius sp. (Chen 1994) ...
  • 18. Use-case: Synthesize presence- absence matrix > nex <- pk_get_ontotrace_xml(taxon = c("Ictalurus", "Ameiurus"), entity = "fin spine") > m <- pk_get_ontotrace(nex) > m[1:10,] ## Source: local data frame [15 x 5] ## ## taxa otu ## (chr) (chr) ## 1 Ameiurus brunneus VTO_0036273 ## 2 Ameiurus catus VTO_0036275 ## 3 Ameiurus melas VTO_0036272 ## 4 Ameiurus natalis VTO_0036274 ## 5 Ameiurus nebulosus VTO_0036278 ## 6 Ameiurus platycephalus VTO_0036276 ## 7 Ameiurus serracanthus VTO_0036277 ## 8 Ictalurus australis VTO_0061495 ## 9 Ictalurus balsanus VTO_0036221 ## 10 Ictalurus dugesii VTO_0061497 ## Variables not shown: otus (chr), anterior dentation of pectoral fin spine ## (int), anterior distal serration of pectoral fin spine (dbl)
  • 19. Use-case: Filter matrix using semantics > is_desc <- pk_is_descendant('Ictalurus', m$taxa) ## [1] FALSE FALSE FALSE FALSE FALSE FALSE FALSE TRUE TRUE TRUE TRUE ## [12] TRUE TRUE TRUE TRUE # pk_is_ancestor() also available (and pk_is_extinct()) # # This is in development for characters, too: # pk_is_descendant(‘jaw skeleton', m$chars, # relationships = ‘part of’)
  • 20. Current limitations Package is not on CRAN (yet), need to install from Github:
 
 
 Data in Phenoscape concentrated on vertebrates, and skeletal fin-limb characters. Semantics-driven matrix synthesis currently limited to presence-absence characters. library(“devtools”)
 install_github(“xu-hong/rphenoscape”)
  • 21. Summary Phenoscape KB has an API for machine access to computable morphology data and computational semantics services. RPhenoscape is a bridge between this API and the ecosystem of comparative phylogenetics packages in R. Translates between R user (who uses labels, data matrices) and Phenoscape KB API (which uses identifiers, ontology terms, NeXML, etc). Code on Github: https://github.com/xu-hong/rphenoscape
  • 23. Acknowledgements U.S. National Science Foundation DBI-1062404, DBI-1062542 National Evolutionary Synthesis Center (NESCent), NSF #EF-0905606 Phenoscape contributors, Advisory Board, Data sources (see: http://phenoscape.org/wiki/ Acknowledgments) RNeXML developers (C. Boettiger, S. Chamberlain)
 http://github.com/rOpenSci/RNeXML
  • 24. Get in touch Repo: github.com/xu-hong/rphenoscape Github: github.com/hlapp Twitter: @hlapp