Presentation for the Symposium: Building the Biodiversity Knowledge Graph for Insects – Components, Progress, and Challenges; 2016 XXV International Congress of Entomology, Orlando, FL – September 26, 2016 (#ICE2016). See https://esa.confex.com/esa/ice2016/meetingapp.cgi/Session/24482
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...taxonbytes
Lightning talk at iEvoBio 2016 (http://www.ievobio.org/), given on June 21, 2016, at Evolution Meetings in Austin, Texas. Brief overview of using Euler/X to align phylogenies. See https://github.com/EulerProject
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...taxonbytes
View a video recording here: https://vimeo.com/195024485
Franz & Sterner @ #TDWG16 - "A new power balance is needed for trustworthy biodiversity data". Talk # 1134, Friday, December 09, 2016, 11:30 am. Session Contributed Papers 05: Data Gaps, Trust, Knowledge Acquisition. See https://mbgserv18.mobot.org/ocs/index.php/tdwg/tdwg2016/schedConf/program
Persistent homology and organismal theory: Quantifying the branching topologi...DanChitwood
The Botany 2017 Donald R. Kaplan Memorial Lecture in Comparative Development, Fort Worth, Texas, June 27, 2017. Dan Chitwood, Independent Researcher (Santa Rosa, CA).
lightning talk for iEvoBio2013, June 25, 2013, delivered by Arlin Stoltzfus on behalf of HIP and the hackathon participants. Phylotastic is a distributed delivery system for expert knowledge of species phylogeny (the tree of life).
Franz et al evol 2016 aligning multipe incongruent phylogenies with the euler...taxonbytes
Lightning talk at iEvoBio 2016 (http://www.ievobio.org/), given on June 21, 2016, at Evolution Meetings in Austin, Texas. Brief overview of using Euler/X to align phylogenies. See https://github.com/EulerProject
Franz sterner tdwg 2016 new power balance needed for trustworthy biodiversity...taxonbytes
View a video recording here: https://vimeo.com/195024485
Franz & Sterner @ #TDWG16 - "A new power balance is needed for trustworthy biodiversity data". Talk # 1134, Friday, December 09, 2016, 11:30 am. Session Contributed Papers 05: Data Gaps, Trust, Knowledge Acquisition. See https://mbgserv18.mobot.org/ocs/index.php/tdwg/tdwg2016/schedConf/program
Persistent homology and organismal theory: Quantifying the branching topologi...DanChitwood
The Botany 2017 Donald R. Kaplan Memorial Lecture in Comparative Development, Fort Worth, Texas, June 27, 2017. Dan Chitwood, Independent Researcher (Santa Rosa, CA).
lightning talk for iEvoBio2013, June 25, 2013, delivered by Arlin Stoltzfus on behalf of HIP and the hackathon participants. Phylotastic is a distributed delivery system for expert knowledge of species phylogeny (the tree of life).
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledgetaxonbytes
Invited Presentation given at the University of Illinois Urbana Champaign iSchool, Center for Informatics Research in Science and Scholarship, CIRSS Seminar, Friday, February 17, 2017.
NISO Virtual Conference: Expanding the Assessment Toolbox: Blending the Old and New Assessment Practices
Dismantling a Single-Discipline Journal Bundle: A Triangulation Method for Assessment Diane (DeDe) Dawson MSc, MLIS, Science Liaison Librarian, Science Library, University of Saskatchewan
Given at the NIH stock center directors meeting, August 8, 2016. Author: Anita Bandrowski
Project: Resource Identification Initiative http://scicrunch.org/resources
Topic: How is model organism data being used in literature
Publishing Germplasm Vocabularies as Linked DataValeria Pesce
What has already been published?
What may still be needed?
How to do it?
This presentation is a part of the 3rd Session of the 1st International e-Conference on Germplasm Data Interoperability https://sites.google.com/site/germplasminteroperability/
The repository ecology: an approach to understanding repository and service i...R. John Robertson
An increasing number of university institutions and other organisations are deciding to deploy repositories and a growing number of formal and informal distributed services are supporting or capitalising on the information these repositories provide. Despite reasonably well understood technical architectures, early majority adopters may struggle to articulate their place within the actualities of a wider information environment. The idea of a repository ecology provides developers and administrators with a useful way of articulating and analysing their place in the information environment, and the technical and organisational interactions they have, or are developing, with other parts of such an environment. This presentation will provide an overview of the concept of a repository ecology and examine some examples from the domains of scholarly communications and elearning.
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Slides from a presentation on the Knowledge Organization System (KOS) work program for GBIF. KOS developments for biodiversity information resources and input to the emerging Vocabulary Management Task Group (VoMaG).
Links
GBIF KOS prototype tools, http://kos.gbif.org/
Tool: Semantic Wiki prototype, http://terms.gbif.org/wiki/
Tool: ISOcat prototype demo, http://kos.gbif.org/isocat/
GBIF concept vocabulary term browser, http://kos.gbif.org/termbrowser/
GBIF Resources Repository, http://rs.gbif.org/terms/
GBIF Vocabulary Server, http://vocabularies.gbif.org/
GBIF Resources Browser, http://tools.gbif.org/resource-browser/
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
Tony Burdett's slides from his talk at Connected Data London. Tony is a Senior Software Engineer at The European Bioinformatics Institute. He presented the complexity of data at the EMBL-EBI and what is their solution to make sense of all this data.
II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical LiteratureDr. Haxel Consult
Strategic partnerships between pharmaceutical companies and medical experts lead to more effective medical and marketing activities throughout a product life cycle. Identification of such medical experts, that is, key opinion leaders (KOLs) from bibliometric analysis is challenging due to volume and variety of data. Today, the research community is flooded with scientific literature, with thousands of journals and over 20 million abstracts in PubMed. Developing a holistic framework to identify, profile and update the KOLs is the need of the hour. Customers want digestible information – everything relevant. In this talk, we will present case studies on how we used the ontologies and disambiguation techniques to address KOL identification for different therapeutic areas.
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
Biodiversity data pose fundamental challenges for unification-based paradigms of data science. In particular, a hierarchical, backbone-driven approach to aggregating global biodiversity data tends to limit community engagement. Data quality, trust, fitness for use, and impact are similarly reduced. This presentation will outline an alternative, de-centralized design for aggregating biodiversity data globally. The design requires a coordinative approach to representing and reconciling evolving systematic perspectives, and further social but technologically mediated coordination between regionally and taxonomically constrained "communities of practice" (sensu Wenger, 2000, https://doi.org/10.1177/135050840072002). Important next steps in this direction include the development of use cases that quantify the benefits of a de-centralized biodiversity data aggregation - in terms of lowering costs to expert engagement, raising efficiency of curation, validating novel integration services, and improving reproducibility and provenance tracking across heterogenous data structures and portals.
Anzaldo franz 2017 ecn your daily weeviltaxonbytes
Slides of the presentation "#YourDailyWeevil - a story of modest but gratifying social media success", given at the 2017 Annual Meeting of the Entomological Collections Network, November 05, 2017, Denver, Colorado.
More Related Content
Similar to Franz et al ice 2016 addressing the name meaning drift challenge in open ended biodiversity information environments
Franz 2017 uiuc cirss non unitary syntheses of systematic knowledgetaxonbytes
Invited Presentation given at the University of Illinois Urbana Champaign iSchool, Center for Informatics Research in Science and Scholarship, CIRSS Seminar, Friday, February 17, 2017.
NISO Virtual Conference: Expanding the Assessment Toolbox: Blending the Old and New Assessment Practices
Dismantling a Single-Discipline Journal Bundle: A Triangulation Method for Assessment Diane (DeDe) Dawson MSc, MLIS, Science Liaison Librarian, Science Library, University of Saskatchewan
Given at the NIH stock center directors meeting, August 8, 2016. Author: Anita Bandrowski
Project: Resource Identification Initiative http://scicrunch.org/resources
Topic: How is model organism data being used in literature
Publishing Germplasm Vocabularies as Linked DataValeria Pesce
What has already been published?
What may still be needed?
How to do it?
This presentation is a part of the 3rd Session of the 1st International e-Conference on Germplasm Data Interoperability https://sites.google.com/site/germplasminteroperability/
The repository ecology: an approach to understanding repository and service i...R. John Robertson
An increasing number of university institutions and other organisations are deciding to deploy repositories and a growing number of formal and informal distributed services are supporting or capitalising on the information these repositories provide. Despite reasonably well understood technical architectures, early majority adopters may struggle to articulate their place within the actualities of a wider information environment. The idea of a repository ecology provides developers and administrators with a useful way of articulating and analysing their place in the information environment, and the technical and organisational interactions they have, or are developing, with other parts of such an environment. This presentation will provide an overview of the concept of a repository ecology and examine some examples from the domains of scholarly communications and elearning.
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
Slides from a presentation on the Knowledge Organization System (KOS) work program for GBIF. KOS developments for biodiversity information resources and input to the emerging Vocabulary Management Task Group (VoMaG).
Links
GBIF KOS prototype tools, http://kos.gbif.org/
Tool: Semantic Wiki prototype, http://terms.gbif.org/wiki/
Tool: ISOcat prototype demo, http://kos.gbif.org/isocat/
GBIF concept vocabulary term browser, http://kos.gbif.org/termbrowser/
GBIF Resources Repository, http://rs.gbif.org/terms/
GBIF Vocabulary Server, http://vocabularies.gbif.org/
GBIF Resources Browser, http://tools.gbif.org/resource-browser/
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
Tony Burdett's slides from his talk at Connected Data London. Tony is a Senior Software Engineer at The European Bioinformatics Institute. He presented the complexity of data at the EMBL-EBI and what is their solution to make sense of all this data.
II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical LiteratureDr. Haxel Consult
Strategic partnerships between pharmaceutical companies and medical experts lead to more effective medical and marketing activities throughout a product life cycle. Identification of such medical experts, that is, key opinion leaders (KOLs) from bibliometric analysis is challenging due to volume and variety of data. Today, the research community is flooded with scientific literature, with thousands of journals and over 20 million abstracts in PubMed. Developing a holistic framework to identify, profile and update the KOLs is the need of the hour. Customers want digestible information – everything relevant. In this talk, we will present case studies on how we used the ontologies and disambiguation techniques to address KOL identification for different therapeutic areas.
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
Biodiversity data pose fundamental challenges for unification-based paradigms of data science. In particular, a hierarchical, backbone-driven approach to aggregating global biodiversity data tends to limit community engagement. Data quality, trust, fitness for use, and impact are similarly reduced. This presentation will outline an alternative, de-centralized design for aggregating biodiversity data globally. The design requires a coordinative approach to representing and reconciling evolving systematic perspectives, and further social but technologically mediated coordination between regionally and taxonomically constrained "communities of practice" (sensu Wenger, 2000, https://doi.org/10.1177/135050840072002). Important next steps in this direction include the development of use cases that quantify the benefits of a de-centralized biodiversity data aggregation - in terms of lowering costs to expert engagement, raising efficiency of curation, validating novel integration services, and improving reproducibility and provenance tracking across heterogenous data structures and portals.
Anzaldo franz 2017 ecn your daily weeviltaxonbytes
Slides of the presentation "#YourDailyWeevil - a story of modest but gratifying social media success", given at the 2017 Annual Meeting of the Entomological Collections Network, November 05, 2017, Denver, Colorado.
Franz et al tdwg 2016 new developments for libraries of lifetaxonbytes
Franz et al. @ #TDWG16 - "New developments for the Libraries of Life project and app". Talk # 1138, Friday, December 09, 2016, 02:45 pm. Session Lightning Talks. See https://mbgserv18.mobot.org/ocs/index.php/tdwg/tdwg2016/schedConf/program
Franz et al tdwg 2016 introducing lep nettaxonbytes
Franz et al. @ #TDWG16 - "Introducing LepNet – the Lepidoptera of North America Network". Talk # 1139, Friday, December 09, 2016, 02:40 pm. Session Lightning Talks. See https://mbgserv18.mobot.org/ocs/index.php/tdwg/tdwg2016/schedConf/program
Zhang et al ecn 2016 building an accessible weevil tissue collection for geno...taxonbytes
Poster describing the origin and function of the ASUHIC Weevil Tissue Collection (WTC), see tinyurl.com/weeviltissuecollection; presented at the 2016 Entomological Collections Network Meeting, September 23, 2016, Orlando, Florida. ECN website: http://ecnweb.org/
Johnston ESA 2014 Trogloderus Sand Dune Speciationtaxonbytes
Andrew Johnston's presentation on Trogloderus (Coleoptera: Tenebrionidae) systematics and speciation in Southwestern United States sand dune habitat, given at the 2014 Annual Meeting of the Entomological Society of America in Portland, OR. http://www.entsoc.org/entomology2014
Franz 2014 BIGCB Tracking Change across Classifications and Phylogeniestaxonbytes
Slides presented on the Euler/X toolkit at the "Understanding Taxon Ranges in Space and Time" Workshop – Berkeley Initiative in Global Change Biology (BIGCB); held on November 07-09, 2014, University of California at Berkeley, CA. See also http://taxonbytes.org/bigcb-workshop-at-uc-berkeley-tackling-the-taxon-concept-problem/
Arizona State University Natural History Collections - Moving to Alameda (201...taxonbytes
A collection of photos showing the transition of the Natural History Collections (School of Life Sciences, Arizona State University) from the Tempe Campus to the Alameda location. May, 2011 to August, 2014. See also http://taxonbytes.org/impressions-alameda-grand-opening/
Cobb, Seltmann, Franz. 2014. The Current State of Arthropod Biodiversity Data...taxonbytes
Cobb et al. 2014. The Current State of Arthropod Biodiversity Data: Addressing Impacts of Global Change. Presented at https://www.idigbio.org/content/collections-21st-century-symposium Program available at https://www.idigbio.org/wiki/index.php/Collections_for_the_21st_Century
Franz. 2014. Explaining taxonomy's legacy to computers – how and why?taxonbytes
Slides presented on the Euler/X projected (http://taxonbytes.org/prior-work-on-concept-taxonomy-2013/ & https://bitbucket.org/eulerx/euler-project) - for the conference "The Meaning of Names: Naming Diversity in the 21st Century", CU Natural History Museum, September 30, 2014.
Ludäscher et al. 2014 - A Hybrid Diagnosis Approach Combining Black-Box and W...taxonbytes
Presentation given at RuleML 2014 conference (http://ruleml2014.vse.cz/) with updates on the Euler/X toolkit; see also http://taxonbytes.org/prior-work-on-concept-taxonomy-2013/
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
Franz et al ice 2016 addressing the name meaning drift challenge in open ended biodiversity information environments
1. Addressing the name:meaning drift
challenge in open biodiversity
information environments
Please
@taxonbytes
Nico M. Franz1 , Salvatore A. Anzaldo1, Edward E. Gilbert1,
M. Andrew Jansen1, M. Andrew Johnston1 & Bertram Ludäscher2
1 School of Life Sciences, Arizona State University
2 iSchool, University of Illinois at Urbana-Champaign
Symposium: Building the Biodiversity Knowledge Graph for Insects – Components, Progress, and Challenges
2016 XXV International Congress of Entomology, Orlando, FL – September 26, 2016 (#ICE2016)
Presentation available @ SlideShare: http://tinyurl.com/franz-et-al-ice-2016
2. Our biodiversity informatics research program, summarized
• We are no longer just putting articles and monographs on library shelves.
91dd0ee1-8a37-4efc-85b7-8176874cf5be
3. Our biodiversity informatics research program, summarized
• We are no longer just putting articles and monographs on library shelves.
• This is more than 'just technology'; we must develop new systematic theory
to deal with inherently dynamic, open data systems.
91dd0ee1-8a37-4efc-85b7-8176874cf5be
4. Our biodiversity informatics research program, summarized
• We are no longer just putting articles and monographs on library shelves.
• This is more than 'just technology'; we must develop new systematic theory
to deal with inherently dynamic, open data systems.
• The concept taxonomy approach has practical implications for strengthening
the roles that individual experts play in big biodiversity data environments.
91dd0ee1-8a37-4efc-85b7-8176874cf5be
5. Products – concept taxonomy in theory and in practice
ZooKeys. doi:10.3897/zookeys.528.6001
Semantic Web. doi:10.3233/SW-160220
Biological Theory (in review). doi:10.1101/022145
PloS ONE. doi:10.1371/journal.pone.0118247
Systematics Biodiv. doi:10.1080/14772000.2013.806371
Systematic Biology. doi:10.1093/sysbio/syw023
Biodiversity Data Journal (in review). #6093
Research Ideas and Outcomes (in review). #6302
6. Premise: We're lucky that insect revisions are not so frequent
"In biology, there are many taxa that are so under-studied that
they are only known from their original description and
none or very few subsequent references […].
The name alone, so long as it is a unique name,
is sufficient to locate all related material."
– David Remsen 2016: 213
Source: Remsen. 2016. The use and limits of scientific names […]. ZooKeys 550: 207–223. doi:10.3897/zookeys.550.9546
8. Snapshot of a more frequently revised organismal lineage
Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
• 9 schemata for the NA Cleistes/Cleistesiopsis complex (orchids)
9. Snapshot of a more frequently revised organismal lineage
Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
• 9 schemata for the NA Cleistes/Cleistesiopsis complex (orchids)
• Vertical sections identify taxonomic concept regions
10. Snapshot of a more frequently revised organismal lineage
Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
• 9 schemata for the NA Cleistes/Cleistesiopsis complex (orchids)
• Vertical sections identify taxonomic concept regions
• Colors identify lineages of taxonomic names (epithets) in use
11. Snapshot of a more frequently revised organismal lineage
Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
• 9 schemata for the NA Cleistes/Cleistesiopsis complex (orchids)
• Vertical sections identify taxonomic concept regions
• Colors identify lineages of taxonomic names (epithets) in use
• There is no consensus! Five incongruent schemata are used concurrently
12. Premise:
If incongruent taxonomies are endorsed
– locally, provisionally, and democratically –
then what is the impact for
aggregated biodiversity data?
14. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus'
• Query: "Where do these orchid
species occur?"
• Same set of 250 orchid specimens,
according to 4 taxonomies.
"Controllingthetaxonomicvariable" Example: the Cleistes use case
15. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
"Controllingthetaxonomicvariable"
• Query: "Where do these orchid
species occur?"
• Same set of 250 orchid specimens,
according to 4 taxonomies.
Example: the Cleistes use case
16. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
"Controllingthetaxonomicvariable"
17. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
18. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
Expert views
are in conflict
19. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
Expert views
are in conflict
"Just bad"
20. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
Impact:
Name-based aggregation has created
a novel synthesis that nobody believes in
"Controllingthetaxonomicvariable"
"Just bad"
21. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
"Just
bad"
Expert views
are in conflict
Solution:
Instead of aggregating
an artificial 'consensus',
…
22. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
"Just
bad"
Expert views
are reconciled
Solution:
Instead of aggregating
an artificial 'consensus',
build translation services
23. Challenges:
How can we redesign aggregation to yield
high-quality biodiversity data packages?
24. Challenges:
How can we redesign aggregation to yield
high-quality biodiversity data packages?
What does this mean for Darwin Core1
and how we use this aggregation standard?
1 Wieczorek et al. 2012. Darwin Core: an evolving […]. PLoS ONE 7(1): e29715. doi:10.1371/journal.pone.0029715
25. Preview of solution with 8 steps
• DwC is insufficient, and part of the problem
Step 7:
26. # 1: Represent only taxonomic concept labels (TCLs) 1
• Syntax (TCL): taxonomic name [author, year, page] sec. source
1 Multi-taxonomy input/alignment visualizations generated with Euler/X toolkit: https://github.com/EulerProject/EulerX
Cleistes divaricata
sec. Gregg & Catling 1993
Pogonia
sec. Brown & Wunderlin 1997
27. # 1: DwC score keeping TCLs are optional; < 1% realized?
• TCL ~ DwC: nameAccordingTo
• SCAN: 19,722 of nearly 9 million records have TCLs (0.2%)
• Lack of enforcement to use TCLs makes standard less big data-ready
DwC record with nameAccordingTo (TCL)
(BDJ)
"Who authors GBIF's Backbone?"
https://storify.com/taxonbytes/who-authors-gbif-s-backbone
28. # 2: Represent each source coherently (Parent-Child relationships)
• Syntax (PC): TCL1 is a child/parent of TCL2 [where TCL1/2 = same source]
Cleistesiopsis bifaria sec. Pans. & de Barr. 2008
is a child of
Cleistesiopsis sec. Pans. & de Barr. 2008
29. # 2: DwC score keeping Not (adequately) represented
• PC ~ DwC: genus, family, order (etc.; higherClassification)
• However, higher-level names in DwC are not modeled as TCLs
• Taxonomic coherence of sources cannot be preserved with DwC alone
DwC record with higherClassification
(BDJ)
30. # 3: Do not force a single hierarchy onto all tip-level TCLs
• Syntax (PC): Tip-level TCL1 , TCL2 , etc. [where TCL1/2 = different sources]
31. # 3: DwC score keeping Optional Not (ever?) practiced
• No PC ~ DwC: infra-/specificEpithet only
• Typically, a single, 'unitary' higher-level classification is represented
• Combinations of algorithmic and social practices achieve the single hierarchy
"Who authors GBIF's Backbone?"
https://storify.com/taxonbytes/who-authors-gbif-s-backbone
32. # 4: Link TCLs via expert-provided RCC–5 articulations
• Syntax (RCC–5): TCL1 {==, >, <, ><, !} TCL2 [where TCL1/2 = diff. sources]
• RCC–5 = Region Connection Calculus
• 14 articulations provided by: http://tinyurl.com/Weakley-Flora-2015
Cleistes bifaria "Coastal Populations" sec. Smith et al. 2004
== (is congruent with)
Cleistesiopsis oricamporum sec. Brown & Pans. 2009
==
33. Source: Thau, D.M. 2010. Reasoning about taxonomies. Thesis, UC Davis. http://gradworks.proquest.com/3422778.pdf
Region Connection Calculus (semantics: set constraints)
== < > >< !
• Two regions N, M are either:
• congruent (N == M)
• properly inclusive (N < M)
• inversely properly inclusive (N > M)
• overlapping (N >< M)
• exclusive of each other (N ! M)
34. Source: Thau, D.M. 2010. Reasoning about taxonomies. Thesis, UC Davis. http://gradworks.proquest.com/3422778.pdf
Region Connection Calculus (semantics: set constraints)
== < > >< !
• Two regions N, M are either:
• congruent (N == M)
• properly inclusive (N < M)
• inversely properly inclusive (N > M)
• overlapping (N >< M)
• exclusive of each other (N ! M)
• RCC–5 articulations answer the query: "can we join regions N and M?"
• Taxonomies have multiple RCC–5 alignable components: nodes (parents,
children), node-associated traits, even node-anchoring specimens
36. Oscillating meanings of the epithet hyalites – 1911 to 2003
Phenotypicdiversity
Type-anchorednameidentityrelations
Source: Vane-Wright. 2003. Indifferent philosophy versus […]. Syst. Biodiv. 1: 3–11. doi:10.1017/S1477200003001063
37. # 5: Identify occurrence records only to TCLs
Records:
EKY39235
MTSU003611
NCSC00040204
…
Records:
BOON8098
CLEMS0061133
WILLI39399
…
Records:
GMUF-0039355
IBE006808
USCH58399
…
Records:
CONV0006268
MDKY00006482
NCU00038930
…
Records:
BRYV0023582, BRYV0023584
KHD00032030, MISS0016604
MMNS000227, NCSC00040206
USMS_000002923, USMS_000002924
VSC0053223, VSC0065528
…
Records:
ARIZ393087
DBG39049
USCH51217
…
Records:
NCU00040710
USCH96248
VSC0053218
…
Records:
CLEMS0012881
FUGR0003293
GA023130
…
Records:
BOON8100
NCSC00040210
SJNM45487
…
Records:
GA023144
LSU00012494
MISS0016608
…
Records:
IBE006810, IND-0012374, MMNS000227
Records:
NY8654
• Syntax (ID): Occurrence / organism is identified to TCL
"CLEMS0012881"
is identified to
Cleistes divaricata sec. Smith et al. 2004
[additional ID metadata]
38. DwC record with Identification metadata
(BDJ)
# 5: DwC score keeping ID metadata optional; > 50% realized
• ID ~ DwC: Identification, (date)identified(By), identificationReference
• SCAN: 4,715,277 of nearly 9 million records have ID metadata (52.5%)
• Enforcement…still also require use of TCLs
39. # 6: Generate comprehensive, consistent RCC–5 alignments
• Euler/X is a toolkit that infers logically consistent RCC–5 alignments
40. # 6: Generate comprehensive, consistent RCC–5 alignments
• Valued-added: MIR – set of Maximally Informative Relations containing
the RCC–5 articulation for every possible TCL pair scalability
Reasonerinference
42. Source: Franz et al. 2016. Controlling the taxonomic variable […]. Research Ideas and Outcomes (RIO). (In Review)
The 'consensus' The 'bible'
The (formerly)
federal 'standard'
The 'best', latest
regional flora
"Controllingthetaxonomicvariable"
Impact:
"Please select your preference (A – D);
we can perform all translations"
43. • We can now respond to queries such as:
• "Show all specimens identified to the taxonomic name Cleistes divaricata"
• Returns many records resolves incongruent lineage of name usages
# 8: "Do you trust us now?" Aggregation as a translational service
44. • We can now respond to queries such as:
• "Show all specimens identified to the taxonomic name Cleistes divaricata"
• Returns many records resolves incongruent lineage of name usages
• "Now show specimens with the TCL Cleistesiopsis divaricata sec. Weakley 2015"
• Returns record subset resolving only one narrowly circumscribed concept
# 8: "Do you trust us now?" Aggregation as a translational service
45. # 8: "Do you trust us now?" Aggregation as a translational service
• We can now respond to queries such as:
• "Show all specimens identified to the taxonomic name Cleistes divaricata"
• Returns many records resolves incongruent lineage of name usages
• "Now show specimens with the TCL Cleistesiopsis divaricata sec. Weakley 2015"
• Returns record subset resolving only one narrowly circumscribed concept
• "Now show specimens identified to the TCL Cleistes divaricata sec. RAB 1968,
yet translated into the more granular TCLs sec. Weakley 2015"
• Returns (again) many records, yet represents and contrasts two treatments,
as opposed to providing the ambiguous lineage view (above)
• "Show all specimens with ambiguous 2010/2015 TCL identifications…" (etc.)
46. Conclusions – designing trusted biodiversity data services
• The Darwin Core standard for aggregating biodiversity data:
(1) Has under-utilized options for better representing taxonomic expertise
(2) Is part of a design paradigm that undermines the plurality of expertise
47. • The Darwin Core standard for aggregating biodiversity data:
(1) Has under-utilized options for better representing taxonomic expertise
(2) Is part of a design paradigm that undermines the plurality of expertise
• We are developing new solutions – including TCLs, PC relations, RCC–5, and
scalable logic applications – that realize data aggregation via translational
services, without disrupting the formation of expert-licensed, high-quality
biodiversity data packages
Conclusions – designing trusted biodiversity data services
48. • The Darwin Core standard for aggregating biodiversity data:
(1) Has under-utilized options for better representing taxonomic expertise
(2) Is part of a design paradigm that undermines the plurality of expertise
• We are developing new solutions – including TCLs, PC relations, RCC–5, and
scalable logic applications – that realize data aggregation via translational
services, without disrupting the formation of expert-licensed, high-quality
biodiversity data packages
• All of us – not just aggregators – "own" the responsibility of designing
systems where the plurality of taxonomic expertise is fairly accommodated
Conclusions – designing trusted biodiversity data services
49. Acknowledgments & links to products
• Cleistes use case: Alan Weakley (UNC)
• Euler/X toolkit: Shizhuo Yu (UC Davis)
• Data trajectories: Beckett Sterner (ASU)
• OBKMS design: Viktor Senderov (Pensoft)
• NSF DEB–1155984, DBI–1342595 (PI Franz)
• NSF IIS–118088, DBI–1147273 (PI Ludäscher)
• Euler/X code @ https://github.com/EulerProject/EulerX
• Franz et al. 2016. Two influential primate classifications logically aligned.
Systematic Biology 65(4): 561–582. Link
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The simple semantics of RCC-5 makes this a rather generic vocabulary for representing advancement in phylogenetic knowledge. At the same time, the onus is on the phylogeneticists to apply the articulations in auch ways that the desired query services are actually obtained.
The simple semantics of RCC-5 makes this a rather generic vocabulary for representing advancement in phylogenetic knowledge. At the same time, the onus is on the phylogeneticists to apply the articulations in auch ways that the desired query services are actually obtained.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.
The more one looks, the more complicated it gets. Notice also the node labeling, or lack thereof.