We have a working infrastructure as well as more than 200 partners, We harvest and sort text and multimedia by topic and by species and put it on our pages. Curation + user-added content from the crowds is added to the mix.This is fed back to providers, giving them traffic, quality control on their own content, and new content for them to use And, we are already seeing spinoff products. We make it easy for developers, and everything is either public domain or CC-licensed so it can be re-used.
We now have over a million pages with content, some of it is even in other languages like Arabic, Spanish, and Chinese. And we are getting traffic mostly from the general public, from all over the world.
Most of our 5.4 million content objects are text blobs and here are the subjects of that text. Most often, our text objects are about distribution. But there are many other subjects involved including essays that include multiple subjects.
Except for the first, links for that one on request
Information Visualization MOOC (Massive Open Online Course) led by Dr. Katy Börner of Indiana University, students TwyBethard (United States), Andrew Miles (United Kingdom), Edward Kok (Netherlands) and Mattia Della Libera (Italy) used GloBI data to create an insightful visualization of spatial marine food webs in the Gulf of Mexico.
Starting with marine dataIn the most simplistic view, we’ll be storing triplesThis data will be organized on a data tab, sorting out the data into the 35 or so “topics” that we currently have text chapters for, and we will also allow powerful downloading and searching capabilityFinally we’ll be setting up ways for other applications to grab the data and do interesting things with it. We already have a tool for making field guides,The approach here builds on our innovations for EOL and adds some proven technology called the “semantic web” to our domain. The next step takes this chain of innovation even further.
Drawing data from the literature, from online databases, and from published datasets as in Dryad, summarizing collections databases
Everyone wants to know theattributes of organismsPeople exploring the world find something and want to be able to search on characteristics they can seeTeachers want their students to become adept at analyzing data, and how better than to work with real numerical information about the size of organisms or their behavior or what their sensitivity is to temperature and what might happen in the face of climate changeSo while scientists were saying they needed us to provide data they could analyze, we heard the same thing from our educators, too.
Phenoscape is a database that is looking at anatomical traits in fishes. Looking just at 57 publications they have more than 500K descriptions for 2500 kinds of organisms.ZFIN is a model organism database for zebrafish, a common model organism for developmental biologists. In just this one species they have captured nearly 40,000 traits – just for ONE very well-studied SPECIES
How the Encyclopedia of Life is wrangling organismal attribute data
eol.org@eol@cydparrHow the Encyclopedia of Life iswrangling organismal attribute data
How EOL worksEOLCrowdsHarvestThird party applications
EOL TodayKey Milestones in 20131.1 million species pages240+ content providers3.3 million unique annualvisitors from 235countries
0 100000 200000 300000 400000 500000 600000 700000 800000DistributionMolecularBiologyMultiple topicsTypeInformationHabitatConservationStatusThreatsMorphologyConservationManagementTrendsSizeAssociationsUsesTrophicStrategyCyclicity & Life CyclePopulationBiologyReproductionMigrationTaxonomyLifeExpectancyIdentificationBehaviourEcologyDiseasesNumber of text objectsSubjectoftextobject
Text mining, crowdsourcing, standardizingsee http://eol.org/info/fellowsCo-occurrence, term extraction &linked dataThessen & DevriesEnvO habitat terms Pafilis et al.Altitude Specificity of FlowerColorationWrightMorphological impacts of extinctionrisk in fishChangButterfly-hostplant associations Ferrer-Parris et al.Species Interactions Poelen & Mungallet al.
14 datasets containing 25ktaxa, 422kinteractions, for 3klocationsalpha version ofingestion, normalization,aggregationalpha version of web APIalpha version of dataexportsDr. Katy Börner ledInformation VisualizationMOOCGLoBI http://globalbioticinteractions.wordpress.com/
EOL TraitBankFunded: Marine focusVirtuoso triple store, re-using URIs where possible5 datasets 128,050 data points for 20,896 taxaHarvest and display on data tabDownloads, fancy searchingMachine access
Uploads & harvests will be by spreadsheetand Darwin Core ArchiveSupport for annotation and curationPlease contact me to be part of the private beta
Easy access to analyzable trait data“Are blue organisms more common in high altitudes?”“Does the evolution of mammalian bacula appear to berelated to the pattern of promiscuous mating?”“What organisms should I collect to fill in gaps in genomequality tissue collections?”• Look for trait, download for all taxa• Create a collection of taxa, download all data• Use Reol: an R interface to EOL (Banbury, O’Meara)http://reolblog.wordpress.com/• Find more specialized data repositories
ThanksFunding & other contributionsSloan FoundationSmithsonian InstitutionDavid RubensteinMarine Biological LaboratoryHarvard UniversityOur content partnersThousands of individualcontributors, and hundreds ofvolunteer curatorsImage creditsJenny from TaipeiCynthia ParrChief Scientist @eol@cydparr firstname.lastname@example.orgAlexandria Archive: Sarah Kansa, Eric Kansa, 34 othezooarchaeologistsGLoBI: Jorrit Poelen (lead/software), Chris Mungall(ontologies), James Simons (biologist) and RobertReiz (software). Datasets shared by: Peter D.Roopnarine, Rachel Hertog, Carlos García-Robledo, James Simons, Jenny L. Wrast, C.Barnes, International Council for the Exploration ofthe Sea (ICES), Jose R. Ferrer Paris, SenolAkin, Malcolm Storey (BioInfo.org.uk), Ivy E.Baremore, Joel Sachs (SPIRE), Colt W. Cook, David A.Blewett
Quick mathIn Phenoscape57 publications had 565,158 anatomical traitdescriptions for 2,527 kinds of organisms= 223 traits/organismIn ZFIN38,189 trait descriptions for 4,727 genes for ZebraFish1.9 million species on the planet= LOTS OF TRAITS
Anatolia Zooarchaeology Case Study led byAlexandria Archive Institute1. 14 different sites2. 34+ zooarchaeologists3. Decoding, cleanup, metadata documentation4. 220,000+ specimens5. 450 entities linked to 143 EOL taxon concepts6. Anatomical entities linked to Uberon.org7. Biometrics linked to measurement ontology8. Collaborative analysishttp://opencontext.org/