FilteredPush                                       PI James Hanken                                       Maureen A. Kelly ...
Annotate What?●    Curatorial and scientific metadata for an    estimated 3 bn. specimens in the worlds    natural history...
http://www.flickr.com/photos/nhm_beetle_id/5556112706/
●    Ongoing programs in U.S., Europe,    Australia,...to digitize all.●    In U.S., 10 year, $500 M effort now underway; ...
The Problem: Data Quality● Collections & occurrence data is  all over the map   – … literally (off the map!)   – … even af...
(1) Kvetch about data                           (2) Push to interested parties                               (3) Filter   ...
Modeling distributed annotations ofmutable, actionable, distributed                data●    OA is almost adequate+ clear d...
Modeling distributed annotations ofmutable, actionable, distributed                data●    OA is almost adequate- Model o...
:anAnnotation a oa:Annotation;  oa:hasTarget :nsptarget1;  oa:hasBody <http://fp.org/rangeViolation> ;  oa:hasBody :invali...
Clients                                        AnnotationProcessor                                                   Annot...
Who asserted photo:owner ???
No confusion about provenance of photo:owner assertion ?
ENDhttp://wiki.filteredpush.orgVideo there in a few(?) weeks
Specimen record annotation
West coastrollout
West coastrollout
Upcoming SlideShare
Loading in...5
×

West coastrollout

177

Published on

FilteredPush use of Open Annotation Ontology presented at Stanford University, April 8 2013

Published in: Technology, Education, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
177
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

West coastrollout

  1. 1. FilteredPush PI James Hanken Maureen A. Kelly David B. Lowery Paul J. Morris Robert A. Morris James A. MacklinUniversity of California, Davis Bertram Ludacher Tianhong Song Sven KoehlerHarvard University Herbaria Former Project Participants Lei Dou Chinua IloabachieUniversity of Massachusetts, Boston Timothy McPhillips Donna Tremonte Zhimin Wang NSF: DBI #0960535 (Production, Year 3 of 3) NSF: DBI #0646266 (Prototype, Complete) http://wiki.filteredpush.org
  2. 2. Annotate What?● Curatorial and scientific metadata for an estimated 3 bn. specimens in the worlds natural history collections. 1 bn in U.S.● Digital record of about 1.5% at most. See gbif.org (Also includes non-captured observations).
  3. 3. http://www.flickr.com/photos/nhm_beetle_id/5556112706/
  4. 4. ● Ongoing programs in U.S., Europe, Australia,...to digitize all.● In U.S., 10 year, $500 M effort now underway; automated; semi-automated; – QC issues; correlations to other paper and digital resources; intentionally changing data
  5. 5. The Problem: Data Quality● Collections & occurrence data is all over the map – … literally (off the map!) – … even after “digitization” (sensu stricto) – … and after “digitization” (sensu lato), a.k.a. “computerization”● Issues: – Lat/Long transposition, coordinate & projection issues – Data entry/creation, “fuzzy” data, naming issues, bit rot, data conversions and transformations, schema mappings, … (you name it) 6
  6. 6. (1) Kvetch about data (2) Push to interested parties (3) Filter http://xkcd.com/386/ (4) Change data in databases(5) Store all assertions
  7. 7. Modeling distributed annotations ofmutable, actionable, distributed data● OA is almost adequate+ clear distinction between annotation model and domain models;+ provenance support+ web document-centric features are optional+ SpecificResource model fits data annotation well+ Use of CNT for machine-opaque resources+ (RDF is friendly to semantic pub/sub)
  8. 8. Modeling distributed annotations ofmutable, actionable, distributed data● OA is almost adequate- Model of Evidence for assertions in Body.- Model to convey annotators Expectation to consuming apps- Model use of queries as Selectors(-) Model “transcription” as an oa:Motivation(?) Model domain associations between Target and Body
  9. 9. :anAnnotation a oa:Annotation; oa:hasTarget :nsptarget1; oa:hasBody <http://fp.org/rangeViolation> ; oa:hasBody :invalidLatitudeText.:nsptarget1 a oad:AnySuchResource; oa:hasSelector :findInvalidLatitudes.:findInvalidLatitudes a oad:SparqlQuerySelector; oad:hasQuery “select distinct ?x WHERE { ?x a dwcFP:Occurrence. ?x dwc:hasLatitude ?lat. filter (?lat > 90 || ?lat < 90).}”; […] .
  10. 10. Clients AnnotationProcessor AnnotationProcessor Specify 6 Mapper ClientLibrary Specify6 Driver SparqlRules Specify6-HUH Driver OA/OAD SQL Driver dwcFP FP-oauth-consumerMorphbank FP-oauth-providerFilteredPush instance Client Tools FP Network Triage FP-PHP-Library ClientLibrary FP Access Point SparqlRules Symbiota OA/OAD Knowledge Messaging Analysis FilteredPush instance dwcFP JMS-KVP Kepler MySQL RDF Handler Fuseki SPARQL Push Fedora Kepler Kuration Mongo
  11. 11. Who asserted photo:owner ???
  12. 12. No confusion about provenance of photo:owner assertion ?
  13. 13. ENDhttp://wiki.filteredpush.orgVideo there in a few(?) weeks
  14. 14. Specimen record annotation
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×