WOW13_RPITWC_Web Observatories

230 views

Published on

Presentation during the WOW 2013 workshop featuring Web Observatory works created by the RPI Tetherless World Constellation

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
230
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Examples from each of these observatories: 1. Science Data Observatory: A. SemantEco B. SemantAqua
  • Examples from each of these observatories: 1. Open Government Observatory: A.Linked Open Government Data Portal B. International Open Government Dataset
  • Semantically-enabled environmental monitoring – in this case monitoring water quality. Done initially as a student project in McGuinness’ Semantic eScience class, attracted interest of USGS and has an extension done with USGS. Currently working on a cooperative agreement with USGS to continue. Also used as a model for semantically enabling monitoring of air, soil, food, etc. Project page: http://tw.rpi.edu/web/project/SemantAQUA
  • Examples from each of these observatories: 1. Healthy & Life Sciences Observatory: A. HealthData Challenge
  • Examples from each of these observatories: 1. Social Spaces Data Observatory: A. Twitter Network Observatory B. First Responder Twitter Network
  • The RPI group has been developing Twitter Network Observatory to explore the relationships of people and semantics in the graph database. The basic functions have been fulfilled,including     Users could visualize and analyze different types of sub-graphs based on the selections of topic, time range.     The Twitter Network observatory performs a set of basic analyses for other COSMIC groups and users to support their purposes. We have been working on adding new functions including     The selection based on time range, location, and sentiments.     Network (and the topological properties) can be exported to various formats to be used in other software (GraphML, XGMML, SVG, etc.).
  • Introduction First Responders , including Emergency Medical Personnel, Firefighters, and Police Officers, have active online communities on Social Media websites. How can we leverage Social Media sites … to gather requirements for active First Responders? … to identify stakeholders within those First Responder communities? * http://www.digitalbuzzblog.com/infographic-24-hours-on-the-internet/
  • WOW13_RPITWC_Web Observatories

    1. 1. Exploration in Web Science:Instruments for WebObservatoriesObservatoriesPresented by:Kristine GloriaCo-authors: Deborah McGuinness and Joanne LucianoThe Tetherless World ConstellationRensselaer Polytechnic Institute, Troy, NYWith thanks to the extended RPI Tetherless World Team
    2. 2. Agenda6I. Web Observatories at RPI’s Web ScienceResearch CenterII. Web Observatory ThemesIII. Science DataIV. Health and Life Sciences,V. Open GovernmentVI. Social Spaces
    3. 3. Web Observatories @ WSRCAt RPI WSRC, our observatories present bothtools and methodologies that empowerresearchers to study the web and to make adifference in the world
    4. 4. Web Observatories ThemesScience Data ObservatoryHealth & Life SciencesObservatoryOpen Government ObservatorySocial Spaces Observatory
    5. 5. Web Observatory ThemeOpen Government Observatory
    6. 6. Open Government DataTWC –Intl Open Government Data Sets
    7. 7. Web Observatories ThemesScience Data Observatory
    8. 8. SemantAqua• Enable/Empower citizens &scientists to explore pollutionsites, facilities, regulations, andhealth impacts along withprovenance• Demonstrates semanticmonitoring possibilities• Extend to endangered speciesand resource mgr issues• Explanations and Provenanceavailable12 3451. Map view of analyzed results2. Explanation of pollution3. Possible health effect of contaminant (from EPA)4. Filtering by facet to select type of data5. Link for reporting problems6. Extended with input from USGS, with population counts for birds & fish
    9. 9. Example Workflow(SemantAqua)ArchiveArchiveCSV2RDF4LODEnhanceCSV2RDF4LODEnhancederive deriveintegratearchivePublishPublishCSV2RDF4LODDirectCSV2RDF4LODDirect visualizevisualize8
    10. 10. Semantic Methodology andSemantic Application Evolution5Originally developed for Virtual Observatories (in solarterrestrial) , now in water quality, Sea ice, volcanology,mycology, oceans…. …McGuinness, Fox, West, Garcia, Cinquini, Benedict,Middleton The Virtual Solar-Terrestrial Observatory: ADeployed Semantic Web Application Case Study forScientific Research. Proc. 19 Conf. on InnovativeApplications of Artificial Intelligence (IAAI-07),http://www.vsto.orgSemantAqua -> SemantEco -> DataOnemodularizing, broadening,provenance, interactionVSTO -> SESDI -> SPCDIS- modularizing, provenance,broadening, interaction
    11. 11. Web Observatory ThemeHealth & Life SciencesObservatory
    12. 12. Department of Health and Human ServicesDeveloper ChallengeDeveloper Challenge6In June 2012, HHS issued the first of its seven challenges calling fordevelopers “to make high value health data more accessible toentrepreneurs, researchers, and policy makers in the hopes of betterhealth outcomes for all.”A group from RPI TWC won first place in the competition, by usingsemantic technologies and in-house developed software, such ascsv2rdf4lod, LODSPeaKr, Farrah and DataFAQS.HHS wanted Metadata"... application of existing voluntary consensusstandards for metadata common to all opengovernment data"RPI TWC submitted:•DCAT - W3C Data Catalog◦Version controlled on github.◦Extracted from their CKAN as input toconverter.•VoID - W3C Vocabulary of InterlinkedData◦Organized datasets by source, dataset,version.◦Provided links to data dumps, Linksets toLOD.•PROV - W3C Provenance InterchangeModel◦Captured during CKAN extraction, retrieval,conversion, and publishing.•Dublin Core Metadata Terms◦Annotated subjects based on descriptions.HHS wanted Classification"...classify datasets in our growing catalog,creating entities, attributes and relations that formthe foundations for better discovery,integration..."RPI TWC presented:•Bottom-up vocabulary and entity reuse◦Vocabulary created for each dataset◦Enhanced datasets shifted to reuse vocabularyand entities from other datasets.◦Three stub vocabularies for top-level reuse.•NCBO (Nat. Center for Biomedical Ont.)Annotations◦annotator/annotator.py SADI service◦data/source/bioontology-org/annotator-description-subject/version/retrieve.shHHS wanted Liquidity"new designs ... that form the foundations for ... liquidity"RPI TWC provided: 2B triples among 1M URIs•Dataset Linked Data◦Machine and Human views (via conneg)◦Faceted search of datasets•Dataset dumps (.ttl.gz)◦For each dataset, and for the whole thing.Dataset query (http://healthdata.tw.rpi.edu/sparql)Text https://github.com/jimmccusker/twc-h
    13. 13. Web Observatory ThemesSocial Spaces Observatory
    14. 14. Twitter Network ObservatoryMakani, B. & Zhang, Q.Makani, B. & Zhang, Q.• Explores the relationshipsof people and semantics inthe graph database• Basic functions:• Users can visualize andanalyze different types ofsub-graphs• Preforms a set of basicanalyses for otherCOSMIC Groups
    15. 15. How can we leverage SocialMedia sites…to identify these communities, andstakeholders within them?to gather requirements from thesecommunities?First Responders, including Emergency Medical Personnel,Firefighters, and Police Officers, have active online communities onSocial Media websites.First Responders (with NIST)McGuinness, Erickson, Chastain, Fry, Yan, Zhuhttp://tw.rpi.edu/web/project/FirstRespondersFind Topics:Find Users:How can we leverage SocialMedia sites…to identify these communities, andstakeholders within them?to gather requirements from thesecommunities?
    16. 16. Questions?6

    ×