SlideShare a Scribd company logo
Exploration in Web Science:
Instruments for Web
Observatories
Observatories
Presented by:
Kristine Gloria
Co-authors: Deborah McGuinness and Joanne Luciano
The Tetherless World Constellation
Rensselaer Polytechnic Institute, Troy, NY
With thanks to the extended RPI Tetherless World Team
Agenda
6
I. Web Observatories at RPI’s Web Science
Research Center
II. Web Observatory Themes
III. Science Data
IV. Health and Life Sciences,
V. Open Government
VI. Social Spaces
Web Observatories @ WSRC
At RPI WSRC, our observatories present both
tools and methodologies that empower
researchers to study the web and to make a
difference in the world
Web Observatories Themes
Science Data Observatory
Health & Life Sciences
Observatory
Open Government Observatory
Social Spaces Observatory
Web Observatory Theme
Open Government Observatory
Open Government Data
TWC –Intl Open Government Data Sets
Web Observatories Themes
Science Data Observatory
SemantAqua
• Enable/Empower citizens &
scientists to explore pollution
sites, facilities, regulations, and
health impacts along with
provenance
• Demonstrates semantic
monitoring possibilities
• Extend to endangered species
and resource mgr issues
• Explanations and Provenance
available
1
2 3
45
1. Map view of analyzed results
2. Explanation of pollution
3. Possible health effect of contaminant (from EPA)
4. Filtering by facet to select type of data
5. Link for reporting problems
6. Extended with input from USGS, with population counts for birds & fish
Example Workflow
(SemantAqua)
ArchiveArchive
CSV2RDF4LOD
Enhance
CSV2RDF4LOD
Enhance
derive derive
integrate
archive
PublishPublish
CSV2RDF4LOD
Direct
CSV2RDF4LOD
Direct visualizevisualize
8
Semantic Methodology and
Semantic Application Evolution
5
Originally developed for Virtual Observatories (in solar
terrestrial) , now in water quality, Sea ice, volcanology,
mycology, oceans…. …
McGuinness, Fox, West, Garcia, Cinquini, Benedict,
Middleton The Virtual Solar-Terrestrial Observatory: A
Deployed Semantic Web Application Case Study for
Scientific Research. Proc. 19 Conf. on Innovative
Applications of Artificial Intelligence (IAAI-07),
http://www.vsto.org
SemantAqua -> SemantEco -> DataOne
modularizing, broadening,
provenance, interaction
VSTO -> SESDI -> SPCDIS
- modularizing, provenance,
broadening, interaction
Web Observatory Theme
Health & Life Sciences
Observatory
Department of Health and Human Services'
Developer Challenge
Developer Challenge
6
In June 2012, HHS issued the first of its seven challenges calling for
developers “to make high value health data more accessible to
entrepreneurs, researchers, and policy makers in the hopes of better
health outcomes for all.”
A group from RPI TWC won first place in the competition, by using
semantic technologies and in-house developed software, such as
csv2rdf4lod, LODSPeaKr, Farrah and DataFAQS.
HHS wanted Metadata
"... application of existing voluntary consensus
standards for metadata common to all open
government data"
RPI TWC submitted:
•DCAT - W3C Data Catalog
◦Version controlled on github.
◦Extracted from their CKAN as input to
converter.
•VoID - W3C Vocabulary of Interlinked
Data
◦Organized datasets by source, dataset,
version.
◦Provided links to data dumps, Linksets to
LOD.
•PROV - W3C Provenance Interchange
Model
◦Captured during CKAN extraction, retrieval,
conversion, and publishing.
•Dublin Core Metadata Terms
◦Annotated subjects based on descriptions.
HHS wanted Classification
"...classify datasets in our growing catalog,
creating entities, attributes and relations that form
the foundations for better discovery,
integration..."
RPI TWC presented:
•Bottom-up vocabulary and entity reuse
◦Vocabulary created for each dataset
◦Enhanced datasets shifted to reuse vocabulary
and entities from other datasets.
◦Three stub vocabularies for top-level reuse.
•NCBO (Nat. Center for Biomedical Ont.)
Annotations
◦annotator/annotator.py SADI service
◦data/source/bioontology-org/annotator-
description-subject/version/retrieve.sh
HHS wanted Liquidity
"new designs ... that form the foundations for ... liquidity"
RPI TWC provided: 2B triples among 1M URIs
•Dataset Linked Data
◦Machine and Human views (via conneg)
◦Faceted search of datasets
•Dataset dumps (.ttl.gz)
◦For each dataset, and for the whole thing.
Dataset query (http://healthdata.tw.rpi.edu/sparql)
Text https://github.com/jimmccusker/twc-h
Web Observatory Themes
Social Spaces Observatory
Twitter Network Observatory
Makani, B. & Zhang, Q.
Makani, B. & Zhang, Q.
• Explores the relationships
of people and semantics in
the graph database
• Basic functions:
• Users can visualize and
analyze different types of
sub-graphs
• Preforms a set of basic
analyses for other
COSMIC Groups
How can we leverage Social
Media sites…
to identify these communities, and
stakeholders within them?
to gather requirements from these
communities?
First Responders, including Emergency Medical Personnel,
Firefighters, and Police Officers, have active online communities on
Social Media websites.
First Responders (with NIST)
McGuinness, Erickson, Chastain, Fry, Yan, Zhu
http://tw.rpi.edu/web/project/FirstResponders
Find Topics:
Find Users:
How can we leverage Social
Media sites…
to identify these communities, and
stakeholders within them?
to gather requirements from these
communities?
Questions?
6

More Related Content

What's hot

decentralization: a trend in biomedical research
decentralization: a trend in biomedical researchdecentralization: a trend in biomedical research
decentralization: a trend in biomedical research
Brian Bot
 
Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)
Andrew Treloar
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Kerstin Lehnert
 
research participation as a social contract
research participation as a social contractresearch participation as a social contract
research participation as a social contract
Brian Bot
 
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos: An On-line Collaborative Platform for Earth Observation ScientistsGeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
Vince Smith
 
GeneLab Final Submitted Abstract
GeneLab Final Submitted AbstractGeneLab Final Submitted Abstract
GeneLab Final Submitted AbstractVictoria Rael
 
Filtergraph: A fast, flexible and sharable service for visualization in big d...
Filtergraph: A fast, flexible and sharable service for visualization in big d...Filtergraph: A fast, flexible and sharable service for visualization in big d...
Filtergraph: A fast, flexible and sharable service for visualization in big d...
Dan Burger
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
Carole Goble
 
The future of the commons
The future of the commonsThe future of the commons
The future of the commons
George Komatsoulis
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And Grid
Ian Foster
 
enabling transparent, reproducible research
enabling transparent, reproducible researchenabling transparent, reproducible research
enabling transparent, reproducible research
Brian Bot
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
Brian Bot
 
AusCover
AusCoverAusCover
AusCover
TERN Australia
 
2008 04 22 Jun Zhao Ldow
2008 04 22 Jun Zhao Ldow2008 04 22 Jun Zhao Ldow
2008 04 22 Jun Zhao Ldow
Jun Zhao
 
Collaborative Data Management at the University of California
Collaborative Data Management at the University of CaliforniaCollaborative Data Management at the University of California
Collaborative Data Management at the University of California
University of California Curation Center
 
LSST Education and Public Outreach (EPO)
LSST Education and Public Outreach (EPO) LSST Education and Public Outreach (EPO)
LSST Education and Public Outreach (EPO)
Amanda Bauer
 

What's hot (20)

decentralization: a trend in biomedical research
decentralization: a trend in biomedical researchdecentralization: a trend in biomedical research
decentralization: a trend in biomedical research
 
Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)Building on the Atlas (of Living Australia)
Building on the Atlas (of Living Australia)
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 
PhD Overview
PhD OverviewPhD Overview
PhD Overview
 
research participation as a social contract
research participation as a social contractresearch participation as a social contract
research participation as a social contract
 
FINAL POSTER
FINAL POSTERFINAL POSTER
FINAL POSTER
 
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos: An On-line Collaborative Platform for Earth Observation ScientistsGeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
 
Scratchpad 2014-introduction
Scratchpad 2014-introductionScratchpad 2014-introduction
Scratchpad 2014-introduction
 
GeneLab Final Submitted Abstract
GeneLab Final Submitted AbstractGeneLab Final Submitted Abstract
GeneLab Final Submitted Abstract
 
Filtergraph: A fast, flexible and sharable service for visualization in big d...
Filtergraph: A fast, flexible and sharable service for visualization in big d...Filtergraph: A fast, flexible and sharable service for visualization in big d...
Filtergraph: A fast, flexible and sharable service for visualization in big d...
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
The future of the commons
The future of the commonsThe future of the commons
The future of the commons
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And Grid
 
enabling transparent, reproducible research
enabling transparent, reproducible researchenabling transparent, reproducible research
enabling transparent, reproducible research
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
 
E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010
 
AusCover
AusCoverAusCover
AusCover
 
2008 04 22 Jun Zhao Ldow
2008 04 22 Jun Zhao Ldow2008 04 22 Jun Zhao Ldow
2008 04 22 Jun Zhao Ldow
 
Collaborative Data Management at the University of California
Collaborative Data Management at the University of CaliforniaCollaborative Data Management at the University of California
Collaborative Data Management at the University of California
 
LSST Education and Public Outreach (EPO)
LSST Education and Public Outreach (EPO) LSST Education and Public Outreach (EPO)
LSST Education and Public Outreach (EPO)
 

Viewers also liked

Instruction for installing scanner under xp
Instruction for installing scanner under xpInstruction for installing scanner under xp
Instruction for installing scanner under xpSantos Turrones
 
Building A Web Observatory Extension: Schema.org
Building A Web Observatory Extension: Schema.orgBuilding A Web Observatory Extension: Schema.org
Building A Web Observatory Extension: Schema.org
gloriakt
 
барсегян арсен
барсегян арсенбарсегян арсен
барсегян арсенKsushaVasileva
 
Covalent bonding
Covalent bondingCovalent bonding
Covalent bondingSarah Annez
 
Rapport de stage amicus anglais
Rapport de stage amicus   anglaisRapport de stage amicus   anglais
Rapport de stage amicus anglaisKillian Vaillant
 
Multiple Truths of the Semantic Web - Web Science 2013
Multiple Truths of the Semantic Web - Web Science 2013Multiple Truths of the Semantic Web - Web Science 2013
Multiple Truths of the Semantic Web - Web Science 2013
gloriakt
 
Studying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & BiasStudying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & Bias
gloriakt
 
A Case for Expectation Informed Design
A Case for Expectation Informed DesignA Case for Expectation Informed Design
A Case for Expectation Informed Design
gloriakt
 
Performativity of Data
Performativity of Data Performativity of Data
Performativity of Data
gloriakt
 
Big Data: A Survey of Technical and Sociotechnical Concepts
Big Data: A Survey of Technical and Sociotechnical ConceptsBig Data: A Survey of Technical and Sociotechnical Concepts
Big Data: A Survey of Technical and Sociotechnical Concepts
gloriakt
 
CAPURGANA Y SUS ALREDEDORES 2
CAPURGANA Y SUS ALREDEDORES 2CAPURGANA Y SUS ALREDEDORES 2
CAPURGANA Y SUS ALREDEDORES 2
Hostal capurgana
 
A Case for Expectation Informed Design - Full
A Case for Expectation Informed Design - FullA Case for Expectation Informed Design - Full
A Case for Expectation Informed Design - Full
gloriakt
 
Issues: What the Web Can Tell us About Human Behavior
Issues: What the Web Can Tell us About Human BehaviorIssues: What the Web Can Tell us About Human Behavior
Issues: What the Web Can Tell us About Human Behavior
gloriakt
 

Viewers also liked (16)

Instruction for installing scanner under xp
Instruction for installing scanner under xpInstruction for installing scanner under xp
Instruction for installing scanner under xp
 
Building A Web Observatory Extension: Schema.org
Building A Web Observatory Extension: Schema.orgBuilding A Web Observatory Extension: Schema.org
Building A Web Observatory Extension: Schema.org
 
soalan latih tubi
soalan latih tubisoalan latih tubi
soalan latih tubi
 
барсегян арсен
барсегян арсенбарсегян арсен
барсегян арсен
 
Covalent bonding
Covalent bondingCovalent bonding
Covalent bonding
 
Rapport de stage amicus anglais
Rapport de stage amicus   anglaisRapport de stage amicus   anglais
Rapport de stage amicus anglais
 
Multiple Truths of the Semantic Web - Web Science 2013
Multiple Truths of the Semantic Web - Web Science 2013Multiple Truths of the Semantic Web - Web Science 2013
Multiple Truths of the Semantic Web - Web Science 2013
 
Studying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & BiasStudying Cybercrime: Raising Awareness of Objectivity & Bias
Studying Cybercrime: Raising Awareness of Objectivity & Bias
 
A Case for Expectation Informed Design
A Case for Expectation Informed DesignA Case for Expectation Informed Design
A Case for Expectation Informed Design
 
Performativity of Data
Performativity of Data Performativity of Data
Performativity of Data
 
Big Data: A Survey of Technical and Sociotechnical Concepts
Big Data: A Survey of Technical and Sociotechnical ConceptsBig Data: A Survey of Technical and Sociotechnical Concepts
Big Data: A Survey of Technical and Sociotechnical Concepts
 
CAPURGANA Y SUS ALREDEDORES 2
CAPURGANA Y SUS ALREDEDORES 2CAPURGANA Y SUS ALREDEDORES 2
CAPURGANA Y SUS ALREDEDORES 2
 
Ip ser
Ip serIp ser
Ip ser
 
A Case for Expectation Informed Design - Full
A Case for Expectation Informed Design - FullA Case for Expectation Informed Design - Full
A Case for Expectation Informed Design - Full
 
Issues: What the Web Can Tell us About Human Behavior
Issues: What the Web Can Tell us About Human BehaviorIssues: What the Web Can Tell us About Human Behavior
Issues: What the Web Can Tell us About Human Behavior
 
Indrakshi Dutta_Resume
Indrakshi Dutta_ResumeIndrakshi Dutta_Resume
Indrakshi Dutta_Resume
 

Similar to WOW13_RPITWC_Web Observatories

British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
Datasets at the British Library
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Robert Grossman
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
James Hendler
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
EarthCube
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
Dag Endresen
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
Dag Endresen
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
GigaScience, BGI Hong Kong
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
maredata
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
Paul Courtney
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
vijayapraba1
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
Andrew Sallans
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary Challenge
Bryan Heidorn
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
Scott Edmunds
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
LEARN Project
 
dkNET Poster ENDO 2016
dkNET Poster ENDO 2016 dkNET Poster ENDO 2016
dkNET Poster ENDO 2016
dkNET
 
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
Larry Smarr
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013
Anita de Waard
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
Robert Grossman
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
James Hendler
 
Scratchpads: Building web communities supporting biodiversity science
Scratchpads: Building web communities supporting biodiversity scienceScratchpads: Building web communities supporting biodiversity science
Scratchpads: Building web communities supporting biodiversity science
Vince Smith
 

Similar to WOW13_RPITWC_Web Observatories (20)

British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
Data Facilities Workshop - Panel on Current Concepts in Data Sharing & Intero...
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18The role of biodiversity informatics in GBIF, 2021-05-18
The role of biodiversity informatics in GBIF, 2021-05-18
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityA VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and Reality
 
2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx2 Discovery and Acquisition of Data1.pptx
2 Discovery and Acquisition of Data1.pptx
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary Challenge
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
dkNET Poster ENDO 2016
dkNET Poster ENDO 2016 dkNET Poster ENDO 2016
dkNET Poster ENDO 2016
 
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
 
Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Scratchpads: Building web communities supporting biodiversity science
Scratchpads: Building web communities supporting biodiversity scienceScratchpads: Building web communities supporting biodiversity science
Scratchpads: Building web communities supporting biodiversity science
 

Recently uploaded

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 

Recently uploaded (20)

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 

WOW13_RPITWC_Web Observatories

  • 1. Exploration in Web Science: Instruments for Web Observatories Observatories Presented by: Kristine Gloria Co-authors: Deborah McGuinness and Joanne Luciano The Tetherless World Constellation Rensselaer Polytechnic Institute, Troy, NY With thanks to the extended RPI Tetherless World Team
  • 2. Agenda 6 I. Web Observatories at RPI’s Web Science Research Center II. Web Observatory Themes III. Science Data IV. Health and Life Sciences, V. Open Government VI. Social Spaces
  • 3. Web Observatories @ WSRC At RPI WSRC, our observatories present both tools and methodologies that empower researchers to study the web and to make a difference in the world
  • 4. Web Observatories Themes Science Data Observatory Health & Life Sciences Observatory Open Government Observatory Social Spaces Observatory
  • 5. Web Observatory Theme Open Government Observatory
  • 6. Open Government Data TWC –Intl Open Government Data Sets
  • 8. SemantAqua • Enable/Empower citizens & scientists to explore pollution sites, facilities, regulations, and health impacts along with provenance • Demonstrates semantic monitoring possibilities • Extend to endangered species and resource mgr issues • Explanations and Provenance available 1 2 3 45 1. Map view of analyzed results 2. Explanation of pollution 3. Possible health effect of contaminant (from EPA) 4. Filtering by facet to select type of data 5. Link for reporting problems 6. Extended with input from USGS, with population counts for birds & fish
  • 10. Semantic Methodology and Semantic Application Evolution 5 Originally developed for Virtual Observatories (in solar terrestrial) , now in water quality, Sea ice, volcanology, mycology, oceans…. … McGuinness, Fox, West, Garcia, Cinquini, Benedict, Middleton The Virtual Solar-Terrestrial Observatory: A Deployed Semantic Web Application Case Study for Scientific Research. Proc. 19 Conf. on Innovative Applications of Artificial Intelligence (IAAI-07), http://www.vsto.org SemantAqua -> SemantEco -> DataOne modularizing, broadening, provenance, interaction VSTO -> SESDI -> SPCDIS - modularizing, provenance, broadening, interaction
  • 11. Web Observatory Theme Health & Life Sciences Observatory
  • 12. Department of Health and Human Services' Developer Challenge Developer Challenge 6 In June 2012, HHS issued the first of its seven challenges calling for developers “to make high value health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of better health outcomes for all.” A group from RPI TWC won first place in the competition, by using semantic technologies and in-house developed software, such as csv2rdf4lod, LODSPeaKr, Farrah and DataFAQS. HHS wanted Metadata "... application of existing voluntary consensus standards for metadata common to all open government data" RPI TWC submitted: •DCAT - W3C Data Catalog ◦Version controlled on github. ◦Extracted from their CKAN as input to converter. •VoID - W3C Vocabulary of Interlinked Data ◦Organized datasets by source, dataset, version. ◦Provided links to data dumps, Linksets to LOD. •PROV - W3C Provenance Interchange Model ◦Captured during CKAN extraction, retrieval, conversion, and publishing. •Dublin Core Metadata Terms ◦Annotated subjects based on descriptions. HHS wanted Classification "...classify datasets in our growing catalog, creating entities, attributes and relations that form the foundations for better discovery, integration..." RPI TWC presented: •Bottom-up vocabulary and entity reuse ◦Vocabulary created for each dataset ◦Enhanced datasets shifted to reuse vocabulary and entities from other datasets. ◦Three stub vocabularies for top-level reuse. •NCBO (Nat. Center for Biomedical Ont.) Annotations ◦annotator/annotator.py SADI service ◦data/source/bioontology-org/annotator- description-subject/version/retrieve.sh HHS wanted Liquidity "new designs ... that form the foundations for ... liquidity" RPI TWC provided: 2B triples among 1M URIs •Dataset Linked Data ◦Machine and Human views (via conneg) ◦Faceted search of datasets •Dataset dumps (.ttl.gz) ◦For each dataset, and for the whole thing. Dataset query (http://healthdata.tw.rpi.edu/sparql) Text https://github.com/jimmccusker/twc-h
  • 13. Web Observatory Themes Social Spaces Observatory
  • 14. Twitter Network Observatory Makani, B. & Zhang, Q. Makani, B. & Zhang, Q. • Explores the relationships of people and semantics in the graph database • Basic functions: • Users can visualize and analyze different types of sub-graphs • Preforms a set of basic analyses for other COSMIC Groups
  • 15. How can we leverage Social Media sites… to identify these communities, and stakeholders within them? to gather requirements from these communities? First Responders, including Emergency Medical Personnel, Firefighters, and Police Officers, have active online communities on Social Media websites. First Responders (with NIST) McGuinness, Erickson, Chastain, Fry, Yan, Zhu http://tw.rpi.edu/web/project/FirstResponders Find Topics: Find Users: How can we leverage Social Media sites… to identify these communities, and stakeholders within them? to gather requirements from these communities?

Editor's Notes

  1. Examples from each of these observatories: 1. Science Data Observatory: A. SemantEco B. SemantAqua
  2. Examples from each of these observatories: 1. Open Government Observatory: A.Linked Open Government Data Portal B. International Open Government Dataset
  3. Semantically-enabled environmental monitoring – in this case monitoring water quality. Done initially as a student project in McGuinness’ Semantic eScience class, attracted interest of USGS and has an extension done with USGS. Currently working on a cooperative agreement with USGS to continue. Also used as a model for semantically enabling monitoring of air, soil, food, etc. Project page: http://tw.rpi.edu/web/project/SemantAQUA
  4. Examples from each of these observatories: 1. Healthy & Life Sciences Observatory: A. HealthData Challenge
  5. Examples from each of these observatories: 1. Social Spaces Data Observatory: A. Twitter Network Observatory B. First Responder Twitter Network
  6. The RPI group has been developing Twitter Network Observatory to explore the relationships of people and semantics in the graph database. The basic functions have been fulfilled,including     Users could visualize and analyze different types of sub-graphs based on the selections of topic, time range.     The Twitter Network observatory performs a set of basic analyses for other COSMIC groups and users to support their purposes. We have been working on adding new functions including     The selection based on time range, location, and sentiments.     Network (and the topological properties) can be exported to various formats to be used in other software (GraphML, XGMML, SVG, etc.).
  7. Introduction First Responders , including Emergency Medical Personnel, Firefighters, and Police Officers, have active online communities on Social Media websites. How can we leverage Social Media sites … to gather requirements for active First Responders? … to identify stakeholders within those First Responder communities? * http://www.digitalbuzzblog.com/infographic-24-hours-on-the-internet/