Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Ontology of citizen science @ Siena 2016 11 24
1. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
Towards an ontology of citizen
science
The representation
of crowdsourced information
Luigi Ceccaroni (1000001 Labs)
Siena, November 24th, 2016
2. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
Index
• An ontology of citizen science
– Projects
– Tools
3. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
An ontology of citizen science
4. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• Systems with no overall organizing
rationale
– Not incorporating any organizing principle
for data, information and knowledge
• Systems with inward organization
– Incorporating an organizing principle (such
as standard-based metadata schema) to
bolster categorization and processing
capabilities
– However, imposing constraints on adopting
organizations
Knowledge organization
5. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• Systems with outward organization
– Based on standards that are already accepted or
in use
– Facilitating future interaction between diverse
organizations by providing data to other
participants in predictable and mutually agreed
upon formats
– In some cases, based on the specifications of a
single system (with inward organization) that
became accepted over time
Knowledge organization
6. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• Influenced by the way that information is
structured in the SciStarter database:
– US Federal Crowdsourcing and Citizen Science
Catalog developed by the Wilson Center
– Atlas of Living Australia
• Data shared through a set of custom-designed
APIs (at the most basic level)
Shared standards for project metadata
7. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• Readable by a computer
• Can enhance inter-organizational
communication through a standard set of
definitions based on a format like:
– RDF/XML OWL
– JavaScript object notation for Linked Data (JSON-
LD, a method of encoding Linked Data using JSON)
Benefits of outward organization
8. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
An ontology of project metadata
9. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• An international WG
CSA’s Data and metadata working group
Description ALA - BioCollect SciStarter PPSR-CORE (CitSci.org) The Federal Crowdsourcing and Citizen
Science Catalog
Dublin Core GBIF (IPT) POD v1.1 CKAN API DCAT Schema.org OGC CobWeb ADIwg Data Type Multiplicity ISO
Instance (Citclops)
Database Name Type Mandatory/Optional Database Name Type Required Database Name Type Required Type Required Database Name Database Name 19115/
19110
Value Type
IDENTIFIERS, DESCRIPTORS & VERSIONS
Globally unique identifier (GUID) for the project; system generated project:projectId text M id integer always ProjectGUID GUID Y cartodb_id integer y collectionID alternateIdentifier identifier id dct:identifier Citclops text
Type of identifier indicating the remote repository
The short name of the project that led to the creation of the dataset
Citclops text
The title of the project that led to the creation of the dataset project:name text M title string always ProjectName text Y project.title title title dct:title name gmd:name text 0 .. 1
Citizens' observatory for coast and
ocean optical monitoring text
A persistent identifier of the dataset in an external repository activity:projectActivityId text M alternateIdentifier
Citclops text
Type of identifier indicating the remote repository
The edition or version number of the submitted dataset additionalMetadata.hierarchyLevel gmd:edition text 0 .. 1
2015_09_30 text
The activity status of the project (This automatically updates based on serverDate relative to project start/end dates.) project:status enumeration M
Derived from date
range
expired boolean always ProjectStatus text / categorical Y project_status string Y temporal dct:temporal temporal
ended
enumeration: pending,
active/ongoing,
ended/complete, undefined
How often the project information or dataset is updated maintenance.maintenanceUpdateFrequency (controlled
vocabulary) & maintenance.description.para (free text)
accrualPeriodicity dct:accrualPeriodicity
Short text name or title of the project; title used to identify the submission project:name + activity:name text M project_name string y
Citclops text
The unique ID for the submission gmd:identifier text 0 .. n
Citclops
The Datacite DOI minted for the submission citation@identifier
Instructions on how the dataset may be reused intellectualRights.para
ID/Name(s) of datasets related to this one
The name of the dataset for citation purposes project:organisationName + activity:name text M title gmd:title text 1 .. 1
Citclops
Alternative or other name given to the dataset title@xml:lang (titles in other languages)
EyeOnWater
Free text description of the aim, objectives or expected/intended outcomes of the project; description of what the
project should accomplish
project:aim text M goal string[64] always intended_outcomes string y abstract gmd:abstract
Natural-waters optical monitoring text
Project outcomes
Suggested Dataset Objective purpose
Free text description of the project project:description text M description string always ProjectDescription text Y project_description string y project.designDescription description notes dct:description description gmd:description text 0 .. 1
The Citclops project developed
systems to retrieve and use data on
natural-waters colour,
transparency and fluorescence,
using low-cost sensors and
contextual information combined
with citizen participation. text
Short description of what needs to be done by the participant project:task text M task string[64] always participation_tasks string y
To retrieve and use data on
natural-waters colour,
transparency and fluorescence,
using low-cost sensors text
Catch-all for any project-specific data administrators want to make available ProjectMetadata text N additionalInfo
Citclops is supported by the EC-FP7
Programme, grant agreement nº
308469
International Standard Book Number (ISBN) bibliography.citation.identifier gmd:ISBN text 0 .. 1
International Standard Serial Number (ISSN) bibliography.citation.identifier gmd:ISSN text 0 .. 1
DATE FIELDS
The date the submission was published into the receiving system pubDate
The date and time that the project was created in the database project:dateCreated ISODate M date datetime always created_at string y issued dct:issued datePublished
The date and time that project metadata was last updated project:lastUpdated ISODate M updated datetime always ProjectDateLastUpdated ISO 8601
DateTime (UTC)
Y updated_at string y additionalMetadata.dateStamp modified dct:modified dateModified gmd:date CI_Date 1 .. n
2015-09-30 "YYYY-MM-DD"
The date that the project is planned to commence. The date on which the project began or will begin. project:plannedStartDate ISODate M begin_date date optional ProjectStartYear ISO 8601 Year
(UTC)
N start_date string y
2012-10-01 "YYYY-MM-DD"
The date that the project is planned to end. Applicable for projects operating over a defined period of time. The date on
which the project ended or will end.
project:plannedEndDate ISODate O end_date date optional ProjectEndDate ISO 8601
Date (UTC)
N
2015-09-30 "YYYY-MM-DD"
Actual start date for project project:startDate ISODate M
2012-10-01 "YYYY-MM-DD"
Actual end date for project project:endDate ISODate O
2015-09-30 "YYYY-MM-DD"
The date that the activity/survey is planned to commence. activity:startDate ISODate M
The date that the activity/survey is planned to end. activity:endDate ISODate O
CONTACTS, OWNERS, SUBMITTERS & PARTICIPANTS
Primary project coordinator: first and last name(s) of person, or name of organization project:manager O project_owner_name string[64] optional ProjectCoordinator Person
Object/Construct
N project_contact string y contact contactPoint ? fn maintainer dcat:contactPoint ? vcard:fn provider ? Person:name
Luigi Ceccaroni
text
Primary dataset contact: first and last name(s) of person project:manager O ProjectContactName Person Object /
Construct
Y gov_contact string y personnel.individualName
Luigi Ceccaroni
text
10. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
CSA’s Data and metadata working group
Description ALA - BioCollect SciStarter
Database Name Type Mandatory/
Optional
Database Name Type Required
IDENTIFIERS, DESCRIPTORS & VERSIONS
Globally unique identifier (GUID) for the project; system generated project:projectId text M id integer always
Type of identifier indicating the remote repository
The short name of the project that led to the creation of the dataset
The title of the project that led to the creation of the dataset project:name text M title string always
A persistent identifier of the dataset in an external repository activity:projectActivityId text M
Type of identifier indicating the remote repository
The edition or version number of the submitted dataset
The activity status of the project (This automatically updates based on serverDate relative to project
start/end dates.)
project:status enumeration M
Derived from date
range
expired boolean always
11. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
CSA’s Data and metadata working group
ALA - BioCollect
SciStarter
PPSR-CORE (CitSci.org)
The Federal
Crowdsourcing and
Citizen Science
Catalog
Dublin Core
GBIF (IPT)
POD v1.1
CKAN API
DCAT
Schema.org
OGC
CobWeb
ADIwg
ISO
19115/
19110
12. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• [http://citizenscience.org/2015/11/12/introdu
cing-the-data-and-metadata-working-group/]
• Contact people:
– Anne Bowser (co-chair), Woodrow Wilson
International Center for Scholars
– Peter Brenton (ACSA liaison), Atlas of Living
Australia
– Luigi Ceccaroni (ECSA liaison), 1000001 Labs
CSA’s Data and metadata working group
13. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
An ontology of tools
14. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
SciStarter’s tools database
Definition Format Values Notes
What about
Tool/Device
Accessories?
Tool Name Name of specific tool
give usable
examples to
define the
name
Description
Image
Manufacturer Maker/ producer of tool
Model
Vendor Provider of tool
Domain Field of study
Medium Sample type
Equipment/Sensor E.g., sensor, etc.
Function
measure, model,
analyze, observe,
support data collection,
recording
Measures
Cost Price
Availability
Manufacturer - Build,
Buy, Borrow
Free text
fields
Time to build
and ship, effort
to obtain the
thing,
recommender
system
Accesssibility
Ease of use for different
populations
Portability
Size/Weight (shipping vs
final)
Size
Weight
Total weight of assembled
tool
Technical
requirements/Add-ons
Response Time
Ideal Conditions
Range of Error
How long to set up
Expertise needed to
operate/Instructions
Training Required?
Skills Needed?
Calibration Needed?
Ages appropriate
Detection capability
Response time
Active
Frequency of use
How often do you need to
check the tool, data upload
Definition
15. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
• [http://scistarter.com/finder]
• Contact people:
– Darlene Cavalier, SciStarter, Arizona State
University
– Anne Bowser, Woodrow Wilson International
Center for Scholars
SciStarter’s tools database
16. Workshop on “crowdsourced information &
citizen science: critical aspects and the future”
November 25th, 2016 – Luigi Ceccaroni
ENERGIC IC1203 COST action
Towards an ontology of citizen science
Luigi Ceccaroni
1000001 Labs, Research lead
Citizen science COST action 15212, Interoperability WG chair
ECSA, Board of Directors
CSA, Data and metadata working group
luigi@1000001labs.org
http://www.1000001labs.org/