SlideShare a Scribd company logo
Towards linked data conventions for
delivery of environmental data using
netCDF
LAND AND WATER FLAGSHIP | OCEANS AND ATMOSPHERE FLAGSHIP
Jonathan Yu, Nicholas Car, Adam Leadbetter*, Bruce A. Simons, and Simon J.D. Cox
CSIRO LWF and BODC/ Marine Institute (Galway)
25 March 2014 | ISESS
… netCDF-LD
Outline
1. Big data and netCDF
2. Metadata challenges
3. Linked Data, JSON-LD & CSV-on-the-web
4. netCDF-LD
5. Applications
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu2 |
Big data
• “90% of the world’s data has been produced over the last two
years”
• Environmental and earth observation has always been big-data…
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu3 |
National Soil and Landscape Grid
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu4 |
Hydrodynamic models
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu5 |
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu6 |
Real-time wind data
netCDF
• Persistence layer
• Simple: self-describing, user
accessible “flat file”
• Java, C++, python, others
• Interfaces to the Data Model
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu7 |
File format
Software library
API
netCDF-4 Data Model
netCDF cont’
• Really good at handling array-oriented data – efficient subsetting
• Multi-dimensional grids
• CF conventions
climatology and
weather forecasting
domains
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu9 |
netCDF metadata example
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu10 |
float Nap_MIM(time, latitude, longitude) ;
Nap_MIM:_FillValue = -999.f ;
Nap_MIM:long_name = "TSS, MIM SVDC on Rrs" ;
Nap_MIM:units = "mg/L" ;
Nap_MIM:valid_min = 0.01209607f ;
Nap_MIM:valid_max = 226.9626f ;
Nap_MIM:standard_name =
”total_suspended_solids”;
Use of netCDF for Chlorophyll observations
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu11 |
netCDF challenges
• Encoding format and framework
• “Domain agnostic” but conventions exist for communities – e.g.
Climate and Forecasting (CF conventions)
• CF conventions limited
• Use of ‘standard_name’ attribute to denote the combination of
medium/transformation/substance/state/quantity using a particular syntax
• Assumes English
• Standard names take a while to be adopted
• Standard names variations of each other
• Units are from UDUNITS – Unidata managed
• Narrow scope of scientific domain – can’t reuse for other domains
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu12 |
netCDF Metadata challenges – Semantic heterogeneity
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu13 |
Enviro
Application
#1
Data
DB
Chl_MIM
Enviro
Application
#2
Data
DB
mass_conc_
chlorophyll_
In_sea_water
Enviro
Application
#3
Data
DB
mass_conc_
chlorophyll_a_
In_sea_water
Semantic Heterogeneity… leads to data silos
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu14 |
Enviro
Application
#1
Enviro
Application
#2
Enviro
Application
#3
Data Data Data
DB DBDB
Chl_MIM
mass_conc_
chlorophyll_
In_sea_water
mass_conc_
chlorophyll_a_
In_sea_water
X X
Meetings
Approaches to address netCDF metadata challenge
1. Strict naming grammars for standard_name
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu15 |
(Peckham 2014) CSDMS Standard
Naming Conventions
Approaches to address netCDF metadata challenge (2)
2. Break up various concerns in standard_name into multiple
attributes – ref (Yu et al. 2014)
float Nap_MIM(time, latitude, longitude) ;
Nap_MIM:_FillValue = -999.f ;
Nap_MIM:long_name = "TSS, MIM SVDC on Rrs" ;
Nap_MIM:units = "mg/L" ;
Nap_MIM:valid_min = 0.01209607f ;
Nap_MIM:valid_max = 226.9626f ;
Nap_MIM:scaledQuantityKind_id
= "http://environment.data.gov.au/water/quality/def/property/solids-
total_suspended" ;
Nap_MIM:unit_id =
"http://environment.data.gov.au/water/quality/def/unit/MilliGramsPerLitre" ;
Nap_MIM:substanceOrTaxon_id =
"http://environment.data.gov.au/water/quality/def/object/solids";
Nap_MIM:medium_id =
"http://environment.data.gov.au/water/quality/def/object/ocean"
Nap_MIM:procedure_id = "http://data.ereefs.org.au/ocean-colour/MIM_SVDC_RRS" ;
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu16 |
Harmonised Publish, Discovery, Access and Use
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu17 |
Relies on community
agreed vocabularies
Describe/Publish
the data
Query/Use data
Enviro
Application
#1
Enviro
Application
#2
Enviro
Application
#3
Data Data Data
DB DBDB
substanceOrTaxon=
http://environment.data.gov.au/def/object/chlorophyll
scaledQuantityKind =
http://environment.data.gov.au/def/property/chlorophyll_
concentration
Need to communicate more consistently
Requires shared, precise, agreed semantics
Linked Data, JSON-LD & CSV-on-the-web
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu18 |
"LOD Cloud Diagram as of September 2011" by Anja Jentzsch
Linked Data
• Method of connect related data, existing vocabularies and other
semantics using web links (URIs) and a language for describing
resources (RDF)
• Applications can then
query data, follow the links
and infer new insights
from the data
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu19 |
JSON-LD
{
"name": "John Lennon",
"born": "1940-10-09",
"spouse":
"http://dbpedia.org/resource/Cynthia_Lennon"
}
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu20 |
JSON-LD
Decorators
JSON-LD and Semantic Web
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu21 |
http://dbpedia.org/re
source/John_Lennon
http://dbpedia.org/res
ource/Cynthia_Lennon
“John Lennon”
1940-10-09
CSV-on-the-web
• Linked data for CSV
tabular data
• Add context to
tables
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu22 |
Example: Domain vocab term
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu23 |
Example: Domain vocab term
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu24 |
...
wqp:chlorophyll_a_concentration
a skos:Concept, op:ScaledQuantityKind,
qudt:ChemistryQuantityKind ;
skos:broader wqp:chlorophyll_concentration ;
skos:prefLabel "chlorophyll a concentration"@en ;
netCDF-LD
Take the already successful and popular encoding
format and…
‘linkify’ netCDF!
25 |
netCDF-LD: Linkifying netCDF
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu26 |
‘Context’ boilerplates
netCDF-LD: Linkifying netCDF
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu27 |
Air Temperature
Definition
Air
Temperature
Medium
Quantity
Kind
Kelvin
UoM
eReefs Vocabularies
netCDF-LD: Global Attributes
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu28 |
Assigning URIs to variable level attributes
z:units = "meters";
z:units_ref = "http://qudt.org/vocab/unit#Meter";
z:a =
"http://environment.data.gov.au/def/op#quantityKind";
z:dcPartOf = "http://foo.bar/linked_netCDF_example";
z:valid_range = 0., 5000.;
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu29 |
netCDF-LD to RDF
@prefix unit: <http://qudt.org/vocab/unit#> .
@prefix qudt: <http://qudt.org/1.1/schema/qudt#> .
@prefix op: <http://environment.data.gov.au/def/op#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix dcterms: <http://purl.org/dc/terms/> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
_:z qudt:unit unit:Meter;
a op:ScaledQuantityKind;
dcterms:isPartOf <http://foo.bar/linked_netCDF_example>.
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu30 |
Use of netCDF-LD to support Data Discovery
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu31 |
Data annotated with bindings to vocab URIs
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu32 |
THREDDS
THREDDS
Catalog
Domain Vocabs
(Water Quality at
environment.data.gov.au)
Quantities/ Units ontology
(QUDT)
substanceOrTaxon=
http://environment.data.gov.au
/def/object/chlorophyll
scaledQuantityKind
= http://environment.data.gov.au
/def/property/chlorophyll_concentra
tion
unit
= http://qudt.org/vocab/unit#Unitless
medium
= http://environment.data.gov.au
/def/feature/ocean
DBL Harvesting and End Use
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu33 |
Data Brokering
Layer
THREDDS
THREDDS
Catalog
Domain Vocabs
(Water Quality at
environment.data.gov.au)
Quantities/ Units ontology
(QUDT)
substanceOrTaxon=
http://environment.data.gov.au
/def/object/chlorophyll
scaledQuantityKind
= http://environment.data.gov.au
/def/property/chlorophyll_concentra
tion
unit
= http://qudt.org/vocab/unit#Unitless
medium
= http://environment.data.gov.au
/def/feature/ocean
End users
Client
application
chlorophyll
netCDF-LD benefits
• Enhanced metadata – better self-description for any domain
• Allows binding to rich semantics from existing vocabularies being
published via SKOS/OWL/RDF in multiple domains
• Linked Data approach – consistent with Semantic Web
technologies, RDF, JSON-LD, CSV-on-the-web
• Hook into toolkits and software libraries for data discovery via
Semantic Web technologies and other Linked data already being
published online
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu34 |
Future/Current Work
1. netCDF-LD requires adoption at community level – need to test
the proposal at OGC, Unidata, CF communities
2. Tooling for parsing netCDF-LD
3. Encoding large volumes using netCDF-LD
4. Demonstrate Data Broker concept using netCDF-LD
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu35 |
Summary
• netCDF is really good at big data – esp. array-oriented data
• Limitations with the current netCDF metadata structure
• *-LD pattern is emerging across multiple formats (JSON, CSV)
• Propose netCDF-LD which is an unintrusive extension on netCDF to
facilitate improved self-described datasets
• netCDF-LD has potential to enhance data discovery from
environmental dataset across to other Linked Data being
published online
Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu36 |
LAND AND WATER
Thank you
Land and Water
Jonathan Yu
Research Software
Engineer
t +61 3 9252 6440
e jonathan.yu@csiro.au

More Related Content

What's hot

Global Research Platforms: Past, Present, Future
Global Research Platforms: Past, Present, FutureGlobal Research Platforms: Past, Present, Future
Global Research Platforms: Past, Present, Future
Larry Smarr
 
PRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path ForwardPRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path Forward
Larry Smarr
 
Peering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkPeering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains Network
Larry Smarr
 
PRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSGPRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSG
Larry Smarr
 
Panel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
Panel 5. CCS projects in action - Professor Jon Gibbins, University of EdinburghPanel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
Panel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
Global CCS Institute
 
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
Larry Smarr
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningThe Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
Larry Smarr
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
Robert Grossman
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
Larry Smarr
 
High Performance Cyberinfrastructure Is Required for the Era of Big Data
High Performance Cyberinfrastructure Is Required for the Era of Big DataHigh Performance Cyberinfrastructure Is Required for the Era of Big Data
High Performance Cyberinfrastructure Is Required for the Era of Big Data
Larry Smarr
 
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNETRudolf Husar
 
Virtual Marine Geophysical Data Center July 2010 Original
Virtual Marine Geophysical Data Center July 2010 OriginalVirtual Marine Geophysical Data Center July 2010 Original
Virtual Marine Geophysical Data Center July 2010 Original
Brad Ilg
 
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Tatiana Tarasova
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Robert Grossman
 
Physics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global CyberinfrastructurePhysics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global Cyberinfrastructure
Larry Smarr
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
Robert Grossman
 
CENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataCENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big Data
Larry Smarr
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
Larry Smarr
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
Robert Grossman
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster Relief
Robert Grossman
 

What's hot (20)

Global Research Platforms: Past, Present, Future
Global Research Platforms: Past, Present, FutureGlobal Research Platforms: Past, Present, Future
Global Research Platforms: Past, Present, Future
 
PRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path ForwardPRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path Forward
 
Peering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkPeering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains Network
 
PRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSGPRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSG
 
Panel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
Panel 5. CCS projects in action - Professor Jon Gibbins, University of EdinburghPanel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
Panel 5. CCS projects in action - Professor Jon Gibbins, University of Edinburgh
 
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
The Pacific Research Platform: Building a Distributed Big-Data Machine-Learni...
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningThe Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
 
Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11Open Science Data Cloud - CCA 11
Open Science Data Cloud - CCA 11
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
 
High Performance Cyberinfrastructure Is Required for the Era of Big Data
High Performance Cyberinfrastructure Is Required for the Era of Big DataHigh Performance Cyberinfrastructure Is Required for the Era of Big Data
High Performance Cyberinfrastructure Is Required for the Era of Big Data
 
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET
2004-07-28 Fast Aerosol Sensing Tools for Natural Event Tracking FASTNET
 
Virtual Marine Geophysical Data Center July 2010 Original
Virtual Marine Geophysical Data Center July 2010 OriginalVirtual Marine Geophysical Data Center July 2010 Original
Virtual Marine Geophysical Data Center July 2010 Original
 
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
Semantically-Enabled Environmental Data Discovery and Integration: Demonstrat...
 
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
Lessons Learned from a Year's Worth of Benchmarking Large Data Clouds (Robert...
 
Physics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global CyberinfrastructurePhysics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global Cyberinfrastructure
 
My Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big DataMy Other Computer is a Data Center: The Sector Perspective on Big Data
My Other Computer is a Data Center: The Sector Perspective on Big Data
 
CENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataCENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big Data
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
 
Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)Bionimbus Cambridge Workshop (3-28-11, v7)
Bionimbus Cambridge Workshop (3-28-11, v7)
 
Large Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster ReliefLarge Scale On-Demand Image Processing For Disaster Relief
Large Scale On-Demand Image Processing For Disaster Relief
 

Similar to netCDF-LD - Towards linked data conventions for delivery of environmental data using netCDF

Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Laurent Lefort
 
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
First online hangout SC5 - Big Data Europe  first pilot-presentation-hangoutFirst online hangout SC5 - Big Data Europe  first pilot-presentation-hangout
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
BigData_Europe
 
Foss4G 2009 Scenz Grid
Foss4G 2009 Scenz GridFoss4G 2009 Scenz Grid
Foss4G 2009 Scenz Grid
noho
 
EcoTas13 BradEvans e-MAST framework
EcoTas13 BradEvans e-MAST frameworkEcoTas13 BradEvans e-MAST framework
EcoTas13 BradEvans e-MAST framework
TERN Australia
 
Knowledge Discovery in Environmental Management
Knowledge Discovery in Environmental Management Knowledge Discovery in Environmental Management
Knowledge Discovery in Environmental Management
Dr. Aparna Varde
 
The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)
Globus
 
Pacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big DataPacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big Data
Larry Smarr
 
High Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchHigh Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive Research
Larry Smarr
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
Larry Smarr
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentationTERN Australia
 
ESRI DevMeetup 201100607
ESRI DevMeetup 201100607ESRI DevMeetup 201100607
Conflation, Data Quality and MADness (David Smith)
Conflation, Data Quality and MADness (David Smith)Conflation, Data Quality and MADness (David Smith)
Conflation, Data Quality and MADness (David Smith)geeknixta
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
aceas13tern
 
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE DataRudolf Husar
 
Seeds Poster2
Seeds Poster2Seeds Poster2
Seeds Poster2
Rudolf Husar
 
2008-02-11: EPA DataFed Presentation
2008-02-11: EPA DataFed Presentation2008-02-11: EPA DataFed Presentation
2008-02-11: EPA DataFed PresentationRudolf Husar
 
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse GasesAndrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Integrated Carbon Observation System (ICOS)
 
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
Brad Evans
 
EcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MASTEcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MAST
TERN Australia
 

Similar to netCDF-LD - Towards linked data conventions for delivery of environmental data using netCDF (20)

Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
 
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
First online hangout SC5 - Big Data Europe  first pilot-presentation-hangoutFirst online hangout SC5 - Big Data Europe  first pilot-presentation-hangout
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
 
Foss4G 2009 Scenz Grid
Foss4G 2009 Scenz GridFoss4G 2009 Scenz Grid
Foss4G 2009 Scenz Grid
 
EcoTas13 BradEvans e-MAST framework
EcoTas13 BradEvans e-MAST frameworkEcoTas13 BradEvans e-MAST framework
EcoTas13 BradEvans e-MAST framework
 
Knowledge Discovery in Environmental Management
Knowledge Discovery in Environmental Management Knowledge Discovery in Environmental Management
Knowledge Discovery in Environmental Management
 
The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)The Department of Energy's Integrated Research Infrastructure (IRI)
The Department of Energy's Integrated Research Infrastructure (IRI)
 
Pacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big DataPacific Wave and PRP Update Big News for Big Data
Pacific Wave and PRP Update Big News for Big Data
 
Andy Fox DART
Andy Fox DARTAndy Fox DART
Andy Fox DART
 
High Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchHigh Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive Research
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
 
ESRI DevMeetup 201100607
ESRI DevMeetup 201100607ESRI DevMeetup 201100607
ESRI DevMeetup 201100607
 
Conflation, Data Quality and MADness (David Smith)
Conflation, Data Quality and MADness (David Smith)Conflation, Data Quality and MADness (David Smith)
Conflation, Data Quality and MADness (David Smith)
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
 
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data
2004-10-15 SHAirED: Services for Helping the Air-quality Community use ESE Data
 
Seeds Poster2
Seeds Poster2Seeds Poster2
Seeds Poster2
 
2008-02-11: EPA DataFed Presentation
2008-02-11: EPA DataFed Presentation2008-02-11: EPA DataFed Presentation
2008-02-11: EPA DataFed Presentation
 
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse GasesAndrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
Andrews, Arlyn: Towards an International Reference Network for Greenhouse Gases
 
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
 
EcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MASTEcoTas13 BradEvans e-MAST
EcoTas13 BradEvans e-MAST
 

More from Jonathan Yu

Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscape
Jonathan Yu
 
OzNome 5-star Data Ratings
OzNome 5-star Data RatingsOzNome 5-star Data Ratings
OzNome 5-star Data Ratings
Jonathan Yu
 
Australian Open government and research data pilot survey 2017
Australian Open government and research data pilot survey 2017Australian Open government and research data pilot survey 2017
Australian Open government and research data pilot survey 2017
Jonathan Yu
 
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
Jonathan Yu
 
An OzNome as Infrastructure Value Proposition
An OzNome as Infrastructure Value PropositionAn OzNome as Infrastructure Value Proposition
An OzNome as Infrastructure Value Proposition
Jonathan Yu
 
eReefs Data Brokering Layer
eReefs Data Brokering LayereReefs Data Brokering Layer
eReefs Data Brokering Layer
Jonathan Yu
 

More from Jonathan Yu (6)

Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscape
 
OzNome 5-star Data Ratings
OzNome 5-star Data RatingsOzNome 5-star Data Ratings
OzNome 5-star Data Ratings
 
Australian Open government and research data pilot survey 2017
Australian Open government and research data pilot survey 2017Australian Open government and research data pilot survey 2017
Australian Open government and research data pilot survey 2017
 
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
WESCML: A Data Standard for Exchanging Water and Energy Supply and Consumptio...
 
An OzNome as Infrastructure Value Proposition
An OzNome as Infrastructure Value PropositionAn OzNome as Infrastructure Value Proposition
An OzNome as Infrastructure Value Proposition
 
eReefs Data Brokering Layer
eReefs Data Brokering LayereReefs Data Brokering Layer
eReefs Data Brokering Layer
 

Recently uploaded

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 

Recently uploaded (20)

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 

netCDF-LD - Towards linked data conventions for delivery of environmental data using netCDF

  • 1. Towards linked data conventions for delivery of environmental data using netCDF LAND AND WATER FLAGSHIP | OCEANS AND ATMOSPHERE FLAGSHIP Jonathan Yu, Nicholas Car, Adam Leadbetter*, Bruce A. Simons, and Simon J.D. Cox CSIRO LWF and BODC/ Marine Institute (Galway) 25 March 2014 | ISESS … netCDF-LD
  • 2. Outline 1. Big data and netCDF 2. Metadata challenges 3. Linked Data, JSON-LD & CSV-on-the-web 4. netCDF-LD 5. Applications Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu2 |
  • 3. Big data • “90% of the world’s data has been produced over the last two years” • Environmental and earth observation has always been big-data… Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu3 |
  • 4. National Soil and Landscape Grid Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu4 |
  • 5. Hydrodynamic models Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu5 |
  • 6. Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu6 | Real-time wind data
  • 7. netCDF • Persistence layer • Simple: self-describing, user accessible “flat file” • Java, C++, python, others • Interfaces to the Data Model Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu7 | File format Software library API
  • 9. netCDF cont’ • Really good at handling array-oriented data – efficient subsetting • Multi-dimensional grids • CF conventions climatology and weather forecasting domains Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu9 |
  • 10. netCDF metadata example Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu10 | float Nap_MIM(time, latitude, longitude) ; Nap_MIM:_FillValue = -999.f ; Nap_MIM:long_name = "TSS, MIM SVDC on Rrs" ; Nap_MIM:units = "mg/L" ; Nap_MIM:valid_min = 0.01209607f ; Nap_MIM:valid_max = 226.9626f ; Nap_MIM:standard_name = ”total_suspended_solids”;
  • 11. Use of netCDF for Chlorophyll observations Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu11 |
  • 12. netCDF challenges • Encoding format and framework • “Domain agnostic” but conventions exist for communities – e.g. Climate and Forecasting (CF conventions) • CF conventions limited • Use of ‘standard_name’ attribute to denote the combination of medium/transformation/substance/state/quantity using a particular syntax • Assumes English • Standard names take a while to be adopted • Standard names variations of each other • Units are from UDUNITS – Unidata managed • Narrow scope of scientific domain – can’t reuse for other domains Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu12 |
  • 13. netCDF Metadata challenges – Semantic heterogeneity Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu13 | Enviro Application #1 Data DB Chl_MIM Enviro Application #2 Data DB mass_conc_ chlorophyll_ In_sea_water Enviro Application #3 Data DB mass_conc_ chlorophyll_a_ In_sea_water
  • 14. Semantic Heterogeneity… leads to data silos Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu14 | Enviro Application #1 Enviro Application #2 Enviro Application #3 Data Data Data DB DBDB Chl_MIM mass_conc_ chlorophyll_ In_sea_water mass_conc_ chlorophyll_a_ In_sea_water X X Meetings
  • 15. Approaches to address netCDF metadata challenge 1. Strict naming grammars for standard_name Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu15 | (Peckham 2014) CSDMS Standard Naming Conventions
  • 16. Approaches to address netCDF metadata challenge (2) 2. Break up various concerns in standard_name into multiple attributes – ref (Yu et al. 2014) float Nap_MIM(time, latitude, longitude) ; Nap_MIM:_FillValue = -999.f ; Nap_MIM:long_name = "TSS, MIM SVDC on Rrs" ; Nap_MIM:units = "mg/L" ; Nap_MIM:valid_min = 0.01209607f ; Nap_MIM:valid_max = 226.9626f ; Nap_MIM:scaledQuantityKind_id = "http://environment.data.gov.au/water/quality/def/property/solids- total_suspended" ; Nap_MIM:unit_id = "http://environment.data.gov.au/water/quality/def/unit/MilliGramsPerLitre" ; Nap_MIM:substanceOrTaxon_id = "http://environment.data.gov.au/water/quality/def/object/solids"; Nap_MIM:medium_id = "http://environment.data.gov.au/water/quality/def/object/ocean" Nap_MIM:procedure_id = "http://data.ereefs.org.au/ocean-colour/MIM_SVDC_RRS" ; Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu16 |
  • 17. Harmonised Publish, Discovery, Access and Use Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu17 | Relies on community agreed vocabularies Describe/Publish the data Query/Use data Enviro Application #1 Enviro Application #2 Enviro Application #3 Data Data Data DB DBDB substanceOrTaxon= http://environment.data.gov.au/def/object/chlorophyll scaledQuantityKind = http://environment.data.gov.au/def/property/chlorophyll_ concentration Need to communicate more consistently Requires shared, precise, agreed semantics
  • 18. Linked Data, JSON-LD & CSV-on-the-web Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu18 | "LOD Cloud Diagram as of September 2011" by Anja Jentzsch
  • 19. Linked Data • Method of connect related data, existing vocabularies and other semantics using web links (URIs) and a language for describing resources (RDF) • Applications can then query data, follow the links and infer new insights from the data Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu19 |
  • 20. JSON-LD { "name": "John Lennon", "born": "1940-10-09", "spouse": "http://dbpedia.org/resource/Cynthia_Lennon" } Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu20 | JSON-LD Decorators
  • 21. JSON-LD and Semantic Web Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu21 | http://dbpedia.org/re source/John_Lennon http://dbpedia.org/res ource/Cynthia_Lennon “John Lennon” 1940-10-09
  • 22. CSV-on-the-web • Linked data for CSV tabular data • Add context to tables Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu22 |
  • 23. Example: Domain vocab term Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu23 |
  • 24. Example: Domain vocab term Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu24 | ... wqp:chlorophyll_a_concentration a skos:Concept, op:ScaledQuantityKind, qudt:ChemistryQuantityKind ; skos:broader wqp:chlorophyll_concentration ; skos:prefLabel "chlorophyll a concentration"@en ;
  • 25. netCDF-LD Take the already successful and popular encoding format and… ‘linkify’ netCDF! 25 |
  • 26. netCDF-LD: Linkifying netCDF Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu26 | ‘Context’ boilerplates
  • 27. netCDF-LD: Linkifying netCDF Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu27 | Air Temperature Definition Air Temperature Medium Quantity Kind Kelvin UoM eReefs Vocabularies
  • 28. netCDF-LD: Global Attributes Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu28 |
  • 29. Assigning URIs to variable level attributes z:units = "meters"; z:units_ref = "http://qudt.org/vocab/unit#Meter"; z:a = "http://environment.data.gov.au/def/op#quantityKind"; z:dcPartOf = "http://foo.bar/linked_netCDF_example"; z:valid_range = 0., 5000.; Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu29 |
  • 30. netCDF-LD to RDF @prefix unit: <http://qudt.org/vocab/unit#> . @prefix qudt: <http://qudt.org/1.1/schema/qudt#> . @prefix op: <http://environment.data.gov.au/def/op#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix dcterms: <http://purl.org/dc/terms/> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . _:z qudt:unit unit:Meter; a op:ScaledQuantityKind; dcterms:isPartOf <http://foo.bar/linked_netCDF_example>. Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu30 |
  • 31. Use of netCDF-LD to support Data Discovery Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu31 |
  • 32. Data annotated with bindings to vocab URIs Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu32 | THREDDS THREDDS Catalog Domain Vocabs (Water Quality at environment.data.gov.au) Quantities/ Units ontology (QUDT) substanceOrTaxon= http://environment.data.gov.au /def/object/chlorophyll scaledQuantityKind = http://environment.data.gov.au /def/property/chlorophyll_concentra tion unit = http://qudt.org/vocab/unit#Unitless medium = http://environment.data.gov.au /def/feature/ocean
  • 33. DBL Harvesting and End Use Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu33 | Data Brokering Layer THREDDS THREDDS Catalog Domain Vocabs (Water Quality at environment.data.gov.au) Quantities/ Units ontology (QUDT) substanceOrTaxon= http://environment.data.gov.au /def/object/chlorophyll scaledQuantityKind = http://environment.data.gov.au /def/property/chlorophyll_concentra tion unit = http://qudt.org/vocab/unit#Unitless medium = http://environment.data.gov.au /def/feature/ocean End users Client application chlorophyll
  • 34. netCDF-LD benefits • Enhanced metadata – better self-description for any domain • Allows binding to rich semantics from existing vocabularies being published via SKOS/OWL/RDF in multiple domains • Linked Data approach – consistent with Semantic Web technologies, RDF, JSON-LD, CSV-on-the-web • Hook into toolkits and software libraries for data discovery via Semantic Web technologies and other Linked data already being published online Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu34 |
  • 35. Future/Current Work 1. netCDF-LD requires adoption at community level – need to test the proposal at OGC, Unidata, CF communities 2. Tooling for parsing netCDF-LD 3. Encoding large volumes using netCDF-LD 4. Demonstrate Data Broker concept using netCDF-LD Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu35 |
  • 36. Summary • netCDF is really good at big data – esp. array-oriented data • Limitations with the current netCDF metadata structure • *-LD pattern is emerging across multiple formats (JSON, CSV) • Propose netCDF-LD which is an unintrusive extension on netCDF to facilitate improved self-described datasets • netCDF-LD has potential to enhance data discovery from environmental dataset across to other Linked Data being published online Towards linked data conventions for delivery of environmental data using netCDF | Jonathan Yu36 |
  • 37. LAND AND WATER Thank you Land and Water Jonathan Yu Research Software Engineer t +61 3 9252 6440 e jonathan.yu@csiro.au