Streamlining Python Development: A Guide to a Modern Project Setup
EPA Linked Data Seminar Overview
1. 6/29/2011 U.S. Environmental Protection Agency 1 Linked Data at EPA June 29, 2011 David G. Smith PE PLS USEPA Office of Environmental Information 202-566-0797 SmithG.David@epa.gov
3. History… RDF via Data.Gov <rdf:Description rdf:about="#entry9985"> <hdatum_desc>NAD83</hdatum_desc> <state_name>NEBRASKA</state_name> <latitude83>40.944623</latitude83> <interest_types>STATE MASTER</interest_types> <city_name>GARLAND</city_name> <create_date>01-MAR-00</create_date> <frs_facility_detail_report_url rdf:resource=" http://iaspub.epa.gov/enviro/fii_query_detail.disp_program_facility?p_registry_id=110006555085 "/> <congressional_dist_num>01</congressional_dist_num> <pgm_sys_acrnms>NE-IIS</pgm_sys_acrnms> <epa_region_code>07</epa_region_code> <country_name>USA</country_name> <fips_code>31159</fips_code> <huc_code>10200203</huc_code> <collect_desc>ADDRESS MATCHING-HOUSE NUMBER</collect_desc> <primary_name>TERRI KELLER RESIDENCE</primary_name> <rdf:type rdf:resource=" http://data-gov.tw.rpi.edu/2009/data-gov-twc.rdf#DataEntry "/> <ref_point_desc>ENTRANCE POINT OF A FACILITY OR STATION</ref_point_desc> <postal_code>683609338</postal_code> <registry_id>110006555085</registry_id> <location_address>1976 OLD MILL RD</location_address> <accuracy_value>30</accuracy_value> <update_date>06-AUG-01</update_date> <county_name>SEWARD</county_name> <conveyor>FRS</conveyor> <longitude83>-96.990306</longitude83> <state_code>NE</state_code> <site_type_name>STATIONARY</site_type_name> </rdf:Description> A very basic serialization to RDF, lacking deeper geospatial enablement and semantic contextualization
4. FRS is a data aggregator FRS performs integration, validation and QA across 32 federal databases and 57 state, territory and tribal databases FRS contains information on 2.8 million facilities > 80% of facilities have lat/long information FRS Overview
5. High Level Data Model Organization Industrial Classification Affiliation Individual Individual Supplemental Interest Mailing Address Alternative Name Facility/Site Geospatial Environmental Interest
6. FRS Scope Major Programs Represented in FRS Air AFS AQS CAMDBS EGRID NEI RBLC RFS (Ethanol) Water PCS ICIS-NPDES SDWIS CWNS Chemical Releases TRIS RMP TSCA SSTS FRP BRAC Hazardous Waste ACRES CERCLIS RCRAINFO RADINFO Enforcement/Compliance ICIS ECRM NCDB Schools NCES GNIS BIA INDIAN SCHOOL Other LANDFILL http://www.epa.gov/enviro/html/frs_demo/new_crosswalks.html
7. FRS Modeling More robust, multifaceted, standards-driven, contextualized representation
8. FRS Data Where is EPA now? http://epa.gov.clients.talis.com/.html Ontology: http://epa.gov.clients.talis.com/schema.html Example: http://epa.gov.clients.talis.com/facilities/110000868918.html Search:http://api.talis.com/stores/epagov-dev1/items Mashup: https://wiki.us.talis.com/dev/demos/epa_frs/brownfields.html 6/29/2011 U.S. Environmental Protection Agency 8
9. count(?site){ ?site rdf:typeepa:BrownfieldsSite . ?site epa:region region:04 } 6/29/2011 U.S. Environmental Protection Agency 9 FRS Data EPA is currently testing functionality to meet business needs; tweaking models and exploring opportunities SELECT ?state ?name (count( ?site ) as ?count) WHERE { ?site a epa:NuclearElectricityGenerator . ?site place:in ?state . ?state a place:State . ?state foaf:name ?name . FILTER regex(?name, ".+") } GROUP BY ?state ?name ORDER BY DESC(?count)
10. EPA Linked Data Efforts What else is EPA doing? Soon: Substance Registry Next… Toxic Release Inventory Corporate Entities? Regulations? Collaborating CIO Committee DAS SOCOP http://socop.org/ http://www.w3.org/2011/gld/charter EcoInformatics, VoCampDC, GeoSPARQL, others 6/29/2011 U.S. Environmental Protection Agency 10
11. Thanks/Questions David G. Smith PE PLS USEPA Office of Environmental Information 202-566-0797 SmithG.David@epa.gov 6/29/2011 U.S. Environmental Protection Agency 11