On-the-fly Integration of Static and Dynamic Linked Data

A
On-the-fly Integration of Static and Dynamic Linked
Data
Andreas Harth (KIT), Craig Knoblock (USC), Steffen Stadtmüller (KIT), Rudi Studer
(KIT), Pedro Szekely (USC)

INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB)

KIT – University of the State of Baden-Wuerttemberg and
National Research Center of the Helmholtz Association

www.kit.edu
Outline
Motivation
Scenario and Overview
Modelling Sources: Karma
Accessing and Integrating Sources: Data-Fu
Demo
Conclusion

2

On-the-fly Integration of Static and Dynamic Linked Data
Motivation
The relevance of many types of data perishes or degrades over time
(e.g., weather information, moving objects)
Timely decision making requires
access to live data and
inclusion of new sources in a flexible manner.

Our goals
(Near) real-time access to a variety of data sources in a range of data
formats and access modalities
Rapidly integrate sources via modeling and to generate a Linked Data
interface to live sources

3

On-the-fly Integration of Static and Dynamic Linked Data
Static vs. Dynamic Sources
Various sources have different update intervals (from minutes to
weeks)
We treat the access to all sources in the same way via polling (HTTP
GETs)
Thus, the only distinction between „static“ and „dynamic“ sources is
how fast we refresh the query results for each source

4

On-the-fly Integration of Static and Dynamic Linked Data
Scenario

POIs
(Crunchbase, OS
M, Wikimapia)
5

Venues/Events

Buses/Stops

(Eventful, LastFM)

(LA Metro)

On-the-fly Integration of Static and Dynamic Linked Data

Vehicles
(Campus
Cruisers)

Marine Vessels
(AIS)
Architecture

6

On-the-fly Integration of Static and Dynamic Linked Data
Karma
Interactive tool for rapidly
extracting, cleaning, transforming, integrating, and publishing data

Tabular
Sources

Karma

Hierarchical
Sources
Database

Services

Model

…
See http://isi.edu/integration/karma/ for more info and download
7

On-the-fly Integration of Static and Dynamic Linked Data
Modelling Sources with Karma
Karma is a data integration tool

Linked API

Map data onto an ontology to generate Linked Data
Karma extension to enable the on-the-fly lifting of API I/O
data according to a pre-defined mapping model
Web
API

Vehicles
(Campus
Cruisers)

8

On-the-fly Integration of Static and Dynamic Linked Data
Linked Data Access to Event APIs
Venues/Events

LastFM API

(Eventful, LastFM)

Given a lat/lon of a location, return a list of event identifiers
http://km.aifb.kit.edu/services/lastfmwrap/geo.getevents?
lat={?lat}&long={?lon}
Given an event identifier, return details about the event
http://lastfm.rdfize.com/events/{event-id}
Eventful API
List events given a keyword search term and a date range
http://km.aifb.kit.edu/services/eventfulwrap/search?locat
ion={?loc}&date={?date}

9

On-the-fly Integration of Static and Dynamic Linked Data
LastFM Data-Fu Program (I)
Program at http://km.aifb.kit.edu/services/data-fu/lastfm
with input lat/lon (in RDF via HTTP POST)
Rule to search for events at given location:
{ ?p geo:long ?lon .
?p geo:lat ?lat . }
=>
{ [] http:mthd http:GET ;
http:requestURI
<http://km.aifb.kit.edu/services/lastfm
wrap/geo.getevents?lat={?lat}&long={?lo
n}> . } .

10

On-the-fly Integration of Static and Dynamic Linked Data

“For the input point with lat/long

perform an HTTP GET
at the KIT LastFM Wrapper URI
constructed with the lat/long”
LastFM Data-Fu Program (II)
Rule for retrieving information about the found events, including
geolocation of event:
{ ?e rdf:type lode:Event. }
=>
{ [] http:mthd http:GET ;
http:requestURI ?e . } .

“For every resource of type event
perform an HTTP GET
at the resource URI”

Query to return a table with lat/lon and label to transform to
KML/Google Earth:
:q1 qrl:select ( ?event ?place ?label ?lat ?lon ) ;
qrl:where {
?event <http://purl.org/NET/c4dm/event.owl#place> ?place .
?event rdfs:label ?label .
“Output is every entity with
?place geo:lat ?lat .
latitude, longitude and associated
?place geo:long ?lon .
label”
} .
11

On-the-fly Integration of Static and Dynamic Linked Data
Data Source Characteristics

12

On-the-fly Integration of Static and Dynamic Linked Data
Demo
Load http://people.aifb.kit.edu/aha/2013/d3/index.kml into Google Earth
Location of buses and ships are updated

13

On-the-fly Integration of Static and Dynamic Linked Data
Conclusion
System interoperation in distributed environments with Linked Data as
interface
Rapid integration of new sources (via Karma models and Data-Fu
scripts)
Realtime access to networked data via Data-Fu scripts/programs
http://code.google.com/p/data-fu/

Ability to rapidy integrate new sources via Karma models
http://www.isi.edu/integration/karma/

Future work
Modular organisation of programs
Manipulating resource state (Read-Write Linked Data)
Optimisations for limited bandwidth environments
14

On-the-fly Integration of Static and Dynamic Linked Data
Challenges
Data is provided at different places, by different owners, often over the
web (decentralised data publishing)
Data and links are provided in a many different formats/protocols
Developers have to gain a deep understanding of every API by reading
textual descriptions

Applications (user agents) are supposed to follow links as found during
runtime of the application
Developers have to define their desired interaction at design time
Developers have to write individually tailored code to consume services in
applications

15

On-the-fly Integration of Static and Dynamic Linked Data
1 of 15

Recommended

Field Data Collecting, Processing and Sharing: Using web Service Technologies by
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesField Data Collecting, Processing and Sharing: Using web Service Technologies
Field Data Collecting, Processing and Sharing: Using web Service TechnologiesNiroshan Sanjaya
865 views1 slide
CKANへの空間情報機能拡張実装の試み by
CKANへの空間情報機能拡張実装の試みCKANへの空間情報機能拡張実装の試み
CKANへの空間情報機能拡張実装の試みYoichi Kayama
2.3K views42 slides
Latest Developments in Oceanographic Applications of GIS, including Near-real... by
Latest Developments in Oceanographic Applications of GIS, including Near-real...Latest Developments in Oceanographic Applications of GIS, including Near-real...
Latest Developments in Oceanographic Applications of GIS, including Near-real...Dawn Wright
204 views53 slides
A Study on New York City Taxi Rides by
A Study on New York City Taxi RidesA Study on New York City Taxi Rides
A Study on New York City Taxi RidesCaglar Subasi
154 views16 slides
ccalendar by
ccalendarccalendar
ccalendarPraveen AP
433 views12 slides
Data Sources by
Data SourcesData Sources
Data SourcesJoe Larson
607 views20 slides

More Related Content

What's hot

Advait kulkarni by
Advait kulkarniAdvait kulkarni
Advait kulkarniAdvait Kulkarni
91 views1 slide
Ross McDonald - PgRouting in QGIS by
Ross McDonald - PgRouting in QGISRoss McDonald - PgRouting in QGIS
Ross McDonald - PgRouting in QGISRoss McDonald
1.3K views18 slides
Geolocation analysis using HiveQL by
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQLPriyanka Kale
736 views17 slides
Chek mate geolocation analyzer by
Chek mate geolocation analyzerChek mate geolocation analyzer
Chek mate geolocation analyzerpriyal mistry
510 views23 slides
PowerStream: Propelling Energy Innovation with Predictive Analytics by
PowerStream: Propelling Energy Innovation with Predictive Analytics PowerStream: Propelling Energy Innovation with Predictive Analytics
PowerStream: Propelling Energy Innovation with Predictive Analytics SingleStore
386 views28 slides
Web Services Emissions 2006 Falke by
Web Services Emissions 2006 FalkeWeb Services Emissions 2006 Falke
Web Services Emissions 2006 FalkeRudolf Husar
449 views13 slides

What's hot(19)

Ross McDonald - PgRouting in QGIS by Ross McDonald
Ross McDonald - PgRouting in QGISRoss McDonald - PgRouting in QGIS
Ross McDonald - PgRouting in QGIS
Ross McDonald1.3K views
Geolocation analysis using HiveQL by Priyanka Kale
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQL
Priyanka Kale736 views
Chek mate geolocation analyzer by priyal mistry
Chek mate geolocation analyzerChek mate geolocation analyzer
Chek mate geolocation analyzer
priyal mistry510 views
PowerStream: Propelling Energy Innovation with Predictive Analytics by SingleStore
PowerStream: Propelling Energy Innovation with Predictive Analytics PowerStream: Propelling Energy Innovation with Predictive Analytics
PowerStream: Propelling Energy Innovation with Predictive Analytics
SingleStore386 views
Web Services Emissions 2006 Falke by Rudolf Husar
Web Services Emissions 2006 FalkeWeb Services Emissions 2006 Falke
Web Services Emissions 2006 Falke
Rudolf Husar449 views
060525AGU_ESSI CAPITA Poster by Rudolf Husar
060525AGU_ESSI CAPITA Poster060525AGU_ESSI CAPITA Poster
060525AGU_ESSI CAPITA Poster
Rudolf Husar188 views
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto by idsecconf
Collecting Endpoint Security Logs Through Big Data Technology - Dedi DwiantoCollecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
Collecting Endpoint Security Logs Through Big Data Technology - Dedi Dwianto
idsecconf374 views
060730 Igarss06 Denver Husar by Rudolf Husar
060730 Igarss06 Denver Husar060730 Igarss06 Denver Husar
060730 Igarss06 Denver Husar
Rudolf Husar324 views
Serving Ireland's Geospatial Information as Linked Data by Christophe Debruyne
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data
Participatory Cyber Physical System in Public Transport Application by John Lau
Participatory Cyber Physical System in Public Transport ApplicationParticipatory Cyber Physical System in Public Transport Application
Participatory Cyber Physical System in Public Transport Application
John Lau608 views
The habitats approach to build the inspire infrastructure by Karel Charvat
The habitats approach to build the inspire infrastructureThe habitats approach to build the inspire infrastructure
The habitats approach to build the inspire infrastructure
Karel Charvat807 views
Inspire hack 2017-linked-data by Raul Palma
Inspire hack 2017-linked-dataInspire hack 2017-linked-data
Inspire hack 2017-linked-data
Raul Palma212 views
Project on nypd accident analysis using hadoop environment by Siddharth Chaudhary
Project on nypd accident analysis using hadoop environmentProject on nypd accident analysis using hadoop environment
Project on nypd accident analysis using hadoop environment
Visualising statistical Linked Data with Plone by Eau de Web
Visualising statistical Linked Data with PloneVisualising statistical Linked Data with Plone
Visualising statistical Linked Data with Plone
Eau de Web475 views
2013 open analytics-meetup-mortar by Open Analytics
2013 open analytics-meetup-mortar2013 open analytics-meetup-mortar
2013 open analytics-meetup-mortar
Open Analytics1.6K views

Similar to On-the-fly Integration of Static and Dynamic Linked Data

Traffic Data Analysis and Prediction using Big Data by
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataJongwook Woo
1.2K views29 slides
Data dissemination and materials informatics at LBNL by
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLAnubhav Jain
348 views1 slide
Presentation by
PresentationPresentation
Presentationbolu804
510 views41 slides
Ws For Aqm by
Ws For AqmWs For Aqm
Ws For AqmRudolf Husar
382 views9 slides
Streaming Weather Data from Web APIs to Jupyter through Kafka by
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaLeo Salemann
910 views11 slides
Streaming Weather Data from Web APIs to Jupyter through Kafka by
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaWenfan Xu
64 views11 slides

Similar to On-the-fly Integration of Static and Dynamic Linked Data(20)

Traffic Data Analysis and Prediction using Big Data by Jongwook Woo
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big Data
Jongwook Woo1.2K views
Data dissemination and materials informatics at LBNL by Anubhav Jain
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNL
Anubhav Jain348 views
Presentation by bolu804
PresentationPresentation
Presentation
bolu804510 views
Streaming Weather Data from Web APIs to Jupyter through Kafka by Leo Salemann
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
Leo Salemann910 views
Streaming Weather Data from Web APIs to Jupyter through Kafka by Wenfan Xu
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
Wenfan Xu64 views
Lemmens kessler-agile-linked data v3-slideshare by Rob Lemmens
Lemmens kessler-agile-linked data v3-slideshareLemmens kessler-agile-linked data v3-slideshare
Lemmens kessler-agile-linked data v3-slideshare
Rob Lemmens3.4K views
Andrew Murdoch Avian Influenza 20080414 by a_murdoch
Andrew Murdoch Avian Influenza 20080414Andrew Murdoch Avian Influenza 20080414
Andrew Murdoch Avian Influenza 20080414
a_murdoch831 views
2003-11-02 Combined Aerosol Trajectory Tool, CATT by Rudolf Husar
2003-11-02 Combined Aerosol Trajectory Tool, CATT2003-11-02 Combined Aerosol Trajectory Tool, CATT
2003-11-02 Combined Aerosol Trajectory Tool, CATT
Rudolf Husar332 views
LarKC Tutorial at ISWC 2009 - Urban Computing by LarKC
LarKC Tutorial at ISWC 2009 - Urban ComputingLarKC Tutorial at ISWC 2009 - Urban Computing
LarKC Tutorial at ISWC 2009 - Urban Computing
LarKC399 views
Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets by Adila Krisnadhi
Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic DatasetsDiversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets
Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets
Adila Krisnadhi558 views
070726 Igarss07 Barcelona by Rudolf Husar
070726 Igarss07 Barcelona070726 Igarss07 Barcelona
070726 Igarss07 Barcelona
Rudolf Husar304 views
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for... by Hong-Linh Truong
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Emerging Dynamic TUW-ASE Summer 2015 - Distributed Systems and Challenges for...
Hong-Linh Truong1.7K views
070416 Egu Vienna Husar by Rudolf Husar
070416 Egu Vienna Husar070416 Egu Vienna Husar
070416 Egu Vienna Husar
Rudolf Husar377 views
Godiva2 Overview by jonblower
Godiva2 OverviewGodiva2 Overview
Godiva2 Overview
jonblower1.2K views
Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc... by Gilles Fedak
Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...
Active Data: Managing Data-Life Cycle on Heterogeneous Systems and Infrastruc...
Gilles Fedak2.6K views
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem... by Ian Foster
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Ian Foster816 views

Recently uploaded

DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...ShapeBlue
141 views29 slides
Initiating and Advancing Your Strategic GIS Governance Strategy by
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance StrategySafe Software
184 views68 slides
LLMs in Production: Tooling, Process, and Team Structure by
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team StructureAggregage
57 views77 slides
The Role of Patterns in the Era of Large Language Models by
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language ModelsYunyao Li
91 views65 slides
NTGapps NTG LowCode Platform by
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform Mustafa Kuğu
437 views30 slides
Ransomware is Knocking your Door_Final.pdf by
Ransomware is Knocking your Door_Final.pdfRansomware is Knocking your Door_Final.pdf
Ransomware is Knocking your Door_Final.pdfSecurity Bootcamp
98 views46 slides

Recently uploaded(20)

DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue141 views
Initiating and Advancing Your Strategic GIS Governance Strategy by Safe Software
Initiating and Advancing Your Strategic GIS Governance StrategyInitiating and Advancing Your Strategic GIS Governance Strategy
Initiating and Advancing Your Strategic GIS Governance Strategy
Safe Software184 views
LLMs in Production: Tooling, Process, and Team Structure by Aggregage
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team Structure
Aggregage57 views
The Role of Patterns in the Era of Large Language Models by Yunyao Li
The Role of Patterns in the Era of Large Language ModelsThe Role of Patterns in the Era of Large Language Models
The Role of Patterns in the Era of Large Language Models
Yunyao Li91 views
NTGapps NTG LowCode Platform by Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu437 views
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue196 views
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue120 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays36 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE84 views
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... by Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue225 views
"Node.js Development in 2024: trends and tools", Nikita Galkin by Fwdays
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin
Fwdays33 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue199 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue178 views
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023 by BookNet Canada
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
Redefining the book supply chain: A glimpse into the future - Tech Forum 2023
BookNet Canada44 views
Optimizing Communication to Optimize Human Behavior - LCBM by Yaman Kumar
Optimizing Communication to Optimize Human Behavior - LCBMOptimizing Communication to Optimize Human Behavior - LCBM
Optimizing Communication to Optimize Human Behavior - LCBM
Yaman Kumar38 views

On-the-fly Integration of Static and Dynamic Linked Data

  • 1. On-the-fly Integration of Static and Dynamic Linked Data Andreas Harth (KIT), Craig Knoblock (USC), Steffen Stadtmüller (KIT), Rudi Studer (KIT), Pedro Szekely (USC) INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB) KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu
  • 2. Outline Motivation Scenario and Overview Modelling Sources: Karma Accessing and Integrating Sources: Data-Fu Demo Conclusion 2 On-the-fly Integration of Static and Dynamic Linked Data
  • 3. Motivation The relevance of many types of data perishes or degrades over time (e.g., weather information, moving objects) Timely decision making requires access to live data and inclusion of new sources in a flexible manner. Our goals (Near) real-time access to a variety of data sources in a range of data formats and access modalities Rapidly integrate sources via modeling and to generate a Linked Data interface to live sources 3 On-the-fly Integration of Static and Dynamic Linked Data
  • 4. Static vs. Dynamic Sources Various sources have different update intervals (from minutes to weeks) We treat the access to all sources in the same way via polling (HTTP GETs) Thus, the only distinction between „static“ and „dynamic“ sources is how fast we refresh the query results for each source 4 On-the-fly Integration of Static and Dynamic Linked Data
  • 5. Scenario POIs (Crunchbase, OS M, Wikimapia) 5 Venues/Events Buses/Stops (Eventful, LastFM) (LA Metro) On-the-fly Integration of Static and Dynamic Linked Data Vehicles (Campus Cruisers) Marine Vessels (AIS)
  • 6. Architecture 6 On-the-fly Integration of Static and Dynamic Linked Data
  • 7. Karma Interactive tool for rapidly extracting, cleaning, transforming, integrating, and publishing data Tabular Sources Karma Hierarchical Sources Database Services Model … See http://isi.edu/integration/karma/ for more info and download 7 On-the-fly Integration of Static and Dynamic Linked Data
  • 8. Modelling Sources with Karma Karma is a data integration tool Linked API Map data onto an ontology to generate Linked Data Karma extension to enable the on-the-fly lifting of API I/O data according to a pre-defined mapping model Web API Vehicles (Campus Cruisers) 8 On-the-fly Integration of Static and Dynamic Linked Data
  • 9. Linked Data Access to Event APIs Venues/Events LastFM API (Eventful, LastFM) Given a lat/lon of a location, return a list of event identifiers http://km.aifb.kit.edu/services/lastfmwrap/geo.getevents? lat={?lat}&long={?lon} Given an event identifier, return details about the event http://lastfm.rdfize.com/events/{event-id} Eventful API List events given a keyword search term and a date range http://km.aifb.kit.edu/services/eventfulwrap/search?locat ion={?loc}&date={?date} 9 On-the-fly Integration of Static and Dynamic Linked Data
  • 10. LastFM Data-Fu Program (I) Program at http://km.aifb.kit.edu/services/data-fu/lastfm with input lat/lon (in RDF via HTTP POST) Rule to search for events at given location: { ?p geo:long ?lon . ?p geo:lat ?lat . } => { [] http:mthd http:GET ; http:requestURI <http://km.aifb.kit.edu/services/lastfm wrap/geo.getevents?lat={?lat}&long={?lo n}> . } . 10 On-the-fly Integration of Static and Dynamic Linked Data “For the input point with lat/long perform an HTTP GET at the KIT LastFM Wrapper URI constructed with the lat/long”
  • 11. LastFM Data-Fu Program (II) Rule for retrieving information about the found events, including geolocation of event: { ?e rdf:type lode:Event. } => { [] http:mthd http:GET ; http:requestURI ?e . } . “For every resource of type event perform an HTTP GET at the resource URI” Query to return a table with lat/lon and label to transform to KML/Google Earth: :q1 qrl:select ( ?event ?place ?label ?lat ?lon ) ; qrl:where { ?event <http://purl.org/NET/c4dm/event.owl#place> ?place . ?event rdfs:label ?label . “Output is every entity with ?place geo:lat ?lat . latitude, longitude and associated ?place geo:long ?lon . label” } . 11 On-the-fly Integration of Static and Dynamic Linked Data
  • 12. Data Source Characteristics 12 On-the-fly Integration of Static and Dynamic Linked Data
  • 13. Demo Load http://people.aifb.kit.edu/aha/2013/d3/index.kml into Google Earth Location of buses and ships are updated 13 On-the-fly Integration of Static and Dynamic Linked Data
  • 14. Conclusion System interoperation in distributed environments with Linked Data as interface Rapid integration of new sources (via Karma models and Data-Fu scripts) Realtime access to networked data via Data-Fu scripts/programs http://code.google.com/p/data-fu/ Ability to rapidy integrate new sources via Karma models http://www.isi.edu/integration/karma/ Future work Modular organisation of programs Manipulating resource state (Read-Write Linked Data) Optimisations for limited bandwidth environments 14 On-the-fly Integration of Static and Dynamic Linked Data
  • 15. Challenges Data is provided at different places, by different owners, often over the web (decentralised data publishing) Data and links are provided in a many different formats/protocols Developers have to gain a deep understanding of every API by reading textual descriptions Applications (user agents) are supposed to follow links as found during runtime of the application Developers have to define their desired interaction at design time Developers have to write individually tailored code to consume services in applications 15 On-the-fly Integration of Static and Dynamic Linked Data