SlideShare a Scribd company logo
1 of 28
Download to read offline
Peter Haase, Michael Schmidt
fluid Operations AG
Cloud-based Linked Data Management
for
Self-service Application Development
International Workshop on Scalable Semantic Computing
Hangzhou, November 6, 2010
Increasing Popularity of Linked Open Data
• LOD cloud as of May 2009
• 4.7 billion triples
• 142 million RDF links
• LOD cloud as of Sep 2010
• 25 billion triples
• 395 million RDF links
• Covering various domains
• Media
• Life Science
• Geography
• Publications
• …
Linking Open Data cloud diagram, by Richard
Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Agenda
• Linked Data Application Development
Opportunities and Challenges
• Information Workbench as Platform for
Linked Data Application Development
• Accessing Linked Data as a Service
Vision and First Experiences
• Conclusions
New Opportunities
• Established standards define common data models,
vocabularies, semantics
• RDF/RDFS, OWL, SPARQL
• From data silos to a web of data
• Ease of specifying relationships in a decentralized way
• Innovative applications that integrate data from various
domains and sources
• Linked Government Data
• Linked Open Data
• Benefits of Linked Data in the enterprise
• Semantically integrate and interlink data scattered among systems
• Cross the chasm between enterprise-internal and public data
• Leverage semantic technologies for improved search and presentation
Challenges in Building Linked Data Applications
• Heterogeneity in various dimensions
 Location of data (internal / external, open / closed)
 Identifiers, structure and vocabularies
 Ownership of data
• Structured and unstructered data
• Quality of Linked Data
• Various forms of imperfection (erroneous, incomplete, imprecise data)
• Trustworthiness
• End-user oriented interfaces and interaction paradigms
• Interfaces that operate over large amounts of data, flexible and dynamic schemas
• Meaningful aggregation of the data
• Support for expressive queries, while retaining intuitive interfaces
• User-generated content
• Collaborative annotation and knowledge acquisition
The Information Workbench
• Platform for Linked Data application development
• Base functionality to build applications without any programming
• SDK for easy extensions
• Covering the entire lifecycle of interacting with Linked Data
 Discovery of data sources
 Integration of data sources
 Visualization
 Search and Exploration
 Collaborative generation of data
• Targeted at
• Semantic Web Community
• Linked Open Data community
• Innovative Enterprises
• Demo and source available at http://iwb.fluidops.com/.
The IWB Application Development Process
Linked Open Data Discovery
• Visually explore data sets
registered to global registries
• Sort/filter data sets by domain,
location, and many more facets
to identify relevant data
1
LOD Discovery with the Information Workbench
The IWB Application Development Process
Linked Open Data Discovery
• Visually explore data sets
registered to global registries
• Sort/filter data sets by domain,
location, and many more facets
to identify relevant data
Data Integration
• Integrate discovered Linked Data
• Add providers for internal and external
legacy data sources
• Improve data quality, e.g. via
incremental refinement of ontology
1 2
The IWB Application Development Process
Linked Open Data Discovery
• Visually explore data sets
registered to global registries
• Sort/filter data sets by domain,
location, and many more facets
to identify relevant data
Data Integration
• Integrate discovered Linked Data
• Add providers for internal and external
legacy data sources
• Improve data quality, e.g. via
incremental refinement of ontology
Customization
• Declaratively specify UI
based on available pool of
widgets
• Embed reports and charts into
wiki pages and wiki page
templates
• Semantically annotate and
interlink connected resources
1
3
2
The IWB Application Development Process
Linked Open Data Discovery
• Visually explore data sets
registered to global registries
• Sort/filter data sets by domain,
location, and many more facets
to identify relevant data
Data Integration
• Integrate discovered Linked Data
• Add providers for internal and external
legacy data sources
• Improve data quality, e.g. via
incremental refinement of ontology
Customization
• Declaratively specify UI
based on available pool of
widgets
• Embed reports and charts into
wiki pages and wiki page
templates
• Semantically annotate and
interlink connected resources
Advanced System Configuration
and Extensions
• Use APIs and SDKs to implement own
widgets and mashups
• Script data providers to integrate data
behind non-standard interfaces
• Develop and integrate own modules,
e.g. for customized search and
information extraction
1 2
3 4
Information Workbench Architecture
• Extensible, widget-based UI
• Resource-centric presentation
• Living UI, which exploits semantics
of underlying data
• Large collection of predefined
widgets, easily extendable
• Search and information Access
• Coexistence of structured and
unstructured data
• Different search paradigms (keyword
and faceted search, semantic query
completion)
• Data integration through providers
• Convert data from a data source into
the RDF data format
• Customizable, easily extensible
• Use of public LOD registries
Information Workbench Architecture
In the remainder of the talk
• Focus on challenges in data
integration layer
• In particular: virtualized, cloud-
based integration of data
sources
Linked Data Integration – Where we are
• Non-RDF data stored locally in the repository
• On demand, this data can be updated periodically
• RDF data can be…
• persisted in repository, or
• connected via naive federation layer (where possible)
Linked Data Integration – Our Vision
• Current way of publishing
• Authors provide RDF dumps linked on some homepage
• Provisioning information missing (data zipped, splitted, available in
different formats, …)
• Often also SPARQL endpoints (typically with poor response times)
• How it should be done
• Rich meta-data describing content, structure, properties of the data
• Enable exploration of data via meta repositories
• Efforts have been made (see CKAN), but…
• … poor quality of meta data and data
• Possibility for end-users to buy service guarantees
• Integration details should be irrelevant to the end-user
Software Components
• Definition of „Software Components“
"A software component is a unit of composition with contractually
specified interfaces and explicit context dependencies only. A software
component can be deployed independently and is subject to
composition by third parties." (wikipedia.org)
Data Components
• What we need for Linked Data: „Data Components“
• Interfaces: data components with precise interfaces and metadata
• Deployment: easy provisioning and integration in applications
• Composition: transparent access to atomic or composite units
• Definition of „Software Components“
"A software component is a unit of composition with contractually
specified interfaces and explicit context dependencies only. A software
component can be deployed independently and is subject to
composition by third parties." (wikipedia.org)
Next Step: Data-as-a-Service
• Idea
• Producer provides data components
• Consumers can access data components as a service
Next Step: Data-as-a-Service
• Idea
• Producer provides data components
• Consumers can access data components as a service
• Possible realization: use cloud technology!
• Sold on demand
• Elastic
• Fully managed by provider
characteristics of cloud services,
like e.g. AWS, exactly match the
needs (just like it is the case for
Software-as-a-Service)
Next Step: Data-as-a-Service
Virtualized Semantic Repositories
Identification, composition, and use of (fragments of) datasets in manners
that abstract the applications from the specific setup of the data
management service (such as local vs. remote, federation, and distribution)
• Idea
• Producer provides data components
• Consumers can access data components as a service
• Possible realization: use cloud technology!
• Sold on demand
• Elastic
• Fully managed by provider
characteristics of cloud services,
like e.g. AWS, exactly match the
needs (just like it is the case for
Software-as-a-Service)
Challenge 1: Precise Interfaces
• Standardization efforts for RDF meta data descriptions
• Statistical Core Vocabulary (SCOVO)
• Very flexible
• Forms a good basis for describing RDF statistics
• Vocabulary of Interlinked Data Sets (voiD)
• Based on SCOVO
• Used to publish meta information about Linked Data Sources
• voiD 2 (in progress)
• Dataset meta information, like source, description, dump, license
• Used vocabularies/ontologies
• Dataset interlinking
• Statistics (e.g. distinct subject count, triples with given predicate etc.)
• Open data registries
• Comprehensive Knowledge Archive Network
• Based on DublinCore and DERI‘s data catalog vocabulary (dcat)
Challenge 2: Deployment
• Based on Interfaces
• Possibly based on cloud technologies
• State-of-the-art not satisfying
• URLs pointing to human readable description, but not the actual endpoint
• Various forms of syntax errors in RDF documents
• MIME types incorrect or missing
• Endpoints/servers not reachable
• Endpoint/file password protected
Some Statistics
Based on subset of LOD cloud
(excluding a few extremely large datasets)
Challenge 3: Composition
Query Processing over Federation: State-of-the-Art
• First public implementations exists
• AliBaba federation layer on top of Sesame
• Benchmark results show severy bottlenecks
• Efficiency issues
• Which data sets deliver results for which graph patterns?
• Localized execution of subqueries
• Global estimation of subquery result sizes
• Join oder optimization
• Incremental processing with completeness/correctness guarantees
Peter Haase, Tobias Mathäß, Michael Ziller: An Evaluation to Approaches for Federated
Query Processing over Linked Data. In Proc. I-Semantics 2010.
Linked Data Federation: Vision
Data Source Data Source Data Source Data Source
SPARQL
Endpoint
Virtualized Federation Layer
Consumer
Publisher
Local
Repository
RDF
Dump
Data
Component
RDF
Dump
Data
Component
Self-service Data Provisioning (Data-as-a-Service)
Challenge 3: Composition
Rich theory in database community for Federated Query
Processing exists
• Data Statistics
• Accuracy vs. index size
• Updating statistics
• Query Optimization
• Join types (e.g., semi-joins)
• Minimizing communication cost
• Optimizing execution localization
• Streaming results
Olaf Görlitz, Steffen Staab: Federated Data Management and Query Optimization for
Linked Open Data. In „New Directions of Web Data Management“, to appear.
Challenges
• Satisfying and standardized statistics framework for RDF
• void 2.0 not yet fully satisfying (e.g. histograms missing)
• Therefore:
• Establish comprehensive, standardized statistics framework for RDF
• Should also be tailored to query optimization
• Address specifics of RDF and SPARQL
• Graph-structured data model
• Importance of efficient merge joins
• OPTIONAL queries
• Exploit built-in semantics of RDFS
• Semantic Query Optimization
Michael Schmidt, Michael Meier, Georg Lausen: Foundations of SPARQL Query
Optimization. In Proc. ICDT 2010.
Conclusion
• Clear benefits of Linked Data application development platform
• Discovery of relevant data
• Virtualized integration of data sources as a key step to success
• Fast customization and extensions
• Information Workbench addressing these needs
• Still some work left to do
• Metadata quality and standardization
• Data quality in general, trust
• Data-as-a-Service
• Efficient federated query processing
Thank you for your attention!
CONTACT
fluid Operations AG Email: info@fluidOps.com
Altrottstr. 31 Website: www.fluidOps.com
Walldorf, Germany Tel.: +49 6227 3849-567

More Related Content

What's hot

Migrating On-Premises DBs to Cloud Systems
Migrating On-Premises DBs to Cloud SystemsMigrating On-Premises DBs to Cloud Systems
Migrating On-Premises DBs to Cloud SystemsChristopher Foot
 
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...NoSQLmatters
 
Big Data Quickstart Series 3: Perform Data Integration
Big Data Quickstart Series 3: Perform Data IntegrationBig Data Quickstart Series 3: Perform Data Integration
Big Data Quickstart Series 3: Perform Data IntegrationAlibaba Cloud
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Mark Tabladillo
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builderTimothy Spann
 
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part20812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2Raul Chong
 
BlueData Integration with Cloudera Manager
BlueData Integration with Cloudera ManagerBlueData Integration with Cloudera Manager
BlueData Integration with Cloudera ManagerBlueData, Inc.
 
Introduction to Microsoft's Big Data Platform and Hadoop Primer
Introduction to Microsoft's Big Data Platform and Hadoop PrimerIntroduction to Microsoft's Big Data Platform and Hadoop Primer
Introduction to Microsoft's Big Data Platform and Hadoop PrimerDenny Lee
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData, Inc.
 
Cloudant Overview Bluemix Meetup from Lisa Neddam
Cloudant Overview Bluemix Meetup from Lisa NeddamCloudant Overview Bluemix Meetup from Lisa Neddam
Cloudant Overview Bluemix Meetup from Lisa NeddamRomeo Kienzler
 
Exploring microservices in a Microsoft landscape
Exploring microservices in a Microsoft landscapeExploring microservices in a Microsoft landscape
Exploring microservices in a Microsoft landscapeAlex Thissen
 
Accelerate Business Agility with PaaS
Accelerate Business Agility with PaaS Accelerate Business Agility with PaaS
Accelerate Business Agility with PaaS WSO2
 
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, PresetStreaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, PresetHostedbyConfluent
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSManage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSMesosphere Inc.
 
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...Lucas Jellema
 
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Amazon Web Services
 
Dell/EMC Technical Validation of BlueData EPIC with Isilon
Dell/EMC Technical Validation of BlueData EPIC with IsilonDell/EMC Technical Validation of BlueData EPIC with Isilon
Dell/EMC Technical Validation of BlueData EPIC with IsilonGreg Kirchoff
 
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Leveraging ApsaraDB to Deploy Business Data on the CloudLeveraging ApsaraDB to Deploy Business Data on the Cloud
Leveraging ApsaraDB to Deploy Business Data on the CloudOliver Theobald
 
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoStateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoRedis Labs
 

What's hot (20)

Migrating On-Premises DBs to Cloud Systems
Migrating On-Premises DBs to Cloud SystemsMigrating On-Premises DBs to Cloud Systems
Migrating On-Premises DBs to Cloud Systems
 
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...
Glynn Bird – Cloudant – Building applications for success.- NoSQL matters Bar...
 
Big Data Quickstart Series 3: Perform Data Integration
Big Data Quickstart Series 3: Perform Data IntegrationBig Data Quickstart Series 3: Perform Data Integration
Big Data Quickstart Series 3: Perform Data Integration
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builder
 
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part20812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
0812 2014 01_toronto-smac meetup_i_os_cloudant_worklight_part2
 
BlueData Integration with Cloudera Manager
BlueData Integration with Cloudera ManagerBlueData Integration with Cloudera Manager
BlueData Integration with Cloudera Manager
 
Introduction to Microsoft's Big Data Platform and Hadoop Primer
Introduction to Microsoft's Big Data Platform and Hadoop PrimerIntroduction to Microsoft's Big Data Platform and Hadoop Primer
Introduction to Microsoft's Big Data Platform and Hadoop Primer
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 Overview
 
Cloudant Overview Bluemix Meetup from Lisa Neddam
Cloudant Overview Bluemix Meetup from Lisa NeddamCloudant Overview Bluemix Meetup from Lisa Neddam
Cloudant Overview Bluemix Meetup from Lisa Neddam
 
Exploring microservices in a Microsoft landscape
Exploring microservices in a Microsoft landscapeExploring microservices in a Microsoft landscape
Exploring microservices in a Microsoft landscape
 
Accelerate Business Agility with PaaS
Accelerate Business Agility with PaaS Accelerate Business Agility with PaaS
Accelerate Business Agility with PaaS
 
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, PresetStreaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
 
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OSManage Microservices & Fast Data Systems on One Platform w/ DC/OS
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
 
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...
It's a wrap - closing keynote for nlOUG Tech Experience 2017 (16th June, The ...
 
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
 
Dell/EMC Technical Validation of BlueData EPIC with Isilon
Dell/EMC Technical Validation of BlueData EPIC with IsilonDell/EMC Technical Validation of BlueData EPIC with Isilon
Dell/EMC Technical Validation of BlueData EPIC with Isilon
 
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Leveraging ApsaraDB to Deploy Business Data on the CloudLeveraging ApsaraDB to Deploy Business Data on the Cloud
Leveraging ApsaraDB to Deploy Business Data on the Cloud
 
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoStateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
 

Viewers also liked

Presentation deploying cloud based services
Presentation   deploying cloud based servicesPresentation   deploying cloud based services
Presentation deploying cloud based servicesxKinAnx
 
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015Victoria Livschitz
 
Understand AWS Pricing
Understand AWS PricingUnderstand AWS Pricing
Understand AWS PricingLynn Langit
 
Deploying in the Cloud: Why and How
Deploying in the Cloud: Why and HowDeploying in the Cloud: Why and How
Deploying in the Cloud: Why and HowMatt Small
 
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...XBOSoft
 
Using JMeter and Google Analytics for Software Performance Testing
Using JMeter and Google Analytics for Software Performance TestingUsing JMeter and Google Analytics for Software Performance Testing
Using JMeter and Google Analytics for Software Performance TestingXBOSoft
 

Viewers also liked (7)

Presentation deploying cloud based services
Presentation   deploying cloud based servicesPresentation   deploying cloud based services
Presentation deploying cloud based services
 
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015
Autonomic Management of Cloud Applications with Tonomi, Gluecon Keynote, 2015
 
Understand AWS Pricing
Understand AWS PricingUnderstand AWS Pricing
Understand AWS Pricing
 
Deploying in the Cloud: Why and How
Deploying in the Cloud: Why and HowDeploying in the Cloud: Why and How
Deploying in the Cloud: Why and How
 
Pragmatic portfolio management, 25th september 2012
Pragmatic portfolio management, 25th september 2012Pragmatic portfolio management, 25th september 2012
Pragmatic portfolio management, 25th september 2012
 
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
7 Things Testers Should Know About The Cloud with Bill Wilder & XBOSoft March...
 
Using JMeter and Google Analytics for Software Performance Testing
Using JMeter and Google Analytics for Software Performance TestingUsing JMeter and Google Analytics for Software Performance Testing
Using JMeter and Google Analytics for Software Performance Testing
 

Similar to Cloud-based Linked Data Management for Self-service Application Development

Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Anja Jentzsch
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementPeter Haase
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015Comsode - FP7 project
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data ApplicationsEUCLID project
 
BlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical IntroductionBlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical IntroductionBogdan Roman
 
Connected development data
Connected development dataConnected development data
Connected development dataRob Worthington
 
IoT Interoperability: a Hub-based Approach
IoT Interoperability: a Hub-based ApproachIoT Interoperability: a Hub-based Approach
IoT Interoperability: a Hub-based ApproachMichael Blackstock
 
Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Nandana Mihindukulasooriya
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Linked Services for the Web of Data
Linked Services for the Web of DataLinked Services for the Web of Data
Linked Services for the Web of DataCarlos Pedrinaci
 
Mobile Offline First for inclusive data that spans the data divide
Mobile Offline First for inclusive data that spans the data divideMobile Offline First for inclusive data that spans the data divide
Mobile Offline First for inclusive data that spans the data divideRob Worthington
 

Similar to Cloud-based Linked Data Management for Self-service Application Development (20)

Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud Management
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Echoes Project
Echoes ProjectEchoes Project
Echoes Project
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
Building Linked Data Applications
Building Linked Data ApplicationsBuilding Linked Data Applications
Building Linked Data Applications
 
BlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical IntroductionBlueBrain Nexus Technical Introduction
BlueBrain Nexus Technical Introduction
 
Connected development data
Connected development dataConnected development data
Connected development data
 
IoT Interoperability: a Hub-based Approach
IoT Interoperability: a Hub-based ApproachIoT Interoperability: a Hub-based Approach
IoT Interoperability: a Hub-based Approach
 
Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...Linked Data Platform as a novel approach for Enterprise Application Integra...
Linked Data Platform as a novel approach for Enterprise Application Integra...
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Linked Data
Linked DataLinked Data
Linked Data
 
Linked Services for the Web of Data
Linked Services for the Web of DataLinked Services for the Web of Data
Linked Services for the Web of Data
 
Mobile Offline First for inclusive data that spans the data divide
Mobile Offline First for inclusive data that spans the data divideMobile Offline First for inclusive data that spans the data divide
Mobile Offline First for inclusive data that spans the data divide
 

More from Peter Haase

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsPeter Haase
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationPeter Haase
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudPeter Haase
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphPeter Haase
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsPeter Haase
 
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataMapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataPeter Haase
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterprisePeter Haase
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic TechnologiesPeter Haase
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a ServicePeter Haase
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingPeter Haase
 
Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchPeter Haase
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...Peter Haase
 

More from Peter Haase (15)

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge Graphs
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federation
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataMapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
 
On demand access to Big Data through Semantic Technologies
 On demand access to Big Data through Semantic Technologies On demand access to Big Data through Semantic Technologies
On demand access to Big Data through Semantic Technologies
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a Service
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
 
Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information Workbench
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
 

Recently uploaded

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 

Recently uploaded (20)

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 

Cloud-based Linked Data Management for Self-service Application Development

  • 1. Peter Haase, Michael Schmidt fluid Operations AG Cloud-based Linked Data Management for Self-service Application Development International Workshop on Scalable Semantic Computing Hangzhou, November 6, 2010
  • 2. Increasing Popularity of Linked Open Data • LOD cloud as of May 2009 • 4.7 billion triples • 142 million RDF links • LOD cloud as of Sep 2010 • 25 billion triples • 395 million RDF links • Covering various domains • Media • Life Science • Geography • Publications • … Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 3. Agenda • Linked Data Application Development Opportunities and Challenges • Information Workbench as Platform for Linked Data Application Development • Accessing Linked Data as a Service Vision and First Experiences • Conclusions
  • 4. New Opportunities • Established standards define common data models, vocabularies, semantics • RDF/RDFS, OWL, SPARQL • From data silos to a web of data • Ease of specifying relationships in a decentralized way • Innovative applications that integrate data from various domains and sources • Linked Government Data • Linked Open Data • Benefits of Linked Data in the enterprise • Semantically integrate and interlink data scattered among systems • Cross the chasm between enterprise-internal and public data • Leverage semantic technologies for improved search and presentation
  • 5. Challenges in Building Linked Data Applications • Heterogeneity in various dimensions  Location of data (internal / external, open / closed)  Identifiers, structure and vocabularies  Ownership of data • Structured and unstructered data • Quality of Linked Data • Various forms of imperfection (erroneous, incomplete, imprecise data) • Trustworthiness • End-user oriented interfaces and interaction paradigms • Interfaces that operate over large amounts of data, flexible and dynamic schemas • Meaningful aggregation of the data • Support for expressive queries, while retaining intuitive interfaces • User-generated content • Collaborative annotation and knowledge acquisition
  • 6. The Information Workbench • Platform for Linked Data application development • Base functionality to build applications without any programming • SDK for easy extensions • Covering the entire lifecycle of interacting with Linked Data  Discovery of data sources  Integration of data sources  Visualization  Search and Exploration  Collaborative generation of data • Targeted at • Semantic Web Community • Linked Open Data community • Innovative Enterprises • Demo and source available at http://iwb.fluidops.com/.
  • 7. The IWB Application Development Process Linked Open Data Discovery • Visually explore data sets registered to global registries • Sort/filter data sets by domain, location, and many more facets to identify relevant data 1 LOD Discovery with the Information Workbench
  • 8. The IWB Application Development Process Linked Open Data Discovery • Visually explore data sets registered to global registries • Sort/filter data sets by domain, location, and many more facets to identify relevant data Data Integration • Integrate discovered Linked Data • Add providers for internal and external legacy data sources • Improve data quality, e.g. via incremental refinement of ontology 1 2
  • 9. The IWB Application Development Process Linked Open Data Discovery • Visually explore data sets registered to global registries • Sort/filter data sets by domain, location, and many more facets to identify relevant data Data Integration • Integrate discovered Linked Data • Add providers for internal and external legacy data sources • Improve data quality, e.g. via incremental refinement of ontology Customization • Declaratively specify UI based on available pool of widgets • Embed reports and charts into wiki pages and wiki page templates • Semantically annotate and interlink connected resources 1 3 2
  • 10. The IWB Application Development Process Linked Open Data Discovery • Visually explore data sets registered to global registries • Sort/filter data sets by domain, location, and many more facets to identify relevant data Data Integration • Integrate discovered Linked Data • Add providers for internal and external legacy data sources • Improve data quality, e.g. via incremental refinement of ontology Customization • Declaratively specify UI based on available pool of widgets • Embed reports and charts into wiki pages and wiki page templates • Semantically annotate and interlink connected resources Advanced System Configuration and Extensions • Use APIs and SDKs to implement own widgets and mashups • Script data providers to integrate data behind non-standard interfaces • Develop and integrate own modules, e.g. for customized search and information extraction 1 2 3 4
  • 11. Information Workbench Architecture • Extensible, widget-based UI • Resource-centric presentation • Living UI, which exploits semantics of underlying data • Large collection of predefined widgets, easily extendable • Search and information Access • Coexistence of structured and unstructured data • Different search paradigms (keyword and faceted search, semantic query completion) • Data integration through providers • Convert data from a data source into the RDF data format • Customizable, easily extensible • Use of public LOD registries
  • 12. Information Workbench Architecture In the remainder of the talk • Focus on challenges in data integration layer • In particular: virtualized, cloud- based integration of data sources
  • 13. Linked Data Integration – Where we are • Non-RDF data stored locally in the repository • On demand, this data can be updated periodically • RDF data can be… • persisted in repository, or • connected via naive federation layer (where possible)
  • 14. Linked Data Integration – Our Vision • Current way of publishing • Authors provide RDF dumps linked on some homepage • Provisioning information missing (data zipped, splitted, available in different formats, …) • Often also SPARQL endpoints (typically with poor response times) • How it should be done • Rich meta-data describing content, structure, properties of the data • Enable exploration of data via meta repositories • Efforts have been made (see CKAN), but… • … poor quality of meta data and data • Possibility for end-users to buy service guarantees • Integration details should be irrelevant to the end-user
  • 15. Software Components • Definition of „Software Components“ "A software component is a unit of composition with contractually specified interfaces and explicit context dependencies only. A software component can be deployed independently and is subject to composition by third parties." (wikipedia.org)
  • 16. Data Components • What we need for Linked Data: „Data Components“ • Interfaces: data components with precise interfaces and metadata • Deployment: easy provisioning and integration in applications • Composition: transparent access to atomic or composite units • Definition of „Software Components“ "A software component is a unit of composition with contractually specified interfaces and explicit context dependencies only. A software component can be deployed independently and is subject to composition by third parties." (wikipedia.org)
  • 17. Next Step: Data-as-a-Service • Idea • Producer provides data components • Consumers can access data components as a service
  • 18. Next Step: Data-as-a-Service • Idea • Producer provides data components • Consumers can access data components as a service • Possible realization: use cloud technology! • Sold on demand • Elastic • Fully managed by provider characteristics of cloud services, like e.g. AWS, exactly match the needs (just like it is the case for Software-as-a-Service)
  • 19. Next Step: Data-as-a-Service Virtualized Semantic Repositories Identification, composition, and use of (fragments of) datasets in manners that abstract the applications from the specific setup of the data management service (such as local vs. remote, federation, and distribution) • Idea • Producer provides data components • Consumers can access data components as a service • Possible realization: use cloud technology! • Sold on demand • Elastic • Fully managed by provider characteristics of cloud services, like e.g. AWS, exactly match the needs (just like it is the case for Software-as-a-Service)
  • 20. Challenge 1: Precise Interfaces • Standardization efforts for RDF meta data descriptions • Statistical Core Vocabulary (SCOVO) • Very flexible • Forms a good basis for describing RDF statistics • Vocabulary of Interlinked Data Sets (voiD) • Based on SCOVO • Used to publish meta information about Linked Data Sources • voiD 2 (in progress) • Dataset meta information, like source, description, dump, license • Used vocabularies/ontologies • Dataset interlinking • Statistics (e.g. distinct subject count, triples with given predicate etc.) • Open data registries • Comprehensive Knowledge Archive Network • Based on DublinCore and DERI‘s data catalog vocabulary (dcat)
  • 21. Challenge 2: Deployment • Based on Interfaces • Possibly based on cloud technologies • State-of-the-art not satisfying • URLs pointing to human readable description, but not the actual endpoint • Various forms of syntax errors in RDF documents • MIME types incorrect or missing • Endpoints/servers not reachable • Endpoint/file password protected
  • 22. Some Statistics Based on subset of LOD cloud (excluding a few extremely large datasets)
  • 23. Challenge 3: Composition Query Processing over Federation: State-of-the-Art • First public implementations exists • AliBaba federation layer on top of Sesame • Benchmark results show severy bottlenecks • Efficiency issues • Which data sets deliver results for which graph patterns? • Localized execution of subqueries • Global estimation of subquery result sizes • Join oder optimization • Incremental processing with completeness/correctness guarantees Peter Haase, Tobias Mathäß, Michael Ziller: An Evaluation to Approaches for Federated Query Processing over Linked Data. In Proc. I-Semantics 2010.
  • 24. Linked Data Federation: Vision Data Source Data Source Data Source Data Source SPARQL Endpoint Virtualized Federation Layer Consumer Publisher Local Repository RDF Dump Data Component RDF Dump Data Component Self-service Data Provisioning (Data-as-a-Service)
  • 25. Challenge 3: Composition Rich theory in database community for Federated Query Processing exists • Data Statistics • Accuracy vs. index size • Updating statistics • Query Optimization • Join types (e.g., semi-joins) • Minimizing communication cost • Optimizing execution localization • Streaming results Olaf Görlitz, Steffen Staab: Federated Data Management and Query Optimization for Linked Open Data. In „New Directions of Web Data Management“, to appear.
  • 26. Challenges • Satisfying and standardized statistics framework for RDF • void 2.0 not yet fully satisfying (e.g. histograms missing) • Therefore: • Establish comprehensive, standardized statistics framework for RDF • Should also be tailored to query optimization • Address specifics of RDF and SPARQL • Graph-structured data model • Importance of efficient merge joins • OPTIONAL queries • Exploit built-in semantics of RDFS • Semantic Query Optimization Michael Schmidt, Michael Meier, Georg Lausen: Foundations of SPARQL Query Optimization. In Proc. ICDT 2010.
  • 27. Conclusion • Clear benefits of Linked Data application development platform • Discovery of relevant data • Virtualized integration of data sources as a key step to success • Fast customization and extensions • Information Workbench addressing these needs • Still some work left to do • Metadata quality and standardization • Data quality in general, trust • Data-as-a-Service • Efficient federated query processing
  • 28. Thank you for your attention! CONTACT fluid Operations AG Email: info@fluidOps.com Altrottstr. 31 Website: www.fluidOps.com Walldorf, Germany Tel.: +49 6227 3849-567