SlideShare a Scribd company logo
1 of 47
Download to read offline
LOD2 Webinar . 29.11.2011 . Page 1 http://lod2.eu
Creating Knowledge out of Interlinked Data
http://lod2.eu
LOD2 is a large-scale integrating project co-funded by the European Commission
within the FP7 Information and Communication Technologies Work Programme.
This 4-year project comprises leading Linked Open Data technology researchers,
companies, and service providers. Coming from across 12 countries the partners
are coordinated by the Agile Knowledge Engineering and Semantic Web Research
Group at the University of Leipzig, Germany.
LOD2 will integrate and syndicate Linked Data with existing large-scale
applications. The project shows the benefits in the scenarios of Media and
Publishing, Corporate Data intranets and eGovernment.
http://lod2.eu
Once	
  per	
  month	
  the	
  LOD2	
  webinar	
  series	
  offer	
  a	
  free	
  
webinar	
  about	
  tools	
  and	
  services	
  along	
  the	
  Linked	
  
Open	
  Data	
  Life	
  Cycle.	
  
	
  
Stay	
  with	
  us	
  and	
  learn	
  more	
  about	
  acquisiAon,	
  ediAng,	
  
composing,	
  connected	
  applicaAons	
  –	
  and	
  finally	
  
publishing	
  Linked	
  Open	
  Data.	
  
© 2012 OpenLink Software, All rights reserved.
Virtuoso 7.0
Enabling Massively Scalable Big Data Analytics
for RDF & SQL Data Management
By Orri Erling, Virtuoso Program Manager
& Hugh Williams, Professional Services Manager
Making Technology Work ForYou
© 2012 OpenLink Software, All rights reserved.
Company Overview
OpenLink Company Overview
n  OpenLink Software is a privately-held company founded in 1992 by its President &
CEO, Kingsley Idehen. The company is an industry acclaimed technology innovator
in the following areas:
§  ODBC, JDBC, ADO.NET, and OLE-DB compliant Data Access Drivers for Oracle,
SQL Server, Informix, Ingres, Sybase, Progress, MySQL, and PostgreSQL
§  High-Performance & Scalable Multi-Model (Relational & Graph) Database
Technology
§  Data Integration Middleware (Data Virtualization Technology across a wide variety of
Protocols & Formats)
§  Web Application Server Technology
§  Linked Data Deployment & Management
§  Socially-enhanced Distributed Collaborative Applications Platforms (Weblogs, Wikis,
Feed Aggregation and Syndication, Web File Systems, Discussion Forums, etc.)
§  Identity Management.
© 2012 OpenLink Software, All rights reserved.
Products & Services
Software Products
•  OpenLink Universal Data Access Drivers (UDA) -
High-performance data access drivers for ODBC, JDBC, ADO.NET, and OLE DB that provide
transparent access to enterprise databases.
•  OpenLink Virtuoso - available in single server and cluster editions that are deployed in cloud
and/or enterprise modes.
•  OpenLink Data Spaces Platform and Applications
•  OpenLink Ajax Toolkit
•  OpenLink Data Explorer
•  An Open Source Data Access SDK for ODBC
All OpenLink products are delivered by download from the Internet (http, ftp, etc.).
Temporary licenses are issued upon download and may be extended as needed, on a case-
by-case basis. Permanent licenses are issued once payment is received.
© 2012 OpenLink Software, All rights reserved.
Products & Services
Professional and Support Services
•  OpenLink Product Support provides front-line email and phone support,
web-based online support, and a variety of premium services such as phone,
emergency, and onsite support.
•  Our Support staff is comprised of individuals with extensive knowledge of data
access, data migration, database administration, programming APIs, and other
relevant skills.
•  Services are sold in either Standard "Bronze" or Premium "Platinum" Support
packages, with varying hours of availability, response times, etc.
•  We also offer Custom Development, Training, and other Consultancy services.
These services can be offered on- or off-site. Expenses for travel, accommodations,
food, etc., associated with on-site services are charged separately.
© 2012 OpenLink Software, All rights reserved.
Customers
OpenLink's installed base is in excess of 10,000 customers worldwide.
Examples include:
© 2012 OpenLink Software, All rights reserved.
n  Data.Gov (U.S. Govt. Open
Linked Data initiative)
n  Verizon
n  Raytheon
n  Bank of America
n  CGI Federal
n  Elsevier
n  French National Library
n  Globo
n  Scottish Government
n  St Jude's Medical
n  Barclays Bank
n  Wells Fargo
n  and many more
Office Locations
USA
OpenLink Software, Inc
10 Burlington Mall Road
Suite 265
Burlington, MA 01803
Tel.: +1 781 273 0900
Fax: +1 781 229 8030
© 2012 OpenLink Software, All rights reserved.
UK
OpenLink Software Ltd.
Airport House
Purley Way
Croydon, Surrey CR0 0XZ
Tel.: +44 (0)20 8681 7701
Fax: +44 (0)20 8681 7702
© 2012 OpenLink Software, All rights reserved.
Virtuoso Universal Server
Overview
Situation Analysis
© 2012 OpenLink Software, All rights reserved.
Data is growing exponentially
along the following dimensions:
n Volume
n Velocity
n Variety
All of this happens while the total
hours in day remains 24 hrs.
Product Value Proposition
© 2012 OpenLink Software, All rights reserved.
Enterprise and Individual Agility
via Data Access, Integration, and
Management, without
compromising performance,
scalability, security, and platform
independence.
Virtuoso locks you into an experience
(openness, performance, and scale) not
the platform itself.
-- Kingsley Idehen, Founder & CEO, OpenLink
Software
Product Architecture
© 2012 OpenLink Software, All rights reserved.
A high-performance, scalable,
secure, and operating-system-
independent server designed
to handle contemporary
challenges associated with
standards compliant data
access, data integration, and
data management.
Data Virtualization Middleware
© 2012 OpenLink Software, All rights reserved.
An in-built middleware layer
(“Sponger”) for creating
Transient & Persistent
Views over Heterogeneous
Data Sources.
Sophisticated Content Crawler
© 2012 OpenLink Software, All rights reserved.
DBMS hosted Content
Crawler that’s leverages
loosely coupled binding to
the Sponger Middleware
component for
transformation of
unstructured and semi-
structured data into Linked
Data.
Core Platform behind LOD Cloud
© 2010 OpenLink Software, All rights reserved.
Core Platform (Graph DBMS and Linked Data Deployment) behind DBpedia, many
bubbles in the LOD Cloud, and the LOD Cloud cache itself.
Virtuoso Linked Data projects
•  DBpedia - public SPARQL endpoint over the DBpedia data
(and international Chapters)
•  LOD Cloud Cache - public server hosting LOD cloud datasets
•  URIBurner - Linked Data generation & transformation service
•  Linked Geo Data - OpenStreetMap Spatial data as Linked Data
•  Sindice - SPARQL endpoint behind its Semantic Web Index
•  Data.gov - US Government Linked Data
•  Health.data.gov - Clinical Quality Linked Data on health.data.gov
•  Seevl - Linked Data music discovery service
•  Bio2RDF - Life science data mapped to Linked Data
•  Neurocommons - Life science data mapped to Linked Data
•  Musicbrainz - MusicBrainz database published as Linked Data
•  Open PHACTS - DBpedia-like Linked Data Space for Pharma
•  Others - Many others …
© 2012 OpenLink Software, All rights reserved.
Powerful Standards Support
© 2012 OpenLink Software, All rights reserved.
ODBC compliance enables use of client applications (e.g. Microsoft Access) as front-
ends for Virtuoso, 3rd party RDBMS engines, and the World Wide Web hosted Linked
Open Data Cloud.
Powerful Standards Support Cont’d
© 2012 OpenLink Software, All rights reserved.
ODBC & HTML5 compliance enables development of rich client apps. that
leverage the WebDB-ODBC bridge for accessing data across: Virtuoso, 3rd party
RDBMS engines, and the World Wide Web hosted Linked Open Data Cloud.
Insight Discovery & Exploration
© 2012 OpenLink Software, All rights reserved.
Native Faceted Browsing that enables multi-dimensional drill-downs via any browser
Insight Discovery & Exploration
© 2012 OpenLink Software, All rights reserved.
Microsoft Silverlight or HTML5 based PivotViewer Front-End for SPARQL and SPARQL-FED
Queries
Powerful SPARQL Query Service
© 2012 OpenLink Software, All rights reserved.
Basic SPARQL Endpoint for Creating Query Definitions & Sharing Query Results.
Example: health.data.gov data directly from a Web Browser.
Powerful SPARQL Query Builder
© 2012 OpenLink Software, All rights reserved.
Use Query By Example (QBE) Patterns to Construct & Share Query
Results.
How Do I Get Going?
n  Download, install, and experience the power of coherent
integration of disparate data sources, data access protocols,
and data representation formats.
n  In an nutshell, commence exploitation of powerful business
intelligence, socially enhanced collaboration, data virtualization,
and entity analytics without writing a line of code!
n  Turn "Big Data" into exploitable "Smart Data"
without compromise!
n  Will be integrated into the next release of the LOD2 Stack
© 2012 OpenLink Software, All rights reserved.
© 2012 OpenLink Software, All rights reserved.
Virtuoso 7.0
27 © 2012 OpenLink Software, All rights reserved.
Flexible Big Data Challenge
n  Data Agility is challenged by Volume, Velocity,
and Variety
n  “Schema Last” is great - if the price is right
n  RDF, graphs promise powerful querying with the
flexibility and scale of NoSQL key-value stores
n  Inference may be good for integration, if can
express the right things, beyond OWL
n  RDF data management technology must learn
from the lessons of SQL RDBMS, everything
applies
28 © 2012 OpenLink Software, All rights reserved.
Virtuoso 7.0 Mission Statement
Destruction of the following items as impediments to
Big (Open) Linked Data exploitation:
n Performance
n Scalability
n Platform Independence
n Security & Privacy
n Price
29 © 2012 OpenLink Software, All rights reserved.
Virtuoso 7.0 & Big Data Myths
Myths put to rest:
n Scalable Open Ended SPARQL Endpoints
n Scalable Open Ended Read-Write SPARQL
Endpoints
n Fine-grained Access Controls underlying Read-
Only or Read-Write endpoints.
30 © 2012 OpenLink Software, All rights reserved.
Virtuoso Column Store Features
n  Supports SQL and SPARQL query languages
n  Compact column-wise storage
n  Vectored execution of commands
n  Shared nothing scale out for clusters
n  Powerful procedure language with parallel,
distributed control structures
n  Full-text and geospatial indexes
31 © 2012 OpenLink Software, All rights reserved.
Storage Engine
n  Freely mix column-, and row-wise indices
n  All SQL and RDF data types natively supported , single
execution engine for SQL/SPARQL
n  Column compression 3x more space efficient than row-
wise compression for RDF
n  Column stores are not only for big scans, random access
surpasses rows as as soon as there is some locality
n  9 B/quad with DBpedia, 7 B/quad with BSBM or RDF-H,
14 B/quad with web crawls (PSOG, POSG, SP, OP, GS,
excluding literals)
32 © 2012 OpenLink Software, All rights reserved.
Execution Engine
n  Vectoring is not only for column stores
n  Vectoring makes a random access into a linear merge
join if there is any locality: Always a win, mileage
depends on run time factors
n  Vectoring eliminates interpretation overhead and
makes CPU friendly code possible
n  Even with run time data typing, vectoring allows use of
type-specific operators on homogenous data, e.g.
arithmetic
n  Dynamically adjust vector size: Larger vector may not
fit in cache but will get better locality for random access
33 © 2012 OpenLink Software, All rights reserved.
Graph operations
n  Run time computation plus caching instead of
materialization
n  SPARQL/SQL extension for arbitrary transitive subqueries:
n  Flexible options for returning shortest paths, all paths, all /
distinct reachable, attributes of steps on paths etc.
n  Efficient execution, searching the graph from both ends if
looking for a path with ends given
n  Query operators for RDF hierarchy traversal
n  Special query operator for OWL sameAs and IFP based
identity
n  Taking OWL sameAs / IFP identity into account for
DISTINCT /GROUP BY
34 © 2012 OpenLink Software, All rights reserved.
Query Optimization Challenges
n  Typical SQL stats do not help
n  Need to measure data cardinalities starting from
constants in the query
n  Need to sample fanout predicate by predicate, as
needed
n  Predicate and class hierarchies are easy to
handle in sampling
n  sameAs or IFP inference voids all guesses
n  Is hash join worthwhile? High setup cost means
that one must be sure of cardinalities first
35 © 2012 OpenLink Software, All rights reserved.
Deep Sampling
n  Everything is a join -> sampling must also do joins
n  As the candidate plan grows, the cost model
executes all the ops on a sample of the data
n  Actual cardinality and locality are known, also when
search conditions are correlated
n  Having high confidence in the cost model, hash join
plans become safe and attractive
n  Even though there is an indexed access path for all,
a scan can be better because it produces results in
order. Need to be sure of selectivity before taking the
risk
36 © 2012 OpenLink Software, All rights reserved.
Elastic Cluster
n  Data is partitioned by key, different indices may
have different partition keys
n  Partitions may split and migrate between servers
n  Partitions may be kept in duplicate for fault
tolerance/load balancing
n  Actual access stats drive partition split and
placement
37 © 2012 OpenLink Software, All rights reserved.
Optimizing for Cluster
n  Vectored execution is natural in a cluster since single-tuple
messages are not an option
n  Keep max ops in flight at all times, always send long messages
n  Fully distributed query coordination:
¡  Any node can service a client request. Correlated subqueries, stored
procedures may execute anywhere, arbitrary parallelism and recursion
between partitions
¡  On single shared memory box, cluster is approximately even with single
process multithreading, low overhead
¡  1.8x more throughput in BSBM BI when going from 1 to 2 machines
¡  Distributed stored procedures, send the proc to the data, as in map-
reduce, except that there are no limits on cross partition calling/recursion
¡  Choice of transactional and auto-commit update semantics, can have
atomic ops without global transaction
38 © 2012 OpenLink Software, All rights reserved.
Cluster Architecture Diagrams
39 © 2012 OpenLink Software, All rights reserved.
n  55 billion triples in LOD cache, only 384 GB of
RAM, 2TB disk
n  2 x 384 GB of RAM, 4TB SSD
n  Most of Linked Open Data and Web Crawls
n  http://lod.openlinksw.com
n  http://lod.openlinksw.com/sparql
LOD Cache
40 © 2012 OpenLink Software, All rights reserved.
Independent Benchmark Report from CWI:
Berlin SPARQL Benchmark
#Triples Source File
Size
Compress
ed Source
File Size
Source
Data Files
Per Loader
Node
Final
Database
File Size
Load Time
50 Billion 2.8 TB 240 GB 30 GB 1.8 TB 10h 54s
150 Billion 8.5 TB 728 GB 91 GB 5.6 TB n/a
41 © 2012 OpenLink Software, All rights reserved.
Store Comparisons Summary:
Exploration oriented queries (QMpH)
Berlin SPARQL Benchmark
100 Million
Triples
200 Million
Triples
1 Billion Triples
Virtuoso 6 37,678.319 32,969.006 8,984.789
Virtuoso 7 47,178.820 27,933.682
42 © 2012 OpenLink Software, All rights reserved.
Store Comparisons Summary:
Business Intelligence oriented queries (QMpH)
Berlin SPARQL Benchmark
10 Million Triples 100 Million
Triples
1 Billion Triples
Virtuoso 6 431.465 35.342 2.383
Virtuoso 7 996.795 75.236
43 © 2012 OpenLink Software, All rights reserved.
Store Comparisons Summary:
Exploration oriented queries (Cluster Edition)
(QMpH)
Berlin SPARQL Benchmark
10 Billion Triples 50 Billion Triples 150 Billion
Triples
Virtuoso 7 2,360.210 4,253.157 2,090.574
44 © 2012 OpenLink Software, All rights reserved.
Store Comparisons Summary:
Business Intelligence oriented queries (Cluster
Edition) (QMpH)
Berlin SPARQL Benchmark
10 Billion Triples 50 Billion Triples 150 Billion
Triples
Virtuoso 7 13.078 0.964 0.285
45 © 2012 OpenLink Software, All rights reserved.
Future Work
n  Complete deep sampling: enhanced query
optimization plans
n  Run TPC-H and TPC-DS in SQL and their 1:1
translation in SPARQL, demonstrating SPARQL
performance as near to SQL as possible
Additional Information
n  OpenLink Software
¡  OpenLink Software - www.openlinksw.com
¡  OpenLink Virtuoso - virtuoso.openlinksw.com
¡  Universal Data Access - uda.openlinksw.com
n  Social Media Data spaces
¡  http://virtuoso.openlinksw.com/blog/ (weblog)
¡  https://plus.google.com/112399767740508618350/
posts (Google+)
¡  https://twitter.com/OpenLink (Twitter)
¡  http://www.linkedin.com/company/openlink-software
(LinkedIn)
¡  Hashtag: #LinkedData (Anywhere)
© 2012 OpenLink Software, All rights reserved.
EU-FP7 LOD2 WP6 – 25.-26.03.2013. Page 47 http://lod2.eu
Creating Knowledge out of Interlinked Data
LOD2 Stack Usability Survey 2013
w.surveygizmo.com/s3/1188229/LOD2-Stack-Usability-Survey-2013

More Related Content

What's hot

OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewKingsley Uyi Idehen
 
Exploiting Linked (Open) Data via Microsoft Access
Exploiting Linked (Open) Data via Microsoft AccessExploiting Linked (Open) Data via Microsoft Access
Exploiting Linked (Open) Data via Microsoft AccessKingsley Uyi Idehen
 
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBCUsing SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBCKingsley Uyi Idehen
 
Understanding Linked Data via EAV Model based Structured Descriptions
Understanding Linked Data via EAV Model based Structured DescriptionsUnderstanding Linked Data via EAV Model based Structured Descriptions
Understanding Linked Data via EAV Model based Structured DescriptionsKingsley Uyi Idehen
 
HTML5 based PivotViewer for Visualizing LInked Data
HTML5 based PivotViewer for Visualizing LInked Data HTML5 based PivotViewer for Visualizing LInked Data
HTML5 based PivotViewer for Visualizing LInked Data Kingsley Uyi Idehen
 
Making the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataMaking the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataKingsley Uyi Idehen
 
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference KeynoteKingsley Uyi Idehen
 
Microsoft 365 Delve profile integration with Connections
Microsoft 365 Delve profile integration with ConnectionsMicrosoft 365 Delve profile integration with Connections
Microsoft 365 Delve profile integration with ConnectionsMartin Schmidt
 
Virtuoso ODBC Driver Configuration & Usage (Windows)
Virtuoso ODBC Driver Configuration & Usage (Windows)Virtuoso ODBC Driver Configuration & Usage (Windows)
Virtuoso ODBC Driver Configuration & Usage (Windows)Kingsley Uyi Idehen
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research datavty
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataversevty
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataversevty
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
5 years of Dataverse evolution
5 years of Dataverse evolution 5 years of Dataverse evolution
5 years of Dataverse evolution vty
 

What's hot (20)

OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers OverviewOpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers Overview
 
Exploiting Linked (Open) Data via Microsoft Access
Exploiting Linked (Open) Data via Microsoft AccessExploiting Linked (Open) Data via Microsoft Access
Exploiting Linked (Open) Data via Microsoft Access
 
Understanding Data
Understanding Data Understanding Data
Understanding Data
 
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBCUsing SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
 
Integrating Semantic Systems
Integrating Semantic SystemsIntegrating Semantic Systems
Integrating Semantic Systems
 
Understanding Linked Data via EAV Model based Structured Descriptions
Understanding Linked Data via EAV Model based Structured DescriptionsUnderstanding Linked Data via EAV Model based Structured Descriptions
Understanding Linked Data via EAV Model based Structured Descriptions
 
HTML5 based PivotViewer for Visualizing LInked Data
HTML5 based PivotViewer for Visualizing LInked Data HTML5 based PivotViewer for Visualizing LInked Data
HTML5 based PivotViewer for Visualizing LInked Data
 
Virtuoso Platform Overview
Virtuoso Platform OverviewVirtuoso Platform Overview
Virtuoso Platform Overview
 
Data Residency: Challenges and the Need for Standards
Data Residency: Challenges and the Need for StandardsData Residency: Challenges and the Need for Standards
Data Residency: Challenges and the Need for Standards
 
Making the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked DataMaking the Conceptual Layer Real via HTTP based Linked Data
Making the Conceptual Layer Real via HTTP based Linked Data
 
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
 
Microsoft 365 Delve profile integration with Connections
Microsoft 365 Delve profile integration with ConnectionsMicrosoft 365 Delve profile integration with Connections
Microsoft 365 Delve profile integration with Connections
 
Virtuoso ODBC Driver Configuration & Usage (Windows)
Virtuoso ODBC Driver Configuration & Usage (Windows)Virtuoso ODBC Driver Configuration & Usage (Windows)
Virtuoso ODBC Driver Configuration & Usage (Windows)
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
5 years of Dataverse evolution
5 years of Dataverse evolution 5 years of Dataverse evolution
5 years of Dataverse evolution
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 

Similar to LOD2 Webinar Series: Virtuoso 7

Hybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTHybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTSumit Sarkar
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked .
 
Extending the Value of Content in Enterprise Systems with Web Content Management
Extending the Value of Content in Enterprise Systems with Web Content ManagementExtending the Value of Content in Enterprise Systems with Web Content Management
Extending the Value of Content in Enterprise Systems with Web Content ManagementScott Abel
 
OOW13: Next Generation Optimized Directory (CON9024)
OOW13: Next Generation Optimized Directory (CON9024)OOW13: Next Generation Optimized Directory (CON9024)
OOW13: Next Generation Optimized Directory (CON9024)GregOracle
 
All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight Mark Hinkle
 
IP PROJECT E-GOVERNMENTAL HELPLINE
IP PROJECT E-GOVERNMENTAL HELPLINEIP PROJECT E-GOVERNMENTAL HELPLINE
IP PROJECT E-GOVERNMENTAL HELPLINEMathesh T
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationDenodo
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
 
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)Denny Muktar
 
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...WalmartLabs
 
Präsentation share point
Präsentation share pointPräsentation share point
Präsentation share pointcoda-efurt
 
Sharepoint Architecture
Sharepoint Architecture Sharepoint Architecture
Sharepoint Architecture arun kumar
 
Interior Designs
Interior DesignsInterior Designs
Interior Designsarun kumar
 
Microsoft PPT_Sharepoint_introduction
Microsoft PPT_Sharepoint_introductionMicrosoft PPT_Sharepoint_introduction
Microsoft PPT_Sharepoint_introductionDipti Bohra
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
Delivering Network Innovation with SDN - Tom Nadeau
Delivering Network Innovation with SDN - Tom Nadeau Delivering Network Innovation with SDN - Tom Nadeau
Delivering Network Innovation with SDN - Tom Nadeau scoopnewsgroup
 
Con9024 next generation optimized directory - oracle unified directory - final
Con9024 next generation optimized directory - oracle unified directory - finalCon9024 next generation optimized directory - oracle unified directory - final
Con9024 next generation optimized directory - oracle unified directory - finalOracleIDM
 
Cloud computing Introductory Session
Cloud computing Introductory SessionCloud computing Introductory Session
Cloud computing Introductory SessionAbhinav Parmar
 
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...DataWorks Summit
 

Similar to LOD2 Webinar Series: Virtuoso 7 (20)

Hybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and RESTHybrid Data Pipeline for SQL and REST
Hybrid Data Pipeline for SQL and REST
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
Extending the Value of Content in Enterprise Systems with Web Content Management
Extending the Value of Content in Enterprise Systems with Web Content ManagementExtending the Value of Content in Enterprise Systems with Web Content Management
Extending the Value of Content in Enterprise Systems with Web Content Management
 
OOW13: Next Generation Optimized Directory (CON9024)
OOW13: Next Generation Optimized Directory (CON9024)OOW13: Next Generation Optimized Directory (CON9024)
OOW13: Next Generation Optimized Directory (CON9024)
 
All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight All Things Open SDN, NFV and Open Daylight
All Things Open SDN, NFV and Open Daylight
 
IP PROJECT E-GOVERNMENTAL HELPLINE
IP PROJECT E-GOVERNMENTAL HELPLINEIP PROJECT E-GOVERNMENTAL HELPLINE
IP PROJECT E-GOVERNMENTAL HELPLINE
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open Data
 
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)
IBM Private Cloud Platform - Setting Foundation for Hybrid (JUKE, 2015)
 
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
 
Präsentation share point
Präsentation share pointPräsentation share point
Präsentation share point
 
Sharepoint Architecture
Sharepoint Architecture Sharepoint Architecture
Sharepoint Architecture
 
Interior Designs
Interior DesignsInterior Designs
Interior Designs
 
Microsoft PPT_Sharepoint_introduction
Microsoft PPT_Sharepoint_introductionMicrosoft PPT_Sharepoint_introduction
Microsoft PPT_Sharepoint_introduction
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Delivering Network Innovation with SDN - Tom Nadeau
Delivering Network Innovation with SDN - Tom Nadeau Delivering Network Innovation with SDN - Tom Nadeau
Delivering Network Innovation with SDN - Tom Nadeau
 
Con9024 next generation optimized directory - oracle unified directory - final
Con9024 next generation optimized directory - oracle unified directory - finalCon9024 next generation optimized directory - oracle unified directory - final
Con9024 next generation optimized directory - oracle unified directory - final
 
Video report
Video reportVideo report
Video report
 
Cloud computing Introductory Session
Cloud computing Introductory SessionCloud computing Introductory Session
Cloud computing Introductory Session
 
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
 

More from LOD2 Creating Knowledge out of Interlinked Data

More from LOD2 Creating Knowledge out of Interlinked Data (20)

LOD2 Webinar: SIREn
LOD2 Webinar: SIREnLOD2 Webinar: SIREn
LOD2 Webinar: SIREn
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
LOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia SpotlightLOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia Spotlight
 
LOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKANLOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKAN
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 General Presentation 2012
LOD2 General Presentation 2012LOD2 General Presentation 2012
LOD2 General Presentation 2012
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMESLOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMES
 
LOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project ManagementLOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project Management
 
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
 
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
 
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
 
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data WebLOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
 
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
 

Recently uploaded

The Importance of Indoor Air Quality (English)
The Importance of Indoor Air Quality (English)The Importance of Indoor Air Quality (English)
The Importance of Indoor Air Quality (English)IES VE
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsDianaGray10
 
UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2DianaGray10
 
How to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxHow to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxKaustubhBhavsar6
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationKnoldus Inc.
 
20140402 - Smart house demo kit
20140402 - Smart house demo kit20140402 - Smart house demo kit
20140402 - Smart house demo kitJamie (Taka) Wang
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox
 
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - TechWebinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - TechProduct School
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTopCSSGallery
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTxtailishbaloch
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl
 
From the origin to the future of Open Source model and business
From the origin to the future of  Open Source model and businessFrom the origin to the future of  Open Source model and business
From the origin to the future of Open Source model and businessFrancesco Corti
 
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxNeo4j
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosErol GIRAUDY
 
Flow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameFlow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameKapil Thakar
 
2024.03.12 Cost drivers of cultivated meat production.pdf
2024.03.12 Cost drivers of cultivated meat production.pdf2024.03.12 Cost drivers of cultivated meat production.pdf
2024.03.12 Cost drivers of cultivated meat production.pdfThe Good Food Institute
 
March Patch Tuesday
March Patch TuesdayMarch Patch Tuesday
March Patch TuesdayIvanti
 

Recently uploaded (20)

The Importance of Indoor Air Quality (English)
The Importance of Indoor Air Quality (English)The Importance of Indoor Air Quality (English)
The Importance of Indoor Air Quality (English)
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projects
 
UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2
 
How to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxHow to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptx
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its application
 
20140402 - Smart house demo kit
20140402 - Smart house demo kit20140402 - Smart house demo kit
20140402 - Smart house demo kit
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
 
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - TechWebinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development Companies
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile Brochure
 
SheDev 2024
SheDev 2024SheDev 2024
SheDev 2024
 
From the origin to the future of Open Source model and business
From the origin to the future of  Open Source model and businessFrom the origin to the future of  Open Source model and business
From the origin to the future of Open Source model and business
 
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie WorldTrustArc Webinar - How to Live in a Post Third-Party Cookie World
TrustArc Webinar - How to Live in a Post Third-Party Cookie World
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptxGraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
GraphSummit Copenhagen 2024 - Neo4j Vision and Roadmap.pptx
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenarios
 
Flow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First FrameFlow Control | Block Size | ST Min | First Frame
Flow Control | Block Size | ST Min | First Frame
 
2024.03.12 Cost drivers of cultivated meat production.pdf
2024.03.12 Cost drivers of cultivated meat production.pdf2024.03.12 Cost drivers of cultivated meat production.pdf
2024.03.12 Cost drivers of cultivated meat production.pdf
 
March Patch Tuesday
March Patch TuesdayMarch Patch Tuesday
March Patch Tuesday
 

LOD2 Webinar Series: Virtuoso 7

  • 1. LOD2 Webinar . 29.11.2011 . Page 1 http://lod2.eu Creating Knowledge out of Interlinked Data
  • 2. http://lod2.eu LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. This 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers. Coming from across 12 countries the partners are coordinated by the Agile Knowledge Engineering and Semantic Web Research Group at the University of Leipzig, Germany. LOD2 will integrate and syndicate Linked Data with existing large-scale applications. The project shows the benefits in the scenarios of Media and Publishing, Corporate Data intranets and eGovernment.
  • 3. http://lod2.eu Once  per  month  the  LOD2  webinar  series  offer  a  free   webinar  about  tools  and  services  along  the  Linked   Open  Data  Life  Cycle.     Stay  with  us  and  learn  more  about  acquisiAon,  ediAng,   composing,  connected  applicaAons  –  and  finally   publishing  Linked  Open  Data.  
  • 4. © 2012 OpenLink Software, All rights reserved. Virtuoso 7.0 Enabling Massively Scalable Big Data Analytics for RDF & SQL Data Management By Orri Erling, Virtuoso Program Manager & Hugh Williams, Professional Services Manager Making Technology Work ForYou
  • 5. © 2012 OpenLink Software, All rights reserved. Company Overview
  • 6. OpenLink Company Overview n  OpenLink Software is a privately-held company founded in 1992 by its President & CEO, Kingsley Idehen. The company is an industry acclaimed technology innovator in the following areas: §  ODBC, JDBC, ADO.NET, and OLE-DB compliant Data Access Drivers for Oracle, SQL Server, Informix, Ingres, Sybase, Progress, MySQL, and PostgreSQL §  High-Performance & Scalable Multi-Model (Relational & Graph) Database Technology §  Data Integration Middleware (Data Virtualization Technology across a wide variety of Protocols & Formats) §  Web Application Server Technology §  Linked Data Deployment & Management §  Socially-enhanced Distributed Collaborative Applications Platforms (Weblogs, Wikis, Feed Aggregation and Syndication, Web File Systems, Discussion Forums, etc.) §  Identity Management. © 2012 OpenLink Software, All rights reserved.
  • 7. Products & Services Software Products •  OpenLink Universal Data Access Drivers (UDA) - High-performance data access drivers for ODBC, JDBC, ADO.NET, and OLE DB that provide transparent access to enterprise databases. •  OpenLink Virtuoso - available in single server and cluster editions that are deployed in cloud and/or enterprise modes. •  OpenLink Data Spaces Platform and Applications •  OpenLink Ajax Toolkit •  OpenLink Data Explorer •  An Open Source Data Access SDK for ODBC All OpenLink products are delivered by download from the Internet (http, ftp, etc.). Temporary licenses are issued upon download and may be extended as needed, on a case- by-case basis. Permanent licenses are issued once payment is received. © 2012 OpenLink Software, All rights reserved.
  • 8. Products & Services Professional and Support Services •  OpenLink Product Support provides front-line email and phone support, web-based online support, and a variety of premium services such as phone, emergency, and onsite support. •  Our Support staff is comprised of individuals with extensive knowledge of data access, data migration, database administration, programming APIs, and other relevant skills. •  Services are sold in either Standard "Bronze" or Premium "Platinum" Support packages, with varying hours of availability, response times, etc. •  We also offer Custom Development, Training, and other Consultancy services. These services can be offered on- or off-site. Expenses for travel, accommodations, food, etc., associated with on-site services are charged separately. © 2012 OpenLink Software, All rights reserved.
  • 9. Customers OpenLink's installed base is in excess of 10,000 customers worldwide. Examples include: © 2012 OpenLink Software, All rights reserved. n  Data.Gov (U.S. Govt. Open Linked Data initiative) n  Verizon n  Raytheon n  Bank of America n  CGI Federal n  Elsevier n  French National Library n  Globo n  Scottish Government n  St Jude's Medical n  Barclays Bank n  Wells Fargo n  and many more
  • 10. Office Locations USA OpenLink Software, Inc 10 Burlington Mall Road Suite 265 Burlington, MA 01803 Tel.: +1 781 273 0900 Fax: +1 781 229 8030 © 2012 OpenLink Software, All rights reserved. UK OpenLink Software Ltd. Airport House Purley Way Croydon, Surrey CR0 0XZ Tel.: +44 (0)20 8681 7701 Fax: +44 (0)20 8681 7702
  • 11. © 2012 OpenLink Software, All rights reserved. Virtuoso Universal Server Overview
  • 12. Situation Analysis © 2012 OpenLink Software, All rights reserved. Data is growing exponentially along the following dimensions: n Volume n Velocity n Variety All of this happens while the total hours in day remains 24 hrs.
  • 13. Product Value Proposition © 2012 OpenLink Software, All rights reserved. Enterprise and Individual Agility via Data Access, Integration, and Management, without compromising performance, scalability, security, and platform independence. Virtuoso locks you into an experience (openness, performance, and scale) not the platform itself. -- Kingsley Idehen, Founder & CEO, OpenLink Software
  • 14. Product Architecture © 2012 OpenLink Software, All rights reserved. A high-performance, scalable, secure, and operating-system- independent server designed to handle contemporary challenges associated with standards compliant data access, data integration, and data management.
  • 15. Data Virtualization Middleware © 2012 OpenLink Software, All rights reserved. An in-built middleware layer (“Sponger”) for creating Transient & Persistent Views over Heterogeneous Data Sources.
  • 16. Sophisticated Content Crawler © 2012 OpenLink Software, All rights reserved. DBMS hosted Content Crawler that’s leverages loosely coupled binding to the Sponger Middleware component for transformation of unstructured and semi- structured data into Linked Data.
  • 17. Core Platform behind LOD Cloud © 2010 OpenLink Software, All rights reserved. Core Platform (Graph DBMS and Linked Data Deployment) behind DBpedia, many bubbles in the LOD Cloud, and the LOD Cloud cache itself.
  • 18. Virtuoso Linked Data projects •  DBpedia - public SPARQL endpoint over the DBpedia data (and international Chapters) •  LOD Cloud Cache - public server hosting LOD cloud datasets •  URIBurner - Linked Data generation & transformation service •  Linked Geo Data - OpenStreetMap Spatial data as Linked Data •  Sindice - SPARQL endpoint behind its Semantic Web Index •  Data.gov - US Government Linked Data •  Health.data.gov - Clinical Quality Linked Data on health.data.gov •  Seevl - Linked Data music discovery service •  Bio2RDF - Life science data mapped to Linked Data •  Neurocommons - Life science data mapped to Linked Data •  Musicbrainz - MusicBrainz database published as Linked Data •  Open PHACTS - DBpedia-like Linked Data Space for Pharma •  Others - Many others … © 2012 OpenLink Software, All rights reserved.
  • 19. Powerful Standards Support © 2012 OpenLink Software, All rights reserved. ODBC compliance enables use of client applications (e.g. Microsoft Access) as front- ends for Virtuoso, 3rd party RDBMS engines, and the World Wide Web hosted Linked Open Data Cloud.
  • 20. Powerful Standards Support Cont’d © 2012 OpenLink Software, All rights reserved. ODBC & HTML5 compliance enables development of rich client apps. that leverage the WebDB-ODBC bridge for accessing data across: Virtuoso, 3rd party RDBMS engines, and the World Wide Web hosted Linked Open Data Cloud.
  • 21. Insight Discovery & Exploration © 2012 OpenLink Software, All rights reserved. Native Faceted Browsing that enables multi-dimensional drill-downs via any browser
  • 22. Insight Discovery & Exploration © 2012 OpenLink Software, All rights reserved. Microsoft Silverlight or HTML5 based PivotViewer Front-End for SPARQL and SPARQL-FED Queries
  • 23. Powerful SPARQL Query Service © 2012 OpenLink Software, All rights reserved. Basic SPARQL Endpoint for Creating Query Definitions & Sharing Query Results. Example: health.data.gov data directly from a Web Browser.
  • 24. Powerful SPARQL Query Builder © 2012 OpenLink Software, All rights reserved. Use Query By Example (QBE) Patterns to Construct & Share Query Results.
  • 25. How Do I Get Going? n  Download, install, and experience the power of coherent integration of disparate data sources, data access protocols, and data representation formats. n  In an nutshell, commence exploitation of powerful business intelligence, socially enhanced collaboration, data virtualization, and entity analytics without writing a line of code! n  Turn "Big Data" into exploitable "Smart Data" without compromise! n  Will be integrated into the next release of the LOD2 Stack © 2012 OpenLink Software, All rights reserved.
  • 26. © 2012 OpenLink Software, All rights reserved. Virtuoso 7.0
  • 27. 27 © 2012 OpenLink Software, All rights reserved. Flexible Big Data Challenge n  Data Agility is challenged by Volume, Velocity, and Variety n  “Schema Last” is great - if the price is right n  RDF, graphs promise powerful querying with the flexibility and scale of NoSQL key-value stores n  Inference may be good for integration, if can express the right things, beyond OWL n  RDF data management technology must learn from the lessons of SQL RDBMS, everything applies
  • 28. 28 © 2012 OpenLink Software, All rights reserved. Virtuoso 7.0 Mission Statement Destruction of the following items as impediments to Big (Open) Linked Data exploitation: n Performance n Scalability n Platform Independence n Security & Privacy n Price
  • 29. 29 © 2012 OpenLink Software, All rights reserved. Virtuoso 7.0 & Big Data Myths Myths put to rest: n Scalable Open Ended SPARQL Endpoints n Scalable Open Ended Read-Write SPARQL Endpoints n Fine-grained Access Controls underlying Read- Only or Read-Write endpoints.
  • 30. 30 © 2012 OpenLink Software, All rights reserved. Virtuoso Column Store Features n  Supports SQL and SPARQL query languages n  Compact column-wise storage n  Vectored execution of commands n  Shared nothing scale out for clusters n  Powerful procedure language with parallel, distributed control structures n  Full-text and geospatial indexes
  • 31. 31 © 2012 OpenLink Software, All rights reserved. Storage Engine n  Freely mix column-, and row-wise indices n  All SQL and RDF data types natively supported , single execution engine for SQL/SPARQL n  Column compression 3x more space efficient than row- wise compression for RDF n  Column stores are not only for big scans, random access surpasses rows as as soon as there is some locality n  9 B/quad with DBpedia, 7 B/quad with BSBM or RDF-H, 14 B/quad with web crawls (PSOG, POSG, SP, OP, GS, excluding literals)
  • 32. 32 © 2012 OpenLink Software, All rights reserved. Execution Engine n  Vectoring is not only for column stores n  Vectoring makes a random access into a linear merge join if there is any locality: Always a win, mileage depends on run time factors n  Vectoring eliminates interpretation overhead and makes CPU friendly code possible n  Even with run time data typing, vectoring allows use of type-specific operators on homogenous data, e.g. arithmetic n  Dynamically adjust vector size: Larger vector may not fit in cache but will get better locality for random access
  • 33. 33 © 2012 OpenLink Software, All rights reserved. Graph operations n  Run time computation plus caching instead of materialization n  SPARQL/SQL extension for arbitrary transitive subqueries: n  Flexible options for returning shortest paths, all paths, all / distinct reachable, attributes of steps on paths etc. n  Efficient execution, searching the graph from both ends if looking for a path with ends given n  Query operators for RDF hierarchy traversal n  Special query operator for OWL sameAs and IFP based identity n  Taking OWL sameAs / IFP identity into account for DISTINCT /GROUP BY
  • 34. 34 © 2012 OpenLink Software, All rights reserved. Query Optimization Challenges n  Typical SQL stats do not help n  Need to measure data cardinalities starting from constants in the query n  Need to sample fanout predicate by predicate, as needed n  Predicate and class hierarchies are easy to handle in sampling n  sameAs or IFP inference voids all guesses n  Is hash join worthwhile? High setup cost means that one must be sure of cardinalities first
  • 35. 35 © 2012 OpenLink Software, All rights reserved. Deep Sampling n  Everything is a join -> sampling must also do joins n  As the candidate plan grows, the cost model executes all the ops on a sample of the data n  Actual cardinality and locality are known, also when search conditions are correlated n  Having high confidence in the cost model, hash join plans become safe and attractive n  Even though there is an indexed access path for all, a scan can be better because it produces results in order. Need to be sure of selectivity before taking the risk
  • 36. 36 © 2012 OpenLink Software, All rights reserved. Elastic Cluster n  Data is partitioned by key, different indices may have different partition keys n  Partitions may split and migrate between servers n  Partitions may be kept in duplicate for fault tolerance/load balancing n  Actual access stats drive partition split and placement
  • 37. 37 © 2012 OpenLink Software, All rights reserved. Optimizing for Cluster n  Vectored execution is natural in a cluster since single-tuple messages are not an option n  Keep max ops in flight at all times, always send long messages n  Fully distributed query coordination: ¡  Any node can service a client request. Correlated subqueries, stored procedures may execute anywhere, arbitrary parallelism and recursion between partitions ¡  On single shared memory box, cluster is approximately even with single process multithreading, low overhead ¡  1.8x more throughput in BSBM BI when going from 1 to 2 machines ¡  Distributed stored procedures, send the proc to the data, as in map- reduce, except that there are no limits on cross partition calling/recursion ¡  Choice of transactional and auto-commit update semantics, can have atomic ops without global transaction
  • 38. 38 © 2012 OpenLink Software, All rights reserved. Cluster Architecture Diagrams
  • 39. 39 © 2012 OpenLink Software, All rights reserved. n  55 billion triples in LOD cache, only 384 GB of RAM, 2TB disk n  2 x 384 GB of RAM, 4TB SSD n  Most of Linked Open Data and Web Crawls n  http://lod.openlinksw.com n  http://lod.openlinksw.com/sparql LOD Cache
  • 40. 40 © 2012 OpenLink Software, All rights reserved. Independent Benchmark Report from CWI: Berlin SPARQL Benchmark #Triples Source File Size Compress ed Source File Size Source Data Files Per Loader Node Final Database File Size Load Time 50 Billion 2.8 TB 240 GB 30 GB 1.8 TB 10h 54s 150 Billion 8.5 TB 728 GB 91 GB 5.6 TB n/a
  • 41. 41 © 2012 OpenLink Software, All rights reserved. Store Comparisons Summary: Exploration oriented queries (QMpH) Berlin SPARQL Benchmark 100 Million Triples 200 Million Triples 1 Billion Triples Virtuoso 6 37,678.319 32,969.006 8,984.789 Virtuoso 7 47,178.820 27,933.682
  • 42. 42 © 2012 OpenLink Software, All rights reserved. Store Comparisons Summary: Business Intelligence oriented queries (QMpH) Berlin SPARQL Benchmark 10 Million Triples 100 Million Triples 1 Billion Triples Virtuoso 6 431.465 35.342 2.383 Virtuoso 7 996.795 75.236
  • 43. 43 © 2012 OpenLink Software, All rights reserved. Store Comparisons Summary: Exploration oriented queries (Cluster Edition) (QMpH) Berlin SPARQL Benchmark 10 Billion Triples 50 Billion Triples 150 Billion Triples Virtuoso 7 2,360.210 4,253.157 2,090.574
  • 44. 44 © 2012 OpenLink Software, All rights reserved. Store Comparisons Summary: Business Intelligence oriented queries (Cluster Edition) (QMpH) Berlin SPARQL Benchmark 10 Billion Triples 50 Billion Triples 150 Billion Triples Virtuoso 7 13.078 0.964 0.285
  • 45. 45 © 2012 OpenLink Software, All rights reserved. Future Work n  Complete deep sampling: enhanced query optimization plans n  Run TPC-H and TPC-DS in SQL and their 1:1 translation in SPARQL, demonstrating SPARQL performance as near to SQL as possible
  • 46. Additional Information n  OpenLink Software ¡  OpenLink Software - www.openlinksw.com ¡  OpenLink Virtuoso - virtuoso.openlinksw.com ¡  Universal Data Access - uda.openlinksw.com n  Social Media Data spaces ¡  http://virtuoso.openlinksw.com/blog/ (weblog) ¡  https://plus.google.com/112399767740508618350/ posts (Google+) ¡  https://twitter.com/OpenLink (Twitter) ¡  http://www.linkedin.com/company/openlink-software (LinkedIn) ¡  Hashtag: #LinkedData (Anywhere) © 2012 OpenLink Software, All rights reserved.
  • 47. EU-FP7 LOD2 WP6 – 25.-26.03.2013. Page 47 http://lod2.eu Creating Knowledge out of Interlinked Data LOD2 Stack Usability Survey 2013 w.surveygizmo.com/s3/1188229/LOD2-Stack-Usability-Survey-2013