This webinar in the course of the LOD2 webinar series will present Zemanta and its LODRefine - a LOD-enabled version of OpenRefine (previously Google Refine), which is a part of the LOD2 stack. LODRefine extends cleansing and linking functionalities of OpenRefine by providing means to reconcile and augment your data with DBpedia or any other SPARQL endpoint, extract named entities using Zemanta API, export data in one of the RDF formats, and recently also to exploit available crowdsourcing services. In webinar we will demonstrate several task which demonstrate the ease of use and versatility of LODRefine.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series: http://lod2.eu/BlogPost/webinar-series
This webinar in the course of the LOD2 webinar series will present use cases and live demos of D2R (Free University Berlin) and Sparqlify (University of Leipzig).
D2R Server is a tool for publishing relational databases on the Semantic Web. It enables RDF and HTML browsers to navigate the content of the database, and allows applications to query the database using the SPARQL query language.
Sparqlify is a tool enabling one to define expressive RDF views on relational databases and query them with a subset of the SPARQL query language. By featuring a novel RDF view definition syntax, it aims at simplifying the RDB-RDF mapping process.
more to be found at:
UnifiedViews is a joint project currently maintained by Semantic Web Company (SWC) and Semantica.cz (Semantica.cz). It has been mainly developed by Charles University in Prague as a student project called ODCleanStore (version 2). It is based on the experience SWC obtained with the LOD Management Suite (LODMS) used in WP7 and ODCleansStore (version 1) developed by Charles University in Prague for the WP9a use case of the LOD2 FP7 project. In the next stack release of the LOD2 stack, UnifiedViews will replace LODMS as an ETL tool in the stack and the tool has already been adopted in other projects.
In the webinar we will give a brief overview of the UnifiedViews project (Helmut Nagy). The main part will be a presentation of the tool and it's capabilities (Tomas Knap)
http://lod2.eu/BlogPost/webinar-series
This webinar in the course of the LOD2 webinar series will present the release 3.0 of the LOD2 stack, which contains updates to
*) Virtuoso 7 [Openlink]: the original row store of the Virtuoso 6 universal server has now been replaced by a column store, increasing the performance of SPARQL queries significantly, the store is now up to three times as fast as the previous major version.
Linked Open Data Manager Suite [SWC]: the 'lodms' application allows the user to quickly set up pipelines for transforming linked data through the use of its many extensions. It also allows operations for extracting rdf from other types of data.
*) dbpedia-spotlight-ui [ULEI]: a graphical user interface component that allows the user to use a remote DBpedia spotlight instance to annotate a text with DBpedia concepts.
*) sparqlify [ULEI]: a scalable SPARQL-SQL rewriter, allowing you to query an SQL database as if it were a triple store.
*) SIREn [DERI]: a Lucene plugin that allows you to efficiently index and query RDF, as well as any textual document with an arbitrary amount of metadata fields.
*) CubeViz [ULEI]: CubeViz allows visualization of the Data Cube linked data representation of statistical data. It has support for the more advanced DataCube features, such as slices. It also allows the selection of a remote SPARQL endpoint and export of a modified cube.
*) R2R [UMA]: the R2R mapping API is now included directly into the lod2 demonstrator application, allowing users to experience the full effect of the R2R semantic mapping language through a graphical user interface.
*) ontowiki-csvimport [ULEI]: an OntoWiki extension that transforms CSV files to RDF. The extension can create Data Cubes that can be visualized by CubeViz.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
In this Webinar Lorenz Bühmann presents the ontology repair and enrichment tool ORE and also the DL-Learner , a machine learning tool to solve supervised learnings tasks and support knowledge engineers in constructing knowledge. Those two beneighbored tools in the LOD2 Stack are for classification and the following quality analysis of Linked Data.
This webinar in the course of the LOD2 webinar series will present use cases and live demos of D2R (Free University Berlin) and Sparqlify (University of Leipzig).
D2R Server is a tool for publishing relational databases on the Semantic Web. It enables RDF and HTML browsers to navigate the content of the database, and allows applications to query the database using the SPARQL query language.
Sparqlify is a tool enabling one to define expressive RDF views on relational databases and query them with a subset of the SPARQL query language. By featuring a novel RDF view definition syntax, it aims at simplifying the RDB-RDF mapping process.
more to be found at:
UnifiedViews is a joint project currently maintained by Semantic Web Company (SWC) and Semantica.cz (Semantica.cz). It has been mainly developed by Charles University in Prague as a student project called ODCleanStore (version 2). It is based on the experience SWC obtained with the LOD Management Suite (LODMS) used in WP7 and ODCleansStore (version 1) developed by Charles University in Prague for the WP9a use case of the LOD2 FP7 project. In the next stack release of the LOD2 stack, UnifiedViews will replace LODMS as an ETL tool in the stack and the tool has already been adopted in other projects.
In the webinar we will give a brief overview of the UnifiedViews project (Helmut Nagy). The main part will be a presentation of the tool and it's capabilities (Tomas Knap)
http://lod2.eu/BlogPost/webinar-series
This webinar in the course of the LOD2 webinar series will present the release 3.0 of the LOD2 stack, which contains updates to
*) Virtuoso 7 [Openlink]: the original row store of the Virtuoso 6 universal server has now been replaced by a column store, increasing the performance of SPARQL queries significantly, the store is now up to three times as fast as the previous major version.
Linked Open Data Manager Suite [SWC]: the 'lodms' application allows the user to quickly set up pipelines for transforming linked data through the use of its many extensions. It also allows operations for extracting rdf from other types of data.
*) dbpedia-spotlight-ui [ULEI]: a graphical user interface component that allows the user to use a remote DBpedia spotlight instance to annotate a text with DBpedia concepts.
*) sparqlify [ULEI]: a scalable SPARQL-SQL rewriter, allowing you to query an SQL database as if it were a triple store.
*) SIREn [DERI]: a Lucene plugin that allows you to efficiently index and query RDF, as well as any textual document with an arbitrary amount of metadata fields.
*) CubeViz [ULEI]: CubeViz allows visualization of the Data Cube linked data representation of statistical data. It has support for the more advanced DataCube features, such as slices. It also allows the selection of a remote SPARQL endpoint and export of a modified cube.
*) R2R [UMA]: the R2R mapping API is now included directly into the lod2 demonstrator application, allowing users to experience the full effect of the R2R semantic mapping language through a graphical user interface.
*) ontowiki-csvimport [ULEI]: an OntoWiki extension that transforms CSV files to RDF. The extension can create Data Cubes that can be visualized by CubeViz.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
In this Webinar Lorenz Bühmann presents the ontology repair and enrichment tool ORE and also the DL-Learner , a machine learning tool to solve supervised learnings tasks and support knowledge engineers in constructing knowledge. Those two beneighbored tools in the LOD2 Stack are for classification and the following quality analysis of Linked Data.
(http://lod2.eu/BlogPost/webinar-series) In this Webinar Michael Martin presents CubeViz - a facetted browser for statistical data utilizing the RDF Data Cube vocabulary which is the state-of-the-art in representing statistical data in RDF. This vocabulary is compatible with SDMX and increasingly being adopted. Based on the vocabulary and the encoded Data Cube, CubeViz is generating a facetted browsing widget that can be used to filter interactively observations to be visualized in charts. Based on the selected structure, CubeViz offer beneficiary chart types and options which can be selected by users.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
Slides of the presentation by Robert Isele of Free University of Berlin, Germany in the course of the LOD2 webinar: SILK on 21.02.2012 - for more information please see: http://lod2.eu/BlogPost/webinar-series
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
The W3C Linked Data Platform (LDP) specification describes a set of best practices and simple approach for a read-write Linked Data architecture, based on HTTP access to web resources that describe their state using the RDF data model. This presentation provides a set of simple examples that illustrates how an LDP client can interact with an LDP server in the context of a read-write Linked Data application i.e. how to use the LDP protocol for retrieving, updating, creating and deleting Linked Data resources.
This paper proposes a mapping of the Linked Data Platform (LDP) specification for Constrained Application Protocol (CoAP). Main motivation stems from the fact that LDP W3C Recommendation presents resource management primitives for HTTP only. A general translation of LDP-HTTP requests and responses is provided, as well as a framework for HTTP-to-CoAP proxying. Experiments have been carried out using the LDP W3C Test Suite.
A set of slides that provides a high-level overview of the W3C Linked Data Platform specification presented at the 4th Linked Data in Architecture and Construction Workshop.
For more detailed and technical version of the presentation, please refer to
http://www.slideshare.net/nandana/learning-w3c-linked-data-platform-with-examples
LDAC 2016 programme
http://smartcity.linkeddata.es/LDAC2016/#programme
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
How to Access Your Library Book Collections Using Solrlucenerevolution
Presented by Engy Ali | The Library of Alexandria See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
Do you have a large collection of text content that you want to search? Facing challenges on how to facet after performing a full text search across metadata and content? Do you want to use Solr with personalization? Bibliotheca Alexandrina provides public access to digitized book collections that exceed 220,000 books, through a web-based search and browsing facility. The facility is completely built on Solr in five different languages. The website provides full text morphological search within the books’ metadata and content with result highlighting. Different personalization features like annotation tools and tagging are also implemented using Solr. This presentation will cover how Bibliotheca Alexandrina uses Solr to implement full text indexing and searching across the entire collection, faceting, search within the content of a book and result highlighting and techniques used for personalization.
This presentation is based on the second chapter of my textbook Fundamentals of Web Development. The book is published by Addison-Wesley. It can be purchased via http://www.amazon.com/Fundamentals-Web-Development-Randy-Connolly/dp/0133407152.
This book is intended to be used as a textbook on web development suitable for intermediate to upper-level computing students. It may also be of interest to a non-student reader wanting a single book that encompasses the entire breadth of contemporary web development.
This book will be the first in what will hopefully be a textbook series. Each book in the series will have the same topics and coverage but each will use a different web development environment. The first book in the series will use PHP.
To learn more about the book, visit http://www.funwebdev.com.
Born from the wish to make linking tractable, the Link Discovery Framework for Metric Spaces (LIMES) is tailored towards the time-efficient and lossless discovery of links across knowledge bases. LIMES is an extensible declarative framework that encapsulates manifold algorithms dedicated to the processing of structured data of any sort. Built with extensibility and easy integration in mind, LIMES allows implementing applications that integrate, consume and/or generate Linked Data. Within LOD2, it will be used for discovering links between knowledge bases.
This webinar will be presented by the LOD2 Partner: University of Leipzig (ULEI), Germany.
This webinar in the course of the LOD2 webinar series will present the implications of Linked Open Data and Semantic Web Technologies in the information and publishing industry.
The publishing industry is struggling with too much information on the one hand and too less resources to bring meaning to this information on the other hand. As an industrial use case partner in LOD2, Wolters Kluwer Deutschland GmbH investigates in detail, how LOD and Semantic Web have the potential to solve this critical issue for their business. The presentation will show what parts of the LOD2 stack are used within the use case and what challenges had to be addressed in the last two years. Interesting future areas like natural language processing will also be mentioned. The topics covered are relevant for any industry that deals with a lot of data and documents, not only publishing.
This series will provide a monthly webinar about Linked (Open) Data tools and services around the LOD2 project, the LOD2 Stack and the Linked Open Data Life Cycle, also in the form of 3rd party tools. Please find continuously updated information here: http://lod2.eu/BlogPost/webinar-series
(http://lod2.eu/BlogPost/webinar-series) In this Webinar Michael Martin presents CubeViz - a facetted browser for statistical data utilizing the RDF Data Cube vocabulary which is the state-of-the-art in representing statistical data in RDF. This vocabulary is compatible with SDMX and increasingly being adopted. Based on the vocabulary and the encoded Data Cube, CubeViz is generating a facetted browsing widget that can be used to filter interactively observations to be visualized in charts. Based on the selected structure, CubeViz offer beneficiary chart types and options which can be selected by users.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
Slides of the presentation by Robert Isele of Free University of Berlin, Germany in the course of the LOD2 webinar: SILK on 21.02.2012 - for more information please see: http://lod2.eu/BlogPost/webinar-series
LOD2 plenary meeting in Paris: presentation of WP5: State of Play: Linked Data Visualization, Browsing and Authoring, by Renaud Delbru (National University of Ireland, Galway).
The W3C Linked Data Platform (LDP) specification describes a set of best practices and simple approach for a read-write Linked Data architecture, based on HTTP access to web resources that describe their state using the RDF data model. This presentation provides a set of simple examples that illustrates how an LDP client can interact with an LDP server in the context of a read-write Linked Data application i.e. how to use the LDP protocol for retrieving, updating, creating and deleting Linked Data resources.
This paper proposes a mapping of the Linked Data Platform (LDP) specification for Constrained Application Protocol (CoAP). Main motivation stems from the fact that LDP W3C Recommendation presents resource management primitives for HTTP only. A general translation of LDP-HTTP requests and responses is provided, as well as a framework for HTTP-to-CoAP proxying. Experiments have been carried out using the LDP W3C Test Suite.
A set of slides that provides a high-level overview of the W3C Linked Data Platform specification presented at the 4th Linked Data in Architecture and Construction Workshop.
For more detailed and technical version of the presentation, please refer to
http://www.slideshare.net/nandana/learning-w3c-linked-data-platform-with-examples
LDAC 2016 programme
http://smartcity.linkeddata.es/LDAC2016/#programme
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
How to Access Your Library Book Collections Using Solrlucenerevolution
Presented by Engy Ali | The Library of Alexandria See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
Do you have a large collection of text content that you want to search? Facing challenges on how to facet after performing a full text search across metadata and content? Do you want to use Solr with personalization? Bibliotheca Alexandrina provides public access to digitized book collections that exceed 220,000 books, through a web-based search and browsing facility. The facility is completely built on Solr in five different languages. The website provides full text morphological search within the books’ metadata and content with result highlighting. Different personalization features like annotation tools and tagging are also implemented using Solr. This presentation will cover how Bibliotheca Alexandrina uses Solr to implement full text indexing and searching across the entire collection, faceting, search within the content of a book and result highlighting and techniques used for personalization.
This presentation is based on the second chapter of my textbook Fundamentals of Web Development. The book is published by Addison-Wesley. It can be purchased via http://www.amazon.com/Fundamentals-Web-Development-Randy-Connolly/dp/0133407152.
This book is intended to be used as a textbook on web development suitable for intermediate to upper-level computing students. It may also be of interest to a non-student reader wanting a single book that encompasses the entire breadth of contemporary web development.
This book will be the first in what will hopefully be a textbook series. Each book in the series will have the same topics and coverage but each will use a different web development environment. The first book in the series will use PHP.
To learn more about the book, visit http://www.funwebdev.com.
Born from the wish to make linking tractable, the Link Discovery Framework for Metric Spaces (LIMES) is tailored towards the time-efficient and lossless discovery of links across knowledge bases. LIMES is an extensible declarative framework that encapsulates manifold algorithms dedicated to the processing of structured data of any sort. Built with extensibility and easy integration in mind, LIMES allows implementing applications that integrate, consume and/or generate Linked Data. Within LOD2, it will be used for discovering links between knowledge bases.
This webinar will be presented by the LOD2 Partner: University of Leipzig (ULEI), Germany.
This webinar in the course of the LOD2 webinar series will present the implications of Linked Open Data and Semantic Web Technologies in the information and publishing industry.
The publishing industry is struggling with too much information on the one hand and too less resources to bring meaning to this information on the other hand. As an industrial use case partner in LOD2, Wolters Kluwer Deutschland GmbH investigates in detail, how LOD and Semantic Web have the potential to solve this critical issue for their business. The presentation will show what parts of the LOD2 stack are used within the use case and what challenges had to be addressed in the last two years. Interesting future areas like natural language processing will also be mentioned. The topics covered are relevant for any industry that deals with a lot of data and documents, not only publishing.
This series will provide a monthly webinar about Linked (Open) Data tools and services around the LOD2 project, the LOD2 Stack and the Linked Open Data Life Cycle, also in the form of 3rd party tools. Please find continuously updated information here: http://lod2.eu/BlogPost/webinar-series
Redefining the ways in which we understand and use business information, even reshaping the boundaries of what constitutes "business information" today. What this means is Exalead CloudView breaks new ground in the efficacy with which it can hide the scale, complexity and obscurity of today's information landscape in order to deliver to real people in real time exactly the information they need in their unique context. To understand the evolutions in Exalead CloudView behind these new capabilities, and by extension to better understand the broader revolution underway in the world of data management, we invite you to join Nigel MOTTE & Amar-Djalil MEZAOUR for an informative look under the hood of Exalead CloudView.
Slides of the presentation by Hugh Williams of OpenLink Software in the course of the LOD2 webinar: Virtuoso Universal Server on 20.12. 2011 - for more information please see: http://lod2.eu/BlogPost/webinar-series
DBpedia Spotlight is a tool employed in the Extraction stage of the LOD Lyfe Cycle, performing Entity Recognition and Linking. Although the tool currently specializes in English language, the support for other languages is currently being tested, and demos for German, Dutch and others are available or underway. The tool can be used to enable faceted browsing, semantic search, among other applications. In this webinar we will describe what is DBpedia Spotlight, how it works and how can you benefit from it in your application.
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
http://lod2.eu/BlogPost/webinar-series
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
This webinar in the course of the LOD2 webinar series will present Virtuoso 7. Virtuoso Column Store, Adaptive Techniques for RDF Graph Databases. In this webinar we shall discuss the application of column store techniques to both graph (RDF) and relational data for mixed work-loads ranging from lookup to analytics.
Virtuoso is an innovative enterprise grade multi-model data server for agile enterprises & individuals. It delivers an unrivaled platform agnostic solution for data management, access, and integration. The unique hybrid server architecture of Virtuoso enables it to offer traditionally distinct server functionality within a single product
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series!
http://lod2.eu/BlogPost/webinar-series
Europeana in a nutshell - Breandán KnowltonEuropeana
A presentation given by Breandán Knowlton of the Europeana project to the Linked Heritage "Aggregation and Semantic Web Workshop" on 22 May 2012 in Stockholm, Sweden.
Outlines the primary structures and goals of the Europeana project.
Freme at feisgiltt 2015 freme & linked data & localisersFelix Sasaki
This presentation is a complement to
http://slideshare.net/atcfsenzoku/freme-at-feisgiltt-2015-freme-use-cases
It provides more details on processing of linked data in FREME, using NIF and with the FREME services e-Entity and e-Link
Talk at 3th Keystone Training School - Keyword Search in Big Linked Data - Institute for Software Technology and Interactive Systems, TU Wien, Austria, 2017
Similar to LOD2 Webinar Series: Zemanta / Open refine (20)
PublicData.eu is striving to become the Pan European one-stop-shop, providing access to open, freely reusable datasets from numerous local, regional and national public bodies across Europe.
After the first release of the PublicData.eu website (Alpha release was Jan 2011 & Beta release was June 2011) and it's subsequent upgrades (a significant upgrade was efected March 2012), OKFN worked towards the deployment of various personalization features, meant to improve the user experience on Publicdata.eu and spur more interest and interaction around the official data-sets.
This webinar in the course of the LOD2 webinar series will present use cases and live demos of PoolParty (by Semantic Web Company).
Knowledge organization systems like taxonomies or thesauri can benefit from linked data approaches and vice versa. In recent years SKOS became very popular in various industries due to its simplicity, SKOS turned out to be the entry point to the Semantic Web. Learn more about the possibilities to link your enterprise metadata with the web of data! Learn more about the possibilities to link your enterprise metadata with the web of data and PoolParty as means for linked data management!
If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the LOD2 webinar series!
http://lod2.eu/BlogPost/webinar-series
State of Play presentation at the LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building, Fertilization by Martin Kaltenböck, Semantic Web Company (SWC)
State of Play presentation at the LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public Sector Contracts by Vojtěch Svátek (UEP)
State of Play presentation at the LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Information as Linked Data by Irena Irina Bolychevsky, OKFN
State of Play presentation at the LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web by Amar-Djalil MEZAOUR,Dassault Systèmes Exalead.
State of Play presentation at the LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing by Christian Dirschl, Wolters Kluwer Germany
State of Play presentation at the LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Authoring Interfaces by Sean Policarpio of DERI / NUIG.
Slides of the presentation by Philipp Frischmuth of University of Leipzig, Germany in the course of the LOD2 webinar: OntoWiki on 24.01.2012 - for more information please see: http://lod2.eu/BlogPost/webinar-series
Slides of Institute Mihajlo Pupin (Serbia) for their partner introduction as a new LOD2 partner in the course of the LOD2 project enlargement - presented at the LOD2 plenary meeting in Leuven, Belgium on September 2011
More from LOD2 Creating Knowledge out of Interlinked Data (14)
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
LOD2 Webinar Series: Zemanta / Open refine
1. Creating Knowledge out of Interlinked Data
LOD2 Webinar . 29.11.2011 . Page 1 http://lod2.eu
2. Creating Knowledge out of Interlinked Data
LOD2 is a large-scale integrating project co-funded by the European
Commission within the FP7 Information and Communication Technologies
Work Programme. This 4-year project comprises leading Linked Open
Data technology researchers, companies, and service providers. Coming
from across 12 countries the partners are coordinated by the Agile
Knowledge Engineering and Semantic Web Research Group at the
University of Leipzig, Germany.
LOD2 will integrate and syndicate Linked Data with existing large-scale
applications. The project shows the benefits in the scenarios of Media and
Publishing, Corporate Data intranets and eGovernment.
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 2 http://lod2.eu
3. Creating Knowledge out of Interlinked Data
Once
per
month
the
LOD2
webinar
series
offer
a
free
webinar
about
tools
and
services
along
the
Linked
Open
Data
Life
Cycle.
Stay
with
us
and
learn
more
about
acquisiAon,
ediAng,
composing,
connected
applicaAons
–
and
finally
publishing
Linked
Open
Data.
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 3 http://lod2.eu
4. Creating Knowledge out of Interlinked Data
LODRefine – LOD-enabled
OpenRefine
The tool for cleansing, linking and augmenting data by Mateja Verlic, Zemanta
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 4 http://lod2.eu
5. Creating Knowledge out of Interlinked Data
Company
Zemanta brings useful content to bloggers,
connect authors to their peers and publishers
to marketers.
• Content research services
• Content enrichment tools
Our role in LOD2
• Web scale link & text mining from unstructured data
• Tools for cleansing data and crowdsourcing of cleansing
Dr. Mateja Verlič
LOD2 Webinar . 29.11.2011 . Page 5 http://lod2.eu
6. Creating Knowledge out of Interlinked Data
Presentation outline
• Terminology briefing
• Introduction to LODRefine
• The core: OpenRefine
• LOD-friendly extensions
• Demonstration
• Q&A
LOD2 Webinar . 29.11.2011 . Page 6 http://lod2.eu
7. Creating Knowledge out of Interlinked Data
Reconciling
Def: to reconcile
• To reestablish a close relationship between.
• To make compatible or consistent.
(The Free Dictionary)
LOD2 Webinar . 29.11.2011 . Page 7 http://lod2.eu
8. Creating Knowledge out of Interlinked Data
Augmenting / extending
Def: to augment
• To make (something already developed or well under way) greater, as in size,
extent, or quantity
(The Free Dictionary)
LOD2 Webinar . 29.11.2011 . Page 8 http://lod2.eu
9. Creating Knowledge out of Interlinked Data
Crowdsourcing
Def: crowdsourcing
• is the act of outsourcing tasks, traditionally performed by an employee or
contractor, to an undefined, large group of people or community (a crowd),
through an open call.
LOD2 Webinar . 29.11.2011 . Page 9 http://lod2.eu
10. Creating Knowledge out of Interlinked Data
Introduction to LODRefine
LOD-enabled OpenRefine
Google Refine ==> OpenRefine
LODGrefine ==> LODRefine
• Supporting DBpedia (and Freebase)
• Supporting crowdsourcing
• Exporting RDF
• Extracting named entities
LOD2 Webinar . 29.11.2011 . Page 10 http://lod2.eu
11. Creating Knowledge out of Interlinked Data
LODRefine’s place in LOD life cycle
LOD2 Webinar . 29.11.2011 . Page 11 http://lod2.eu
12. Creating Knowledge out of Interlinked Data
OpenRefine
Cross-platform server-client application
• Runs locally
• No dataset
Supports:
• Faceted browsing
• Regular expressions
• GREL expressions
• Extensions
value.split(",")[0].strip()
LOD2 Webinar . 29.11.2011 . Page 12 http://lod2.eu
13. Creating Knowledge out of Interlinked Data
OpenRefine
LOD2 Webinar . 29.11.2011 . Page 13 http://lod2.eu
14. Creating Knowledge out of Interlinked Data
The Extensions
Extend functionalities of OpenRefine
Developed by
• Zemanta: DBpedia extension, Crowdsourcing
• DERI: RDF Refine
• Free Your Metadata Group: Named Entity Extraction extension
LOD2 Webinar . 29.11.2011 . Page 14 http://lod2.eu
15. Creating Knowledge out of Interlinked Data
RDF Refine extension
Reconciliation and interlinking
• DBpedia
• Any SPARQL Endpoint or RDF dump
• Supporting for Apache Stanbol
Exporting RDF
• Defining graph shape before exporting
• Using custom vocabularies or importing existing ones
Webpage: http://refine.deri.ie/
Github: https://github.com/fadmaa/grefine-rdf-extension
LOD2 Webinar . 29.11.2011 . Page 15 http://lod2.eu
16. Creating Knowledge out of Interlinked Data
RDF Refine extension - reconciling
LOD2 Webinar . 29.11.2011 . Page 16 http://lod2.eu
17. Creating Knowledge out of Interlinked Data
DBpedia extension
Extending reconciled data with columns from DBpedia
• RDF extension recommended
Extracting Named Entities using Zemanta API
• API key required
Webpage: http://code.zemanta.com/sparkica
Github: https://github.com/sparkica/dbpedia-extension
LOD2 Webinar . 29.11.2011 . Page 17 http://lod2.eu
18. Creating Knowledge out of Interlinked Data
DBpedia extension – extending data
LOD2 Webinar . 29.11.2011 . Page 18 http://lod2.eu
19. Creating Knowledge out of Interlinked Data
DBpedia extension – extracting entities
LOD2 Webinar . 29.11.2011 . Page 19 http://lod2.eu
20. Creating Knowledge out of Interlinked Data
NER extension
Extracts named entities from unstructured text
Currently supports
• Alchemy API
• DBpedia Lookup
• Zemanta API
API keys required
Webpage: http://freeyourmetadata.org/named-entity-extraction/
Github: https://github.com/RubenVerborgh/Refine-NER-Extension
LOD2 Webinar . 29.11.2011 . Page 20 http://lod2.eu
21. Creating Knowledge out of Interlinked Data
NER extension – extracting entities
LOD2 Webinar . 29.11.2011 . Page 21 http://lod2.eu
22. Creating Knowledge out of Interlinked Data
Crowdsourcing extension
Support for
• Creating new crowdsourcing jobs
• Publishing data on CrowdFlower service
• Multiple labor channels (Amazon MT)
• CrowdFlower API key required
Job templates
• Evaluating reconciliation results
• Finding information (e.g. URLs)
Webpage: http://code.zemanta.com/sparkica/
Github: https://github.com/sparkica/crowdsourcing
LOD2 Webinar . 29.11.2011 . Page 22 http://lod2.eu
23. Creating Knowledge out of Interlinked Data
Crowdsourcing extension – create job from template
LOD2 Webinar . 29.11.2011 . Page 23 http://lod2.eu
24. Creating Knowledge out of Interlinked Data
Crowdsourcing extension – upload data
LOD2 Webinar . 29.11.2011 . Page 24 http://lod2.eu
25. Creating Knowledge out of Interlinked Data
Availability of LODRefine & extensions
LOD2 Webinar . 29.11.2011 . Page 25 http://lod2.eu
26. Creating Knowledge out of Interlinked Data
Availability of LODRefine & extensions
LOD2 Webinar . 29.11.2011 . Page 26 http://lod2.eu
27. Creating Knowledge out of Interlinked Data
Demonstration
Top 50 summer books by Forbes
• Creating project
• Preparing data
• Reconciling, extending data with DBpedia
Reconciliation evaulation for NHL players (links extracted from blogs)
• Create crowdsourcing job from template
• Upload data to CrowdFlower
LOD2 Webinar . 29.11.2011 . Page 27 http://lod2.eu
28. Creating Knowledge out of Interlinked Data
Contact
Zemanta Other extensions – resources
Celovska 32, SI-1000 Ljubljana, Slovenia
RDF extension
Presenter Webpage: http://refine.deri.ie/
Mateja Verlic Github: https://github.com/fadmaa/grefine-rdf-extension
Email: mateja.verlic@zemanta.com
Twitter: @sparkica NER extension
Skype: mverlic Webpage: http://freeyourmetadata.org/named-entity-extraction/
Github: https://github.com/RubenVerborgh/Refine-NER-Extension
LODRefine and extensions – resources
LOD2 project & Webinars
LODRefine LOD2 project: http://lod2.eu
Webpage: http://code.zemanta.com/sparkica Webinar series: http://lod2.eu/BlogPost/webinar-series
Github: https://github.com/sparkica/OpenRefine/tree/lodrefine
OpenRefine Resources
Extensions Google Group: https://groups.google.com/forum/#!forum/openrefine
DBpedia extension: https://github.com/sparkica/dbpedia-extension Github: https://github.com/OpenRefine/OpenRefine/
Crowdsourcing extension: Wiki: https://github.com/OpenRefine/OpenRefine/wiki
https://github.com/sparkica/crowdsourcing
Refine-stats extension: https://github.com/sparkica/refine-stats
Utlitities extension: https://github.com/sparkica/utilities
Thanks for your attention!
LOD2 Webinar . 29.11.2011 . Page 28
http://lod2.eu
http://lod2.eu
29. Creating Knowledge out of Interlinked Data
Credits
Jingle R.E.M., Martin Kaltenböck, Florian Kondert
Coordination Thomas Thurner
Martin Kaltenböck
Moderation Martin Kaltenböck
Presented by Mateja Verlič
LOD2 Webinar . 29.11.2011 . Page 29 http://lod2.eu
30. Creating Knowledge out of Interlinked Data
Hope
you
enjoyed
staying
with
us
–
if
you
need
more
detailed
informaAon,
visit
us
at
www.lod2.eu
and
let
us
know
how
we
can
improve
to
meet
your
expectaAons!
Don’t
forget
to
register
for
our
next
webinar
26.02.
2013
–
dbPedia
Spotlight
(University
of
Mannheim)
27.03.
2013
–
CKAN
and
publicdata.eu
(Open
Knowledge
FoundaAon)
Have
a
great
day
and
don’t
forget
...
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 30 http://lod2.eu
31. Creating Knowledge out of Interlinked Data
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 31 http://lod2.eu