• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
LOD2 Webinar Series: Zemanta / Open refine
 

LOD2 Webinar Series: Zemanta / Open refine

on

  • 849 views

This webinar in the course of the LOD2 webinar series will present Zemanta and its LODRefine - a LOD-enabled version of OpenRefine (previously Google Refine), which is a part of the LOD2 stack. ...

This webinar in the course of the LOD2 webinar series will present Zemanta and its LODRefine - a LOD-enabled version of OpenRefine (previously Google Refine), which is a part of the LOD2 stack. LODRefine extends cleansing and linking functionalities of OpenRefine by providing means to reconcile and augment your data with DBpedia or any other SPARQL endpoint, extract named entities using Zemanta API, export data in one of the RDF formats, and recently also to exploit available crowdsourcing services. In webinar we will demonstrate several task which demonstrate the ease of use and versatility of LODRefine.




If you are interested in Linked (Open) Data principles and mechanisms, LOD tools & services and concrete use cases that can be realised using LOD then join us in the free LOD2 webinar series: http://lod2.eu/BlogPost/webinar-series

Statistics

Views

Total Views
849
Views on SlideShare
849
Embed Views
0

Actions

Likes
1
Downloads
15
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

CC Attribution License

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine Presentation Transcript

    • Creating Knowledge out of Interlinked DataLOD2 Webinar . 29.11.2011 . Page 1 http://lod2.eu
    • Creating Knowledge out of Interlinked Data LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. This 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers. Coming from across 12 countries the partners are coordinated by the Agile Knowledge Engineering and Semantic Web Research Group at the University of Leipzig, Germany. LOD2 will integrate and syndicate Linked Data with existing large-scale applications. The project shows the benefits in the scenarios of Media and Publishing, Corporate Data intranets and eGovernment. http://lod2.euLOD2 Webinar . 29.11.2011 . Page 2 http://lod2.eu
    • Creating Knowledge out of Interlinked Data Once  per  month  the  LOD2  webinar  series  offer  a  free  webinar  about   tools  and  services  along  the  Linked  Open  Data  Life  Cycle.     Stay  with  us  and  learn  more  about  acquisiAon,  ediAng,  composing,   connected  applicaAons  –  and  finally  publishing  Linked  Open  Data.   http://lod2.euLOD2 Webinar . 29.11.2011 . Page 3 http://lod2.eu
    • Creating Knowledge out of Interlinked Data LODRefine – LOD-enabled OpenRefine The tool for cleansing, linking and augmenting data by Mateja Verlic, Zemanta http://lod2.euLOD2 Webinar . 29.11.2011 . Page 4 http://lod2.eu
    • Creating Knowledge out of Interlinked DataCompanyZemanta brings useful content to bloggers,connect authors to their peers and publishersto marketers.•  Content research services•  Content enrichment toolsOur role in LOD2•  Web scale link & text mining from unstructured data•  Tools for cleansing data and crowdsourcing of cleansing Dr. Mateja VerličLOD2 Webinar . 29.11.2011 . Page 5 http://lod2.eu
    • Creating Knowledge out of Interlinked DataPresentation outline•  Terminology briefing•  Introduction to LODRefine•  The core: OpenRefine•  LOD-friendly extensions•  Demonstration•  Q&ALOD2 Webinar . 29.11.2011 . Page 6 http://lod2.eu
    • Creating Knowledge out of Interlinked DataReconcilingDef: to reconcile•  To reestablish a close relationship between.•  To make compatible or consistent.(The Free Dictionary)LOD2 Webinar . 29.11.2011 . Page 7 http://lod2.eu
    • Creating Knowledge out of Interlinked DataAugmenting / extendingDef: to augment•  To make (something already developed or well under way) greater, as in size, extent, or quantity(The Free Dictionary)LOD2 Webinar . 29.11.2011 . Page 8 http://lod2.eu
    • Creating Knowledge out of Interlinked DataCrowdsourcingDef: crowdsourcing•  is the act of outsourcing tasks, traditionally performed by an employee or contractor, to an undefined, large group of people or community (a crowd), through an open call.LOD2 Webinar . 29.11.2011 . Page 9 http://lod2.eu
    • Creating Knowledge out of Interlinked DataIntroduction to LODRefineLOD-enabled OpenRefineGoogle Refine ==> OpenRefineLODGrefine ==> LODRefine•  Supporting DBpedia (and Freebase)•  Supporting crowdsourcing•  Exporting RDF•  Extracting named entitiesLOD2 Webinar . 29.11.2011 . Page 10 http://lod2.eu
    • Creating Knowledge out of Interlinked DataLODRefine’s place in LOD life cycleLOD2 Webinar . 29.11.2011 . Page 11 http://lod2.eu
    • Creating Knowledge out of Interlinked DataOpenRefineCross-platform server-client application•  Runs locally•  No datasetSupports:•  Faceted browsing•  Regular expressions•  GREL expressions•  Extensions   value.split(",")[0].strip()  LOD2 Webinar . 29.11.2011 . Page 12 http://lod2.eu
    • Creating Knowledge out of Interlinked DataOpenRefineLOD2 Webinar . 29.11.2011 . Page 13 http://lod2.eu
    • Creating Knowledge out of Interlinked DataThe ExtensionsExtend functionalities of OpenRefineDeveloped by•  Zemanta: DBpedia extension, Crowdsourcing•  DERI: RDF Refine•  Free Your Metadata Group: Named Entity Extraction extensionLOD2 Webinar . 29.11.2011 . Page 14 http://lod2.eu
    • Creating Knowledge out of Interlinked DataRDF Refine extensionReconciliation and interlinking•  DBpedia•  Any SPARQL Endpoint or RDF dump•  Supporting for Apache StanbolExporting RDF•  Defining graph shape before exporting•  Using custom vocabularies or importing existing onesWebpage: http://refine.deri.ie/Github: https://github.com/fadmaa/grefine-rdf-extensionLOD2 Webinar . 29.11.2011 . Page 15 http://lod2.eu
    • Creating Knowledge out of Interlinked DataRDF Refine extension - reconcilingLOD2 Webinar . 29.11.2011 . Page 16 http://lod2.eu
    • Creating Knowledge out of Interlinked DataDBpedia extensionExtending reconciled data with columns from DBpedia•  RDF extension recommendedExtracting Named Entities using Zemanta API•  API key requiredWebpage: http://code.zemanta.com/sparkicaGithub: https://github.com/sparkica/dbpedia-extensionLOD2 Webinar . 29.11.2011 . Page 17 http://lod2.eu
    • Creating Knowledge out of Interlinked DataDBpedia extension – extending dataLOD2 Webinar . 29.11.2011 . Page 18 http://lod2.eu
    • Creating Knowledge out of Interlinked DataDBpedia extension – extracting entitiesLOD2 Webinar . 29.11.2011 . Page 19 http://lod2.eu
    • Creating Knowledge out of Interlinked DataNER extensionExtracts named entities from unstructured textCurrently supports•  Alchemy API•  DBpedia Lookup•  Zemanta APIAPI keys requiredWebpage: http://freeyourmetadata.org/named-entity-extraction/Github: https://github.com/RubenVerborgh/Refine-NER-ExtensionLOD2 Webinar . 29.11.2011 . Page 20 http://lod2.eu
    • Creating Knowledge out of Interlinked DataNER extension – extracting entitiesLOD2 Webinar . 29.11.2011 . Page 21 http://lod2.eu
    • Creating Knowledge out of Interlinked DataCrowdsourcing extensionSupport for•  Creating new crowdsourcing jobs•  Publishing data on CrowdFlower service•  Multiple labor channels (Amazon MT)•  CrowdFlower API key requiredJob templates•  Evaluating reconciliation results•  Finding information (e.g. URLs)Webpage: http://code.zemanta.com/sparkica/Github: https://github.com/sparkica/crowdsourcingLOD2 Webinar . 29.11.2011 . Page 22 http://lod2.eu
    • Creating Knowledge out of Interlinked DataCrowdsourcing extension – create job from templateLOD2 Webinar . 29.11.2011 . Page 23 http://lod2.eu
    • Creating Knowledge out of Interlinked DataCrowdsourcing extension – upload dataLOD2 Webinar . 29.11.2011 . Page 24 http://lod2.eu
    • Creating Knowledge out of Interlinked DataAvailability of LODRefine & extensionsLOD2 Webinar . 29.11.2011 . Page 25 http://lod2.eu
    • Creating Knowledge out of Interlinked DataAvailability of LODRefine & extensionsLOD2 Webinar . 29.11.2011 . Page 26 http://lod2.eu
    • Creating Knowledge out of Interlinked DataDemonstrationTop 50 summer books by Forbes•  Creating project•  Preparing data•  Reconciling, extending data with DBpediaReconciliation evaulation for NHL players (links extracted from blogs)•  Create crowdsourcing job from template•  Upload data to CrowdFlowerLOD2 Webinar . 29.11.2011 . Page 27 http://lod2.eu
    • Creating Knowledge out of Interlinked DataContactZemanta Other extensions – resourcesCelovska 32, SI-1000 Ljubljana, Slovenia RDF extensionPresenter Webpage: http://refine.deri.ie/Mateja Verlic Github: https://github.com/fadmaa/grefine-rdf-extensionEmail: mateja.verlic@zemanta.comTwitter: @sparkica NER extensionSkype: mverlic Webpage: http://freeyourmetadata.org/named-entity-extraction/ Github: https://github.com/RubenVerborgh/Refine-NER-ExtensionLODRefine and extensions – resources LOD2 project & WebinarsLODRefine LOD2 project: http://lod2.euWebpage: http://code.zemanta.com/sparkica Webinar series: http://lod2.eu/BlogPost/webinar-seriesGithub: https://github.com/sparkica/OpenRefine/tree/lodrefine OpenRefine ResourcesExtensions Google Group: https://groups.google.com/forum/#!forum/openrefineDBpedia extension: https://github.com/sparkica/dbpedia-extension Github: https://github.com/OpenRefine/OpenRefine/Crowdsourcing extension: Wiki: https://github.com/OpenRefine/OpenRefine/wiki https://github.com/sparkica/crowdsourcingRefine-stats extension: https://github.com/sparkica/refine-statsUtlitities extension: https://github.com/sparkica/utilitiesThanks for your attention!LOD2 Webinar . 29.11.2011 . Page 28 http://lod2.eu http://lod2.eu
    • Creating Knowledge out of Interlinked DataCreditsJingle R.E.M., Martin Kaltenböck, Florian KondertCoordination Thomas Thurner Martin KaltenböckModeration Martin KaltenböckPresented by Mateja VerličLOD2 Webinar . 29.11.2011 . Page 29 http://lod2.eu
    • Creating Knowledge out of Interlinked Data Hope  you  enjoyed  staying  with  us  –  if  you  need  more  detailed   informaAon,  visit  us  at  www.lod2.eu  and  let  us  know  how  we  can   improve  to  meet  your  expectaAons!     Don’t  forget  to  register  for  our  next  webinar          26.02.  2013  –  dbPedia  Spotlight  (University  of  Mannheim)          27.03.  2013  –  CKAN  and  publicdata.eu  (Open  Knowledge  FoundaAon)     Have  a  great  day  and  don’t  forget  ...   http://lod2.euLOD2 Webinar . 29.11.2011 . Page 30 http://lod2.eu
    • Creating Knowledge out of Interlinked Data http://lod2.euLOD2 Webinar . 29.11.2011 . Page 31 http://lod2.eu