Your SlideShare is downloading. ×
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Introducing the official SlideShare app

Stunning, full-screen experience for iPhone and Android

Text the download link to your phone

Standard text messaging rates apply

LOD2 - Creating Knowledge out of Interlinked Data - General Presentation

2,173
views

Published on

General presentation for the FP7, EU funded project: LOD2 - Creating Knowledge out of Interlinked Data.

General presentation for the FP7, EU funded project: LOD2 - Creating Knowledge out of Interlinked Data.

Published in: Education, Technology

0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
2,173
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
3
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • The LOD2 consortium partners bring the essential knowledge and tools to build the LOD2 stack …
  • Transcript

    • 1. Creating Knowledge out of Interlinked Data Intelligent Information Management Collaborative Project 2010-2014 in Information and Communication Technologies Project No. 257943 Start Date 01/09/2010 http://lod2.euEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 1 http://lod2.eu
    • 2. Creating Knowledge out of Interlinked Data The emerging Web of Data achievements and challenges• Web - a global, distributed platform for data, information and knowledge integration• exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF Achievements  Challenges 1. Extension of the Web with 1. Coherence: Relatively few, a data commons expensively maintained links (currently amounting 25 2. Quality: partly low qualityJuly 2007 April 2008 September 2008 Billion facts) data and inconsistencies 2. vibrant, global RTD 3. Performance: Still community substantial penalties 3. Industrial uptake begins compared to relational (e.g. BBC, Thomson 4. Data consumption: large- Reuters, Eli Lilly) scale processing, schema mapping and data fusion still 4. Emerging governmental in its infancy adoption in sight 5. Usability: Missing direct end- 5. Establishing Linked Data user tools and network effect as a deployment path for the Semantic Web. These issues are closely related July 2009 and should ultimately lead to an ecosystem of interlinked EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 2 knowledge! http://lod2.eu
    • 3. Creating Knowledge out of Interlinked Data LOD2 in a NutshellResearch focus• Very large RDF data management• Knowledge Enrichment & Interlinking• Fusion & Information Quality• Adaptive, semantic user interfacesUse Cases• Media & Publishing• Enterprise Data Webs• Open Gov DataMain Result• Integrated LOD2-Stack for Linked Data lifecycle managementPartnerUni Leipzig, CWI, DERIGalway, FU Berlin,Semantic Web Company,OpenLink, Tenforce,Exalead, Wolters Kluwer,OKFN 3 EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 3 http://lod2.eu
    • 4. Creating Knowledge out of Interlinked Data LOD2 EC-funded collabarotive project that aims to utilize the Web as an integration platform for data and information Linked Data Linked Data provides the necessary basic technologies and standards to realize the goal of LOD2. Linked Open Data publicly accessible data which is to be integrated into the web and linked among one another and with non-public contents such as enterprise intranets Project Highlights Open Government Linked Data Initiative Common European platform publicdata.eu Leading Web 3.0 technologies are combined in the project in to the coherent LOD2 stack (e.g. DBpedia, Virtuoso, Sindice, Silk)EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 4 http://lod2.eu
    • 5. Creating Knowledge out of Interlinked DataWP1: Requirements, Design & LOD2 Stack Prototype Use Case High-Level AbstractionEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 5 http://lod2.eu
    • 6. Creating Knowledge out of Interlinked Data WP1: Use Case Objectives Objective of WP8: Applying Linked Data technologies in an enterprise stack to support Human Resources-related issues. ENTERPRISE APPLICATIONS (Exalead)Objective of WP7: MEDIA OPEN Objective of WP9:Supporting content-related & GOVERNMENT Improving accessibility,production workflows in the PUBLISHING DATA findability and reusability ofmedia & publishing industry. Open Government Data. (Wolters (Open Kluwer Knowledge Germany) Foundation) EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 6 http://lod2.eu
    • 7. Creating Knowledge out of Interlinked DataWP2: Storing & Querying Very Large Knowledge BasesGoal: Enabling large-scale, feature-rich & enterprise-ready Linked Data management solutionsDatabase Partners in LOD2: CWI - Leading open source analytics RDBMS OpenLink - Leading Linked data deployment platformTechnological Excellence: Creating and publishing metrics for choosing RDF solutions Bringing Column Store Technology for Business Intelligence on RDF Ground-breaking database innovations for RDF stores (Dynamic Query optimization, Adaptive Caching of Joins, Optimized Graph Processing, Cluster/Cloud scalability)EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 7 http://lod2.eu
    • 8. Creating Knowledge out of Interlinked DataWP2: Linked Open Data For Real In Your AppsBusiness Advantages: Enrich your application with (free & rich) Linked Open Data RDF store technology has 10x lower deployment costs than relational for ragged dataTechnological Flexibility: Deliver Schema-Last Flexibility and Inference at Relational Data Warehouse Cost and Performance Grow as you go: the LOD2 platform dynamically adapts to your usage patterns and structure of your data Integrate, resolve, align anything: Schema, instance identityRich Features for complex Applications: Advanced SPARQL and SQL query processing SPARQL and SQL Federation Full Text, Geospatial, Text Search Scale-Out on Clusters, ReplicationEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 8 http://lod2.eu
    • 9. Creating Knowledge out of Interlinked Data WP3: GoalsGeneral Goal: Creation, improvement, repair of knowledge basesFocus: Very large knowledge bases, diverse knowledge, web data Refine existing (Virtuoso Sponger, RDF Views, Triplify, D2R) triplification approaches Improve schema of knowledge based on data Fix problems in knowledge bases e.g. inconsistenciesTechniques: semi-automatic machine learning, ontology debugging, NLP, shallow parsing etc. EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 9 http://lod2.eu
    • 10. Creating Knowledge out of Interlinked DataWP3: Knowledge Base Improvement CycleMutual Refinement Cycle(with optional Extractionphase) Modelling Problems Repair Performance Problems Disjoint- Semi- ness Structured Definitions structured Extraction Enrichment Un- Linkage structured ValidationEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 10 http://lod2.eu
    • 11. Creating Knowledge out of Interlinked DataWP3: Task OverviewProvenance-Aware Extraction of Linked Data from Existing Structured Formats Relational databases, spreadsheets, CMS, logs, XML documents Development of D2R Triplify, Virtuoso SpongerProvenance-Aware Extraction of Linked Data from Unstructured and Semi-Structured Sources HTML, PDF and Office documents with meta-data, wiki code, plain text Development of NLP2RDF and DBpediaKnowledge Base Schema Enrichment Learn axioms in knowledge bases, e.g. disjointness, definitions, super-classes Development of ORE and DL-LearnerKnowledge Base Repair Fix inconsistencies, modeling problems, reasoning performance problems Development of OREWeb Linkage Validator Reports whether knowledge base is suitable to be interlinked with othersEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 11 http://lod2.eu
    • 12. Creating Knowledge out of Interlinked DataWP4: Reuse, Interlinking and Knowledge Fusion (1)Goal: Provide open-source software components for link generation, schemamapping, data quality assessment and knowledge fusion.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 12 http://lod2.eu
    • 13. Creating Knowledge out of Interlinked DataWP4: Reuse, Interlinking and Knowledge Fusion (2)Technological Excellence: Ease the creation of RDF links by using machine-learning as well as link quality assessment workbench Provide for the flexible integration of Web data based on mappings discovered on the Web Provide for assessing the quality of Web data and fusing high-quality data.Expected Outcomes: Link discovery tools, linking assist and workbench Framework for publishing and discovering expressive mappings on the Web Data quality assessment framework providing for a wide range of different quality assessment policies Data fusion components providing various conflict resolution strategiesEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 13 http://lod2.eu
    • 14. Creating Knowledge out of Interlinked DataWP 5: Linked Data Visualization, Browsing and Authoring (1)WP5 aims to build on and go beyond existing approaches for realizing adaptive Webuser interfaces by: Automatic content adaptationA subset of the domain knowledge is identified, possibly through some reasoningmechanism, as relevant to the current user and context. Adaptive BrowsingFaceted spatial semantic browsing: reusable component for browsing spatial contentin a faceted way (also for mobiles).EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 14 http://lod2.eu
    • 15. Creating Knowledge out of Interlinked DataWP5: Linked Data Visualization, Browsing and Authoring (2)Adaptive Semantic Authoring semantic widget interface: allows the creation of small reusable interface components for domain- specific user interfaces (also for mobiles). adaptive widget choreography: enables the automatic generation of user interfaces. social networking interfaces: enable users to subscribe to arbitrary information adhering to certain semantically defined filter criteria.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 15 http://lod2.eu
    • 16. Creating Knowledge out of Interlinked DataWP 5: Technologies & Methods (1) Semantic pipes:An engine and graphicalenvironment for general WebData transformations and Mashup. Sig.ma:A service and an end-userapplication to access theWeb of Data as an integratedinformation space.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 16 http://lod2.eu
    • 17. Creating Knowledge out of Interlinked DataWP 5: Technologies & Methods (2)Site Services: Site Search and Site Widgets Widgets (right) provide relevant information, from Sindice, about the topic of the site. Site search (below) provides a rich faceted-browsing functionality of the site’s widgets.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 17 http://lod2.eu
    • 18. Creating Knowledge out of Interlinked DataWP6: Interfaces, Integration & LOD2 Stack (1)This work package deploys the LOD2 stack, based on the requirements andprerequisitesdefined in work package 1.The LOD2 stack will be made available as downloadable packages.While leading work package 6, the following will be delivered: Integrated user-interface components Integrated LOD2 Stack API components Evaluation, Documentation, TutorialsEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 18 http://lod2.eu
    • 19. Creating Knowledge out of Interlinked DataWP6: Interfaces, Integration & LOD2 Stack (2)Output WP1Use Case WP7: Media & Media &Publishing Publishing Yearly releases Requirements/ Applied on PrerequisitesUse Case Generates Open WP8: Requirements/ AppliedEnterprise Prerequisites packages with Source on Media & integrated toolsData web Package Publishing SAF LOD2 stack (Software Assembly Factory) Requirements/ Released on Starts on 09/2011 • 09/2012 Prerequisites Applied on • 09/2013 • 09/2014 Use Case WP9:Government Media & Data PublishingEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 19 http://lod2.eu
    • 20. Creating Knowledge out of Interlinked DataWP7: LOD2 for PublishersWolters Kluwer Deutschland (WKD):“Semantic Technologies and Standards are an enabler for the media and publishingindustry to create added-value for their customers with reasonable costs.“ WKD Legal & RegulatoryCompanies/Brands Products (Examples)- Carl Heymanns Verlag - IP, Administrative Law WKD is part of Wolters Kluwer B.V.- Luchterhand - Civil, Family, Labor Law- Werner Verlag - Construction Law Customer orientation Worldwide reach- Carl Link - Publications for Schools/KiTas - Lawyers - Europe- CW Haarfeld - Public Health Insurance - Tax Accountants - North America- Deutscher - Magazin „Personalwirtschaft“ - Corporations and SMEs - Asia/PacificWirtschaftsdienst (HR Management) - Fincancial institutions- AnNoText - SW for Lawyers and Notaries - Health Providers Economic success- Trigon Data - Public Sector - Revenue 2009 EUR 3,4 bln. - 18.000 Employees WKD Tax & Accounting - Listed Amsterdam SECompanies/Brands Products (Examples)- Akademische Arbeits- - Tax SW for Consumersgemeinschaft Verlag- Addison Group - SW for Tax Accountants- Schleupen Tax - SW for SMEs with focus- Wago Curadata Controlling and AccountingEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 20 http://lod2.eu
    • 21. Creating Knowledge out of Interlinked DataWP7: WKD as a Consumer of LOD Data Content Supply Chain of Content Composing Publishing Customer Wolters Kluwer Editing Sales Customer Acquisition Bundling Interfacing Service Deutschland (WKD) Content Acquisition Content Enrichment Enterprise ApplicationsAcquisition of LOD governmental data Enrichment of WKD data Data integration in Enterprise and other Costumer Applications- Laws & Regulations - Enrichment with additional metadata from the LOD cloud - Integration of customer and WKD data- Court cases with data from the LOD cloud - Automatic Interlinking within WKD data,- Administrative Rulings but also into the LOD cloud - Development of new services, e.g.- Statistical information around metadata economics Based on:Based on: Based on: - Adequate delivery format- Adequate delivery format - Adequate functionality - Adequate metadata- Adequate metadata - Adequate APIs - Adequate functionality- Adequate Licensing and IPR - Adequate Licensing and IPR - Adequate Licensing and IPREU-FP7 LOD2 Project Overview . 02.09.2010 . Page 21 http://lod2.eu
    • 22. Creating Knowledge out of Interlinked DataWP7: WKD as a Publisher of LOD Data Content Supply Chain of Content Composing Publishing Customer Wolters Kluwer Editing Sales Customer Acquisition Bundling Interfacing Service Deutschland (WKD) Cloud - Publishing Marketing measures Development of WKpedia Integration in overall marketing strategy of WKD - Publishing of enriched governmental information - Dissemination of LOD2 in media and publishing sector - Publishing of legal domain thesauri - Launching surveys - Motivating contextualisation in LOD cloud - Permanent information of customers Based on: - Sponsoring of conferences - Adequate functionality Based on: -Adequate APIs - Clear scope of LOD2 project to support future publishing paradigms - Adequate Licensing and IPREU-FP7 LOD2 Project Overview . 02.09.2010 . Page 22 http://lod2.eu
    • 23. Creating Knowledge out of Interlinked DataWP8: Towards Linked Enterprise Data Webs (1)Linked Enterprise Intra Data Webs can fill the gap between Intra-/Extranets and ERPsystemsFacilitates data integration along value-chains within and across enterprisesThe pragmatic, incremental, vocabulary based Linked Data approach reduces dataintegration costs significantlyObjectives: Web publishing Promote openness and standards in enterprise data workflows and applicationsEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 23 http://lod2.eu
    • 24. Creating Knowledge out of Interlinked DataWP8: Linked Enterprise Data Use Case Scenario (2)Wage policy EBI: Build an application for surveying wage policy in a company, domain, sector, region, etc.Scenarios: A company wants to know if its wage policy is consistent with the market (in similar and related companies and sectors). A job applicant would like to have an idea about his wage expectations according to his expertise, profile and education background A governmental agency would like to survey the salaries in a particular region according to an economic branch and other parametersEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 24 http://lod2.eu
    • 25. Creating Knowledge out of Interlinked DataWP8: Linked Enterprise Data (3)Targeted service: A Saas service with different levels of subscription The service is a mashup of payroll and HR data of enterprises subscribing to the service to build an index store of data facts about wages. Different consolidation parameters and key performance indicators (KPI) will be studied to provide relevant reports and visualisation interfaces. Integration of external datasets in a particular survey: public datasets in the web cloud or private datasets of participating companies. Privacy issues management: make private and nominative data anonymous.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 25 http://lod2.eu
    • 26. Creating Knowledge out of Interlinked Data WP8: Linked Enterprise Data (4) Preliminary overview: Search and EBI Employees interface Full and HR text databaseindex Indexer Data Data Data Data Data LODification Taxonomies consolidatio enrichment, Cleaning and crawler and n annotation uniformisatio anonymisation nRDFstore SPARQL Payroll software endpoint EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 26 http://lod2.eu
    • 27. Creating Knowledge out of Interlinked DataWP9: Open Government Data Use Casepublicdata.eu - find and reuse datasets from local, regional and national publicbodies across Europe from a single placeEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 27 http://lod2.eu
    • 28. Creating Knowledge out of Interlinked DataWP9: Who is this for? Data literate citizenry Data journalists Policy experts Decision makers Mobile and web developers Academics / researchers Public bodies Companies Civic society / NGOs And so on...EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 28 http://lod2.eu
    • 29. Creating Knowledge out of Interlinked DataWP9: What will it involve?  Enabling exchange of metadata between different data catalogues  Aggregating datasets from existing data catalogues  Creating a European community of reusers to improve metadata  Creating mechanisms for capturing derived / related datasets  Bridge language and topical gaps to associate related information from all Member StatesEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 29 http://lod2.eu
    • 30. Creating Knowledge out of Interlinked DataWP10: Training, Dissemination, Community Building & FertilizationThe general aim of this work package is to establish a worldwide focal point foracademic and industry parties interested in contributing to or taking advantage ofthe novel Linked Data methodologies and components, which will emerge in theproject.In particular, our activities will be targeted at: informing the community of the state-of-the-art developments taking place in the field, disseminating the project results in order to foster community building and to create an impact on industry and research in Europe and worldwide, providing training to interested audiences in the technologies developed throughout the projectEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 30 http://lod2.eu
    • 31. Creating Knowledge out of Interlinked DataWP10: Tasks & TimelineTask 10.1 Training(M 14 – M37) Internal face-to-face training External training PhD programmeTask 10.2 Dissemination, Community Building & Cross-Fertilization(M1 – 48) Scientific dissemination Industrial dissemination Online marketing activities across all identified target groupsEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 31 http://lod2.eu
    • 32. Creating Knowledge out of Interlinked Data WP10: LOD2 Dissemination Resources Website: http://lod2.eu Weblog: http://lod2.eu/BlogPost Twitter: http://twitter.com/lod2project SlideShare: http://www.slideshare.net/lod2project PUBLINK: http://lod2.eu/Article/Publink.htmlRemark: please use #lod2 on twitter for your posts & connect with account: lod2project many thanks in advance!! EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 32 http://lod2.eu
    • 33. Creating Knowledge out of Interlinked DataPubLink – LOD2’s Linked Open Data Starter Service• PubLink helps selected organizations with a focused consulting effort of 10-15 days to publish and make use out of Linked Data• PubLink helps to evaluate the LOD2 technologies and to increase the wealth of Linked Data• Yearly application deadline in Winter• 2011 PubLink participants include: 1. Umweltbundesamt GmbH, Austria 2. Greater London Authority 3. Deutsch Bibliographie, Historische Kommission 4. The Parliament of Finland 5. City of Vienna 6. Instituto Canario de Estadística (ISTAC)• See: http://lod2.eu/Article/Publink.htmlEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 33 http://lod2.eu
    • 34. Creating Knowledge out of Interlinked DataWP11: Exploitation and Standardization (1)Objectives: Realizing the vision of the LOD2 project and use case studies Standardisation of LOD2 architecture Exploitation of knowledge and technical resultsExploitation: Use case studies and the industrial and end-user community partners will drive the exploitation. Tracking important technical and commercial in information retrieval, data management including news and media. Publish exploitation plan identifying opportunities, benefits and impact of LOD2 consortium.EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 34 http://lod2.eu
    • 35. Creating Knowledge out of Interlinked DataWP11: Exploitation and Standardization (2)Interlectual Property Rights (IPR): Core component of the LOD2 stack will be published under open-source license. Domain adoptions of LOD2 stack considered on case-by-case basis to protect IPR. Strategy ensures that all components of LOD2 are royalty-free.Standardization: Actively participating in appropriate standards bodies. Establishing a W3C Linked data interest group.Orchestration with other projects: Encourage take-up of LOD2 technologies by other projects. Foster input from other EU projects relevant for the development of LOD2EU-FP7 LOD2 Project Overview . 02.09.2010 . Page 35 http://lod2.eu
    • 36. Creating Knowledge out of Interlinked DataWP12: Fact SheetProject Means Instrument: Large-scale Integrating Project Objective: Intelligent Information Management  Total Budget: 8,58 M€ Call: FP7-ICT-2009-5  Total Funding: 6,45 M€ Duration: 09/2010 – 08/2014  Total Resources: 844 PMConsortium Universität Leipzig (Coordinator) Centrum Wiskunde & Informatica National University of Ireland in Galway Freie Universität Berlin OpenLink Software Semantic Web Company TenForce Exalead Wolters Kluwer Deutschland 10 Partners from 7 European Open Knowledge Foundation CountriesEU-FP7 LOD2 Project Overview . 02.09.2010 . Page 36 http://lod2.eu
    • 37. Creating Knowledge out of Interlinked DataContactAddress CoordinatorUniversity of Leipzig Dr Sören AuerFaculty of Mathematics and Computer Science Scientific Project LeaderInstitute of Computer Science Phone:+49 (341) 97-32367Department of Business Information Systems Fax: +49 (341) 97-32329Postfach 100920 Email: auer@uni-leipzig.de04009 Leipzig http://www.informatik.uni-leipzig.de/~auerGermany Nadine Jänicke Project Manager Phone:+49 (341) 97-32310 Fax: +49 (341) 97-32329 Email: jaenicke@uni-leipzig.de http://bis.informatik.uni-leipzig.de/NadineJaenickeThanks for your attention! http://lod2.euLOD2 Title . 02.09.2010 . Page 37 http://lod2.eu