SlideShare a Scribd company logo
1 of 32
the
                             Large Knowledge Collider


                                           Frank van Harmelen
Creative Commons License:
allowed to share & remix,
but must attribute & non-commercial
                                      Vrije Universiteit Amsterdam
•   The vision
•   The project
•   The consortium
•   The plan




                      Oh
                      Yes!
                     Shit…
The Vision
 “a configurable platform for
    infinitely scalable semantic web reasoning”
Why we need
The Large Knowledge Collider

   Gartner (May 2007):
                                  "By 2012,
       70% of public Web pages will have some level of semantic markup,
         20% will use more extensive Semantic Web-based ontologies”


   • Semantic Technologies at Web Scale?
      – 20% of 30 billion pages @ 1000 triples per page =
        6 trillion triples
      – 30 billion and 1000 are underestimates,
        imagine in 6 years from now…
      – data-integration and semantic search at web-scale?




                                                                  27-June-07
1 triple:




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)     5         http://www.aifb.uni-karlsruhe.de/WBS
Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   6   http://www.aifb.uni-karlsruhe.de/WBS
Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   7   http://www.aifb.uni-karlsruhe.de/WBS
Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   8   http://www.aifb.uni-karlsruhe.de/WBS
Suez Canal                                               107 Triples
                                                          [OWLIM]




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   9   http://www.aifb.uni-karlsruhe.de/WBS
Moon                                                 RDF Store subsecond querying
                                                                       108 Triples
                                                                         [Ingenta]




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)          10           http://www.aifb.uni-karlsruhe.de/WBS
Earth                                                     ~109 Triples




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   11     http://www.aifb.uni-karlsruhe.de/WBS
[LarKC proposal]
Jupiter                                              ~1010 Triples ≈ 1 triple per web-page




                                                      ≈ 1 triple per web-page




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)             12               http://www.aifb.uni-karlsruhe.de/WBS
~1011 Triples




Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   13      http://www.aifb.uni-karlsruhe.de/WBS
Distance Sun – Pluto
                                                          ~1014 Triples




 Fensel / Harmelen estimate
 1014 Triples
Denny Vrandečić – AIFB, Universität Karlsruhe (TH)   14      http://www.aifb.uni-karlsruhe.de/WBS
Infinitely scalable (1/2)

• by giving up 100% correctness:
    • trading quality for size
    • often completeness is not needed
    • sometimes even correctness is not needed


                          precision (soundness)                    logic
 A logician’s nightmare

 (Dieter Fensel)                                              Semantic Web




                                                              IR
                                      recall (completeness)
Infinitely scalable (2/2)

• by parallelisation:
   • cluster computing


  • wide area distribution
    “Thinking@home”,
    “self-computing semantic Web”


  • cloud computing?
    (Amazon now, Google soon?)
“Configurable platform”
“a configurable platform for
infinitely scalable semantic web reasoning”
Why “LarKC” ?

• The Large Knowledge Collider

A configurable platform
for experimentation
by others
Why “LarKC” ?

But also:




and also:
1.   a merry, carefree adventure.
2.   innocent or good-natured mischief; a prank.
3.   something extremely easy to accomplish
•   The vision
•   The consortium
•   The project
•   The plan
The consortium




      50 people present
The Consortium



 • Combining consortium competence
    – IR, Cognition
    – ML, Ontologies
    – Statistics, ML,
      Cognition,DB
    – Logic,DB,
      Probabilistic Inference
    – Economics,
      Decision Theory
Use Case 2
                    Use Case 1
                     Database
                    Technology
                          RDF
                    technology
                   Probabilistic
                     Inference
                      Machine
                      Learning
                        human
                 problemsolving
                    Information
                       Retrieval
The Consortium




                    Distributed
                    Computing
                          Logic
                 Semantic Web




                                                                                                                             WHO-IARC
                                                                                                         CEFRIEL
                                                                                    Siemens
                                                                                              Ontotext
                                                CycEur




                                                                                                                   Saltlux
                                                                USFD
                                                         HLRS
                                   UIBK




                                                                       MPG
                                                                             WICI
                                          VUA
•   The vision
•   The consortium
•   The project
•   The plan




                      Oh
                     Shit…
The project


 •   10M€ budget
 •   3.5 years
 •   80 person years
 •   3 case studies
 •   14 partners
 •   obtained in FP7 Call1:
     – overall < 10% funding rate
     – LarKC has highest funding, longest runtime
Project Workpackages
& timeline



     Exploitation and          WP1 – Conceptual Framework & Evaluation
       standards




                                                                                      WP 10: Project Management
          WP 9:




                        WP 2: Retrieval     WP3: Abstraction       WP4: Reasoning
                        and Selection        and Learning           and Deciding


                                          WP5: Collider Platform
     WP 8: Training,
     dissemination,
       community
        building




                        WP 6: Use case:     WP 7a: Use case:       WP 7b: Use case:
                        Real Time City       Early Clinical        Carcinogenesis
                                             Development              Reference
                                                                     Production
Use case:white paper Discovery
      FDA
          Drug Innovation or Stagnation (March 2004):
             “developers have no choice but to use the tools of the last century
• Problem: pharmaceutical R&D in early clinical
        to assess this century's candidate solutions.”
  development is stagnating
             “industry scientists often lack cross-cutting information about an
             entire product area, or information about techniques that may be
             used in areas other than theirs”

   “Show me any potential liver toxicity associated with the
   compound’s drug class, target, structure and disease.”

                                (Q1∩Q2∩Q3)
     Q1                         Q2                           Q3
    Show me all liver toxicity “Show me all liver toxicity   “Show me all liver toxicity
    associated with the target associated with compounds     from the public literature and
                               with similar structure”
    or the pathway.                                          internal reports that are related
                                                             to the drug class, disease and
                                                             patient population”
    Genetics                    Chemistry                    LITERATURE
                                         Current NCBI: linking but no inference
Use Case: City on-line

   • Our cities face many challenges
   • Urban Computing
     is the ICT way to
     address them        • How can we redevelop existing neighborhoods
Is public transportation where the people are?improve the quality of
                            and business districts to
                              life?
                Which       • How can we create more choices in
                        landmarks attract more people? housing,
                              accommodating diverse lifestyles and all
                              income levels?
                            Where are people concentrating?
                            • How can we reduce traffic congestion yet stay
                              connected?
                                              Where is traffic moving?
                            • How can we include citizens in planning their
                              communities rather than limiting input to only
                              those affected by the next project?
                            • How can we fund schools, bridges, roads, and
                              clean water while meeting short-term costs of
                              increased security?
•   The vision
•   The consortium
•   The project
•   The plan




                      Oh
                     Shit…
Project Timeline



• Surveys (plugins, platform)
• Requirements (use cases)

               Prototype        Internal Release    Public Release   Final Release




0          6      10                18                         33           42

                   Use Cases                       Use Cases           Use Cases
                      V1                              V2                  V3
Communication


 • Early Access Group

 • Usage Competition
   – “we will win if we start to loose”

 • We deliver:
   – software
   – publications
   – not “deliverables”
And Finally….

 • People are already looking at us:
     – “Damn... the EU is where all the cool semweb work is
       happening these days”
     – “This kind of infrastructure is exactly the kind of rocket fuel
       that is needed at this stage of semweb maturity.”
     – “The LarKC-inspired workshop on new formstiareasoning a”
                                                        of l
                                                  ten this
       the semantic web was a conference highlight for me” a       re for
                                              po i Web, LarKC
                                        the possible will quickly
     – “With the current growth rates of RDF on then
       which started out as technologically ork
                                   has le w
                             ectit alleop
       become operationally necessary”
     – “this project really jhas
                      pro y p (potentially) in terms of both
       science his impact” a
                and
             “T the w
 •   “projectsnge
         ch a already seeking collaboration:
     OKKAM, MUSING
     to

More Related Content

Similar to LarKC: the large knowledge collider

Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondEmanuele Della Valle
 
Efficient implementations of machine vision algorithms using a dynamically ty...
Efficient implementations of machine vision algorithms using a dynamically ty...Efficient implementations of machine vision algorithms using a dynamically ty...
Efficient implementations of machine vision algorithms using a dynamically ty...Jan Wedekind
 
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...BCS Data Management Specialist Group
 
Carpenter - Wolfram Data Summit ResourceSync
Carpenter - Wolfram Data Summit ResourceSyncCarpenter - Wolfram Data Summit ResourceSync
Carpenter - Wolfram Data Summit ResourceSyncnisohq
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchYehia El-khatib
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinalDeborah McGuinness
 
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...Michael Harries
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceMarcus Hanwell
 
OW2con'14 - Weblab in the land of Big Data
OW2con'14 - Weblab in the land of Big DataOW2con'14 - Weblab in the land of Big Data
OW2con'14 - Weblab in the land of Big DataOW2
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Tracy Kent
 
Information Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionInformation Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionKrist Wongsuphasawat
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...Kerstin Forsberg
 
Cloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersCloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersAlan Sill
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceRobert Grossman
 

Similar to LarKC: the large knowledge collider (20)

Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and Beyond
 
Efficient implementations of machine vision algorithms using a dynamically ty...
Efficient implementations of machine vision algorithms using a dynamically ty...Efficient implementations of machine vision algorithms using a dynamically ty...
Efficient implementations of machine vision algorithms using a dynamically ty...
 
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
Adopting a Situated Learning framework for (Big) Data Projects - Martin Dougl...
 
Hak intis2013
Hak intis2013Hak intis2013
Hak intis2013
 
A RESTful WfXML
A RESTful WfXMLA RESTful WfXML
A RESTful WfXML
 
Carpenter - Wolfram Data Summit ResourceSync
Carpenter - Wolfram Data Summit ResourceSyncCarpenter - Wolfram Data Summit ResourceSync
Carpenter - Wolfram Data Summit ResourceSync
 
Resource Sync - Introduction
Resource Sync - IntroductionResource Sync - Introduction
Resource Sync - Introduction
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal
 
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...
The Audioverse In Your Pocket - Invited Talk at ABC Radio National - Harries ...
 
Oscon 2011 Practicing Open Science
Oscon 2011 Practicing Open ScienceOscon 2011 Practicing Open Science
Oscon 2011 Practicing Open Science
 
OW2con'14 - Weblab in the land of Big Data
OW2con'14 - Weblab in the land of Big DataOW2con'14 - Weblab in the land of Big Data
OW2con'14 - Weblab in the land of Big Data
 
Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source Mark Hughes Annual Seminar Presentation on Open Source
Mark Hughes Annual Seminar Presentation on Open Source
 
Information Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An IntroductionInformation Visualization for Knowledge Discovery: An Introduction
Information Visualization for Knowledge Discovery: An Introduction
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
 
ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)
 
ResourceSync - An Introduction
ResourceSync - An IntroductionResourceSync - An Introduction
ResourceSync - An Introduction
 
Cloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersCloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for Developers
 
The Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of ScienceThe Open Science Data Cloud: Empowering the Long Tail of Science
The Open Science Data Cloud: Empowering the Long Tail of Science
 

More from Frank van Harmelen

The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"Frank van Harmelen
 
Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Frank van Harmelen
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyFrank van Harmelen
 
Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Frank van Harmelen
 
Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Frank van Harmelen
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationFrank van Harmelen
 
The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)Frank van Harmelen
 
On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...Frank van Harmelen
 
The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)Frank van Harmelen
 
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines InteractionsFrank van Harmelen
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?Frank van Harmelen
 
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years agoSemantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years agoFrank van Harmelen
 
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Frank van Harmelen
 
Informatics is a natural science
Informatics is a natural scienceInformatics is a natural science
Informatics is a natural scienceFrank van Harmelen
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)Frank van Harmelen
 
4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic WebFrank van Harmelen
 

More from Frank van Harmelen (20)

The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"The K in "neuro-symbolic" stands for "knowledge"
The K in "neuro-symbolic" stands for "knowledge"
 
Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)Adoption of Knowledge Graphs, mid 2022 (incomplete)
Adoption of Knowledge Graphs, mid 2022 (incomplete)
 
Modular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxologyModular design patterns for systems that learn and reason: a boxology
Modular design patterns for systems that learn and reason: a boxology
 
Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, late 2019
 
Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019Adoption of Knowledge Graphs, mid 2019
Adoption of Knowledge Graphs, mid 2019
 
Empirical Semantics
Empirical SemanticsEmpirical Semantics
Empirical Semantics
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge Representation
 
The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)The end of the scientific paper as we know it (or not...)
The end of the scientific paper as we know it (or not...)
 
On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...On the nature of AI, and the relation between symbolic and statistical approa...
On the nature of AI, and the relation between symbolic and statistical approa...
 
The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (in 4 easy steps)
 
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines Interactions
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years agoSemantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years ago
 
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
 
Informatics is a natural science
Informatics is a natural scienceInformatics is a natural science
Informatics is a natural science
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web4 Popular Fallacies about the Semantic Web
4 Popular Fallacies about the Semantic Web
 
WCIT2010
WCIT2010WCIT2010
WCIT2010
 
Het slimme Web 3.0
Het slimme Web 3.0Het slimme Web 3.0
Het slimme Web 3.0
 
OWL briefing
OWL briefingOWL briefing
OWL briefing
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

LarKC: the large knowledge collider

  • 1. the Large Knowledge Collider Frank van Harmelen Creative Commons License: allowed to share & remix, but must attribute & non-commercial Vrije Universiteit Amsterdam
  • 2. The vision • The project • The consortium • The plan Oh Yes! Shit…
  • 3. The Vision “a configurable platform for infinitely scalable semantic web reasoning”
  • 4. Why we need The Large Knowledge Collider Gartner (May 2007): "By 2012, 70% of public Web pages will have some level of semantic markup, 20% will use more extensive Semantic Web-based ontologies” • Semantic Technologies at Web Scale? – 20% of 30 billion pages @ 1000 triples per page = 6 trillion triples – 30 billion and 1000 are underestimates, imagine in 6 years from now… – data-integration and semantic search at web-scale? 27-June-07
  • 5. 1 triple: Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 5 http://www.aifb.uni-karlsruhe.de/WBS
  • 6. Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 6 http://www.aifb.uni-karlsruhe.de/WBS
  • 7. Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 7 http://www.aifb.uni-karlsruhe.de/WBS
  • 8. Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 8 http://www.aifb.uni-karlsruhe.de/WBS
  • 9. Suez Canal 107 Triples [OWLIM] Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 9 http://www.aifb.uni-karlsruhe.de/WBS
  • 10. Moon RDF Store subsecond querying 108 Triples [Ingenta] Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 10 http://www.aifb.uni-karlsruhe.de/WBS
  • 11. Earth ~109 Triples Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 11 http://www.aifb.uni-karlsruhe.de/WBS
  • 12. [LarKC proposal] Jupiter ~1010 Triples ≈ 1 triple per web-page ≈ 1 triple per web-page Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 12 http://www.aifb.uni-karlsruhe.de/WBS
  • 13. ~1011 Triples Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 13 http://www.aifb.uni-karlsruhe.de/WBS
  • 14. Distance Sun – Pluto ~1014 Triples Fensel / Harmelen estimate 1014 Triples Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 14 http://www.aifb.uni-karlsruhe.de/WBS
  • 15. Infinitely scalable (1/2) • by giving up 100% correctness: • trading quality for size • often completeness is not needed • sometimes even correctness is not needed precision (soundness) logic A logician’s nightmare (Dieter Fensel) Semantic Web IR recall (completeness)
  • 16. Infinitely scalable (2/2) • by parallelisation: • cluster computing • wide area distribution “Thinking@home”, “self-computing semantic Web” • cloud computing? (Amazon now, Google soon?)
  • 17. “Configurable platform” “a configurable platform for infinitely scalable semantic web reasoning”
  • 18. Why “LarKC” ? • The Large Knowledge Collider A configurable platform for experimentation by others
  • 19. Why “LarKC” ? But also: and also: 1. a merry, carefree adventure. 2. innocent or good-natured mischief; a prank. 3. something extremely easy to accomplish
  • 20. The vision • The consortium • The project • The plan
  • 21. The consortium 50 people present
  • 22. The Consortium • Combining consortium competence – IR, Cognition – ML, Ontologies – Statistics, ML, Cognition,DB – Logic,DB, Probabilistic Inference – Economics, Decision Theory
  • 23. Use Case 2 Use Case 1 Database Technology RDF technology Probabilistic Inference Machine Learning human problemsolving Information Retrieval The Consortium Distributed Computing Logic Semantic Web WHO-IARC CEFRIEL Siemens Ontotext CycEur Saltlux USFD HLRS UIBK MPG WICI VUA
  • 24. The vision • The consortium • The project • The plan Oh Shit…
  • 25. The project • 10M€ budget • 3.5 years • 80 person years • 3 case studies • 14 partners • obtained in FP7 Call1: – overall < 10% funding rate – LarKC has highest funding, longest runtime
  • 26. Project Workpackages & timeline Exploitation and WP1 – Conceptual Framework & Evaluation standards WP 10: Project Management WP 9: WP 2: Retrieval WP3: Abstraction WP4: Reasoning and Selection and Learning and Deciding WP5: Collider Platform WP 8: Training, dissemination, community building WP 6: Use case: WP 7a: Use case: WP 7b: Use case: Real Time City Early Clinical Carcinogenesis Development Reference Production
  • 27. Use case:white paper Discovery FDA Drug Innovation or Stagnation (March 2004): “developers have no choice but to use the tools of the last century • Problem: pharmaceutical R&D in early clinical to assess this century's candidate solutions.” development is stagnating “industry scientists often lack cross-cutting information about an entire product area, or information about techniques that may be used in areas other than theirs” “Show me any potential liver toxicity associated with the compound’s drug class, target, structure and disease.” (Q1∩Q2∩Q3) Q1 Q2 Q3 Show me all liver toxicity “Show me all liver toxicity “Show me all liver toxicity associated with the target associated with compounds from the public literature and with similar structure” or the pathway. internal reports that are related to the drug class, disease and patient population” Genetics Chemistry LITERATURE Current NCBI: linking but no inference
  • 28. Use Case: City on-line • Our cities face many challenges • Urban Computing is the ICT way to address them • How can we redevelop existing neighborhoods Is public transportation where the people are?improve the quality of and business districts to life? Which • How can we create more choices in landmarks attract more people? housing, accommodating diverse lifestyles and all income levels? Where are people concentrating? • How can we reduce traffic congestion yet stay connected? Where is traffic moving? • How can we include citizens in planning their communities rather than limiting input to only those affected by the next project? • How can we fund schools, bridges, roads, and clean water while meeting short-term costs of increased security?
  • 29. The vision • The consortium • The project • The plan Oh Shit…
  • 30. Project Timeline • Surveys (plugins, platform) • Requirements (use cases) Prototype Internal Release Public Release Final Release 0 6 10 18 33 42 Use Cases Use Cases Use Cases V1 V2 V3
  • 31. Communication • Early Access Group • Usage Competition – “we will win if we start to loose” • We deliver: – software – publications – not “deliverables”
  • 32. And Finally…. • People are already looking at us: – “Damn... the EU is where all the cool semweb work is happening these days” – “This kind of infrastructure is exactly the kind of rocket fuel that is needed at this stage of semweb maturity.” – “The LarKC-inspired workshop on new formstiareasoning a” of l ten this the semantic web was a conference highlight for me” a re for po i Web, LarKC the possible will quickly – “With the current growth rates of RDF on then which started out as technologically ork has le w ectit alleop become operationally necessary” – “this project really jhas pro y p (potentially) in terms of both science his impact” a and “T the w • “projectsnge ch a already seeking collaboration: OKKAM, MUSING to