SlideShare a Scribd company logo
1 of 23
Download to read offline
Linked  Data-­‐‑based  
Concept  Recommendation:  
Comparison  of  Different  Methods  in  Open  
         Innovation  Scenario	
     Danica Damljanovic, Milan Stankovic,
              Philippe Laublet
Innovation
Innovation  Platforms	




Challenge:  Promote  innovation  problems  to  an  audience  of  solvers  who  
can  propose  relevant  innovative  solutions
Finding	
  Meaningful	
  
                            Connec0ons	
  
                                                                                            Kaolinite	
  
               Clay	
  mining	
                                                          extrac0on	
  from	
  
                       …	
                                                                    rocks	
  
                                                                                                …	
  


Different	
  communi-es	
  use	
  different	
  terms	
  and	
  concepts	
  to	
  speak	
  about	
  seman-cally	
  related	
  	
  
   things.	
  Such	
  “language”	
  defines	
  communi-es	
  and	
  separates	
  them.	
  Being	
  able	
  to	
  find	
  
meaningful	
  connec-ons	
  between	
  concepts	
  would	
  enable	
  us	
  to	
  build	
  bridges	
  between	
  people	
  
                                                 and	
  content.	
  




    h;p://bit.ly/hyProximity	
  
Concept	
  recommenda0on	
  
•  Concepts	
  you	
  might	
  not	
  know	
  but	
  might	
  want	
  to	
  use:	
  to	
  annotate	
  
   your	
  content,	
  to	
  search	
  for	
  content,	
  to	
  search	
  for	
  people…	
  
•  Help	
  problem	
  promoters	
  discover	
  relevant	
  concepts	
  (problem	
  
   promoters	
  some0mes	
  not	
  field	
  experts)	
  
•  Discovery	
  =	
  relevance	
  +	
  unexpectedness	
  

  h;p://bit.ly/hyProximity	
  
Discovering  Direct  and  
     Lateral  Concepts  
            	
•  HyProximity, a structure-based similarity
•  Structure-based Statistical Semantics Similarity
   Random Indexing, a well-known statistical semantics
   from Information Retrieval to RDF
Linked	
  Data-­‐based	
  Concept	
  
          Recommenda0on 	
  	
  

                                     DBPedia  
Textual                              Concepts        DBPedia  
                         Zemanta	
                                  suggestions	
 Input	
                             found  in      Exploration	
                                      the  text	




    h;p://bit.ly/hyProximity	
  
hyProximity	
  




•  We	
  start	
  from	
  several	
  seed	
  concepts	
  found	
  directly	
  in	
  the	
  text,	
  and	
  search	
  
   the	
  DBPedia	
  graph	
  
•  The	
  concepts	
  found	
  in	
  the	
  proximity	
  of	
  several	
  seed	
  concepts	
  are	
  considered	
  
   more	
  “in	
  context”	
  for	
  the	
  given	
  input	
  
•  Concepts	
  found	
  at	
  a	
  shorter	
  distance	
  from	
  the	
  seed	
  concepts	
  have	
  higher	
  
   hyProximity	
  
Different	
  Distance	
  Func0ons	
  
                                          Things in France
                                                                                  skos:broader	
  
                                                                                  other	
  property	
  
  Rivers in France                                    Products of France    Car Industry
                                Cities in France




   2	
           2	
                                            2	
     2+1	
  
Marne         Seine                       Paris              Chanel     Peugeot               BMW


  •  Hierarchical:	
  exploring	
  skos:broader	
  rela9ons	
  
  •  Transversal:	
  exploring	
  transversal	
  links	
  
  •  mixed:	
  a	
  linear	
  combina0on	
  of	
  hierarchical	
  and	
  transversal	
  	
  
    research.hypios.com/hyproximity	
  
Different	
  Distance	
  Func0ons	
  
                                          Things in France
                                                                                          skos:broader	
  
                                                                                          other	
  property	
  
  Rivers in France                                      Products of France           Car Industry
                                Cities in France



                                                   famous for
                         flows through               “fashion”	
                         competitor
                 1	
                                                    1	
      1	
  
Marne         Seine                       Paris                      Chanel     Peugeot               BMW


  •  Hierarchical:	
  exploring	
  skos:broader	
  rela0ons	
  
  •  Transversal:	
  exploring	
  transversal	
  links	
  
  •  Mixed:	
  a	
  linear	
  combina0on	
  of	
  hierarchical	
  and	
  transversal	
  	
  
    research.hypios.com/hyproximity	
  
Random  Indexing	
•  Words which appear in the similar context - with the
   same set of other words - are contextually related
   e.g. synonyms.
•  Synonyms tend not to co-occur with one another
   directly, so indirect inference is required to draw
   associations between words used to express the
   same idea
Two  steps  to  Random  
           Indexing	
•  Indexing
    o  For an RDF graph, generate virtual documents
    o  Prepare the corpus (pre-processing)
    o  Generate semantic index
•  Search - given a term X calculate a cosine similarity
   between the vector of that term and other vectors
   in the semantic space
Building  context  
                         vectors	
 Seed  length	
     d1	
 d2	
 ..	
 dp	

                                                                    =	
                             d1	
 0	
 0	
 -­‐‑1	
 1	
 -­‐‑1	
 1	

                           X	
t1	
 1	
 2	
 ..	
 0	
t2	
 3	
 0	
 ..	
 0	
        d2	
 -­‐‑1	
 1	
 0	
 0	
 1	
 -­‐‑1	
..	
 ..	
 ..	
 ..	
 ..	
tq	
 0	
 1	
        10	
                             …	
                                                                    D	
                             dp	
 0	
 1	
 0	
 -­‐‑1	
 -­‐‑1	
 1	

M	
                                    Dimensionality  =  n	

                             t1	
                             t2	
                                    T
                             …	
                             tq
Indexing:  virtual  documents	

                                                   lexicalise	
    S	
 P1	
 L1	
                                    L8	
                           S	
 P2	
 L2	
                                              L7	
            L1	
                       P10	
                       S	
 P3	
 L3	
                                             P9	
                                                                   S	
 P4	
 O1	
                   P1	
         P7	
O2	
      P8	
                                                         L6	
      S	
 P7	
 O2	
            P2	
          S	
                                          P4	
 O 	
 P 	
 L 	
     L2	
                                                          S	
       1    5    4
              P3	
              P4	
                               S	
 P4	
 O1	
 P6	
 L 	
                                                  L5	
                                 5
                                           P6	
                    S	
 P7	
 O 	
             L3	
               O1	
      P5	
                               2   P8	
 L6	
                                                  L4	
             S	
 P7	
 O2	
 P 	
 L7	
                                                                                  9
                                                                   S	
 P7	
 O2	
 P10	
L8	
Representative  subgraph  for  URI=S	
                            Virtual  document  for  
                                                                  URI=S	
                                                                                             14
Experiments	
•  26 real innovation problems from Hypios
•  Measure of success: the suggested concepts
   appear in the actual solutions (precision, recall, f-
   measure)
(+) reasonable list of concepts from real scenarios
(-) not complete:
    o  User study: measure discovery = relevance
       +unexpectedness
DBpedia  Dataset	
•  Select a number of properties relevant to the Open
   Innovation-related scenario
•  dbo:product, dbp:pruducts, dbo:industry,
   dbo:service, dbo:genre, and properties serving to
   establish a hierarchical categorization of con-
   cepts, namely dc:subject and skos:broader
Evaluation	
•  “Gold standard”
    o  Extract problem URIs
    o  Extract solution URIs
•  Baseline:
    o  Google Adwords Keyword Tool: finds similar
       topics based on their distribution in textual
       corpora and the corpora of search queries.
    o  Suggesting up to 600 concepts which are then
       used for Web crawling for finding experts.
Evaluation:  Results	

                             !
           !




                         !
           !
User  Study	
•  Suggestions being both relevant and unexpected
    o  the most valuable discoveries for the user
•  12 users
•  34 problem evaluations
   o  3060 suggested concepts/keywords.

•  For the chosen innovation problem, the evaluators
   were presented with the lists of 30 top-ranked
   suggestions generated by adWords, hyProximity
   (mixed approach) and Random Indexing.
Example
User  Study:  Results
Conclusion	
•  Linked Data valuable source of knowledge for
   concept recommendation
•  Our two methods complementary
   o  hyProximity better for precision
   o  Random Indexing better for recall

•  User study: unexpectedness higher with our
   methods than with baseline
•  Subjective user comment:
    o  Random Indexing: generic
    o  hyProximity: granular
    o  adWords: redundant
Thank  You!	
•  Find out more:
•  http://research.hypios.com/?page_id=165

Contact us:
•  Danica Damljanovic @dancheeee
•  Milan Stankovic: @milstan

More Related Content

Recently uploaded

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 

Recently uploaded (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Linked Data-based Concept Recommendation: Comparison of Different Methods in Open Innovation Scenario

  • 1. Linked  Data-­‐‑based   Concept  Recommendation:   Comparison  of  Different  Methods  in  Open   Innovation  Scenario Danica Damljanovic, Milan Stankovic, Philippe Laublet
  • 3. Innovation  Platforms Challenge:  Promote  innovation  problems  to  an  audience  of  solvers  who   can  propose  relevant  innovative  solutions
  • 4. Finding  Meaningful   Connec0ons   Kaolinite   Clay  mining   extrac0on  from   …   rocks   …   Different  communi-es  use  different  terms  and  concepts  to  speak  about  seman-cally  related     things.  Such  “language”  defines  communi-es  and  separates  them.  Being  able  to  find   meaningful  connec-ons  between  concepts  would  enable  us  to  build  bridges  between  people   and  content.   h;p://bit.ly/hyProximity  
  • 5. Concept  recommenda0on   •  Concepts  you  might  not  know  but  might  want  to  use:  to  annotate   your  content,  to  search  for  content,  to  search  for  people…   •  Help  problem  promoters  discover  relevant  concepts  (problem   promoters  some0mes  not  field  experts)   •  Discovery  =  relevance  +  unexpectedness   h;p://bit.ly/hyProximity  
  • 6. Discovering  Direct  and   Lateral  Concepts   •  HyProximity, a structure-based similarity •  Structure-based Statistical Semantics Similarity Random Indexing, a well-known statistical semantics from Information Retrieval to RDF
  • 7. Linked  Data-­‐based  Concept   Recommenda0on     DBPedia   Textual   Concepts   DBPedia   Zemanta suggestions Input found  in   Exploration the  text h;p://bit.ly/hyProximity  
  • 8. hyProximity   •  We  start  from  several  seed  concepts  found  directly  in  the  text,  and  search   the  DBPedia  graph   •  The  concepts  found  in  the  proximity  of  several  seed  concepts  are  considered   more  “in  context”  for  the  given  input   •  Concepts  found  at  a  shorter  distance  from  the  seed  concepts  have  higher   hyProximity  
  • 9. Different  Distance  Func0ons   Things in France skos:broader   other  property   Rivers in France Products of France Car Industry Cities in France 2   2   2   2+1   Marne Seine Paris Chanel Peugeot BMW •  Hierarchical:  exploring  skos:broader  rela9ons   •  Transversal:  exploring  transversal  links   •  mixed:  a  linear  combina0on  of  hierarchical  and  transversal     research.hypios.com/hyproximity  
  • 10. Different  Distance  Func0ons   Things in France skos:broader   other  property   Rivers in France Products of France Car Industry Cities in France famous for flows through “fashion”   competitor 1   1   1   Marne Seine Paris Chanel Peugeot BMW •  Hierarchical:  exploring  skos:broader  rela0ons   •  Transversal:  exploring  transversal  links   •  Mixed:  a  linear  combina0on  of  hierarchical  and  transversal     research.hypios.com/hyproximity  
  • 11. Random  Indexing •  Words which appear in the similar context - with the same set of other words - are contextually related e.g. synonyms. •  Synonyms tend not to co-occur with one another directly, so indirect inference is required to draw associations between words used to express the same idea
  • 12. Two  steps  to  Random   Indexing •  Indexing o  For an RDF graph, generate virtual documents o  Prepare the corpus (pre-processing) o  Generate semantic index •  Search - given a term X calculate a cosine similarity between the vector of that term and other vectors in the semantic space
  • 13. Building  context    vectors Seed  length d1 d2 .. dp = d1 0 0 -­‐‑1 1 -­‐‑1 1 X t1 1 2 .. 0 t2 3 0 .. 0 d2 -­‐‑1 1 0 0 1 -­‐‑1 .. .. .. .. .. tq 0 1 10 … D dp 0 1 0 -­‐‑1 -­‐‑1 1 M Dimensionality  =  n t1 t2 T … tq
  • 14. Indexing:  virtual  documents lexicalise S P1 L1 L8 S P2 L2 L7 L1 P10 S P3 L3 P9 S P4 O1 P1 P7 O2 P8 L6 S P7 O2 P2 S P4 O P L L2 S 1 5 4 P3 P4 S P4 O1 P6 L L5 5 P6 S P7 O L3 O1 P5 2 P8 L6 L4 S P7 O2 P L7 9 S P7 O2 P10 L8 Representative  subgraph  for  URI=S Virtual  document  for   URI=S 14
  • 15. Experiments •  26 real innovation problems from Hypios •  Measure of success: the suggested concepts appear in the actual solutions (precision, recall, f- measure) (+) reasonable list of concepts from real scenarios (-) not complete: o  User study: measure discovery = relevance +unexpectedness
  • 16. DBpedia  Dataset •  Select a number of properties relevant to the Open Innovation-related scenario •  dbo:product, dbp:pruducts, dbo:industry, dbo:service, dbo:genre, and properties serving to establish a hierarchical categorization of con- cepts, namely dc:subject and skos:broader
  • 17. Evaluation •  “Gold standard” o  Extract problem URIs o  Extract solution URIs •  Baseline: o  Google Adwords Keyword Tool: finds similar topics based on their distribution in textual corpora and the corpora of search queries. o  Suggesting up to 600 concepts which are then used for Web crawling for finding experts.
  • 19. User  Study •  Suggestions being both relevant and unexpected o  the most valuable discoveries for the user •  12 users •  34 problem evaluations o  3060 suggested concepts/keywords. •  For the chosen innovation problem, the evaluators were presented with the lists of 30 top-ranked suggestions generated by adWords, hyProximity (mixed approach) and Random Indexing.
  • 22. Conclusion •  Linked Data valuable source of knowledge for concept recommendation •  Our two methods complementary o  hyProximity better for precision o  Random Indexing better for recall •  User study: unexpectedness higher with our methods than with baseline •  Subjective user comment: o  Random Indexing: generic o  hyProximity: granular o  adWords: redundant
  • 23. Thank  You! •  Find out more: •  http://research.hypios.com/?page_id=165 Contact us: •  Danica Damljanovic @dancheeee •  Milan Stankovic: @milstan