SlideShare a Scribd company logo
Linked  Data-­‐‑based  
Concept  Recommendation:  
Comparison  of  Different  Methods  in  Open  
         Innovation  Scenario	
     Danica Damljanovic, Milan Stankovic,
              Philippe Laublet
Innovation
Innovation  Platforms	




Challenge:  Promote  innovation  problems  to  an  audience  of  solvers  who  
can  propose  relevant  innovative  solutions
Finding	
  Meaningful	
  
                            Connec0ons	
  
                                                                                            Kaolinite	
  
               Clay	
  mining	
                                                          extrac0on	
  from	
  
                       …	
                                                                    rocks	
  
                                                                                                …	
  


Different	
  communi-es	
  use	
  different	
  terms	
  and	
  concepts	
  to	
  speak	
  about	
  seman-cally	
  related	
  	
  
   things.	
  Such	
  “language”	
  defines	
  communi-es	
  and	
  separates	
  them.	
  Being	
  able	
  to	
  find	
  
meaningful	
  connec-ons	
  between	
  concepts	
  would	
  enable	
  us	
  to	
  build	
  bridges	
  between	
  people	
  
                                                 and	
  content.	
  




    h;p://bit.ly/hyProximity	
  
Concept	
  recommenda0on	
  
•  Concepts	
  you	
  might	
  not	
  know	
  but	
  might	
  want	
  to	
  use:	
  to	
  annotate	
  
   your	
  content,	
  to	
  search	
  for	
  content,	
  to	
  search	
  for	
  people…	
  
•  Help	
  problem	
  promoters	
  discover	
  relevant	
  concepts	
  (problem	
  
   promoters	
  some0mes	
  not	
  field	
  experts)	
  
•  Discovery	
  =	
  relevance	
  +	
  unexpectedness	
  

  h;p://bit.ly/hyProximity	
  
Discovering  Direct  and  
     Lateral  Concepts  
            	
•  HyProximity, a structure-based similarity
•  Structure-based Statistical Semantics Similarity
   Random Indexing, a well-known statistical semantics
   from Information Retrieval to RDF
Linked	
  Data-­‐based	
  Concept	
  
          Recommenda0on 	
  	
  

                                     DBPedia  
Textual                              Concepts        DBPedia  
                         Zemanta	
                                  suggestions	
 Input	
                             found  in      Exploration	
                                      the  text	




    h;p://bit.ly/hyProximity	
  
hyProximity	
  




•  We	
  start	
  from	
  several	
  seed	
  concepts	
  found	
  directly	
  in	
  the	
  text,	
  and	
  search	
  
   the	
  DBPedia	
  graph	
  
•  The	
  concepts	
  found	
  in	
  the	
  proximity	
  of	
  several	
  seed	
  concepts	
  are	
  considered	
  
   more	
  “in	
  context”	
  for	
  the	
  given	
  input	
  
•  Concepts	
  found	
  at	
  a	
  shorter	
  distance	
  from	
  the	
  seed	
  concepts	
  have	
  higher	
  
   hyProximity	
  
Different	
  Distance	
  Func0ons	
  
                                          Things in France
                                                                                  skos:broader	
  
                                                                                  other	
  property	
  
  Rivers in France                                    Products of France    Car Industry
                                Cities in France




   2	
           2	
                                            2	
     2+1	
  
Marne         Seine                       Paris              Chanel     Peugeot               BMW


  •  Hierarchical:	
  exploring	
  skos:broader	
  rela9ons	
  
  •  Transversal:	
  exploring	
  transversal	
  links	
  
  •  mixed:	
  a	
  linear	
  combina0on	
  of	
  hierarchical	
  and	
  transversal	
  	
  
    research.hypios.com/hyproximity	
  
Different	
  Distance	
  Func0ons	
  
                                          Things in France
                                                                                          skos:broader	
  
                                                                                          other	
  property	
  
  Rivers in France                                      Products of France           Car Industry
                                Cities in France



                                                   famous for
                         flows through               “fashion”	
                         competitor
                 1	
                                                    1	
      1	
  
Marne         Seine                       Paris                      Chanel     Peugeot               BMW


  •  Hierarchical:	
  exploring	
  skos:broader	
  rela0ons	
  
  •  Transversal:	
  exploring	
  transversal	
  links	
  
  •  Mixed:	
  a	
  linear	
  combina0on	
  of	
  hierarchical	
  and	
  transversal	
  	
  
    research.hypios.com/hyproximity	
  
Random  Indexing	
•  Words which appear in the similar context - with the
   same set of other words - are contextually related
   e.g. synonyms.
•  Synonyms tend not to co-occur with one another
   directly, so indirect inference is required to draw
   associations between words used to express the
   same idea
Two  steps  to  Random  
           Indexing	
•  Indexing
    o  For an RDF graph, generate virtual documents
    o  Prepare the corpus (pre-processing)
    o  Generate semantic index
•  Search - given a term X calculate a cosine similarity
   between the vector of that term and other vectors
   in the semantic space
Building  context  
                         vectors	
 Seed  length	
     d1	
 d2	
 ..	
 dp	

                                                                    =	
                             d1	
 0	
 0	
 -­‐‑1	
 1	
 -­‐‑1	
 1	

                           X	
t1	
 1	
 2	
 ..	
 0	
t2	
 3	
 0	
 ..	
 0	
        d2	
 -­‐‑1	
 1	
 0	
 0	
 1	
 -­‐‑1	
..	
 ..	
 ..	
 ..	
 ..	
tq	
 0	
 1	
        10	
                             …	
                                                                    D	
                             dp	
 0	
 1	
 0	
 -­‐‑1	
 -­‐‑1	
 1	

M	
                                    Dimensionality  =  n	

                             t1	
                             t2	
                                    T
                             …	
                             tq
Indexing:  virtual  documents	

                                                   lexicalise	
    S	
 P1	
 L1	
                                    L8	
                           S	
 P2	
 L2	
                                              L7	
            L1	
                       P10	
                       S	
 P3	
 L3	
                                             P9	
                                                                   S	
 P4	
 O1	
                   P1	
         P7	
O2	
      P8	
                                                         L6	
      S	
 P7	
 O2	
            P2	
          S	
                                          P4	
 O 	
 P 	
 L 	
     L2	
                                                          S	
       1    5    4
              P3	
              P4	
                               S	
 P4	
 O1	
 P6	
 L 	
                                                  L5	
                                 5
                                           P6	
                    S	
 P7	
 O 	
             L3	
               O1	
      P5	
                               2   P8	
 L6	
                                                  L4	
             S	
 P7	
 O2	
 P 	
 L7	
                                                                                  9
                                                                   S	
 P7	
 O2	
 P10	
L8	
Representative  subgraph  for  URI=S	
                            Virtual  document  for  
                                                                  URI=S	
                                                                                             14
Experiments	
•  26 real innovation problems from Hypios
•  Measure of success: the suggested concepts
   appear in the actual solutions (precision, recall, f-
   measure)
(+) reasonable list of concepts from real scenarios
(-) not complete:
    o  User study: measure discovery = relevance
       +unexpectedness
DBpedia  Dataset	
•  Select a number of properties relevant to the Open
   Innovation-related scenario
•  dbo:product, dbp:pruducts, dbo:industry,
   dbo:service, dbo:genre, and properties serving to
   establish a hierarchical categorization of con-
   cepts, namely dc:subject and skos:broader
Evaluation	
•  “Gold standard”
    o  Extract problem URIs
    o  Extract solution URIs
•  Baseline:
    o  Google Adwords Keyword Tool: finds similar
       topics based on their distribution in textual
       corpora and the corpora of search queries.
    o  Suggesting up to 600 concepts which are then
       used for Web crawling for finding experts.
Evaluation:  Results	

                             !
           !




                         !
           !
User  Study	
•  Suggestions being both relevant and unexpected
    o  the most valuable discoveries for the user
•  12 users
•  34 problem evaluations
   o  3060 suggested concepts/keywords.

•  For the chosen innovation problem, the evaluators
   were presented with the lists of 30 top-ranked
   suggestions generated by adWords, hyProximity
   (mixed approach) and Random Indexing.
Example
User  Study:  Results
Conclusion	
•  Linked Data valuable source of knowledge for
   concept recommendation
•  Our two methods complementary
   o  hyProximity better for precision
   o  Random Indexing better for recall

•  User study: unexpectedness higher with our
   methods than with baseline
•  Subjective user comment:
    o  Random Indexing: generic
    o  hyProximity: granular
    o  adWords: redundant
Thank  You!	
•  Find out more:
•  http://research.hypios.com/?page_id=165

Contact us:
•  Danica Damljanovic @dancheeee
•  Milan Stankovic: @milstan

More Related Content

Recently uploaded

AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
Sunil Jagani
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
DianaGray10
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
Ajin Abraham
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
Ortus Solutions, Corp
 

Recently uploaded (20)

AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
What is an RPA CoE? Session 2 – CoE Roles
What is an RPA CoE?  Session 2 – CoE RolesWhat is an RPA CoE?  Session 2 – CoE Roles
What is an RPA CoE? Session 2 – CoE Roles
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
AppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSFAppSec PNW: Android and iOS Application Security with MobSF
AppSec PNW: Android and iOS Application Security with MobSF
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Linked Data-based Concept Recommendation: Comparison of Different Methods in Open Innovation Scenario

  • 1. Linked  Data-­‐‑based   Concept  Recommendation:   Comparison  of  Different  Methods  in  Open   Innovation  Scenario Danica Damljanovic, Milan Stankovic, Philippe Laublet
  • 3. Innovation  Platforms Challenge:  Promote  innovation  problems  to  an  audience  of  solvers  who   can  propose  relevant  innovative  solutions
  • 4. Finding  Meaningful   Connec0ons   Kaolinite   Clay  mining   extrac0on  from   …   rocks   …   Different  communi-es  use  different  terms  and  concepts  to  speak  about  seman-cally  related     things.  Such  “language”  defines  communi-es  and  separates  them.  Being  able  to  find   meaningful  connec-ons  between  concepts  would  enable  us  to  build  bridges  between  people   and  content.   h;p://bit.ly/hyProximity  
  • 5. Concept  recommenda0on   •  Concepts  you  might  not  know  but  might  want  to  use:  to  annotate   your  content,  to  search  for  content,  to  search  for  people…   •  Help  problem  promoters  discover  relevant  concepts  (problem   promoters  some0mes  not  field  experts)   •  Discovery  =  relevance  +  unexpectedness   h;p://bit.ly/hyProximity  
  • 6. Discovering  Direct  and   Lateral  Concepts   •  HyProximity, a structure-based similarity •  Structure-based Statistical Semantics Similarity Random Indexing, a well-known statistical semantics from Information Retrieval to RDF
  • 7. Linked  Data-­‐based  Concept   Recommenda0on     DBPedia   Textual   Concepts   DBPedia   Zemanta suggestions Input found  in   Exploration the  text h;p://bit.ly/hyProximity  
  • 8. hyProximity   •  We  start  from  several  seed  concepts  found  directly  in  the  text,  and  search   the  DBPedia  graph   •  The  concepts  found  in  the  proximity  of  several  seed  concepts  are  considered   more  “in  context”  for  the  given  input   •  Concepts  found  at  a  shorter  distance  from  the  seed  concepts  have  higher   hyProximity  
  • 9. Different  Distance  Func0ons   Things in France skos:broader   other  property   Rivers in France Products of France Car Industry Cities in France 2   2   2   2+1   Marne Seine Paris Chanel Peugeot BMW •  Hierarchical:  exploring  skos:broader  rela9ons   •  Transversal:  exploring  transversal  links   •  mixed:  a  linear  combina0on  of  hierarchical  and  transversal     research.hypios.com/hyproximity  
  • 10. Different  Distance  Func0ons   Things in France skos:broader   other  property   Rivers in France Products of France Car Industry Cities in France famous for flows through “fashion”   competitor 1   1   1   Marne Seine Paris Chanel Peugeot BMW •  Hierarchical:  exploring  skos:broader  rela0ons   •  Transversal:  exploring  transversal  links   •  Mixed:  a  linear  combina0on  of  hierarchical  and  transversal     research.hypios.com/hyproximity  
  • 11. Random  Indexing •  Words which appear in the similar context - with the same set of other words - are contextually related e.g. synonyms. •  Synonyms tend not to co-occur with one another directly, so indirect inference is required to draw associations between words used to express the same idea
  • 12. Two  steps  to  Random   Indexing •  Indexing o  For an RDF graph, generate virtual documents o  Prepare the corpus (pre-processing) o  Generate semantic index •  Search - given a term X calculate a cosine similarity between the vector of that term and other vectors in the semantic space
  • 13. Building  context    vectors Seed  length d1 d2 .. dp = d1 0 0 -­‐‑1 1 -­‐‑1 1 X t1 1 2 .. 0 t2 3 0 .. 0 d2 -­‐‑1 1 0 0 1 -­‐‑1 .. .. .. .. .. tq 0 1 10 … D dp 0 1 0 -­‐‑1 -­‐‑1 1 M Dimensionality  =  n t1 t2 T … tq
  • 14. Indexing:  virtual  documents lexicalise S P1 L1 L8 S P2 L2 L7 L1 P10 S P3 L3 P9 S P4 O1 P1 P7 O2 P8 L6 S P7 O2 P2 S P4 O P L L2 S 1 5 4 P3 P4 S P4 O1 P6 L L5 5 P6 S P7 O L3 O1 P5 2 P8 L6 L4 S P7 O2 P L7 9 S P7 O2 P10 L8 Representative  subgraph  for  URI=S Virtual  document  for   URI=S 14
  • 15. Experiments •  26 real innovation problems from Hypios •  Measure of success: the suggested concepts appear in the actual solutions (precision, recall, f- measure) (+) reasonable list of concepts from real scenarios (-) not complete: o  User study: measure discovery = relevance +unexpectedness
  • 16. DBpedia  Dataset •  Select a number of properties relevant to the Open Innovation-related scenario •  dbo:product, dbp:pruducts, dbo:industry, dbo:service, dbo:genre, and properties serving to establish a hierarchical categorization of con- cepts, namely dc:subject and skos:broader
  • 17. Evaluation •  “Gold standard” o  Extract problem URIs o  Extract solution URIs •  Baseline: o  Google Adwords Keyword Tool: finds similar topics based on their distribution in textual corpora and the corpora of search queries. o  Suggesting up to 600 concepts which are then used for Web crawling for finding experts.
  • 19. User  Study •  Suggestions being both relevant and unexpected o  the most valuable discoveries for the user •  12 users •  34 problem evaluations o  3060 suggested concepts/keywords. •  For the chosen innovation problem, the evaluators were presented with the lists of 30 top-ranked suggestions generated by adWords, hyProximity (mixed approach) and Random Indexing.
  • 22. Conclusion •  Linked Data valuable source of knowledge for concept recommendation •  Our two methods complementary o  hyProximity better for precision o  Random Indexing better for recall •  User study: unexpectedness higher with our methods than with baseline •  Subjective user comment: o  Random Indexing: generic o  hyProximity: granular o  adWords: redundant
  • 23. Thank  You! •  Find out more: •  http://research.hypios.com/?page_id=165 Contact us: •  Danica Damljanovic @dancheeee •  Milan Stankovic: @milstan