When modeling Linked Open Data (LOD), reusing appropriate vo- cabulary terms to represent the data is difficult, because there are many vocabularies to choose from. Vocabulary term recommendations could alleviate this situation. We present a user study evaluating a vocabulary term recommendation service that is based on how other data providers have used RDF classes and properties in the LOD cloud. Our study compares the machine learning technique Learning to Rank (L2R), the classical data mining approach Association Rule mining (AR), and a baseline that does not provide any recommendations. Results show that utilizing AR, participants needed less time and less effort to model the data, which in the end resulted in models of better quality.
3. 3
§ Popularity of a candidate (i.e. vocabulary term)
} Number of LOD sources using a candidate
} Number of LOD sources using candidate’s vocabulary
} Number of total occurrences of a candidate
§ Candidate from an already used vocabulary
§ Collabora5ve filtering:
} How did others use a recommenda5on candidate
Term Recommenda7ons
based on…
Example of Schema-Level PaRerns (SLPs):
slp = ({swrc:Publication}, {dc:creator}, {foaf:Person})
Resources of type swrc:PublicaCon are connected to
resources of type foaf:Person via the property dc:creator
9. § The par5cipants’ effort
} Task Comple5on 5me (max. 6 min. to avoid 5redness)
} Recommenda5on acceptance rate
(number of terms chosen from recommenda5ons)
§ The quality of the resul5ng data
} Number of vocabulary terms that were also used by
five different data modeling experts
§ Level of saCsfacCon with both recommenders
} 5-point Likert scale ra5ng AR and L2R
} Ranking of L2R, AR, and using no recommenda5ons
9
User Study - Measurements
10. § 20 par5cipants (5 female)
} 18 in academia, 2 in both academia and industry
} 2 master students, 14 research associates , 3 post docs,
1 professor
} 8 recruited from USC, 12 recruited from GESIS
§ Knowledge and experience
} Karma: 7 high to expert knowledge, 13 none at all
} LOD: average experience of 3 years
} Self-rated experience (5-point Likert): M = 2.8, SD = 1.6
} Task knowledge (5-point Likert): M = 2.1, SD = 1.1
10
Results (1/3) - Par7cipants