Horizontal decomposition of freebase
Upcoming SlideShare
Loading in...5
×
 

Horizontal decomposition of freebase

on

  • 459 views

Roughly 1/3 of the facts in Freebase contribute most of the value; by extracting from the Freebase dump, the data can be more rapidly processed by RDF tools such as triple stores as well as ...

Roughly 1/3 of the facts in Freebase contribute most of the value; by extracting from the Freebase dump, the data can be more rapidly processed by RDF tools such as triple stores as well as Hadoop-based toolkits such as the infovore framework

Statistics

Views

Total Views
459
Views on SlideShare
459
Embed Views
0

Actions

Likes
2
Downloads
2
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Horizontal decomposition of freebase Horizontal decomposition of freebase Presentation Transcript

  • Horizontal Decomposition of Freebase based on data sampled in July
  • Summary • 29-41% of accepted triples (links + other) hold most of the value • “a” is numerous but doesn’t provide the evidence other predicates do • “descriptions” are bulky • “names” are not machine readable • “keys” are duplicated, nonstandard, optional
  • a 5% description 18% key 11% keyNs 13%label 6% name 6%notability 0% nfp 0% text 8% web 6% links 20% other 7% percentage of gz compressed size
  • a 16% description 1% key 9% keyNs 11% label 6% name 6% notability 2% nfp 2% text 0% web 5% links 32% other 10% percentage of facts
  • a 15% description 7% key 8% keyNs 9% label 4% name 4% notability 2% nfp 1% text 3% web 6% links 30% other 11% percentage of uncompressed size
  • :BaseKB and Infovore Data processed with Infovore software https://github.com/paulhoule/infovore/ Get segmented Freebase data at http://basekb.com/