Be the first to like this
My 3-min talk for Beyond The PDF 2 (Amsterdam) #btpdf2 pointing out that scholarly publishers don't tend to embed very rich metadata in their products e.g. PDFs.
Sure, arguably this data is inside the PDF, but I don't want to look inside the PDF. I want my machines to be able to grep what the PDF is in a nanosecond from structured, standardised metadata. I believe the standards exist, there's XMP and it's been around for a long time. It's just not being implemented much, or fully. In the current mixed world of Open Access and subscription access content it's *particularly* important to indicate the license details clearly in content, in a machine-readable way.
Can we please put this information into all 'professionally' published content? Surely not a difficult task?