A	Vocabulary	for	Persistence	
John	Kunze,	Jeremy	DeBarry,	Ma<hew	
Hanlon,	Calvert	Scout,	Sandra	Sweat
2	
Two	themes	
1.	Proposed	metadata	for	“persistence	statements”	
•  What	you	mean	by	persistence	
•  Informing	user	linking	choices	
2.	Metadata	hardened	in	open	yamz.net	dicNonary	
•  Crowdsourced,	but	with	reputaNon-based	voNng	
•  Every	term	has	a	unique	persistent	idenNfier	(PID)	
	2
•  Open	idenNfiers	
deserve	their	own	
fesNval	
•  9	and	10	November	
in	Reykjavik,	Iceland!	
•  If	you’re	doing	
something	
interesNng	with	PIDs	
(or	you	want	to!)	
come	and	share	
your	ideas	with	a	
crowd	of	like-
minded	innovators
4	
“Persistence”	used,	but	not	defined	
•  Persistence	is	not	binary	
•  Persistence	is	usually	a	forecast	about	
stubbornness	or	sNckiness	
•  Persistence	is	nuanced	and	dimensional	
	4
5	
IdenNfier	strings	don’t	help	much	
	
	
																																															4CF3-57AB-2481-651D-D53D-Q	
	5	
h<p://doi.org/10.5072/4CF3-57AB-2481-651D-D53D-Q		h<p://doi.org/10.5240/4CF3-57AB-2481-651D-D53D-Q	
Persistence	is	not	conferred	by	a	string	or	a	syntax.
6	
Use	cases	and	goals	
•  Classic	case:	reproducible	science	
•  Overlooked	cases:	data	papers,	sohware	releases	
•  Goal:	propose	persistence	metadata		
•  Goal:	whatever	link	you	find,	be	able	to	query	the	
provider	for	its	“persistence	statement”	
	6
7	
Sejng	user	expectaNons	
Terms	for	content	variance	
•  frozen	–	unchanging	bitstream	
•  keeping	–	unchanging	content	
•  fixing	–	subject	to	correcNon	
•  rising	–	subject	to	acNve	enhancement	
•  moul7ng	–	unchanging	theme	
	7
8	
Sejng	user	expectaNons	
Terms	for	object	availability	
•  finite	–	ends	at	known	date	or	event	
•  indefinite	–	no	special	commitment	
•  life7me	–	as	long	as	the	provider	exists		
•  subinfinite	–	beyond	provider’s	lifeNme	
	8
9	
Sejng	user	expectaNons	
A	term	for	objects	that	grow	in	a	certain	way	
•  waxing	–	non-disrupNve	growth	
Examples	
•  live	sensor	data	feeds	
•  	serial	publicaNons	
	9
10	
Why	should	we	believe	you?	
Terms	specifying	the	nature	of	the	provider	
•  name	–	of	organizaNon	
•  iden7fier	–	unique	organizaNonal	idenNfier	
•  mission	–	is	preservaNon	in	your	mission?	
•  succession	policy	
	10
11	
Referencing	in	presence	of	versions	
Terms	for	content	referencing	
•  extraversioned	–	“10.2345/67,	Version	4”	
•  intraversioned	–	“10.2345/67.V4”	
•  introversioned	–	“10.2345/6789”	
	11
12	
The	great	landing	page	debate	
Terms	to	request	either	
•  landing	–	for	human	consumpNon	
•  plunging	–	for	machine	consumpNon	
	12	
mikebaird	on	flickr	
or
13	
Naming	and	remediaNon	policy	
If	there’s	a	problem,	what	repair	priority?	
•  high	–	high	priority	
•  standard	–	not	high	priority	
Forming	idenNfier	strings	
•  NR	–	non-reassignment	
•  OP	–	opaque	idenNfiers	
•  CC	–	check	character	added	
	13
14	
Concept	ids	(naming	and	remediaNon)	
If	there’s	a	problem,	what	repair	priority?	
•  high	–	n2t.net/ark:/99152/h1207	
•  standard	–	n2t.net/ark:/99152/h1208	
Forming	idenNfier	strings	
•  NR	–	n2t.net/ark:/99152/h1215	
•  OP	–	n2t.net/ark:/99152/h1218	
•  CC	–	n2t.net/ark:/99152/h1219	
	14
15	
yamz.net	(yet	another	metadata	zoo)	
	15
16	
Problem:		tradiNonal	standardizaNon	
•  Change	by	commi<ee	is	ugly,	costly,	and	slow	
•  Example:		Dublin	Core,	15	cross-domain	terms	
•  same	terms	aher	5	years	as	aher	11	months	
•  new	terms	banned	in	fear	of	fragile	consensus	
	16	
European	Parliament	Technology	-	DG	ITEC	@	flickr
The	Metadata	Universe	
Jenn	Riley,	IU
The	Metadata	Universe	
Jenn	Riley,	IU
The	Metadata	Universe	
Jenn	Riley,	IU
The	Metadata	Universe	
Jenn	Riley,	IU
The	Metadata	Universe	
Jenn	Riley,	IU
22	
An	alternate	metadata	universe	
•  Vision:	one	dicNonary,	one	namespace	
•  All	research	domains,	any	part	of	“metadata	speech”	
•  Names,	values,	units,	relaNonships,	...		
	
	22	
SimonRobertson@flickr
23	
Crowdsourced,	but	with	voNng	
	23	
vernacular	
canonical	
deprecated	
3	classes	
of	term	
ç		all	terms	are	born	here	
ç		these	don’t	evolve	
ç		so	terms	never	go	away	
Each	term	gets	a	unique	persistent	id.		Example:	
					idenNfier:				hBp://n2t.net/ark:/99152/h1193	
						term:											oba	
						definiNon:		other	(origin:	from	Tagalog)
24	
ReputaNon-based	voNng	resists	“gaming”	
•  Meritocracy:	strong	terms	rise,	weak	terms	decline	
•  Lessons	from	StackOverflow,	Internet	standards,	
and	Wikipedia	processes	
	24	
Karunakar	Rayker	@flickr
25	
YAMZ	usage	pa<erns	
	25	
Search	for	
terms	
(words	and	
definiNons)	
find	a	term	you	love	
great	–	use	it	
find	a	term	you	kind	of	love	 try	it	out,	comment,	
engage	with	author	
no	workable	term	found	 instantly	enter	own	term	
and	watch	for	comments	
find	a	word	you	love	 “I	want	that	word!”,	so	
enter	a	compeNng	term	but	a	defini7on	you	hate
26	
Term	tag	in	YAMZ	
	26
27	
In	conclusion	
•  Choosing	which	objects	to	cite	is	hard	
•  Few	well-defined	terms	to	express	persistence	
•  …	or	to	set	user	expectaNons	of	change	
•  Thanks	to	yamz.net	for	be<er,	cheaper,	faster	
vocabulary-building;	projects	include	
•  CiNzen	Science;	DesignSafe;	Persistence	statements	
	27

A Vocabulary for Persistence