Welcome.	
This	presentation	will	be	done	by	Willem	Melder	(ICT	expert	at	B&G,	developer	of	the	
collection	registration	system	and	the	media	suite),	Eva	Baaren	(media	and	digital	
humanities	specialist	at	B&G,	responsible	for	community	outreach	in	WP5),	and	
Liliana	Melgar	(researcher	at	the	UvA,	responsible	for	user	studies	in	WP5).
We	will	present	our	collection	registration	system	in	three	parts:	(read).	The	most	
important	part	is	the	discussion,	in	which	we	hope	to	count	with	your	questions,	
insights,	suggestions...
2
We	start	with	the	introduction	to	the	media	suite
3
In	the	media	studies	focus	(WP5)	we	began	with	five	tools:	AVResearcherXL,	Trove,	
Comerda,	Dive,	and	Verteld Verleden.	These	tools	were	developed	in	different	
projects,	and	offered	access	to	different	collections	(oral	history	interviews,	
newspapers,	television	and	radio,	museum	collections,	among	others).
In	the	initial	meetings	of	WP5,	the	challenge	that	scholars	posed	to	the	ICT	experts	
was	the	need	to	use	those	different	collections	with	the	different	tools,	since	the	
tools	presented	the	possibility	of	achieving	different	goals.	Some	of	them,	though,	
had	common	functionalities,	for	instance	word	clouds.
The	solution	proposed	by	the	ICT	experts	was	to	separate	the	collections	from	the	
tools,	and	map	the	functionalities	in	the	tools,	reconstructing	them	in	a	modular	
approach.
4
The	current	Media	Suite	consists	thus	of	collections	on	the	one	hand,	and	
functionalities	on	the	other	hand.
Each	collection	is	“pluged”	to	the	available	functionalities.	For	instance,	the	Sound	
and	Vision	catalog	can	be	accessed	via	a	faceted	search	functionality,	or	via	the	linked	
data	browser.
Other	components	include:	(read	headers)
7
In	the	same	way,	the	collections’	metadata	can	be	visualized,	for	instance,	in	a	
timeline
These	components	can be	“recombined”	into	the	previous	tools,	but	now	made	
compatible,	and	ready	to	be	used	with	all	collections.
For	example,	on	the	left,	you	see	the	“recipe”	for	Trove…
LEFT:	TROVeXL
RIGHT:	DIVE
FUTURE:	CANVAS	OPTION	(DIY)
This is the current placeholder/draft home page for the media suite. It directs the user
to available recipes, tools, available functionalities, APIs and datasets.
Search API and Annotation API are currently the APIs the media suite functionalities
are built upon. The search API is connected to the index that contain all data imported
from CKAN. The annotation API enables one to store & access all (W3C Web
Annotation compliant) annotations
10
The	datasets	link to CKAN, which Willem will explain later. The registration system is
currently the only way to provide collection data to the system.
11
For	example:	the	collection	analyzer	is	a	component	(functionality).	All	collections	
registered	in	CKAN	can	appear	in	this	drop	down	menu	and	be	selected.
12
If	a	collection/dataset	is	used	via	the	collection	analyzer,	the	functionality	enables	the	
automatic	visualization	of	different	aspects	in	the	data,	for	example,	the	amount	of	
missing	dates	in	a	collection,	or	the	possible	wrong	dates	(for	example,	some	
programs	may	have	a	wrong	broadcast	date	such	as	2050)
13
Finally,	the	most	important	aspect	of	this	modular	approach	is	that	now	all	the	
different	collections	can	be	used	by	the	different	tools,	which	are	now	called	
“recipes”	since	they	are	made	of	different	functionalities	or	modules,	and	with	
different	ingredients:	the	collections.
STARTING	POINT:	TOOLS
CURRENT	APPROACH:	MODULAR	
(losse functionaliteiten als ingrediënten)
14
Now	Willem	will	explain	CKAN	in	detail
15
This is CKAN, the collection registry for CLARIAH WP5 media studies and currently
the only way to provide collection data to the system. This instance of CKAN is
reserved for ‘official institutional’ datasets. In a later phase of the project we will think
about providing proper means to upload/share custom datasets, such as scraped
twitter data or youtube collections.
DATA	REGISTRATION	VIA	CKAN	(DEMO)
16
And	now	we	would	like	to	open	the	discussion...
17
And	now	we	would	like	to	open	the	discussion...
18
19
20
21
22

Collection registration for the CLARIAH Media Suite.