3. terminology
●
●
●
Dublin Core – metadata dictionary for the description of a wide range of
resources (DCIM 2005)
OAI-PMH – Open Archive Initiative / Protocol for Metadata Harvesting –
open standard for metadata distribution
CoinS – ContextObject in Spans – bibliographical metadata in html
biblioteka universitare shkencore
4. requirements
image format
●
quality / size ratio
●
open
platform
●
support for dublin core, OAI-PHM, CoinS
●
web based ( cross platform )
●
support for integrated search engines ( lucene, sphinx, ... )
●
cloud enabled
●
open source
●
minimal hw / sw requirements
biblioteka universitare shkencore
5. image format - djvu
●
AT&T 1988
●
Created for scanned documents
●
open standard
●
OCR text layer ( eases search/copy-paste )
●
cross platform ( win / *nix / mac )
●
support for personalized encryption schemes
tiff
5000 kb
jpeg
572 kb
pdf
301 kb
djvu
70 kb
biblioteka universitare shkencore
6. platform
●
based on the OMEKA project from the “George Mason University”
●
web based
●
open source
●
support for Dublic Core, OAI-PMH, CoinS
●
modular architecture
●
support for lucene
●
queue management ( gearman / rabbitmq )
●
cloud support
●
import / export option in open formats ( xml / json )
biblioteka universitare shkencore
8. the actual situation
●
Books – 46, from 1537 - 1930
●
magazines
–
Cirka, complete collection
–
Leka, complete collection
–
Posta e Shqypnies, complete collection
–
Agimi, complete collection
–
Perpjekja, complete collection
●
total of 15 000 scanned pages
●
around 300 scanned materials
biblioteka universitare shkencore
9. process automation
●
the usage of open source technologies enables us to have a full process
automation
document
scanning
conversion
upload
ready for metadada
biblioteka universitare shkencore
10. future
●
OCR ( tesseract ) in process
●
partner module
●
map geo referencing in process
●
copyright management ( ? ) in process
●
installation in other libraries / museums
●
search engine for future installation ( based on OAI - PMH )
●
better scanner got it :)
●
better server ( cloud hosting ? )
biblioteka universitare shkencore
11. future 2
june – july 2013 release under open source license of the following code
●
scanning process automation
●
Djvu applet
●
modules and changes in the OMEKA core
for more adsh.unishk.edu.al
biblioteka universitare shkencore