Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

C7 ido ivri_noamcassel_the_open_press_archive


Published on

2014 EVA/Minerva Jerusalem International Conference on Digitisation of Cultural Heritage

Published in: Internet
  • Be the first to comment

  • Be the first to like this

C7 ido ivri_noamcassel_the_open_press_archive

  1. 1. The Open Press Archive Opening up NLI and TAU’s Historic Jewish Press Collection
  2. 2. A few words about HaSadna
  3. 3. Open Data Principles (source) • Access. The work shall be available as a whole and at no more than a reasonable reproduction cost, preferably downloading via the Internet without charge. The work must also be available in a convenient and modifiable form. • Redistribution. The license shall not restrict any party from selling or giving away the work either on its own or as part of a package made from works from many different sources • Reuse. The license must allow for modifications and derivative works and must allow them to be distributed under the terms of the original work. • Absence of Technological Restriction. The work must be provided in such a form that there are no technological obstacles to the performance of the above activities. This can be achieved by the provision of the work in an open data format. • Attribution. The license may require as a condition for redistribution and re-use the attribution of the contributors and creators to the work. • Integrity. The license may require as a condition for the work being distributed in modified form that the resulting work carry a different name or version number from the original work. • No Discrimination Against Persons or Groups. The license must not discriminate against any person or group of persons. • No Discrimination Against Fields of Endeavor. The license must not restrict anyone from making use of the work in a specific field of endeavor. • Distribution of License. The rights attached to the work must apply to all to whom the program is redistributed without the need for execution of an additional license by those parties.
  4. 4. Why the Open Press Archive? • Truly Wonderful collection – • Has relevance for a lot of users • JPRESS is mostly concealed in “the hidden web” • JPRESS Allows for only one use case – search, browse • Can be easily used by Digital Humanities researchers • From Hasadna’s perspective - A good project for novice and advanced coders
  5. 5. What has been done to date?
  6. 6. What has been done to date? Initial Developers’ API: • 4_15_Ar03104 • %94%D7%95%D7%93%D7%94 Exposure to Google: • Search for בן יהודה on Google
  7. 7. What can be achieved in the future? • Hackathons - new frontend, new apps • Tagging and linking to other repositories • OCR’d text improvement with CAPTCHA • Full text analysis • Topic analysis, auto annotation
  8. 8. Thank you You are all welcome to the Open Hub!