Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Miltenova - Ontotext


Published on

  • Be the first to comment

  • Be the first to like this

Miltenova - Ontotext

  1. 1. Electronic Corpus of SlavicManuscripts and Texts with Relevant Metadata Prof. Anissava Miltenova Department of Old Bulgarian Literature Institute for Literature Bulgarian Academy of Sciences Sofia Vienna, November 30, 2012
  2. 2. Cultural context: medieval literature of Slavia Orthodoxa• More than 90% of texts are translated from Greek;• Byzantine texts were not only translated, but transformed;• Old Church Slavonic texts were very often modified, they have many versions;• The prototype of text often remains relative, or even unknown; Slavic scribes created a new recension based primarily on divinatio, or often under the influence of multiple sources (contaminatio);• Very often one manuscript is scattered in 2-5 libraries/ collections /countries. Vienna, November 30, 2012
  3. 3. E-corpus of Medieval Slavic Manuscripts• Cataloguing of manuscripts, analytical description• Facsimiles in the form of computerized image files, linked to the catalogue database• Sets of text files linked both to the relevant catalogue database entries and facsimile image files• Corpuses of Old Church Slavonic texts and parallel Greek sources (CPG, BHG, others)• Dictionaries• Ontology of terminology• Workstation for medievalists: real or illusive? Vienna, November 30, 2012
  4. 4. Principles of processing• TEI P5 guidelines;• The encoding of Old Church Slavonic is based on Unicode 5.1;• Standardization of document file format;• Multiple use (ensured by the separation of data from processing);• Portability of electronic texts (independence of local platforms);• Orientation towards well-structured divisions of data in XML;• User friendly and fast access. Vienna, November 30, 2012
  5. 5. Repertory of Old Bulgarian Literature and Letters: Content and Metadata• To make possible access to medieval manuscripts for all purposes of research in medieval studies - textology - codicology - paleography - identification of text and analysis of context of every text, etc.)• As much as bigger is the corpus, so important is the help of computer tools for searching and extraction of data.• Access: Vienna, November 30, 2012
  6. 6. Browser view Vienna, November 30, 2012
  7. 7. Encoding in oXygen Vienna, November 30, 2012
  8. 8. Outcomes1. Model for highly structured description of manuscript materials based on XML format (designed joint with Andrej Boyadzhiev, Sofia University) TEI P5;2. Corpus of over 350 Slavic manuscripts (11th--18th centuries);3. Model for electronic editions of Old Slavonic texts in XML format and approximately 370 pages electronic texts;4. Model for comparison of the content of miscellanies and visual presentation in SVG format;5. Glossaries of terms, authority files and and thesaurus of terminology (8 languages –Bulgarian, Serbian, Russian, English, German, French, Old Church Slavonic and Greek), over 750 articles in Moodle software: Vienna, November 30, 2012
  9. 9. Repertorium Workstation Vienna, November 30, 2012
  10. 10. SVG tool: David Birnbaum, University of Pittsburgh Vienna, November 30, 2012
  11. 11. Terminology project “Metadata and Electronic catalogues”: from Thesaurus to Ontology Vienna, November 30, 2012
  12. 12. Vienna, November 30, 2012
  13. 13. Cooperation✔ Project “Electronic Description and Edition of Slavic Sources”, 2002- 2003 (UNESCO)✔ Project “Slovo” (2007-2008): Transfer of knowledge among working teams of five countries (Austria, Bulgaria, FYR of Macedonia, Serbia, and Slovenia) and joint research work. • Digital collections of manuscripts of monasteries: St Catherines on Mount Sinai, Dečani monastery, Markov monastery, Rila monastery • Organization of a workshop on digital libraries of Slavic Manuscripts in Sofia in February 2008 and a round table on the Balkan monastic culture. • Production of a specialized portal on medieval Slavic culture ”SLOVO”:✔ The project with Slavic department in the University of Göteborg (2003-2011) and Uppsala (2012-2014)✔ Project with British Library (2004-2011): “Machine-readable description and electronic catalogue of Slavic Manuscripts”✔ Project “Digital Scholarly Editions and Developments in the Slovenian and Bulgarian Literary Studies” with ZRC Slovenia (2010- 2013) Vienna, November 30, 2012
  14. 14. Project “Encyclopedia Slavica Sanctorum”• Encyclopaedia Slavica Sanctorum is a joint project of St. Kliment Ohridski University of Sofia and Institute of Literature, Old Bulgarian Literature Department.• Main goal is to build a repertory of medieval and early modern Bulgarian texts for saints in combination with ethnological data and some visual sources.• During the project IMI-BAS team produces a digital repository for on-line Web access to the Bulgarian information on saints, i.e. Encyclopaedia Slavica Sanctorum Calendar (ESS)• Online access: Vienna, November 30, 2012
  15. 15. Achievments• Repertorium database are very powerful research instruments, more accurate and more comfortable for the users;• Scholars changed their point of view on the effectiveness of the applications of modern software tools to manuscripts and medieval texts;• Strong international cooperation and exchange of information in the field of computational medieval studies and computational humanities in general;• Bridge over the gap between West Europe and East Europe is also essential today and even more for the future. Vienna, November 30, 2012
  16. 16. A digital edition of Codex Suprasliensis (2010-2012)• To unite digital images of all three parts of the Codex Suprasliensis;• To create electronic version of the OCS text in Unicode; (a prior electronic version was transliterated in 7-bit ASCII distributed by the Corpus Cyrillo-Methodianum Helsingiense and the TITUS project);• To create electronic version of Greek texts;• To develop searching system;• To add translation in English;• Edition is freely available under a Creative Commons BY- NC-SA license ( sa/2.0/ ) Vienna, November 30, 2012
  17. 17. What is the Codex Suprasliensis?• Cyrillic• Late tenth century• 37 quires• 285 folia, 1о• Leaf: 31 X 23 cm• Text box: 23 X 15 cm• One column Vienna, November 30, 2012
  18. 18. DecorationVienna, November 30, 2012
  19. 19. OrthographyColophon of Retko in the Vienna, November 30, 2012margin
  20. 20. Preservation of the Codex Supasliensis• University Library in Ljubljana: 118 folia, Kopitar collection;• Russian National Library in St. Petersburg: 16 folia, A. F. Byčkov collection;• National Library in Warsaw, 151 folia, counts Zamoyski collection, lost at the time of the World War II, reemerged in the USA and was returned to Poland in 1968;• listed in the UNESCO’s Memory of the World Register since 2007;• The project is supported by UNESCO.• Main page of the project: Vienna, November 30, 2012
  21. 21. Contents• Primarily vitae of martyrs from the 3rd-4th centuries – Paul and Juliana – St Basiliscus – Forty Martyrs of Sebaste• 1st-2nd centuries – Conon of Isauria• 6th-7th centuries – St Gregory the Great, Pope of Rome• 9th century – Forty-two Martyrs of Amorium• Anchorites – Paul the Simple Vienna, November 30, 2012
  22. 22. Vienna, November 30, 2012
  23. 23. Browse the text: main view• Paragraphed OCS – Helsinki ASCII transcription converted to Unicode and corrected manually against facsimile in Zaimov/Capaldo edition – Paragraphing introduced by editors• Linked to – Parallel Greek (from Zaimov/Capaldo edition) – Parallel English (new translation by Thomas Lysaght) – Folio views (see below)• Linguistic commentary – Word-for-word, scrolls and highlights – Back-linked• Web fonts Vienna, November 30, 2012
  24. 24. Browse the text: folio view• Interlinear OCS and Greek on left – Mouse-over linguistic commentary• Facsimile on right – New photographs, contributed by repositories – Magnifying glass• Links to other folios• Links to main view and searches• Same OCS and Greek text and linguistic annotation as main view – Single source = multipurposing 2012 Vienna, November 30,
  25. 25. Vienna, November 30, 2012
  26. 26. Incorporation into PROIEL Vienna, November 30, 2012
  27. 27. Thank you!Pitagor award, 2009 Acknowledgements: Working team: Department of Old Bulgarian Literature, Institute of Literature BAS: Methods: Commission to the ICS: t/commission/index.html Anissava Miltenova, Andrej Boyadzhiev David J. Birnbaum, Hanne Martin Ekhoff Vienna, November 30, 2012