SlideShare a Scribd company logo
1 of 32
Download to read offline
Electronic
               Linking Text References to
Corpora for
 Ancient       Relevant Digital Resources
Languages               Over The Web
  Prague,
 November      Matteo Romanello matteo.romanello@yahoo.it
                    University “Ca' Foscari” of Venice
  16th -17th
    2007
A Microformat for Canonical Texts
                         References

                   • Topic: how to link secondary sources to corpora
                     of ancient languages texts?

                   • Goal: to give scholars reading the Digital Library's
                     primary and secondary sources more powerful
                     research tools and a richer reading experience

                   • Focus: references to Canonical Texts in XHTML

                   • Examples' Scope: Classical (Greek and Latin)
                     literature




Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   2/32
Digital Library on Classics:
                         the State of the Art

                   • A few of on-line secondary sources (journal
                     articles and monographies) available as (X)HTML
                   • A few of on-line authoritative and born-digital
                     journals: e.g. Classics@ published by the
                     Harvard's Center for Hellenic Studies
                   • Some On-line Text Corpora (Perseus and other
                     minor scattered collections)
                   • Some resources and reviews of electronic
                     resources for humanists, reviews of books...
                   • Research blogs


Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   3/32
Current e-scholarship scenarios (1)

                   Scenario 1

                   John is a scholar on Greek Literature and
                      wants to find all on-line articles or                                  Author of Iliad
                      electronic resources related to the verse                              and Odissey
                      he is focusing on (Hom. Il. 20.249).

                   Then he submits to Google a query like
                      'Hom. Il. 20.249' and what Google                                Homer               Homère
                      retrieves is not pertinent or interesting.
                      Ordinary search engine are just a text
                      based     (no    semantics,     language
                      dependent etc.).                                                             Omero

                   John would have a more precise or                                                         n
                      specialized search engine available,                                                   ...
                      perhaps capable of understanding the
                      semantic of the reference he typed in as
                      query string.

Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007    4/32
Current e-scholarship scenarios (2)

                   Scenario 2




                     John's colleague points out to him that Gregory Nagy within a
                     passage of 2nd chapter mentions the passage John is interested
                     about. John finds an on-line version of the book and open it up in
                     his browser...




Matteo Romanello    Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   5/32
Current e-scholarship scenarios (3)




             In order to have a significant e-reading experience, John would be able to
             read the cited verse in its context, to compare the text of that verse as
             recorded in different manuscripts, to read the same passage in a given
             translation or read a commentary on it.
Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   6/32
New e-scholarship scenarios (1)




                   • Semantic understanding of text references by
                     web browser
                   • Research of resources pertinent to the author,
                     the work or the precise text passage referred to

Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   7/32
New e-scholarship scenarios (2)

              •    Value added services
                   (VAS) for scholars
                   – Reference linking
                   – Related resources
                   – Targeted and
                     semantic-oriented
                     search
                   – Different exemplars of
                     a work
              •    Problems:
                   1) To build a distributed
                      library
                   2) To provide VAS
                      linking secondary to
                      primary sources
Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   8/32
From printed to digital libraries

                   • Find new constructive paradigms to take
                     advantage of net's properties
                   • In a network environment:
                       – Library universally distributed and with higher
                         granularity
                       – Provide reference linking
                   • Reference linking to primary sources (from
                     references in secondary sources):
                       – Ex. move from the citation Hom. Il. 1.1 to all available
                         translations, comparing critical editions and finding
                         related resources



Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   9/32
The evolution of ancient languages
                         corpora

                   • TLG (1970s) -> mass storage and rapid retrieval
                   • Perseus (1980s) -> richer media and higher level
                     data structures
                   • DLs + web protocol -> convergence of
                       – XML related technologies:
                             • TEI (encoding)
                             • XML Db (storage of structured data)
                             • Query capabilities over http protocols
                       – Web services communication over REST protocol
                       – Success of a distributed architecture (cfr. OAI-MHP)
                   Which protocol? Canonical Text Services protocol

Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   10/32
A new paradigm for building on-line
                          corpora: the CTS protocol (1)

                   • CTS web protocol:
                      –   new paradigm for building electronic corpora
                      –   gives hierarchical access to works as XML-TEI files
                      –   lies on the model described by FRBR
                      –   developed by Neel Smith et al. at Harvard's CHS
                      –   Built on the Registry Services Protocol (v. 1.0.rc1) ->
                          authority lists
                   • Some CTS related projects:
                      – Perseus' CTS interface
                      – Multitext Homer




Matteo Romanello    Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   11/32
A new paradigm for building on-line
                       corpora: the CTS protocol (2)
               • Text Server
                 CTS-compliant
               • Texts: XML TEI
               • Textgroup and
                 Works are
                 identified by
                 URNs
               • Collections
                 described by
                 authority lists



Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   12/32
Reference Linking in the Digital Library




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   13/32
Linking primary to secondary sources
                        on-line: state of the art

                   • Two very loosely coupled systems
                   • No born-digital equivalent to printed references
                   • Most of projects use an internal linking system:
                      – Worthy degree of hypertextuality
                      – Fairly closed systems of hard-linked resources
                   • Digital references == strings
                      – No semantic information
                      – No aware information processing
                      – Disambiguation of abbreviations and implicit
                        statementes is left to the reader




Matteo Romanello    Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   14/32
A digital companion to printed canonical
                         texts references
                   • Problem: provide a digital companion to printed
                     references
                       – to express references in a simple and semantic way
                             • exploiting the opportunities given by the digital medium
                             • Separating semantics from presentational matters

                   • Solution:
                       – mapping references to requests compliant to the
                         protocol to build a distributed library (CTS)
                       – embedding chunks of semantic information within
                         XHTML docs
                   • Implementation: Microformats (from Web 2.0)
                   • Goal: to design a Microformat for Canonical Text
                     references
Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   15/32
Microformats or RDF?



               • Mfs = a bottom-up way to Semantic Web (real
                 world semantics or lower-case semantic web)
               • Used within blogs for friendships, geographical
                 data, reviews...
               • Firefox 3 -> native support for Microformats
                 (microformatted content display integrated in the
                 UI)
               • Not the only way to embed metadata inside
                 common tag elements
                    – RDFa <http://www.w3.org/TR/xhtml-rdfa-primer/>
                      proposed by W3C
Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   16/32
Microformats vs RDF




     Microformats




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   17/32
Microformats or RDF?




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   18/32
Microformats or RDF?




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   19/32
Microformats: definition

                   • Microformats are:                              • Microformats are not:
                      – XHTML (POSH)                                       – A new language
                        compounds                                          – An attempt to change
                                                                             everyone's current
                      – A set of design
                                                                             behavior
                        principles for formats
                      – set of simple open data                     • Goals:
                        formats built upon                                 – Make data reusable and
                        existing and widely                                  interoperable among
                        adopted standards                                    webservices and
                                                                             mashup applications




Matteo Romanello    Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   20/32
Texts references: different use cases

                   1. Politics
                   2. like Aristotle claims
                   3. Politics of Aristotle
                   4. Artist. Pol. 1304B
                   5. Line 1 of the first book of Homer's Iliad
                   6. Hom. Il. I 1
                   7. Α 1 (== Upper-case Alpha 1, hellenistic books
                   notation)


Matteo Romanello    Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   21/32
Designing a MF for Canonical Texts
                         References (1)
                   • Start from a specific problem (principle #1)
                       – Problem: link secondary to primary sources on the web
                   • Reuse building blocks from widely adopted
                     standards (princ. #4)
                       – Canonical texts citation scheme widely used among
                         scholars on Classical Literature
                       – Canon of Greek Literature provided as authority list
                         compliant to the Registry Services Protocol
                   • “Paving the Cowpaths”
                       – keep the references appearing the same way as now,
                         regarding to their appearance
                       – Besides add semantics to references
                       – Allow also internal linking systems
Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   22/32
Designing a MF for Canonical Texts
                         References (2)
                   • Modularity and embeddability (princ. #5)

                        3. MF for Text                                                1. MF for author
                        references




                                                                                 2. MF for works




Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   23/32
Designing a MF for Canonical Texts
                       References (3)


                    Reference
                    appearance




                                                                                          Reference
                                                                                          underlying
                                                                                          microformatted
                                                                                          content
     urn:cts:greekLit:tlg0012:tlg001:20.131-20.137

Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   24/32
The Microformat in action

                   • Get some valid microformatted references
                   • Tag resources from a popular review with urns
                     instead of simple tags
                   • Make the browser aware of microformatted
                     contents adding support for CTSreference MF to
                     Operator extension for Firefox
                   • Add exemplifying actions to perform upon each
                     MF:
                       – find pertinent bookmarks on del.icio.us
                       – search for pertinent research articles on CiteUlike



Matteo Romanello     Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   25/32
The Microformat in action

    Green icons means that
    Operator is working...                                             Recognized
                                                                       microformats




                                                                              Available actions
    Some microformatted
    references
Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   26/32
The Microformat in action




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   27/32
The Microformat in action




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   28/32
The Microformat in action




Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   29/32
Benefits for scholarship on Ancient
                       Languages
               • Citations encoded with a MF express references
                 in a form:
                    – Cross-language
                    – Fully semantic, interoperable
                    – reusable
               • The reference linking system produced is:
                    – Open (client-side based)
                    – Independent from specific solutions
               • Microformatted references allow:
                    – targeted search -> more precise Information Retrieval
                      tools (Pingerati: microformats search engine provided by
                      developers at Technorati)


Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   30/32
TODOs

                   • Discussion on Microformats' mailing lists and wiki
                   • Advocacy and support by real projects
                   • Support of a digital library built upon CTS protocol
                   • Urns as semantic tags and keywords in metadata
                     description
                   • Tools for easy authoring
                   • Webservices taking advantage of such a MF:
                         – An application that manages and exports references with
                           several output formats to desktop applications
                         – harvester of CTS repositories




Matteo Romanello       Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   31/32
References

               •
                   John Allsopp, Microformats: Empowering Your Markup for
                   Web 2.0, Berkeley, CA : friends of ed.; New York :
                   Distributed to the book trade by Springer Verlag, 2007

               •   Neel Smith, “TextServer: Toward a Protocol for Describing
                   Libraries”, Classics@ vol. 2, edition of April 3, 2004.

               •   G. Crane et al., 'Beyond digital incunabula: Modeling the
                   next generation of digital libraries', Proceedings of the 10th
                   European Conference on Research and Advanced
                   Technology for Digital Libraries (ECDL 2006) vol. 4172.

               •   The Canonical Text Services (CTS) Protocol, current version:
                   1.1<http://katoptron.holycross.edu/cocoon/diginc/specs/cts
                   >

               •   The Registry Services Protocol, current version: 1.0.rc1 <
                   http://katoptron.holycross.edu/cocoon/diginc/specs/registry
                   >
Matteo Romanello   Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007   32/32

More Related Content

Similar to M.Romanello Ecal Presentation

Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital libraryAlexandr Belov
 
Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?AubreyMcFato
 
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebBuilding Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebSebastian Ryszard Kruk
 
Structured and Unstructured:Extracting Information From Classics Scholarly Texts
Structured and Unstructured:Extracting Information From Classics Scholarly TextsStructured and Unstructured:Extracting Information From Classics Scholarly Texts
Structured and Unstructured:Extracting Information From Classics Scholarly TextsMatteo Romanello
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageMax Kaiser
 
Esad 12may2010
Esad 12may2010Esad 12may2010
Esad 12may2010Anna Ashton
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibrarySebastian Ryszard Kruk
 
The Online-Life of Media Art-Archives
The Online-Life of Media Art-ArchivesThe Online-Life of Media Art-Archives
The Online-Life of Media Art-Archivesfwiencek
 
week6 translating media
week6 translating mediaweek6 translating media
week6 translating mediaeennyyaa
 
Dimensions of Media Object Comprehensibility
Dimensions of Media Object ComprehensibilityDimensions of Media Object Comprehensibility
Dimensions of Media Object ComprehensibilityLawrie Hunter
 
Bohairic Thematic Lexicon
Bohairic Thematic Lexicon Bohairic Thematic Lexicon
Bohairic Thematic Lexicon HalelyHarel
 
Olaf Janssen on the principles of large-scale digital libraries and their app...
Olaf Janssen on the principles of large-scale digital libraries and their app...Olaf Janssen on the principles of large-scale digital libraries and their app...
Olaf Janssen on the principles of large-scale digital libraries and their app...Olaf Janssen
 
Tutorial on Semantic Digital Libraries (ESWC'2007)
Tutorial on Semantic Digital Libraries (ESWC'2007)Tutorial on Semantic Digital Libraries (ESWC'2007)
Tutorial on Semantic Digital Libraries (ESWC'2007)Sebastian Ryszard Kruk
 
Evaluating SharedCanvas in CATCHPlus
Evaluating SharedCanvas in CATCHPlusEvaluating SharedCanvas in CATCHPlus
Evaluating SharedCanvas in CATCHPlusRobert Sanderson
 
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...Antoine Isaac
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsAntoine Isaac
 

Similar to M.Romanello Ecal Presentation (20)

Electronic literature and its place in digital library
Electronic literature and its place in digital libraryElectronic literature and its place in digital library
Electronic literature and its place in digital library
 
Christiane Paul - Arquivando o contexto: estratégias de preservação para a ne...
Christiane Paul - Arquivando o contexto: estratégias de preservação para a ne...Christiane Paul - Arquivando o contexto: estratégias de preservação para a ne...
Christiane Paul - Arquivando o contexto: estratégias de preservação para a ne...
 
Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?Europeana. A Digital Library for the Humanities?
Europeana. A Digital Library for the Humanities?
 
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic WebBuilding Heterogeneous Networks of Digital Libraries on the Semantic Web
Building Heterogeneous Networks of Digital Libraries on the Semantic Web
 
Structured and Unstructured:Extracting Information From Classics Scholarly Texts
Structured and Unstructured:Extracting Information From Classics Scholarly TextsStructured and Unstructured:Extracting Information From Classics Scholarly Texts
Structured and Unstructured:Extracting Information From Classics Scholarly Texts
 
EuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital HeritageEuropeanaConnect - Enhancing User Access to European Digital Heritage
EuropeanaConnect - Enhancing User Access to European Digital Heritage
 
Esad 12may2010
Esad 12may2010Esad 12may2010
Esad 12may2010
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital Library
 
The Online-Life of Media Art-Archives
The Online-Life of Media Art-ArchivesThe Online-Life of Media Art-Archives
The Online-Life of Media Art-Archives
 
week6 translating media
week6 translating mediaweek6 translating media
week6 translating media
 
Irish Digital Libraries Summit
Irish Digital Libraries SummitIrish Digital Libraries Summit
Irish Digital Libraries Summit
 
Dimensions of Media Object Comprehensibility
Dimensions of Media Object ComprehensibilityDimensions of Media Object Comprehensibility
Dimensions of Media Object Comprehensibility
 
Bohairic Thematic Lexicon
Bohairic Thematic Lexicon Bohairic Thematic Lexicon
Bohairic Thematic Lexicon
 
Olaf Janssen on the principles of large-scale digital libraries and their app...
Olaf Janssen on the principles of large-scale digital libraries and their app...Olaf Janssen on the principles of large-scale digital libraries and their app...
Olaf Janssen on the principles of large-scale digital libraries and their app...
 
Tutorial on Semantic Digital Libraries (ESWC'2007)
Tutorial on Semantic Digital Libraries (ESWC'2007)Tutorial on Semantic Digital Libraries (ESWC'2007)
Tutorial on Semantic Digital Libraries (ESWC'2007)
 
Genre12
Genre12Genre12
Genre12
 
Evaluating SharedCanvas in CATCHPlus
Evaluating SharedCanvas in CATCHPlusEvaluating SharedCanvas in CATCHPlus
Evaluating SharedCanvas in CATCHPlus
 
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
Semantic Web and Linked Data for cultural heritage materials - Approaches in ...
 
Europeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E resultsEuropeana and the relevance of the DM2E results
Europeana and the relevance of the DM2E results
 
03 isaac dm2-e14-full
03 isaac dm2-e14-full03 isaac dm2-e14-full
03 isaac dm2-e14-full
 

More from Matteo Romanello

Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...
Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...
Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...Matteo Romanello
 
Scaling up the Extraction of Canonical Citations in Classics
Scaling up the Extraction of Canonical Citations in ClassicsScaling up the Extraction of Canonical Citations in Classics
Scaling up the Extraction of Canonical Citations in ClassicsMatteo Romanello
 
Transforming Indexes Locorum into Citation Networks
Transforming Indexes Locorum into Citation NetworksTransforming Indexes Locorum into Citation Networks
Transforming Indexes Locorum into Citation NetworksMatteo Romanello
 
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...Matteo Romanello
 
Introduction to the Text Reuse panel at DH 2014
Introduction to the Text Reuse panel at DH 2014Introduction to the Text Reuse panel at DH 2014
Introduction to the Text Reuse panel at DH 2014Matteo Romanello
 
Exploring Citation Networks to Study Intertextuality in Classics
Exploring Citation Networks to Study Intertextuality in ClassicsExploring Citation Networks to Study Intertextuality in Classics
Exploring Citation Networks to Study Intertextuality in ClassicsMatteo Romanello
 
DARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceDARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceMatteo Romanello
 
Greedy Enough for the Grid?
Greedy Enough for the Grid?Greedy Enough for the Grid?
Greedy Enough for the Grid?Matteo Romanello
 
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly Texts
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly TextsStuctured Vs Unstructured: Extracting Information from Classics Scholarly Texts
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly TextsMatteo Romanello
 
[poster] Extracting Information From Classics Scholarly Texts
[poster] Extracting Information From Classics Scholarly Texts[poster] Extracting Information From Classics Scholarly Texts
[poster] Extracting Information From Classics Scholarly TextsMatteo Romanello
 
DIGITAL HUMANITIES E FILOLOGIA Un'introduzione
DIGITAL HUMANITIES   E FILOLOGIA   Un'introduzioneDIGITAL HUMANITIES   E FILOLOGIA   Un'introduzione
DIGITAL HUMANITIES E FILOLOGIA Un'introduzioneMatteo Romanello
 
Rethinking Critical Editions of Fragments by Ontologies
Rethinking Critical Editions of Fragments by OntologiesRethinking Critical Editions of Fragments by Ontologies
Rethinking Critical Editions of Fragments by OntologiesMatteo Romanello
 
Presentatio @ ELPUB 2008, Toronto
Presentatio @ ELPUB 2008, TorontoPresentatio @ ELPUB 2008, Toronto
Presentatio @ ELPUB 2008, TorontoMatteo Romanello
 
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...Matteo Romanello
 

More from Matteo Romanello (16)

Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...
Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...
Towards the Automatic Retrieval of Cited Parallel Passages from Secondary Lit...
 
Scaling up the Extraction of Canonical Citations in Classics
Scaling up the Extraction of Canonical Citations in ClassicsScaling up the Extraction of Canonical Citations in Classics
Scaling up the Extraction of Canonical Citations in Classics
 
Transforming Indexes Locorum into Citation Networks
Transforming Indexes Locorum into Citation NetworksTransforming Indexes Locorum into Citation Networks
Transforming Indexes Locorum into Citation Networks
 
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...
Enhancing and Extending the Digital Study of Intertextuality (pt. 2): Reveali...
 
Introduction to the Text Reuse panel at DH 2014
Introduction to the Text Reuse panel at DH 2014Introduction to the Text Reuse panel at DH 2014
Introduction to the Text Reuse panel at DH 2014
 
Exploring Citation Networks to Study Intertextuality in Classics
Exploring Citation Networks to Study Intertextuality in ClassicsExploring Citation Networks to Study Intertextuality in Classics
Exploring Citation Networks to Study Intertextuality in Classics
 
DARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and SpaceDARIAH Geo-browser: Exploring Data through Time and Space
DARIAH Geo-browser: Exploring Data through Time and Space
 
Greedy Enough for the Grid?
Greedy Enough for the Grid?Greedy Enough for the Grid?
Greedy Enough for the Grid?
 
Romanello tokyo
Romanello tokyoRomanello tokyo
Romanello tokyo
 
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly Texts
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly TextsStuctured Vs Unstructured: Extracting Information from Classics Scholarly Texts
Stuctured Vs Unstructured: Extracting Information from Classics Scholarly Texts
 
[poster] Extracting Information From Classics Scholarly Texts
[poster] Extracting Information From Classics Scholarly Texts[poster] Extracting Information From Classics Scholarly Texts
[poster] Extracting Information From Classics Scholarly Texts
 
DIGITAL HUMANITIES E FILOLOGIA Un'introduzione
DIGITAL HUMANITIES   E FILOLOGIA   Un'introduzioneDIGITAL HUMANITIES   E FILOLOGIA   Un'introduzione
DIGITAL HUMANITIES E FILOLOGIA Un'introduzione
 
Ht159 Poster
Ht159 PosterHt159 Poster
Ht159 Poster
 
Rethinking Critical Editions of Fragments by Ontologies
Rethinking Critical Editions of Fragments by OntologiesRethinking Critical Editions of Fragments by Ontologies
Rethinking Critical Editions of Fragments by Ontologies
 
Presentatio @ ELPUB 2008, Toronto
Presentatio @ ELPUB 2008, TorontoPresentatio @ ELPUB 2008, Toronto
Presentatio @ ELPUB 2008, Toronto
 
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...
M. Romanello, E-scholia: scenari digitali per la comunicazione scientifica in...
 

Recently uploaded

Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechNewman George Leech
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewasmakika9823
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...lizamodels9
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Serviceankitnayak356677
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfmuskan1121w
 

Recently uploaded (20)

Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
 
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow đź’‹ Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman Leech
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
 
Best Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting PartnershipBest Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting Partnership
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdf
 

M.Romanello Ecal Presentation

  • 1. Electronic Linking Text References to Corpora for Ancient Relevant Digital Resources Languages Over The Web Prague, November Matteo Romanello matteo.romanello@yahoo.it University “Ca' Foscari” of Venice 16th -17th 2007
  • 2. A Microformat for Canonical Texts References • Topic: how to link secondary sources to corpora of ancient languages texts? • Goal: to give scholars reading the Digital Library's primary and secondary sources more powerful research tools and a richer reading experience • Focus: references to Canonical Texts in XHTML • Examples' Scope: Classical (Greek and Latin) literature Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 2/32
  • 3. Digital Library on Classics: the State of the Art • A few of on-line secondary sources (journal articles and monographies) available as (X)HTML • A few of on-line authoritative and born-digital journals: e.g. Classics@ published by the Harvard's Center for Hellenic Studies • Some On-line Text Corpora (Perseus and other minor scattered collections) • Some resources and reviews of electronic resources for humanists, reviews of books... • Research blogs Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 3/32
  • 4. Current e-scholarship scenarios (1) Scenario 1 John is a scholar on Greek Literature and wants to find all on-line articles or Author of Iliad electronic resources related to the verse and Odissey he is focusing on (Hom. Il. 20.249). Then he submits to Google a query like 'Hom. Il. 20.249' and what Google Homer Homère retrieves is not pertinent or interesting. Ordinary search engine are just a text based (no semantics, language dependent etc.). Omero John would have a more precise or n specialized search engine available, ... perhaps capable of understanding the semantic of the reference he typed in as query string. Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 4/32
  • 5. Current e-scholarship scenarios (2) Scenario 2 John's colleague points out to him that Gregory Nagy within a passage of 2nd chapter mentions the passage John is interested about. John finds an on-line version of the book and open it up in his browser... Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 5/32
  • 6. Current e-scholarship scenarios (3) In order to have a significant e-reading experience, John would be able to read the cited verse in its context, to compare the text of that verse as recorded in different manuscripts, to read the same passage in a given translation or read a commentary on it. Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 6/32
  • 7. New e-scholarship scenarios (1) • Semantic understanding of text references by web browser • Research of resources pertinent to the author, the work or the precise text passage referred to Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 7/32
  • 8. New e-scholarship scenarios (2) • Value added services (VAS) for scholars – Reference linking – Related resources – Targeted and semantic-oriented search – Different exemplars of a work • Problems: 1) To build a distributed library 2) To provide VAS linking secondary to primary sources Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 8/32
  • 9. From printed to digital libraries • Find new constructive paradigms to take advantage of net's properties • In a network environment: – Library universally distributed and with higher granularity – Provide reference linking • Reference linking to primary sources (from references in secondary sources): – Ex. move from the citation Hom. Il. 1.1 to all available translations, comparing critical editions and finding related resources Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 9/32
  • 10. The evolution of ancient languages corpora • TLG (1970s) -> mass storage and rapid retrieval • Perseus (1980s) -> richer media and higher level data structures • DLs + web protocol -> convergence of – XML related technologies: • TEI (encoding) • XML Db (storage of structured data) • Query capabilities over http protocols – Web services communication over REST protocol – Success of a distributed architecture (cfr. OAI-MHP) Which protocol? Canonical Text Services protocol Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 10/32
  • 11. A new paradigm for building on-line corpora: the CTS protocol (1) • CTS web protocol: – new paradigm for building electronic corpora – gives hierarchical access to works as XML-TEI files – lies on the model described by FRBR – developed by Neel Smith et al. at Harvard's CHS – Built on the Registry Services Protocol (v. 1.0.rc1) -> authority lists • Some CTS related projects: – Perseus' CTS interface – Multitext Homer Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 11/32
  • 12. A new paradigm for building on-line corpora: the CTS protocol (2) • Text Server CTS-compliant • Texts: XML TEI • Textgroup and Works are identified by URNs • Collections described by authority lists Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 12/32
  • 13. Reference Linking in the Digital Library Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 13/32
  • 14. Linking primary to secondary sources on-line: state of the art • Two very loosely coupled systems • No born-digital equivalent to printed references • Most of projects use an internal linking system: – Worthy degree of hypertextuality – Fairly closed systems of hard-linked resources • Digital references == strings – No semantic information – No aware information processing – Disambiguation of abbreviations and implicit statementes is left to the reader Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 14/32
  • 15. A digital companion to printed canonical texts references • Problem: provide a digital companion to printed references – to express references in a simple and semantic way • exploiting the opportunities given by the digital medium • Separating semantics from presentational matters • Solution: – mapping references to requests compliant to the protocol to build a distributed library (CTS) – embedding chunks of semantic information within XHTML docs • Implementation: Microformats (from Web 2.0) • Goal: to design a Microformat for Canonical Text references Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 15/32
  • 16. Microformats or RDF? • Mfs = a bottom-up way to Semantic Web (real world semantics or lower-case semantic web) • Used within blogs for friendships, geographical data, reviews... • Firefox 3 -> native support for Microformats (microformatted content display integrated in the UI) • Not the only way to embed metadata inside common tag elements – RDFa <http://www.w3.org/TR/xhtml-rdfa-primer/> proposed by W3C Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 16/32
  • 17. Microformats vs RDF Microformats Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 17/32
  • 18. Microformats or RDF? Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 18/32
  • 19. Microformats or RDF? Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 19/32
  • 20. Microformats: definition • Microformats are: • Microformats are not: – XHTML (POSH) – A new language compounds – An attempt to change everyone's current – A set of design behavior principles for formats – set of simple open data • Goals: formats built upon – Make data reusable and existing and widely interoperable among adopted standards webservices and mashup applications Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 20/32
  • 21. Texts references: different use cases 1. Politics 2. like Aristotle claims 3. Politics of Aristotle 4. Artist. Pol. 1304B 5. Line 1 of the first book of Homer's Iliad 6. Hom. Il. I 1 7. Α 1 (== Upper-case Alpha 1, hellenistic books notation) Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 21/32
  • 22. Designing a MF for Canonical Texts References (1) • Start from a specific problem (principle #1) – Problem: link secondary to primary sources on the web • Reuse building blocks from widely adopted standards (princ. #4) – Canonical texts citation scheme widely used among scholars on Classical Literature – Canon of Greek Literature provided as authority list compliant to the Registry Services Protocol • “Paving the Cowpaths” – keep the references appearing the same way as now, regarding to their appearance – Besides add semantics to references – Allow also internal linking systems Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 22/32
  • 23. Designing a MF for Canonical Texts References (2) • Modularity and embeddability (princ. #5) 3. MF for Text 1. MF for author references 2. MF for works Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 23/32
  • 24. Designing a MF for Canonical Texts References (3) Reference appearance Reference underlying microformatted content urn:cts:greekLit:tlg0012:tlg001:20.131-20.137 Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 24/32
  • 25. The Microformat in action • Get some valid microformatted references • Tag resources from a popular review with urns instead of simple tags • Make the browser aware of microformatted contents adding support for CTSreference MF to Operator extension for Firefox • Add exemplifying actions to perform upon each MF: – find pertinent bookmarks on del.icio.us – search for pertinent research articles on CiteUlike Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 25/32
  • 26. The Microformat in action Green icons means that Operator is working... Recognized microformats Available actions Some microformatted references Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 26/32
  • 27. The Microformat in action Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 27/32
  • 28. The Microformat in action Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 28/32
  • 29. The Microformat in action Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 29/32
  • 30. Benefits for scholarship on Ancient Languages • Citations encoded with a MF express references in a form: – Cross-language – Fully semantic, interoperable – reusable • The reference linking system produced is: – Open (client-side based) – Independent from specific solutions • Microformatted references allow: – targeted search -> more precise Information Retrieval tools (Pingerati: microformats search engine provided by developers at Technorati) Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 30/32
  • 31. TODOs • Discussion on Microformats' mailing lists and wiki • Advocacy and support by real projects • Support of a digital library built upon CTS protocol • Urns as semantic tags and keywords in metadata description • Tools for easy authoring • Webservices taking advantage of such a MF: – An application that manages and exports references with several output formats to desktop applications – harvester of CTS repositories Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 31/32
  • 32. References • John Allsopp, Microformats: Empowering Your Markup for Web 2.0, Berkeley, CA : friends of ed.; New York : Distributed to the book trade by Springer Verlag, 2007 • Neel Smith, “TextServer: Toward a Protocol for Describing Libraries”, Classics@ vol. 2, edition of April 3, 2004. • G. Crane et al., 'Beyond digital incunabula: Modeling the next generation of digital libraries', Proceedings of the 10th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2006) vol. 4172. • The Canonical Text Services (CTS) Protocol, current version: 1.1<http://katoptron.holycross.edu/cocoon/diginc/specs/cts > • The Registry Services Protocol, current version: 1.0.rc1 < http://katoptron.holycross.edu/cocoon/diginc/specs/registry > Matteo Romanello Electronic Corpora for Ancient Languages - Prague, November 16th -17th 2007 32/32