SlideShare a Scribd company logo
Florentina Armaselu – DHLab, Centre virtuel de la connaissance sur l’Europe (CVCE),
Luxembourg
florentina.armaselu@cvce.eu
1
www.cvce.eu
From a Small-Scale Digital Edition to a TEI
Publication Framework in Modern
European History
Text Encoding Initiative (TEI) Conference and Members’
Meeting. Connect, Animate, Innovate. 28 to 31 Octobre
2015. Université Lumière Lyon 2
1. The WEU-DIPLO pilot project
2. Transviewer, towards a TEI publication
framework
3. Discussion
4. References
Summary
2
Part I
The WEU-DIPLO pilot project
3
1. Goal: XML-TEI encoding, corpus analysis and Web publication of institutional documents
of the W.E.U. (Western European Union):
• Topics: armament production, standardization, control in the period from 1954 to 1982;
• Source: Archives nationales de Luxembourg, W.E.U collection.
2. Initial format:
• digitized versions (JPG) of typewritten materials (one file per page).
3. Size:
*proc. = processed
Overview of the WEU-DIPLO project
Part I. WEU-DIPLO pilot 4
Category Number of
documents
Number of documents
per language
Number
of pages
Number of pages per
language
EN FR FR proc.* EN FR FR proc.*
Note 89 43 46 37 395 191 204 155
Minutes 30 15 15 15 256 138 118 118
Memorandum 3 1 2 2 16 7 9 9
Study 2 0 2 1 12 0 12 8
Discourse 1 0 1 0 4 0 4 0
Draft protocol 2 1 1 0 4 2 2 0
Total 127 60 67 55 687 338 349 290
Overview of the WEU-DIPLO project: workflow
Part I. WEU-DIPLO pilot 5
Overview of the WEU-DIPLO project: page structure. ©WEU-UEO
Part I. WEU-DIPLO pilot 6
Header
Content
Footer
Microsoft Word Styling – WEU-DIPLO
Part I. WEU-DIPLO pilot 7
Headers, footers
Headings, line breaks,
paragraphs
Conversion and enrichment (XSLT, manual, NER)
Part I. WEU-DIPLO pilot 8
OxGarage (DOCX to TEI P5)
oXygen XML Editor
• XSLT transformation (metadata, structure);
• manual enrichment (semantics – discourse
of country/institutional representatives)
GATE (Name Entity Recognition)
• training phase (Gazetteer List Collector)
• annotation phase (names of persons,
organisations, places, functions, events,
products; dates)
oXygen XML Editor
• XSLT (GATE XML to TEI P5 transformation)
XML-TEI Encoding: WEU-DIPLO - metadata; layout (header). ©WEU-UEO
Part I. WEU-DIPLO pilot 9
@@hAuthor
@@hArchNum
@@hStampConfid
@@hDocRef
@@hOrigDate
@@hOrigLang
@@hVersion
XML-TEI Encoding: WEU-DIPLO – Structure (headings, paragraphs, line breaks); semantics (named
entities, discourse). ©WEU-UEO
Part I. WEU-DIPLO pilot 10
@@Heading2@@Paragraph
@@LineBreak@@Names
@@Discourse
XML-TEI Encoding: WEU-DIPLO – transcription features (Pierazzo, 2011)
Part I. WEU-DIPLO pilot 11
Part II
Transviewer, towards a TEI
publication framework
12
• Treaties; official declarations and meeting reports; letters; notes; press articles; images, video and
audio archives related to European integration history
Context: The CVCE’s ePublications
Part II. Transviewer 13
1. Transviewer concept:
• XML-TEI transformation/visualisation on the fly, in the browser
• flexible framework for the publication of XML-TEI documents in European
integration history;
2. Technologies :
• XML, HTML, XSLT, CSS and JavaScript
3. Tested platforms:
• EVT (Edition Visualization Technology): http://sourceforge.net/projects/evt-project/
• KILN : http://kiln.readthedocs.org/en/latest/#
• TEIBoilerplate : http://dcl.ils.indiana.edu/teibp/
• Versioning Machine: http://v-machine.org/
• XTF (eXtensible Text Framework): http://xtf.cdlib.org/about/
Transviewer overview
Part II. Transviewer 14
Implementation (adaptation and in-house development):
• side-by-side view digital facsimile and transcription (EVT model)
• third-party libraries:
o BookReader: tool designed to provide online access to scanned books
o Saxon-CE: support for XSLT 2.0 transformation in the browser
o in-house development (configuration, frames and buttons layout/actions, transcription rendering, third-party libraries
calls)
Transviewer prototype
Part II. Transviewer 15
Transviewer experiments– digital facsimile/transcription side-by-side view. ©WEU-UEO
Part II. Transviewer 16
Transviewer experiments– digital facsimile/transcription side-by-side view. Werner –
handwritten notes
Part II. Transviewer 17
Transviewer experiments (simulation) – video/audio and transcription synchronisation.
Werner - interviews
Part II. Transviewer 18
Transviewer features – panels layouts
Part II. Transviewer 19
Transviewer features– transcription format
Part II. Transviewer 20
Transviewer features– panels interlinking
Part II. Transviewer 21
Part III
Discussion
22
“By teaching an edition how to swim, I mean endowing an edition not only with a
store of factual knowledge concerning the work presented, but also with the
capability of dealing gracefully with the mutability of the electronic medium, by
exploiting the possibilities for reader-controlled changes to the edition’s
presentation and by adapting successfully to rapid changes in the hardware and
software environment.” (Sperberg-McQueen, 2009)
1. Transviewer prototype questions:
• flexible enough to support different types of documents in
European integration history and different user requirements;
• modular architecture to allow gradual development and
customisation according to the needs of the projects;
• balance manual interventions/automatic processing (XSLT, NER);
• XML transformation on the fly (no need for intermediary
formats/steps, changes to the XML already part of the publication).
Discussion
Part III. Discussion 23
3. Issues:
• BookReader – use of an older version of jQuery library;
• non-uniform support of Saxon-CE for XSLT 2.0 transformation in the
browsers;
• need for batch conversion to XML-TEI (potential adaptation of
OxGarage for batch processing).
4. Ongoing/future work for further development:
• evaluation (technology – technical experts; usability tests – experts
in European integration studies);
• development of new modules (multi-panels, audio/video
transcription, etc.) and tests with more project samples;
• integration into the existing CVCE’s Website architecture:
o Back End;
o Front End.
Discussion
Part III. Discussion 24
Thank you!
Discussion
25
Scaling in a publication framework would imply not only
teaching your editions “how to swim” but also how to swim
together.
• Book Reader: https://openlibrary.org/dev/docs/bookreader
• EVT (Edition Visualization Technology): http://sourceforge.net/projects/evt-project/
• GATE: https://gate.ac.uk/
• KILN : http://kiln.readthedocs.org/en/latest/#
• OxGarage: http://www.tei-c.org/oxgarage/
• Pierazzo, Elena. (2011). A rationale of digital documentary editions. In LLC. The Journal of Digital
Scholarship in the Humanities, Vol. 26, No. 4, December 2011, pp. 463-477.
• http://www.scholarlyediting.org/2014/essays/essay.pierazzo.html.
• TEIBoilerplate : http://dcl.ils.indiana.edu/teibp/
• TEI (Text Encoding Initiative): http://www.tei-c.org
• Versioning Machine: http://v-machine.org/
• Saxon-CE: http://www.saxonica.com/ce/user-doc/1.1/index.html
• Sperberg-McQueen, C.M. 2009. “How to teach your edition how to swim”. In LLC. The Journal of Digital
Scholarship in the Humanities. Volume 24, No. 1, April 2009. Oxford Journals.
• XTF (eXtensible Text Framework): http://xtf.cdlib.org/about/
References
26

More Related Content

Similar to TEI Conference - CVCE

12_N.Smolenski, M.Kostic, A.Sofronijevic
12_N.Smolenski, M.Kostic, A.Sofronijevic12_N.Smolenski, M.Kostic, A.Sofronijevic
12_N.Smolenski, M.Kostic, A.Sofronijevic
Nikola Smolenski
 
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
Europeana
 
F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian libraries
Libriotech
 
Semtech web-protege-tutorial
Semtech web-protege-tutorialSemtech web-protege-tutorial
Semtech web-protege-tutorial
matthewhorridge
 
Enabling accessible multimedia for Moodle: iMoot 2010
Enabling accessible multimedia for Moodle: iMoot 2010Enabling accessible multimedia for Moodle: iMoot 2010
Enabling accessible multimedia for Moodle: iMoot 2010
Nick Freear
 
Getty Presentation of IMA/AIC OSCI tool
Getty Presentation of IMA/AIC OSCI toolGetty Presentation of IMA/AIC OSCI tool
Getty Presentation of IMA/AIC OSCI tool
Robert J. Stein
 
Presentation of the AIC-IMA publishing tool for OSCI
Presentation of the AIC-IMA publishing tool for OSCIPresentation of the AIC-IMA publishing tool for OSCI
Presentation of the AIC-IMA publishing tool for OSCI
Robert J. Stein
 
Squeak
SqueakSqueak
Science Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth SciencesScience Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth Sciences
EOSCpilot .eu
 
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation FrameworkBL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
IMPACT Centre of Competence
 
Occiglot - Open Language Models by and for Europe
Occiglot - Open Language Models by and for EuropeOcciglot - Open Language Models by and for Europe
Occiglot - Open Language Models by and for Europe
Zilliz
 
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpen Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
OpenAIRE
 
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
Olaf Janssen
 
Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation
EOSCpilot .eu
 
Bne impact iif
Bne impact iifBne impact iif
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
Europeana
 
#T3UXW14 : workspace Team Work
#T3UXW14 : workspace Team Work#T3UXW14 : workspace Team Work
#T3UXW14 : workspace Team Work
Paul Blondiaux
 
XML London 2013 - Architecture of xproc.xq an XProc processor
XML London 2013 - Architecture of xproc.xq an XProc processorXML London 2013 - Architecture of xproc.xq an XProc processor
XML London 2013 - Architecture of xproc.xq an XProc processor
jimfuller2009
 
Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
Europeana
 
ECLAP White paper, social network for Cultural Heritage on Peforming arts
ECLAP White paper, social network for Cultural Heritage on Peforming artsECLAP White paper, social network for Cultural Heritage on Peforming arts
ECLAP White paper, social network for Cultural Heritage on Peforming arts
Paolo Nesi
 

Similar to TEI Conference - CVCE (20)

12_N.Smolenski, M.Kostic, A.Sofronijevic
12_N.Smolenski, M.Kostic, A.Sofronijevic12_N.Smolenski, M.Kostic, A.Sofronijevic
12_N.Smolenski, M.Kostic, A.Sofronijevic
 
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
Crowd wales, Building a crowdsourcing platform for Wales by Paul McCann - Eur...
 
F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian libraries
 
Semtech web-protege-tutorial
Semtech web-protege-tutorialSemtech web-protege-tutorial
Semtech web-protege-tutorial
 
Enabling accessible multimedia for Moodle: iMoot 2010
Enabling accessible multimedia for Moodle: iMoot 2010Enabling accessible multimedia for Moodle: iMoot 2010
Enabling accessible multimedia for Moodle: iMoot 2010
 
Getty Presentation of IMA/AIC OSCI tool
Getty Presentation of IMA/AIC OSCI toolGetty Presentation of IMA/AIC OSCI tool
Getty Presentation of IMA/AIC OSCI tool
 
Presentation of the AIC-IMA publishing tool for OSCI
Presentation of the AIC-IMA publishing tool for OSCIPresentation of the AIC-IMA publishing tool for OSCI
Presentation of the AIC-IMA publishing tool for OSCI
 
Squeak
SqueakSqueak
Squeak
 
Science Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth SciencesScience Demonstrator Session: Social and Earth Sciences
Science Demonstrator Session: Social and Earth Sciences
 
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation FrameworkBL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
BL Demo Day - July2011 - (9) IMPACT Interoperability and Evaluation Framework
 
Occiglot - Open Language Models by and for Europe
Occiglot - Open Language Models by and for EuropeOcciglot - Open Language Models by and for Europe
Occiglot - Open Language Models by and for Europe
 
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpen Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
 
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
An overview of The European Library. Olaf Janssen presenting during DRH 2005,...
 
Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation Reducing Infrastructure and Service Fragmentation
Reducing Infrastructure and Service Fragmentation
 
Bne impact iif
Bne impact iifBne impact iif
Bne impact iif
 
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...WP3 Further specification of Functionality and Interoperability - Gradmann / ...
WP3 Further specification of Functionality and Interoperability - Gradmann / ...
 
#T3UXW14 : workspace Team Work
#T3UXW14 : workspace Team Work#T3UXW14 : workspace Team Work
#T3UXW14 : workspace Team Work
 
XML London 2013 - Architecture of xproc.xq an XProc processor
XML London 2013 - Architecture of xproc.xq an XProc processorXML London 2013 - Architecture of xproc.xq an XProc processor
XML London 2013 - Architecture of xproc.xq an XProc processor
 
Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014Europeana Cloud Aggregator Forum 2014
Europeana Cloud Aggregator Forum 2014
 
ECLAP White paper, social network for Cultural Heritage on Peforming arts
ECLAP White paper, social network for Cultural Heritage on Peforming artsECLAP White paper, social network for Cultural Heritage on Peforming arts
ECLAP White paper, social network for Cultural Heritage on Peforming arts
 

More from dhlab

MyPublications: Enabling personal authoring and narrative making
MyPublications: Enabling personal authoring and narrative makingMyPublications: Enabling personal authoring and narrative making
MyPublications: Enabling personal authoring and narrative making
dhlab
 
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
dhlab
 
Humanist machine interaction for the digital humanities
Humanist machine interaction for the digital humanitiesHumanist machine interaction for the digital humanities
Humanist machine interaction for the digital humanities
dhlab
 
History of Europe demo at IEEE MMSP 2013
History of Europe demo at IEEE MMSP 2013History of Europe demo at IEEE MMSP 2013
History of Europe demo at IEEE MMSP 2013
dhlab
 
CUbRIK Summer School RHodes histoGraph
CUbRIK Summer School RHodes histoGraphCUbRIK Summer School RHodes histoGraph
CUbRIK Summer School RHodes histoGraph
dhlab
 
HistoGraph presentation Insa de Lyon
HistoGraph presentation Insa de LyonHistoGraph presentation Insa de Lyon
HistoGraph presentation Insa de Lyon
dhlab
 
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
dhlab
 
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
dhlab
 
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
dhlab
 
DH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task forceDH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task force
dhlab
 
DH2013: Julia Fallon – Legal aspects of UGC
DH2013: Julia Fallon – Legal aspects of UGCDH2013: Julia Fallon – Legal aspects of UGC
DH2013: Julia Fallon – Legal aspects of UGC
dhlab
 
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
dhlab
 
DH2013: Lars Wieneke – Workshop introduction
DH2013: Lars Wieneke – Workshop introduction DH2013: Lars Wieneke – Workshop introduction
DH2013: Lars Wieneke – Workshop introduction
dhlab
 

More from dhlab (13)

MyPublications: Enabling personal authoring and narrative making
MyPublications: Enabling personal authoring and narrative makingMyPublications: Enabling personal authoring and narrative making
MyPublications: Enabling personal authoring and narrative making
 
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
 
Humanist machine interaction for the digital humanities
Humanist machine interaction for the digital humanitiesHumanist machine interaction for the digital humanities
Humanist machine interaction for the digital humanities
 
History of Europe demo at IEEE MMSP 2013
History of Europe demo at IEEE MMSP 2013History of Europe demo at IEEE MMSP 2013
History of Europe demo at IEEE MMSP 2013
 
CUbRIK Summer School RHodes histoGraph
CUbRIK Summer School RHodes histoGraphCUbRIK Summer School RHodes histoGraph
CUbRIK Summer School RHodes histoGraph
 
HistoGraph presentation Insa de Lyon
HistoGraph presentation Insa de LyonHistoGraph presentation Insa de Lyon
HistoGraph presentation Insa de Lyon
 
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
 
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
DH2013: Roei Amit – Engage the exhibitions audience with the use of photograp...
 
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
 
DH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task forceDH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task force
 
DH2013: Julia Fallon – Legal aspects of UGC
DH2013: Julia Fallon – Legal aspects of UGCDH2013: Julia Fallon – Legal aspects of UGC
DH2013: Julia Fallon – Legal aspects of UGC
 
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
 
DH2013: Lars Wieneke – Workshop introduction
DH2013: Lars Wieneke – Workshop introduction DH2013: Lars Wieneke – Workshop introduction
DH2013: Lars Wieneke – Workshop introduction
 

Recently uploaded

UMiami biyezheng degree offer diploma Transcript
UMiami biyezheng degree offer diploma TranscriptUMiami biyezheng degree offer diploma Transcript
UMiami biyezheng degree offer diploma Transcript
xmevus
 
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
NETWAYS
 
GT biyezheng degree offer diploma Transcript
GT biyezheng degree offer diploma TranscriptGT biyezheng degree offer diploma Transcript
GT biyezheng degree offer diploma Transcript
xmevus
 
CULTURE-The way of life for entire society.
CULTURE-The way of life for entire society.CULTURE-The way of life for entire society.
CULTURE-The way of life for entire society.
RIYAPAWASHE
 
Colorfcul Presentation - Public Relations
Colorfcul Presentation - Public RelationsColorfcul Presentation - Public Relations
Colorfcul Presentation - Public Relations
StephanieFeliciano8
 
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
satpalsheravatmumbai
 
Cornell biyezheng degree offer diploma Transcript
Cornell biyezheng degree offer diploma TranscriptCornell biyezheng degree offer diploma Transcript
Cornell biyezheng degree offer diploma Transcript
xmevus
 
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
bangaloreakshitakaus
 
Call India - AmanTel on the App Store.ppt
Call India - AmanTel on the App Store.pptCall India - AmanTel on the App Store.ppt
Call India - AmanTel on the App Store.ppt
Best International calling app on the market
 
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
sukaniyasunnu
 
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
seenaoberoi
 
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
kishanaaani
 
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
DrAdoGarba
 
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
rashmikasinghdelhiro
 
@ℂall Lucknow @Girls Chinhat 08630512678
@ℂall Lucknow  @Girls Chinhat 08630512678 @ℂall Lucknow  @Girls Chinhat 08630512678
@ℂall Lucknow @Girls Chinhat 08630512678
veenita788
 
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
rawankhanlove256
 
Strategies for Adoption of SDGs in organizations
Strategies for Adoption of SDGs in organizationsStrategies for Adoption of SDGs in organizations
Strategies for Adoption of SDGs in organizations
Amgad Morgan
 
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
arnavkumar9870
 
UW biyezheng degree offer diploma Transcript
UW biyezheng degree offer diploma TranscriptUW biyezheng degree offer diploma Transcript
UW biyezheng degree offer diploma Transcript
xmevus
 
UCI biyezheng degree offer diploma Transcript
UCI biyezheng degree offer diploma TranscriptUCI biyezheng degree offer diploma Transcript
UCI biyezheng degree offer diploma Transcript
xmevus
 

Recently uploaded (20)

UMiami biyezheng degree offer diploma Transcript
UMiami biyezheng degree offer diploma TranscriptUMiami biyezheng degree offer diploma Transcript
UMiami biyezheng degree offer diploma Transcript
 
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
 
GT biyezheng degree offer diploma Transcript
GT biyezheng degree offer diploma TranscriptGT biyezheng degree offer diploma Transcript
GT biyezheng degree offer diploma Transcript
 
CULTURE-The way of life for entire society.
CULTURE-The way of life for entire society.CULTURE-The way of life for entire society.
CULTURE-The way of life for entire society.
 
Colorfcul Presentation - Public Relations
Colorfcul Presentation - Public RelationsColorfcul Presentation - Public Relations
Colorfcul Presentation - Public Relations
 
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
VIP Ahmedabad Girls Call Ahmedabad 0X0000000X Doorstep High-Profile Girl Serv...
 
Cornell biyezheng degree offer diploma Transcript
Cornell biyezheng degree offer diploma TranscriptCornell biyezheng degree offer diploma Transcript
Cornell biyezheng degree offer diploma Transcript
 
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
Lucknow Girls Call Fazullaganj 08630512678 Provide Best And Top Girl Service ...
 
Call India - AmanTel on the App Store.ppt
Call India - AmanTel on the App Store.pptCall India - AmanTel on the App Store.ppt
Call India - AmanTel on the App Store.ppt
 
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
VIP Shimla Girls Call Shimla 0X0000000X Doorstep High-Profile Girl Service Ca...
 
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
Mysore Girls Call Mysore 0X0000000X Payment On Delevery Cash Hot Premium Genu...
 
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
Chandigarh Girls Call Chandigarh 0X0000000X Provide Best And Top Girl Service...
 
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
ANALYSIS OF LIVELIHOOD DIVERSIFICATION STRATEGIES AMONG WOMEN CROP FARMERS IN...
 
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
Hyderabad Girls Call Hyderabad 0X0000000X Unlimited Short Providing Girls Ser...
 
@ℂall Lucknow @Girls Chinhat 08630512678
@ℂall Lucknow  @Girls Chinhat 08630512678 @ℂall Lucknow  @Girls Chinhat 08630512678
@ℂall Lucknow @Girls Chinhat 08630512678
 
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Mysore 000XX00000 Provide Best And Top Girl Service And No1 in City
 
Strategies for Adoption of SDGs in organizations
Strategies for Adoption of SDGs in organizationsStrategies for Adoption of SDGs in organizations
Strategies for Adoption of SDGs in organizations
 
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
Lucknow Girls Call Aliganj 08630512678 Provide Best And Top Girl Service And ...
 
UW biyezheng degree offer diploma Transcript
UW biyezheng degree offer diploma TranscriptUW biyezheng degree offer diploma Transcript
UW biyezheng degree offer diploma Transcript
 
UCI biyezheng degree offer diploma Transcript
UCI biyezheng degree offer diploma TranscriptUCI biyezheng degree offer diploma Transcript
UCI biyezheng degree offer diploma Transcript
 

TEI Conference - CVCE

  • 1. Florentina Armaselu – DHLab, Centre virtuel de la connaissance sur l’Europe (CVCE), Luxembourg florentina.armaselu@cvce.eu 1 www.cvce.eu From a Small-Scale Digital Edition to a TEI Publication Framework in Modern European History Text Encoding Initiative (TEI) Conference and Members’ Meeting. Connect, Animate, Innovate. 28 to 31 Octobre 2015. Université Lumière Lyon 2
  • 2. 1. The WEU-DIPLO pilot project 2. Transviewer, towards a TEI publication framework 3. Discussion 4. References Summary 2
  • 3. Part I The WEU-DIPLO pilot project 3
  • 4. 1. Goal: XML-TEI encoding, corpus analysis and Web publication of institutional documents of the W.E.U. (Western European Union): • Topics: armament production, standardization, control in the period from 1954 to 1982; • Source: Archives nationales de Luxembourg, W.E.U collection. 2. Initial format: • digitized versions (JPG) of typewritten materials (one file per page). 3. Size: *proc. = processed Overview of the WEU-DIPLO project Part I. WEU-DIPLO pilot 4 Category Number of documents Number of documents per language Number of pages Number of pages per language EN FR FR proc.* EN FR FR proc.* Note 89 43 46 37 395 191 204 155 Minutes 30 15 15 15 256 138 118 118 Memorandum 3 1 2 2 16 7 9 9 Study 2 0 2 1 12 0 12 8 Discourse 1 0 1 0 4 0 4 0 Draft protocol 2 1 1 0 4 2 2 0 Total 127 60 67 55 687 338 349 290
  • 5. Overview of the WEU-DIPLO project: workflow Part I. WEU-DIPLO pilot 5
  • 6. Overview of the WEU-DIPLO project: page structure. ©WEU-UEO Part I. WEU-DIPLO pilot 6 Header Content Footer
  • 7. Microsoft Word Styling – WEU-DIPLO Part I. WEU-DIPLO pilot 7 Headers, footers Headings, line breaks, paragraphs
  • 8. Conversion and enrichment (XSLT, manual, NER) Part I. WEU-DIPLO pilot 8 OxGarage (DOCX to TEI P5) oXygen XML Editor • XSLT transformation (metadata, structure); • manual enrichment (semantics – discourse of country/institutional representatives) GATE (Name Entity Recognition) • training phase (Gazetteer List Collector) • annotation phase (names of persons, organisations, places, functions, events, products; dates) oXygen XML Editor • XSLT (GATE XML to TEI P5 transformation)
  • 9. XML-TEI Encoding: WEU-DIPLO - metadata; layout (header). ©WEU-UEO Part I. WEU-DIPLO pilot 9 @@hAuthor @@hArchNum @@hStampConfid @@hDocRef @@hOrigDate @@hOrigLang @@hVersion
  • 10. XML-TEI Encoding: WEU-DIPLO – Structure (headings, paragraphs, line breaks); semantics (named entities, discourse). ©WEU-UEO Part I. WEU-DIPLO pilot 10 @@Heading2@@Paragraph @@LineBreak@@Names @@Discourse
  • 11. XML-TEI Encoding: WEU-DIPLO – transcription features (Pierazzo, 2011) Part I. WEU-DIPLO pilot 11
  • 12. Part II Transviewer, towards a TEI publication framework 12
  • 13. • Treaties; official declarations and meeting reports; letters; notes; press articles; images, video and audio archives related to European integration history Context: The CVCE’s ePublications Part II. Transviewer 13
  • 14. 1. Transviewer concept: • XML-TEI transformation/visualisation on the fly, in the browser • flexible framework for the publication of XML-TEI documents in European integration history; 2. Technologies : • XML, HTML, XSLT, CSS and JavaScript 3. Tested platforms: • EVT (Edition Visualization Technology): http://sourceforge.net/projects/evt-project/ • KILN : http://kiln.readthedocs.org/en/latest/# • TEIBoilerplate : http://dcl.ils.indiana.edu/teibp/ • Versioning Machine: http://v-machine.org/ • XTF (eXtensible Text Framework): http://xtf.cdlib.org/about/ Transviewer overview Part II. Transviewer 14
  • 15. Implementation (adaptation and in-house development): • side-by-side view digital facsimile and transcription (EVT model) • third-party libraries: o BookReader: tool designed to provide online access to scanned books o Saxon-CE: support for XSLT 2.0 transformation in the browser o in-house development (configuration, frames and buttons layout/actions, transcription rendering, third-party libraries calls) Transviewer prototype Part II. Transviewer 15
  • 16. Transviewer experiments– digital facsimile/transcription side-by-side view. ©WEU-UEO Part II. Transviewer 16
  • 17. Transviewer experiments– digital facsimile/transcription side-by-side view. Werner – handwritten notes Part II. Transviewer 17
  • 18. Transviewer experiments (simulation) – video/audio and transcription synchronisation. Werner - interviews Part II. Transviewer 18
  • 19. Transviewer features – panels layouts Part II. Transviewer 19
  • 20. Transviewer features– transcription format Part II. Transviewer 20
  • 21. Transviewer features– panels interlinking Part II. Transviewer 21
  • 23. “By teaching an edition how to swim, I mean endowing an edition not only with a store of factual knowledge concerning the work presented, but also with the capability of dealing gracefully with the mutability of the electronic medium, by exploiting the possibilities for reader-controlled changes to the edition’s presentation and by adapting successfully to rapid changes in the hardware and software environment.” (Sperberg-McQueen, 2009) 1. Transviewer prototype questions: • flexible enough to support different types of documents in European integration history and different user requirements; • modular architecture to allow gradual development and customisation according to the needs of the projects; • balance manual interventions/automatic processing (XSLT, NER); • XML transformation on the fly (no need for intermediary formats/steps, changes to the XML already part of the publication). Discussion Part III. Discussion 23
  • 24. 3. Issues: • BookReader – use of an older version of jQuery library; • non-uniform support of Saxon-CE for XSLT 2.0 transformation in the browsers; • need for batch conversion to XML-TEI (potential adaptation of OxGarage for batch processing). 4. Ongoing/future work for further development: • evaluation (technology – technical experts; usability tests – experts in European integration studies); • development of new modules (multi-panels, audio/video transcription, etc.) and tests with more project samples; • integration into the existing CVCE’s Website architecture: o Back End; o Front End. Discussion Part III. Discussion 24
  • 25. Thank you! Discussion 25 Scaling in a publication framework would imply not only teaching your editions “how to swim” but also how to swim together.
  • 26. • Book Reader: https://openlibrary.org/dev/docs/bookreader • EVT (Edition Visualization Technology): http://sourceforge.net/projects/evt-project/ • GATE: https://gate.ac.uk/ • KILN : http://kiln.readthedocs.org/en/latest/# • OxGarage: http://www.tei-c.org/oxgarage/ • Pierazzo, Elena. (2011). A rationale of digital documentary editions. In LLC. The Journal of Digital Scholarship in the Humanities, Vol. 26, No. 4, December 2011, pp. 463-477. • http://www.scholarlyediting.org/2014/essays/essay.pierazzo.html. • TEIBoilerplate : http://dcl.ils.indiana.edu/teibp/ • TEI (Text Encoding Initiative): http://www.tei-c.org • Versioning Machine: http://v-machine.org/ • Saxon-CE: http://www.saxonica.com/ce/user-doc/1.1/index.html • Sperberg-McQueen, C.M. 2009. “How to teach your edition how to swim”. In LLC. The Journal of Digital Scholarship in the Humanities. Volume 24, No. 1, April 2009. Oxford Journals. • XTF (eXtensible Text Framework): http://xtf.cdlib.org/about/ References 26