SlideShare a Scribd company logo
1 of 14
Download to read offline
Possibilities of Digital
Analysis of Charter corpora
Georg Vogeler
IMC Leeds, 9.7.2009Georg Vogeler 2
Charter Corpora on the Web
 Württembergisches Urkundenbuch (http://maja.bsz-
bw.de/wubonline/
 CDLM (http://cdlm.unipv.it)
 DEEDS (http://www.utoronto.ca/deeds/)
 Monasterium.net (http://www.monasterium.net)
 Ut per litteras apostolicas …
(http://www.brepolis.net)
 Diplomatico Firenze
(http://www.archiviodistato.firenze.it/diplomatico)
IMC Leeds, 9.7.2009Georg Vogeler 3
What’s their advantage?
 Images
 Reconstructed archives
• Virtuelles Archiv Salzburg
• Archive of the Stift Ardagger
 Fast search
 => take the charter heritage as is
not as defined by organisational reasons
IMC Leeds, 9.7.2009Georg Vogeler 4
Online Corpus abolishes borders …
 between repositories
 between forms of representation
and
IMC Leeds, 9.7.2009Georg Vogeler 5
Research on set phrases
 Vernacular dating clauses
• Latin model: (Ulm 1275 März 29)
dirre dinge iſt gezivch herre Marquart von
Bleichen herre hartman von ſahſenhvſen vn
herre tecke von annenhoven. Datum · IIIIo · kl
· aprilis · anno dni · Mo · CCo · IXXVo.
• German model almost free from it:
Diz geſchach zehahberch an deme Ciſtage in der
phingeſtwochen / do von gotteſ geb̓vrte waren
zwelfhundert Sibenzig vn f̓vnf Jar
IMC Leeds, 9.7.2009Georg Vogeler 6
Dating Clauses
 13th century:
• Germany (de Boor 1975)
- South-western model:
• dis geschach do man zalte von gotes gebúrte zwelf
hundert und niun und niunzig jar.
- South-eastern model:
• ditz ist geschehen, do es waren von christes geburt
tousent zwaihundert und darnach in dem niun unde
niunzegisten jare.
IMC Leeds, 9.7.2009Georg Vogeler 7
In monasterium.net
for $u in //tenor[not(.='')]/ancestor::text[.//lang_MOM='Deutsch']
let $dat := substring($u//tenor, (string-length($u//tenor) - 200))
where number($u//date_sort) lt 14000001 and
number($u//date_sort) gt 13000000
order by $u//date_sort
return <dat><wo>{
$u/@b_name
} {
$u//issued/placeName/text()
}</wo>
<was> {
$dat
}</was></dat>
IMC Leeds, 9.7.2009Georg Vogeler 8
In monasterium.net
[Dd][aov]
([uv][ao]n|nach)
(([Gg]ot{1,2}[ei]{0,1}[sz])|
(([Cc]h|[cCkK])rist[eisz]*?)|
([uv]n{1,2}s[ei]{0,1}r[ei]{0,1}[sz]
[hH]er{1,2}[ei]{0,1}n))
([Gge]*?[PBpb][uv][oe°]{0,1}ri{0,1}[td][hte]*?)
(w[ao]^{0,1}[rzs][ien]*?)
IMC Leeds, 9.7.2009Georg Vogeler 9
In monasterium.net
IMC Leeds, 9.7.2009Georg Vogeler 10
Results
 13th century:
• 433 texts
• “zalt”-model: 24, all but 5 from the Chartularium
Sangallense
• “waren”-model: 137, all but 15 from the south-eastern
regions
 14th century:
• 8354 texts
• “zalt”-model: 2478, 964 not from St. Gallen
• “waren”-model: 350, only 13 from St. Gallen
IMC Leeds, 9.7.2009Georg Vogeler 11
Methods of Investigation
 Already in use
• Simple word selection/word count (Tock,
Brousseau, Parisse)
• Phrase statistics (Gervers/Margolin)
• Graphetic detail analysis (Fiebig)
• Hand identification by pattern analysis
(Schomaker/Burgers)
• Named entity recognition (Stoyan/Schmidt)
IMC Leeds, 9.7.2009Georg Vogeler 12
Possible Programming
 Testing/adapting existing algorithms
• Author identification tools
• Graphical variation tools
• Named Entity Recognition methods for clauses
 to find the connections between charters that
aren’t kept in the same archive/aren’t printed in
the same edition:
• e.g.: Influence of recipient on the charters
• Spread of formula, regions of legal culture
IMC Leeds, 9.7.2009Georg Vogeler 13
Early medieval diplomatics
 Add charters to the online corpora
 Add information to the online charter corpora
 Take text analytic software into consideration
 Ask your local computer scientist what he could
help you
Thank you for your attention
g.vogeler@lmu.de

More Related Content

Similar to Possibilities of Digital Analysis of Charter corpora

Open Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata HarvestingOpen Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata Harvestingchessmu
 
Representing the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersRepresenting the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersjudell
 
Linking library and theatre data
Linking library and theatre dataLinking library and theatre data
Linking library and theatre dataLukas Koster
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)RDTF-Discovery
 
Open (linked) bibliographic data
Open (linked) bibliographic dataOpen (linked) bibliographic data
Open (linked) bibliographic dataEdmund Chamberlain
 
Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014bhausstein
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataChristophe Debruyne
 
Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010rweait
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldSøren Schaffstein
 
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoSolutions
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
 
GeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoSolutions
 
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...Dean Bubley
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016KellliBee
 
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesiKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesPieter Pauwels
 
ADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataAndrea Gazzarini
 
Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Onroerend Erfgoed
 
Bridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentBridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentAnita Riley
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopPetr Pridal
 

Similar to Possibilities of Digital Analysis of Charter corpora (20)

Open Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata HarvestingOpen Archives Initiative Protocol for Metadata Harvesting
Open Archives Initiative Protocol for Metadata Harvesting
 
Representing the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makersRepresenting the world: How web users become web thinkers and web makers
Representing the world: How web users become web thinkers and web makers
 
Linking library and theatre data
Linking library and theatre dataLinking library and theatre data
Linking library and theatre data
 
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
Open (linked) bibliographic data   edmund chamberlain (university of cambridge)Open (linked) bibliographic data   edmund chamberlain (university of cambridge)
Open (linked) bibliographic data edmund chamberlain (university of cambridge)
 
Open (linked) bibliographic data
Open (linked) bibliographic dataOpen (linked) bibliographic data
Open (linked) bibliographic data
 
Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014Hausstein data cite-dara-dasish2014
Hausstein data cite-dara-dasish2014
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data
 
Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010Introduction To OpenStreetMap Fosscon2010
Introduction To OpenStreetMap Fosscon2010
 
Rani Pinchuk
Rani PinchukRani Pinchuk
Rani Pinchuk
 
Data Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into GoldData Alchemy: Turn your Data into Gold
Data Alchemy: Turn your Data into Gold
 
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
GeoServer for Spatio-temporal Data Handling With Examples For MetOc And Remot...
 
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Exposing Bibliographic Information as Linked Open Data using Standards-based ...
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
 
GeoServer an introduction for beginners
GeoServer an introduction for beginnersGeoServer an introduction for beginners
GeoServer an introduction for beginners
 
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
WebRTC Tutorial by Dean Bubley of Disruptive Analysis & Tim Panton of Westhaw...
 
Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016Using islandora to build digital collections - 2016.01.29 OLA 2016
Using islandora to build digital collections - 2016.01.29 OLA 2016
 
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologiesiKNOW2014 - SimModel and IFC: a short introduction to the ontologies
iKNOW2014 - SimModel and IFC: a short introduction to the ontologies
 
ADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked DataADLUG 2012: Linking Linked Data
ADLUG 2012: Linking Linked Data
 
Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)Virtual Environments for Research in Archaeology (Mark Baker)
Virtual Environments for Research in Archaeology (Mark Baker)
 
Bridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital EnvironmentBridging the Gap Between Print and Digital Environment
Bridging the Gap Between Print and Digital Environment
 
Edinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline WorkshopEdinburgh OldMapsOnline Workshop
Edinburgh OldMapsOnline Workshop
 

More from Georg Vogeler

Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Georg Vogeler
 
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenVon IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenGeorg Vogeler
 
Working digitally with Historical Documents
Working digitally with Historical DocumentsWorking digitally with Historical Documents
Working digitally with Historical DocumentsGeorg Vogeler
 
Digitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsDigitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsGeorg Vogeler
 
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...Georg Vogeler
 
Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Georg Vogeler
 
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...Georg Vogeler
 
Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Georg Vogeler
 
Results of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsResults of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsGeorg Vogeler
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Georg Vogeler
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Georg Vogeler
 
Medieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeMedieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeGeorg Vogeler
 
Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Georg Vogeler
 
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Georg Vogeler
 

More from Georg Vogeler (15)

Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...Standing-off Trees and Graphs : on the affordance of technologies for the edi...
Standing-off Trees and Graphs : on the affordance of technologies for the edi...
 
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über PersonenVon IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
Von IIIF zu IPIF? Ein Vorschlag für den Datenaustausch über Personen
 
Working digitally with Historical Documents
Working digitally with Historical DocumentsWorking digitally with Historical Documents
Working digitally with Historical Documents
 
Digitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfallsDigitising charter images : benefits and pitfalls
Digitising charter images : benefits and pitfalls
 
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...Transformationen:Zum Übergang aus langfristigen Editionsprojekten in die dig...
Transformationen: Zum Übergang aus langfristigen Editionsprojekten in die dig...
 
Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...Digital diplomatics - Defining a new scope of interpretation of historical do...
Digital diplomatics - Defining a new scope of interpretation of historical do...
 
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...VernetzungZum Verhältnis von klassischen Formen der Archiverschließung und I...
Vernetzung Zum Verhältnis von klassischen Formen der Archiverschließung und I...
 
Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)Encoding Text About Things (Georg Vogeler)
Encoding Text About Things (Georg Vogeler)
 
Results of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documentsResults of “Digital Diplomatics” for the research with medieval documents
Results of “Digital Diplomatics” for the research with medieval documents
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
 
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
Warum werden mittelalterliche und frühneuzeitliche Rechnungsbücher eigentlich...
 
Medieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital AgeMedieval and Early Modern Accounts in the Digital Age
Medieval and Early Modern Accounts in the Digital Age
 
Why not edit medieval account books digitally?
Why not edit medieval account books digitally?Why not edit medieval account books digitally?
Why not edit medieval account books digitally?
 
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
Semantic Technologies in the Scholarly Edition of Medieval and Early Modern A...
 
Charter encoding
Charter encodingCharter encoding
Charter encoding
 

Recently uploaded

Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersAndreaLucarelli
 
Role of herbs in hair care Amla and heena.pptx
Role of herbs in hair care  Amla and  heena.pptxRole of herbs in hair care  Amla and  heena.pptx
Role of herbs in hair care Amla and heena.pptxVaishnaviAware
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxRahulVishwakarma71547
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPirithiRaju
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPirithiRaju
 
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTMARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTjipexe1248
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabkiyorndlab
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Sérgio Sacani
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxUalikhanKalkhojayev1
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPirithiRaju
 
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky Way
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky WayShiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky Way
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky WaySérgio Sacani
 
Gene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfGene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfNetHelix
 
TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)chatterjeesoumili50
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxmarwaahmad357
 
MARSILEA notes in detail for II year Botany.ppt
MARSILEA  notes in detail for II year Botany.pptMARSILEA  notes in detail for II year Botany.ppt
MARSILEA notes in detail for II year Botany.pptaigil2
 
Bureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxBureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxkastureyashashree
 
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPirithiRaju
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Sérgio Sacani
 
soft skills question paper set for bba ca
soft skills question paper set for bba casoft skills question paper set for bba ca
soft skills question paper set for bba caohsadfeeling
 

Recently uploaded (20)

Physics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and EngineersPhysics Serway Jewett 6th edition for Scientists and Engineers
Physics Serway Jewett 6th edition for Scientists and Engineers
 
Role of herbs in hair care Amla and heena.pptx
Role of herbs in hair care  Amla and  heena.pptxRole of herbs in hair care  Amla and  heena.pptx
Role of herbs in hair care Amla and heena.pptx
 
Application of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptxApplication of Foraminiferal Ecology- Rahul.pptx
Application of Foraminiferal Ecology- Rahul.pptx
 
Pests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPRPests of ragi_Identification, Binomics_Dr.UPR
Pests of ragi_Identification, Binomics_Dr.UPR
 
Pests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPRPests of tenai_Identification,Binomics_Dr.UPR
Pests of tenai_Identification,Binomics_Dr.UPR
 
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENTMARKER ASSISTED SELECTION IN CROP IMPROVEMENT
MARKER ASSISTED SELECTION IN CROP IMPROVEMENT
 
World Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlabWorld Water Day 22 March 2024 - kiyorndlab
World Water Day 22 March 2024 - kiyorndlab
 
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
Digitized Continuous Magnetic Recordings for the August/September 1859 Storms...
 
IB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptxIB Biology New syllabus B3.2 Transport.pptx
IB Biology New syllabus B3.2 Transport.pptx
 
Pests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPRPests of Redgram_Identification, Binomics_Dr.UPR
Pests of Redgram_Identification, Binomics_Dr.UPR
 
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky Way
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky WayShiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky Way
Shiva and Shakti: Presumed Proto-Galactic Fragments in the Inner Milky Way
 
Gene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdfGene transfer in plants agrobacterium.pdf
Gene transfer in plants agrobacterium.pdf
 
TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)TORSION IN GASTROPODS- Anatomical event (Zoology)
TORSION IN GASTROPODS- Anatomical event (Zoology)
 
Applied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docxApplied Biochemistry feedback_M Ahwad 2023.docx
Applied Biochemistry feedback_M Ahwad 2023.docx
 
MARSILEA notes in detail for II year Botany.ppt
MARSILEA  notes in detail for II year Botany.pptMARSILEA  notes in detail for II year Botany.ppt
MARSILEA notes in detail for II year Botany.ppt
 
Bureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptxBureau of Indian Standards Specification of Shampoo.pptx
Bureau of Indian Standards Specification of Shampoo.pptx
 
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdfPests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
Pests of cumbu_Identification, Binomics, Integrated ManagementDr.UPR.pdf
 
Cheminformatics tools supporting dissemination of data associated with US EPA...
Cheminformatics tools supporting dissemination of data associated with US EPA...Cheminformatics tools supporting dissemination of data associated with US EPA...
Cheminformatics tools supporting dissemination of data associated with US EPA...
 
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...Identification of Superclusters and Their Properties in the Sloan Digital Sky...
Identification of Superclusters and Their Properties in the Sloan Digital Sky...
 
soft skills question paper set for bba ca
soft skills question paper set for bba casoft skills question paper set for bba ca
soft skills question paper set for bba ca
 

Possibilities of Digital Analysis of Charter corpora

  • 1. Possibilities of Digital Analysis of Charter corpora Georg Vogeler
  • 2. IMC Leeds, 9.7.2009Georg Vogeler 2 Charter Corpora on the Web  Württembergisches Urkundenbuch (http://maja.bsz- bw.de/wubonline/  CDLM (http://cdlm.unipv.it)  DEEDS (http://www.utoronto.ca/deeds/)  Monasterium.net (http://www.monasterium.net)  Ut per litteras apostolicas … (http://www.brepolis.net)  Diplomatico Firenze (http://www.archiviodistato.firenze.it/diplomatico)
  • 3. IMC Leeds, 9.7.2009Georg Vogeler 3 What’s their advantage?  Images  Reconstructed archives • Virtuelles Archiv Salzburg • Archive of the Stift Ardagger  Fast search  => take the charter heritage as is not as defined by organisational reasons
  • 4. IMC Leeds, 9.7.2009Georg Vogeler 4 Online Corpus abolishes borders …  between repositories  between forms of representation and
  • 5. IMC Leeds, 9.7.2009Georg Vogeler 5 Research on set phrases  Vernacular dating clauses • Latin model: (Ulm 1275 März 29) dirre dinge iſt gezivch herre Marquart von Bleichen herre hartman von ſahſenhvſen vn herre tecke von annenhoven. Datum · IIIIo · kl · aprilis · anno dni · Mo · CCo · IXXVo. • German model almost free from it: Diz geſchach zehahberch an deme Ciſtage in der phingeſtwochen / do von gotteſ geb̓vrte waren zwelfhundert Sibenzig vn f̓vnf Jar
  • 6. IMC Leeds, 9.7.2009Georg Vogeler 6 Dating Clauses  13th century: • Germany (de Boor 1975) - South-western model: • dis geschach do man zalte von gotes gebúrte zwelf hundert und niun und niunzig jar. - South-eastern model: • ditz ist geschehen, do es waren von christes geburt tousent zwaihundert und darnach in dem niun unde niunzegisten jare.
  • 7. IMC Leeds, 9.7.2009Georg Vogeler 7 In monasterium.net for $u in //tenor[not(.='')]/ancestor::text[.//lang_MOM='Deutsch'] let $dat := substring($u//tenor, (string-length($u//tenor) - 200)) where number($u//date_sort) lt 14000001 and number($u//date_sort) gt 13000000 order by $u//date_sort return <dat><wo>{ $u/@b_name } { $u//issued/placeName/text() }</wo> <was> { $dat }</was></dat>
  • 8. IMC Leeds, 9.7.2009Georg Vogeler 8 In monasterium.net [Dd][aov] ([uv][ao]n|nach) (([Gg]ot{1,2}[ei]{0,1}[sz])| (([Cc]h|[cCkK])rist[eisz]*?)| ([uv]n{1,2}s[ei]{0,1}r[ei]{0,1}[sz] [hH]er{1,2}[ei]{0,1}n)) ([Gge]*?[PBpb][uv][oe°]{0,1}ri{0,1}[td][hte]*?) (w[ao]^{0,1}[rzs][ien]*?)
  • 9. IMC Leeds, 9.7.2009Georg Vogeler 9 In monasterium.net
  • 10. IMC Leeds, 9.7.2009Georg Vogeler 10 Results  13th century: • 433 texts • “zalt”-model: 24, all but 5 from the Chartularium Sangallense • “waren”-model: 137, all but 15 from the south-eastern regions  14th century: • 8354 texts • “zalt”-model: 2478, 964 not from St. Gallen • “waren”-model: 350, only 13 from St. Gallen
  • 11. IMC Leeds, 9.7.2009Georg Vogeler 11 Methods of Investigation  Already in use • Simple word selection/word count (Tock, Brousseau, Parisse) • Phrase statistics (Gervers/Margolin) • Graphetic detail analysis (Fiebig) • Hand identification by pattern analysis (Schomaker/Burgers) • Named entity recognition (Stoyan/Schmidt)
  • 12. IMC Leeds, 9.7.2009Georg Vogeler 12 Possible Programming  Testing/adapting existing algorithms • Author identification tools • Graphical variation tools • Named Entity Recognition methods for clauses  to find the connections between charters that aren’t kept in the same archive/aren’t printed in the same edition: • e.g.: Influence of recipient on the charters • Spread of formula, regions of legal culture
  • 13. IMC Leeds, 9.7.2009Georg Vogeler 13 Early medieval diplomatics  Add charters to the online corpora  Add information to the online charter corpora  Take text analytic software into consideration  Ask your local computer scientist what he could help you
  • 14. Thank you for your attention g.vogeler@lmu.de

Editor's Notes

  1. CDLM Württembergisches Urkundenbuch DEEDS Monasterium.net Ut per litteras apostolicas … Diplomatico Firenze
  2. The monasterium-project thus gives an insight into the possibilities of a Virtual European Charter Archive: The charters are just one corpus and you will find the documents in the Archives of the Archbishop of Salzburg although the Habsburgs transferred them all to Vienna, you will find all documents dealing with the dioceses of Passau that are incorporated to the capital of Austria before 1469. You will find documents concerning Bratislava, the capital of Slovakia from the times it was Capital of Hungary … Borders between forms: I explained last year with the Online Kemble for the Anglo Saxon Charters. This year I want to give an example how a corpus like monsterium.net can be used for diplomatic research – supported by the computer.
  3. Some of you might know that I had my own conference on Codicology and Palaeography in the Digital Age only a week ago. Thus I had not too much time to prepare this example: From vast variety of possible questions (Paarformeln, Bekräftigungsformenl; Angabe von Gründen für Beurkunden abhängig vom Aussteller? („Notturft“ bei Frauen), #Zustimmungsformeln und an schaden#, the relationship between vernacular formula and latin, function of the witnesses of the seal, seller taking responsibility for the correctness of ##proporty rights#; correlation between issuer, recipient and writing notary etc. etc.) I choose the dating clause: #Latin example#; There are several observations made from 13th century material: For Switzerland Peter Rück observed the introduction of the „modern“ dating style by counting days in a month from West to East, with continously #reluctance in the diocesis of Konstanz#; Helmut de Boor observed
  4. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  5. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  6. I prepared a selection of the full texts in mom-ca that are german and from the 13th century. They are from archives as indicated in this map (google maps): St. Gallen ist from the alemannic part and thus should prefer „zalt“ while the rest should prefer „waren“ And I made a search with regular expressions on it to identify the clauses in their variety
  7. 13th century confirmes the analysis of de Boor 14th century shows a significant change: the „zalt“-model isn‘t restricted to the alemannian region of south Germany and the waren model is much less spread than it was before. That fits into de Boors general observation that the zalt-model is more modern and is spreading already in the 13th century from west to east. If I wouldn‘t be occupied by research on the use of the documents of Frederic II at the moment, I would very much be inclined to continue this research. But I have to be careful: There are lot‘s of other techniques to be applied to digital charter copora:
  8. Let me give you some examples
  9. Author Idenfitication: Leeds 2008: problem of short formalistic texts: difficult to identy in general, thus of great interest for the computer linguists. Graphical Variation: edit-distance, developing soundex NER: Hidden-Markov-Model: training
  10. What could be the result of that for the early medieval diplomatists? You traditionally don’t deal with large corpora. But you could consider that: The CDLM provides a huge amount of data – and I haven’t read any study using the corpus. Unfortunately the ARTEM-Databases aren’t online, but I would so much interested to see research done with it. The online accessible corpora can be improved: Add charters to the online corpora By retro digitization and By digital edition Add information to the online charter corpora Online Editor of www.mom-ca.uni-koeln.de: there are at the moment 636 charters from before 1150, 171 of them without fulltexts. mom-ca provides the possibility to add text online, simply by registering yourself on the site. Why not enhancing the corpus? Take text analytic software into consideration Whereever your material comes from: take into consideration that there are already text analytic tools that could be useful for you. And if you imagine a tool but don’t find it or don’t know how to use it: Ask your local computer scientist what he could help you: and don’t be frustrated if he doesn’t understand you – there are lots of computer scientists supporting the work of historians!