SlideShare a Scribd company logo
1 of 28
The Provenance and History
of the Manuscripts formerly
in the Phillipps Collection
Department of Digital HumanitiesToby Burrows
The Phillipps manuscript collection
• Phillipps’ own printed catalogue
(1837-1871) goes up to no. 23,837
• Thomas Fitzroy Fenwick (grandson,
d. 1938) spent fifty years
reorganizing and renumbering: up to
no. 38,628
• Fenwick’s estimate of the total was
close to 60,000 volumes and
individual documents
• Phillipps also owned 50,000 books,
as well as many prints, photographs,
drawings and paintings
Sir Thomas Phillipps (1792-1872)
Assembling the collection
Meerman
1824
Meerman
1824
Celotti
1825
Celotti
1825
Craven
Ord
1829-32
Craven
Ord
1829-32Page-
Turner
1824
Page-
Turner
1824
Lang
1828
Lang
1828
Drury
1827
Drury
1827
Guilford
(North)
1830
Guilford
(North)
1830
Heber
1836
Heber
1836
Van Ess
1823-6
Van Ess
1823-6
PHILLIPPSPHILLIPPS
Libri
1859-62
Libri
1859-62
Re-creation of Phillipps’ shelves, Grolier Club
Dispersal of the collection
Fenwick family (1886-1945):
• Sales to interested libraries and governments (Germany, Belgium,
Netherlands, France, Ireland, Wales) – more than 2,500 items
• Auctions at Sotheby’s, 1886 to 1938 – 22 auctions, more than 22,000
lots, raised £97,000 (over £30 million)
• Residue (12,000 items) sold to the Robinson brothers in 1945 for
£100,000 (£11-12 million)
W.H. Robinson Ltd (1945-1958):
•Series of sale catalogues, 1945-1954
•Donation to the Bodleian Library of the remaining materials, 1958
Sotheby’s (1946-1950, 1965-1977):
•Series of sale catalogues
Data sources
Source Format Comments
Schoenberg Database of
Manuscripts
Relational database Incorporates other sources, esp. sales catalogues
6,000 Phillipps MSS; 20,000 Phillipps events
Library catalogues (BL, KB etc.) Relational databases
Generally MARC records
Provenance in notes
Export can be awkward
Union catalogues
Relational databases
Printed bibliographies
Formats vary
Coverage varies
Export can be awkward
Sale catalogues
Printed books (some digitized)
Online sources (PDFs, Web sites)
Many included in Schoenberg
MSS in ABE, eBay etc.
Phillipps catalogues and lists
Printed book; Partly digitized
Supplemented by handwritten notes
Partly included in Schoenberg
Handwritten notes not digitized
Phillipps provenance indexes (BL,
IRHT)
Handwritten; Not digitized
Arranged by Phillipps number
No longer updated
Annotated sales catalogues &
printed catalogues
Handwritten; Not digitized
Researchers (Munby), owners (Phillipps), auctioneers
(Sotheby’s)
Held in Cambridge UL, Bodleian, BL
Project summary
• Two main research questions:
– The history and significant characteristics of the transmission
of a major group of European manuscripts between collections
and collectors over the centuries (provenance)
– The applicability and value of Linked Data technologies as a
methodology for the large-scale analysis of the history of
cultural objects and collections (“network archaeology”)
• Project plan
– Ingest the data; transform them to a common Data Model;
represent them computationally; analyse and visualize them;
make them available to other researchers
• Tools
– Excel, OpenRefine, Neo4j, Nodegoat, visualization tools
In 1862, Sir Thomas Phillipps bought Phillipps MS 16402 in London
as part of the Sotheby’s sale of the collection of Guglielmo Libri.
London
1862
MS16402
Libri
Phillipps
Sotheby’s
Neo4j: graph database
• Nodes and relationships (each
with properties)
• Various tools for data import
• Cypher query language for
creating nodes, relationships
and properties
• Cypher is also used to run
queries, analyse paths, count
and list
• No schema as such – develop
and define as you go
• Own visualization interface, but
also works with others
• Data export – JSON
Neo4j Data Model – nodes (entities)
Node (entity: label) Type Properties
AGENT Person
Organization
name
OBJECT Manuscript id
title
foliation
layout
binding
illustration
WORK Text
Description
Exhibition
title
incipit
language
PUBLICATION Catalogue
Book
Article
title
Neo4j Data Model – relationships
Relationship Properties
GAVE
SOLD
CONSIGNED
OWNS
ACQUIRED
date
id
certitude
price
PRODUCED date
certitude
CONTAINS locus
SAME_AS certitude
Relationship Properties
COMPOSED
TRANSLATED
COMPILED
date
certitude
ANNOTATED
INSCRIBED
date
locus
certitude
DESCRIBED_IN date
item no.
DESCRIBED_AS date
item no.
Neo4j Data Model – relationship statements
Node Relationship Node
AGENT: Person GAVE OBJECT: Manuscript
AGENT: Organization SOLD OBJECT: Manuscript
OBJECT: Manuscript CONTAINS WORK: Text
AGENT: Person COMPOSED WORK: Text
PUBLICATION: Catalogue CONTAINS WORK: Description
OBJECT: Manuscript DESCRIBED_AS WORK: Description
AGENT: Organization PRODUCED PUBLICATION: Catalogue
WORK: Exhibition DESCRIBED_IN PUBLICATION: Catalogue
DATA MODEL – Nodegoat
Object Sub-objects Related to:
PERSON Nationality (country) Manuscript
Text
Catalogue
ORGANIZATION Location (city; country) Manuscript
Text
Catalogue
MANUSCRIPT Sold
Donated
Owned
Described In
Produced
Contents
Person/Organization: Agent,
Owner, Buyer, Donor,
Recipient, Scribe, Artist,
Producer
Location (city; country)
Catalogue
Text
TEXT Person: Author
Manuscript
CATALOGUE Organization: Publisher
Person: Compiler
Manuscript
Current status of the project
• Data imported: selections from Schoenberg data, other sample
data
• Data Model: theoretical work + working versions
• Demo versions of Neo4j and Nodegoat databases
• Tested and documented queries, analyses and visualizations
• To come:
– Adding much more data in a production environment
(Nodegoat)
– Carrying out more extensive visualizations and analyses
• Across the whole collection
• In relation to specific “use cases”
– Exporting data for reuse by other researchers
Dr Toby Burrows
Marie Curie Fellow
Department of Digital Humanities
King’s College London
26-29 Drury Lane
London WC2B 5RL
toby.burrows@kcl.ac.uk
@tobyburrows
tobyburrows.wordpress.com

More Related Content

What's hot

Nomisma.org. What's in a namespace?
Nomisma.org. What's in a namespace?Nomisma.org. What's in a namespace?
Nomisma.org. What's in a namespace?Menetys
 
Itinera Nova. The road travelled so far
Itinera Nova. The road travelled so farItinera Nova. The road travelled so far
Itinera Nova. The road travelled so farItinera Nova
 
University of Pennsylvania Librarians' Assembly
University of Pennsylvania Librarians' AssemblyUniversity of Pennsylvania Librarians' Assembly
University of Pennsylvania Librarians' AssemblyHolly Mengel
 
Biblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic WebBiblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic WebEquipex Biblissima
 
Dutch culture link
Dutch culture linkDutch culture link
Dutch culture linkLukas Koster
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...Biblioteca Nacional de España
 
Introduction of the project "Books Discovered Once Again"
Introduction of the project "Books Discovered Once Again" Introduction of the project "Books Discovered Once Again"
Introduction of the project "Books Discovered Once Again" Books Discovered Once Again
 
Publication Web Semantic Biblissima - DH2016
Publication Web Semantic Biblissima - DH2016Publication Web Semantic Biblissima - DH2016
Publication Web Semantic Biblissima - DH2016Equipex Biblissima
 
Dutch culture link - version 2
Dutch culture link - version 2Dutch culture link - version 2
Dutch culture link - version 2Lukas Koster
 
Short resume, Sebastian Wilke
Short resume, Sebastian WilkeShort resume, Sebastian Wilke
Short resume, Sebastian WilkeSebastian Wilke
 
VRA 2012, Archival Collections Case Studies, The Artamonoff Business
VRA 2012, Archival Collections Case Studies, The Artamonoff BusinessVRA 2012, Archival Collections Case Studies, The Artamonoff Business
VRA 2012, Archival Collections Case Studies, The Artamonoff BusinessVisual Resources Association
 
Providing the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of AmericaProviding the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of AmericaRebekah Cummings
 

What's hot (14)

Nomisma.org. What's in a namespace?
Nomisma.org. What's in a namespace?Nomisma.org. What's in a namespace?
Nomisma.org. What's in a namespace?
 
The Big Bang Theory and the project
The Big Bang Theory and the projectThe Big Bang Theory and the project
The Big Bang Theory and the project
 
Itinera Nova. The road travelled so far
Itinera Nova. The road travelled so farItinera Nova. The road travelled so far
Itinera Nova. The road travelled so far
 
University of Pennsylvania Librarians' Assembly
University of Pennsylvania Librarians' AssemblyUniversity of Pennsylvania Librarians' Assembly
University of Pennsylvania Librarians' Assembly
 
Biblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic WebBiblissima: Medieval Manuscripts and the Semantic Web
Biblissima: Medieval Manuscripts and the Semantic Web
 
Dutch culture link
Dutch culture linkDutch culture link
Dutch culture link
 
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste..."Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
"Il n´y a pas de hors-texte": challenges for Archival Linked Data. Adrian Ste...
 
Introduction of the project "Books Discovered Once Again"
Introduction of the project "Books Discovered Once Again" Introduction of the project "Books Discovered Once Again"
Introduction of the project "Books Discovered Once Again"
 
Publication Web Semantic Biblissima - DH2016
Publication Web Semantic Biblissima - DH2016Publication Web Semantic Biblissima - DH2016
Publication Web Semantic Biblissima - DH2016
 
Pitch 4 Seriearchieven | Nico Vriend
Pitch 4 Seriearchieven | Nico VriendPitch 4 Seriearchieven | Nico Vriend
Pitch 4 Seriearchieven | Nico Vriend
 
Dutch culture link - version 2
Dutch culture link - version 2Dutch culture link - version 2
Dutch culture link - version 2
 
Short resume, Sebastian Wilke
Short resume, Sebastian WilkeShort resume, Sebastian Wilke
Short resume, Sebastian Wilke
 
VRA 2012, Archival Collections Case Studies, The Artamonoff Business
VRA 2012, Archival Collections Case Studies, The Artamonoff BusinessVRA 2012, Archival Collections Case Studies, The Artamonoff Business
VRA 2012, Archival Collections Case Studies, The Artamonoff Business
 
Providing the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of AmericaProviding the On-Ramp to the Digital Public Library of America
Providing the On-Ramp to the Digital Public Library of America
 

Viewers also liked

International children's digital library by ann weeks
International children's digital library by ann weeksInternational children's digital library by ann weeks
International children's digital library by ann weeksŚląska Biblioteka Cyfrowa
 
Akoszowska chmura komorka_tablet
Akoszowska chmura komorka_tabletAkoszowska chmura komorka_tablet
Akoszowska chmura komorka_tabletAgnieszka Koszowska
 
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...Digital Classicist Seminar Berlin
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkRobert Sanderson
 
Defragmenting Digitized Manuscripts Sources
Defragmenting Digitized Manuscripts SourcesDefragmenting Digitized Manuscripts Sources
Defragmenting Digitized Manuscripts SourcesDH Benelux
 
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi Kaya
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi KayaDigital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi Kaya
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi KayaFuture Insights
 
Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia
 
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...Biblioteca Nacional de España
 
Presentation of Europeana Regia at "The Message of the Old Book in the New En...
Presentation of Europeana Regia at "The Message of the Old Book in the New En...Presentation of Europeana Regia at "The Message of the Old Book in the New En...
Presentation of Europeana Regia at "The Message of the Old Book in the New En...Europeana Regia
 
20120309 manuscript digitalheritage_firenze
20120309 manuscript digitalheritage_firenze20120309 manuscript digitalheritage_firenze
20120309 manuscript digitalheritage_firenzeStefan Gradmann
 
Digitization of Documentary Heritage Collections in Indic Language Comparativ...
Digitization of Documentary Heritage Collections in Indic LanguageComparativ...Digitization of Documentary Heritage Collections in Indic LanguageComparativ...
Digitization of Documentary Heritage Collections in Indic Language Comparativ...Anup Kumar Das
 
Ukad forum 2 march_2011_iams
Ukad forum 2 march_2011_iamsUkad forum 2 march_2011_iams
Ukad forum 2 march_2011_iamsWilliam Stockting
 
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...The Library as a Digital Research infrastructure: Digital Initiatives and Dig...
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...lorna_hughes
 
Manuscript digitisation
Manuscript digitisationManuscript digitisation
Manuscript digitisationSanjay Goel
 
MANUSCRIPT ACQUISITION
MANUSCRIPT ACQUISITIONMANUSCRIPT ACQUISITION
MANUSCRIPT ACQUISITIONMaude1
 
Culture Untapped: inspirational content & fresh ideas for your games
Culture Untapped: inspirational content & fresh ideas for your gamesCulture Untapped: inspirational content & fresh ideas for your games
Culture Untapped: inspirational content & fresh ideas for your gamesMilena Popova
 
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...Biblioteca Nacional de España
 
Shared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conferenceShared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conferenceMatthieu Bonicel
 

Viewers also liked (20)

International children's digital library by ann weeks
International children's digital library by ann weeksInternational children's digital library by ann weeks
International children's digital library by ann weeks
 
Akoszowska chmura komorka_tablet
Akoszowska chmura komorka_tabletAkoszowska chmura komorka_tablet
Akoszowska chmura komorka_tablet
 
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...
[DCSB] Amiz Zeldes (HU, Berlin) "Towards Digital Coptic: Searching and Visual...
 
Expanding Horizons - Ideas into Practice
Expanding Horizons - Ideas into PracticeExpanding Horizons - Ideas into Practice
Expanding Horizons - Ideas into Practice
 
Parker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript FrameworkParker Keio 2011: Interoperable Manuscript Framework
Parker Keio 2011: Interoperable Manuscript Framework
 
Defragmenting Digitized Manuscripts Sources
Defragmenting Digitized Manuscripts SourcesDefragmenting Digitized Manuscripts Sources
Defragmenting Digitized Manuscripts Sources
 
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi Kaya
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi KayaDigital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi Kaya
Digital Manuscripts Toolkit, using IIIF and JavaScript. Monica Messaggi Kaya
 
Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conference
 
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...
Preservación digital en la BNE: necesidad de un panorama global. Isabel Borde...
 
Presentation of Europeana Regia at "The Message of the Old Book in the New En...
Presentation of Europeana Regia at "The Message of the Old Book in the New En...Presentation of Europeana Regia at "The Message of the Old Book in the New En...
Presentation of Europeana Regia at "The Message of the Old Book in the New En...
 
Maa
MaaMaa
Maa
 
20120309 manuscript digitalheritage_firenze
20120309 manuscript digitalheritage_firenze20120309 manuscript digitalheritage_firenze
20120309 manuscript digitalheritage_firenze
 
Digitization of Documentary Heritage Collections in Indic Language Comparativ...
Digitization of Documentary Heritage Collections in Indic LanguageComparativ...Digitization of Documentary Heritage Collections in Indic LanguageComparativ...
Digitization of Documentary Heritage Collections in Indic Language Comparativ...
 
Ukad forum 2 march_2011_iams
Ukad forum 2 march_2011_iamsUkad forum 2 march_2011_iams
Ukad forum 2 march_2011_iams
 
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...The Library as a Digital Research infrastructure: Digital Initiatives and Dig...
The Library as a Digital Research infrastructure: Digital Initiatives and Dig...
 
Manuscript digitisation
Manuscript digitisationManuscript digitisation
Manuscript digitisation
 
MANUSCRIPT ACQUISITION
MANUSCRIPT ACQUISITIONMANUSCRIPT ACQUISITION
MANUSCRIPT ACQUISITION
 
Culture Untapped: inspirational content & fresh ideas for your games
Culture Untapped: inspirational content & fresh ideas for your gamesCulture Untapped: inspirational content & fresh ideas for your games
Culture Untapped: inspirational content & fresh ideas for your games
 
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...
Biblioteca Digital Hispánica: todas las opciones y funcionalidades para encon...
 
Shared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conferenceShared Canvas presentation at the LIBER conference
Shared Canvas presentation at the LIBER conference
 

Similar to Icms 2015 burrows

Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...ÚISK FF UK
 
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...jessica666
 
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...jessica666
 
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019PACKED vzw
 
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...The Frick Collection
 
LIS698 Practicum Presentation Bieck
LIS698 Practicum Presentation BieckLIS698 Practicum Presentation Bieck
LIS698 Practicum Presentation Bieckbbieck
 
Digital Humanities and Linked Data
Digital Humanities and Linked DataDigital Humanities and Linked Data
Digital Humanities and Linked DataLeon Wessels
 
Discovering libraries's gold through collection-level descriptions
Discovering libraries's gold through collection-level descriptionsDiscovering libraries's gold through collection-level descriptions
Discovering libraries's gold through collection-level descriptionsValentine Charles
 
Enhancing undergraduate research and combating fake news
Enhancing undergraduate research and combating fake newsEnhancing undergraduate research and combating fake news
Enhancing undergraduate research and combating fake newsdoberhelman
 
Being Practical. Electronic editions of Flemish literary texts and documents ...
Being Practical. Electronic editions of Flemish literary texts and documents ...Being Practical. Electronic editions of Flemish literary texts and documents ...
Being Practical. Electronic editions of Flemish literary texts and documents ...Edward Vanhoutte
 
Links and Entities: The Library Data Revolution
Links and Entities: The Library Data RevolutionLinks and Entities: The Library Data Revolution
Links and Entities: The Library Data RevolutionOCLC
 
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Rose Holley
 
Prodigious Histories - Stephen Brooks
Prodigious Histories - Stephen BrooksProdigious Histories - Stephen Brooks
Prodigious Histories - Stephen BrooksIncisive_Events
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondJonathan Bowen
 
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...Merce Crosas
 
Tanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection DatabaseTanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection DatabaseAndrew Prescott
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizonsJanifer Gatenby
 
Explore the hidden life of your objects ceramics and silver
Explore the hidden life of your objects   ceramics and silverExplore the hidden life of your objects   ceramics and silver
Explore the hidden life of your objects ceramics and silversarl2007
 

Similar to Icms 2015 burrows (20)

Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...Maja Žumer: Library catalogues of the future: realising the old vision with n...
Maja Žumer: Library catalogues of the future: realising the old vision with n...
 
Bibliotheca Digitalis Summer School: Bibliographic data – Definition, Structu...
Bibliotheca Digitalis Summer School: Bibliographic data – Definition, Structu...Bibliotheca Digitalis Summer School: Bibliographic data – Definition, Structu...
Bibliotheca Digitalis Summer School: Bibliographic data – Definition, Structu...
 
Hidden Collections
Hidden CollectionsHidden Collections
Hidden Collections
 
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
 
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
Publishers' Bindings Online and The Artistic, Cultural, and Historical Signif...
 
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019
270102019_King_Baudouin_Foundation_Public_domain_day_BE_2019
 
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...
Tango on a Tightrope: Providing Access to Collections Through Symbiotic Partn...
 
LIS698 Practicum Presentation Bieck
LIS698 Practicum Presentation BieckLIS698 Practicum Presentation Bieck
LIS698 Practicum Presentation Bieck
 
Digital Humanities and Linked Data
Digital Humanities and Linked DataDigital Humanities and Linked Data
Digital Humanities and Linked Data
 
Discovering libraries's gold through collection-level descriptions
Discovering libraries's gold through collection-level descriptionsDiscovering libraries's gold through collection-level descriptions
Discovering libraries's gold through collection-level descriptions
 
Enhancing undergraduate research and combating fake news
Enhancing undergraduate research and combating fake newsEnhancing undergraduate research and combating fake news
Enhancing undergraduate research and combating fake news
 
Being Practical. Electronic editions of Flemish literary texts and documents ...
Being Practical. Electronic editions of Flemish literary texts and documents ...Being Practical. Electronic editions of Flemish literary texts and documents ...
Being Practical. Electronic editions of Flemish literary texts and documents ...
 
Links and Entities: The Library Data Revolution
Links and Entities: The Library Data RevolutionLinks and Entities: The Library Data Revolution
Links and Entities: The Library Data Revolution
 
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...Ideas for how volunteers at cultural heritage institutions can help, using Tr...
Ideas for how volunteers at cultural heritage institutions can help, using Tr...
 
Prodigious Histories - Stephen Brooks
Prodigious Histories - Stephen BrooksProdigious Histories - Stephen Brooks
Prodigious Histories - Stephen Brooks
 
Patterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyondPatterns in scholarly publications online: Erdős and beyond
Patterns in scholarly publications online: Erdős and beyond
 
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...
The Rise of Data Publishing in the Digital World (and how Dataverse and DataT...
 
Tanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection DatabaseTanya Szrajber, The British Museum Collection Database
Tanya Szrajber, The British Museum Collection Database
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizons
 
Explore the hidden life of your objects ceramics and silver
Explore the hidden life of your objects   ceramics and silverExplore the hidden life of your objects   ceramics and silver
Explore the hidden life of your objects ceramics and silver
 

Icms 2015 burrows

  • 1. The Provenance and History of the Manuscripts formerly in the Phillipps Collection Department of Digital HumanitiesToby Burrows
  • 2. The Phillipps manuscript collection • Phillipps’ own printed catalogue (1837-1871) goes up to no. 23,837 • Thomas Fitzroy Fenwick (grandson, d. 1938) spent fifty years reorganizing and renumbering: up to no. 38,628 • Fenwick’s estimate of the total was close to 60,000 volumes and individual documents • Phillipps also owned 50,000 books, as well as many prints, photographs, drawings and paintings Sir Thomas Phillipps (1792-1872)
  • 4. Re-creation of Phillipps’ shelves, Grolier Club
  • 5. Dispersal of the collection Fenwick family (1886-1945): • Sales to interested libraries and governments (Germany, Belgium, Netherlands, France, Ireland, Wales) – more than 2,500 items • Auctions at Sotheby’s, 1886 to 1938 – 22 auctions, more than 22,000 lots, raised £97,000 (over £30 million) • Residue (12,000 items) sold to the Robinson brothers in 1945 for £100,000 (£11-12 million) W.H. Robinson Ltd (1945-1958): •Series of sale catalogues, 1945-1954 •Donation to the Bodleian Library of the remaining materials, 1958 Sotheby’s (1946-1950, 1965-1977): •Series of sale catalogues
  • 6. Data sources Source Format Comments Schoenberg Database of Manuscripts Relational database Incorporates other sources, esp. sales catalogues 6,000 Phillipps MSS; 20,000 Phillipps events Library catalogues (BL, KB etc.) Relational databases Generally MARC records Provenance in notes Export can be awkward Union catalogues Relational databases Printed bibliographies Formats vary Coverage varies Export can be awkward Sale catalogues Printed books (some digitized) Online sources (PDFs, Web sites) Many included in Schoenberg MSS in ABE, eBay etc. Phillipps catalogues and lists Printed book; Partly digitized Supplemented by handwritten notes Partly included in Schoenberg Handwritten notes not digitized Phillipps provenance indexes (BL, IRHT) Handwritten; Not digitized Arranged by Phillipps number No longer updated Annotated sales catalogues & printed catalogues Handwritten; Not digitized Researchers (Munby), owners (Phillipps), auctioneers (Sotheby’s) Held in Cambridge UL, Bodleian, BL
  • 7.
  • 8.
  • 9. Project summary • Two main research questions: – The history and significant characteristics of the transmission of a major group of European manuscripts between collections and collectors over the centuries (provenance) – The applicability and value of Linked Data technologies as a methodology for the large-scale analysis of the history of cultural objects and collections (“network archaeology”) • Project plan – Ingest the data; transform them to a common Data Model; represent them computationally; analyse and visualize them; make them available to other researchers • Tools – Excel, OpenRefine, Neo4j, Nodegoat, visualization tools
  • 10. In 1862, Sir Thomas Phillipps bought Phillipps MS 16402 in London as part of the Sotheby’s sale of the collection of Guglielmo Libri. London 1862 MS16402 Libri Phillipps Sotheby’s
  • 11. Neo4j: graph database • Nodes and relationships (each with properties) • Various tools for data import • Cypher query language for creating nodes, relationships and properties • Cypher is also used to run queries, analyse paths, count and list • No schema as such – develop and define as you go • Own visualization interface, but also works with others • Data export – JSON
  • 12. Neo4j Data Model – nodes (entities) Node (entity: label) Type Properties AGENT Person Organization name OBJECT Manuscript id title foliation layout binding illustration WORK Text Description Exhibition title incipit language PUBLICATION Catalogue Book Article title
  • 13. Neo4j Data Model – relationships Relationship Properties GAVE SOLD CONSIGNED OWNS ACQUIRED date id certitude price PRODUCED date certitude CONTAINS locus SAME_AS certitude Relationship Properties COMPOSED TRANSLATED COMPILED date certitude ANNOTATED INSCRIBED date locus certitude DESCRIBED_IN date item no. DESCRIBED_AS date item no.
  • 14. Neo4j Data Model – relationship statements Node Relationship Node AGENT: Person GAVE OBJECT: Manuscript AGENT: Organization SOLD OBJECT: Manuscript OBJECT: Manuscript CONTAINS WORK: Text AGENT: Person COMPOSED WORK: Text PUBLICATION: Catalogue CONTAINS WORK: Description OBJECT: Manuscript DESCRIBED_AS WORK: Description AGENT: Organization PRODUCED PUBLICATION: Catalogue WORK: Exhibition DESCRIBED_IN PUBLICATION: Catalogue
  • 15.
  • 16.
  • 17.
  • 18.
  • 19. DATA MODEL – Nodegoat Object Sub-objects Related to: PERSON Nationality (country) Manuscript Text Catalogue ORGANIZATION Location (city; country) Manuscript Text Catalogue MANUSCRIPT Sold Donated Owned Described In Produced Contents Person/Organization: Agent, Owner, Buyer, Donor, Recipient, Scribe, Artist, Producer Location (city; country) Catalogue Text TEXT Person: Author Manuscript CATALOGUE Organization: Publisher Person: Compiler Manuscript
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27. Current status of the project • Data imported: selections from Schoenberg data, other sample data • Data Model: theoretical work + working versions • Demo versions of Neo4j and Nodegoat databases • Tested and documented queries, analyses and visualizations • To come: – Adding much more data in a production environment (Nodegoat) – Carrying out more extensive visualizations and analyses • Across the whole collection • In relation to specific “use cases” – Exporting data for reuse by other researchers
  • 28. Dr Toby Burrows Marie Curie Fellow Department of Digital Humanities King’s College London 26-29 Drury Lane London WC2B 5RL toby.burrows@kcl.ac.uk @tobyburrows tobyburrows.wordpress.com

Editor's Notes

  1. I’m going to talk about the manuscript collection of Sir Thomas Phillipps, one of the great 19th-century collectors I’ll be reporting on a European Union project aimed at reconstructing the Phillipps collection, and about the ways in which I’m using new technologies to achieve this
  2. The size of the collection – almost certainly the biggest private collection ever assembled; bigger than most (all?) public collections Phillipps was not just a collector of manuscripts
  3. Phillipps was buying at a good time – many private libraries came on the market in 1820s and 1830s especially A period when prices for manuscripts rose quite sharply in Britain – Phillipps played a significant part in this price rise
  4. Phillipps was the illegitimate son and sole heir of a wealthy Manchester mill owner After filling his stately home at Middle Hill (Gloucestershire) with his collection, he then moved it all to Thirlestaine House in Cheltenham in 1864 This gives some idea of the profusion of the collection Grolier Club, New York – these are actual documents from the Phillipps Collection
  5. The collection was inherited by one of Phillipps’ daughters and her husband Most of the dispersal was managed by Phillipps’ grandson Thomas Fitzroy Fenwick (died in 1938) Robinson brothers’ sales were followed by a series of sales by other antiquarian dealers, especially Sotheby’s, through to the mid-1970s Still documents advertised for sale on sites like ABE Books today
  6. Today, a wide variety of sources of information about the Phillipps manuscripts Produced for different purposes, in very varied formats – some in digital form, others not digitized No comprehensive list of Phillipps manuscripts; no consolidated source of information about their history
  7. SDM includes almost 20,000 transactions relating to Phillipps manuscripts – about 6,000 of the manuscripts are covered Assembling the data for my project – started with a CSV export from the Schoenberg Database, filtered for Phillipps transactions
  8. Other data sources are more difficult to make use of This is a page from a list of some of the Phillipps MSS, made for probate purposes – note the alterations and revisions, and the short descriptions There are two different hand-written versions of this list, with slightly different coverage This one is in the Grolier Club Library in New York (the other is in the Bodleian Library)
  9. Two aspects of the project – firstly, the history of the Phillipps Collection: study of the provenance of the manuscripts on a large scale Looking particularly at the patterns of relationships between the manuscripts and the people and organizations involved in their history – both individually and collectively Secondly, the use of digital tools and data modelling methodologies to represent these patterns, and to serve as a basis for visualization and analysis
  10. Look at data modelling first There are a variety of ways of representing provenance in a computational setting – none is entirely satisfactory I went back to a basic conceptual model Here is a typical provenance event statement Here is a simple conceptual model of this event showing entities (nouns in blue) linked by their actions or roles (verbs in red) + properties or attributes of these actions (orange) – places and dates when they occurred
  11. I then looked for software which managed data in a way which was similar to that kind of conceptual model A graph database like Neo4j can show provenance events as a series of nodes and relationships which correspond to the nouns and verbs in that model Neo4j enables path analysis and pattern matching – not just quantitatively, but also by looking for specific chains of relationships
  12. I then had to try and develop a detailed Data Model using the Neo4j notation These are the basic entities (nodes/labels) + types + some key properties “Object” is primarily a physical entity: the manuscript volume itself “Work” is a conceptual entity – the text carried by a manuscript, or the description of a manuscript contained in a catalogue (like FRBR’s Work)
  13. These are the basic relationships (verbs) + some key properties The properties relate to the action, not to the entities involved in the action Not just ownership-related transactions – also want to include description-related transactions
  14. Here are some sample relationship statements expressed in Neo4j notation
  15. This shows Phillipps MSS now owned by Columbia University, the Morgan Library and some other US libraries, together with their donors: George Plimpton, William S. Glazier and others Data from the Schoenberg Database
  16. And this shows some manuscripts which were once owned by both Phillipps and Chester Beatty, and sold by Sotheby’s I’ve expanded the network for Phillipps 12283, so it also shows the works contained in this manuscript, and their authors, as well as a catalogue description for it (Data from the Schoenberg Database)
  17. A few screen shots from my small Neo4j graph database using Neo4j’s own visualization interface This shows the Phillipps MSS now owned by the Royal Library in The Hague, with their Phillipps numbers and the titles of the works they contain (data from the KB catalogue)
  18. But I ran into significant limitations with Neo4j – both in the way it handles data modelling, and in its capacity for visualization and analysis So I’ve also been testing an alternative approach using software called Nodegoat Developed in the Netherlands at the University of Amsterdam
  19. Nodegoat uses a structure based on types of objects, which can each have sub-objects The sub-objects can serve as event clusters, as you can see from my data model “Manuscript” is the central object; its sub-objects are mostly event types The sub-objects can include links to related objects – especially people and organizations, who play different roles depending on the type of event
  20. I currently have a test data collection in Nodegoat, involving 100 Phillipps manuscripts and about 250 provenance transactions (data from the Schoenberg Database) Here is an example of a manuscript object – its description is the top half Its associated sub-objects are summarized in the lower half This MS has three “Sold” sub-objects, as well as “Owned”, “Produced” and “Contents” (a link to the text it contains) Produced in 1580 in the UK, then sold three times between 1815 and 1967, owned by Yale University in 2010
  21. Nodegoat has interesting visualization interfaces – geographical, social networks, and chronological This is the geographical visualization interface, showing how manuscripts have moved over the centuries, both around Europe and to the United States This is for the whole of the sample dataset (200 MSS) – you can also limit the visualization to specific manuscripts or groups of manuscripts
  22. A closer look at this map - you can see the cities where manuscripts were produced (in purple) The lines reflect movements due to sales (in blue) or donations (in orange) or other changes in ownership (red)
  23. You can also use the time slider to see how the pattern of movement changed over time This is the picture up to 1937 – only a couple of MSS in the dataset had moved to the United States before that date
  24. Here is the chronological visualization – showing the relative numbers of sales since 1750 (in blue/purple)
  25. And finally, the social visualization showing the network of connections in the sample dataset of 100 Phillipps manuscripts Shows the centrality of Sir Thomas Phillipps (big red circle), Sotheby’s (green) and France (white) as a place of origin Also a time slider to view the changes in the network over time
  26. Can zoom in to inspect each node and each relationship Here’s a magnification of part of the graph The three major nodes are Sotheby’s (green: 54 sales transactions), Phillipps (red: 47 sales transactions) and France (white: where 32 of the manuscripts were originally produced)
  27. Conclude with a summary of the project to date My aim is to ingest as much data as possible during the project, and to make the platform available for addition of further data in the future by other researchers Effectively building an information system about the Phillipps manuscripts, while also developing generic models for the provenance of cultural heritage objects