SlideShare a Scribd company logo
• It has been calculated that the books in the Library of
Congress represent 235 terabytes of data
• In 1994, at a conference on data management at ULCC, I
heard the word ‘petabyte’ for first time: ‘the problems
which confront the meteorologist today will worry the arts
and humanities in ten years time’
• A petabyte equals approximately four Library of Congresses
• The Large Hadron Collider generates one petabyte of data
every second (which would cost $1 trillion a year to store)
• The Square Kilometre Array will generate 4 petabytes of
data a second
• A new word for me in 2013: zettabyte (equivalent of 250
billion DVDs)
• By 2020, digital universe will be 35 zettabytes (44 times as
big as in 2009)
Is there big data in the arts and
humanities?
Letter of Gladstone to
Disraeli, 1878: British Library, Add.
MS. 44457, f. 166
The political and literary
papers of Gladstone
preserved in the British
Library comprise 762
volumes containing
approx. 160,000
documents.
George W. Bush Presidential Library:
200 million e-mails
4 million photographs
• FamilySearch
 Genealogy Information from around the world
 2.8 billion names; 300 million added every year
 560 million digital images
 10 million hits per day; 7,000 transactions / minute
 Digitising microfilm to JPEG2000
 Digitised in Utah, preserved in Maryland
 Initial storage 7Pb
 Ingest rate : 20Tb per day, rising to 50Tb per day
Anglo-American Legal Tradition (http://aalt.law.uh.edu) contains
over 8,500,000 images of the major series of legal records for
medieval and early modern England, but how do we navigate,
annotate and share these materials?
Credit: Mark Basham, Diamond,
Raw images
(2,600 X 4,000 px each image/projection)
(Total: 6,000 images/projections)
Sinograms
from the projections
(Total: 4,000 sinograms
Each sinogram: 2,600 x 4,000 x 1 px)
A sinogram
Reconstructed slices
in a 3d volume
(Total: 4,000 slices)
2,600 px
4,000 px
6, 000 projections
A slice
Tomographic Reconstruction
~100Gb per 3D image - ~40 mins on 16 GPU cluster
~10 TB per experiment” - ~3 days on site
~ 1PB per year (per beamline)
Working on using the Emerald (376 GPUs)
Use of Diamond Light Source to image
fragile manuscripts
Use of Diamond Light Source to investigate
silver degradation in altarpieces
Laser Scans of St Kilda and Rosslyn Chapel, produced by Digital Design Studio, Glasgow
School of Art, as part of the ‘Scottish Ten’ project. Scans such as these produce data sets
containing billions of points, amounting to many terabytes. Such data can have great
cultural and economic value.
http://worldservice.prototyping.bbc.co.uk/
Big Data in the Humanities
• Frequently (but not always) heterogeneous, comprising different
layers of information: comparable to climate and environmental
data
• Extremely varied in format, from linguistic corpora to images and
video. Navigation and processing major issue, even when in
digital format
• Nevertheless strong common characteristics with big data issues
in other disciplines and scope for shared dialogue
• One characteristic of ‘big data’ hype is use of predictive analytics.
Can predictive analytics work with some arts and humanities
data (eg sound?)
• But data driven research (as opposed to hypothesis driven
research) is potentially important in arts and humanities
• Arts and humanities can potentially help in dealing with big data
by new interpretative approaches
Lise Autogena and Joshua Portway, Most Blue Skies, 2010: http://www.autogena.org/mbs.html
Fabio Lattanzi Antinori,The Obelisk (2012): Open Data Institute
Scanning online news websites in real-time, the artwork changes from
opaque to transparent, according to the quantity of references to the four
main crimes against peace (genocide, crimes against humanity, crimes of
aggression and crimes of war)
Available at: http://mith.umd.edu/sharedhorizons/resources/
ParametricModeling Quantitatively MapsSingle Cell Protein
Levelsto Individual Qualitative Components
Big Data in the Arts and Humanities
Big Data in the Arts and Humanities

More Related Content

What's hot

AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
Digital Research and Curator Team @ British Library
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...
Gerben Zaagsma
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Beth Plale
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social Sciences
Martin Donnelly
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
Beth Plale
 
The Secret Life of Data
The Secret Life of DataThe Secret Life of Data
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
Robert H. McDonald
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
University of Cape Town
 
Ebi
EbiEbi
Digital Humanities, Big Data, and New Research Methods
Digital Humanities, Big Data, and New Research MethodsDigital Humanities, Big Data, and New Research Methods
Digital Humanities, Big Data, and New Research Methods
lorna_hughes
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)
Kerstin Lehnert
 
Translating Databased Meaning
Translating Databased MeaningTranslating Databased Meaning
International Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data ScienceInternational Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data Science
datasciencekorea
 
Materiality and the digital archive
Materiality and the digital archiveMateriality and the digital archive
Materiality and the digital archive
Jisc
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
University of Cape Town
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British Library
Mia
 
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
James Baker
 
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
James Baker
 
Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...
Martin Donnelly
 
Content Mining for Machines and Humans
Content Mining for Machines and HumansContent Mining for Machines and Humans
Content Mining for Machines and Humans
petermurrayrust
 

What's hot (20)

AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
 
Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...Introduction for skills seminar on Search and Data Mining, Master of European...
Introduction for skills seminar on Search and Data Mining, Master of European...
 
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social Sciences
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
 
The Secret Life of Data
The Secret Life of DataThe Secret Life of Data
The Secret Life of Data
 
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data FrameworkThe HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
The HathiTrust Research Center: Big Data Analytics in a Secure Data Framework
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 
Ebi
EbiEbi
Ebi
 
Digital Humanities, Big Data, and New Research Methods
Digital Humanities, Big Data, and New Research MethodsDigital Humanities, Big Data, and New Research Methods
Digital Humanities, Big Data, and New Research Methods
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)
 
Translating Databased Meaning
Translating Databased MeaningTranslating Databased Meaning
Translating Databased Meaning
 
International Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data ScienceInternational Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data Science
 
Materiality and the digital archive
Materiality and the digital archiveMateriality and the digital archive
Materiality and the digital archive
 
Dh presentation 2019
Dh presentation 2019Dh presentation 2019
Dh presentation 2019
 
Rethink research, illuminate history with the British Library
Rethink research, illuminate history with the British LibraryRethink research, illuminate history with the British Library
Rethink research, illuminate history with the British Library
 
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
 
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
Digital Research in the Arts and Humanities: some thoughts on what, why, and ...
 
Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...
 
Content Mining for Machines and Humans
Content Mining for Machines and HumansContent Mining for Machines and Humans
Content Mining for Machines and Humans
 

Viewers also liked

From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
Kimberly Hoffman
 
Interdisciplinarity
InterdisciplinarityInterdisciplinarity
Interdisciplinarity
Andrew Prescott
 
Does My Project Look Big in This?
Does My Project Look Big in This?Does My Project Look Big in This?
Does My Project Look Big in This?
Andrew Prescott
 
Prescottyork
PrescottyorkPrescottyork
Prescottyork
Andrew Prescott
 
AHRC Digital Transformations
AHRC Digital TransformationsAHRC Digital Transformations
AHRC Digital Transformations
Andrew Prescott
 
Being Transformed
Being TransformedBeing Transformed
Being Transformed
Andrew Prescott
 
Alternative Postgraduate Careers
Alternative Postgraduate CareersAlternative Postgraduate Careers
Alternative Postgraduate Careers
Andrew Prescott
 
Prescottleicesterquad
PrescottleicesterquadPrescottleicesterquad
Prescottleicesterquad
Andrew Prescott
 

Viewers also liked (8)

From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle From DARPA to Shakespeare: All the Data we Can Handle
From DARPA to Shakespeare: All the Data we Can Handle
 
Interdisciplinarity
InterdisciplinarityInterdisciplinarity
Interdisciplinarity
 
Does My Project Look Big in This?
Does My Project Look Big in This?Does My Project Look Big in This?
Does My Project Look Big in This?
 
Prescottyork
PrescottyorkPrescottyork
Prescottyork
 
AHRC Digital Transformations
AHRC Digital TransformationsAHRC Digital Transformations
AHRC Digital Transformations
 
Being Transformed
Being TransformedBeing Transformed
Being Transformed
 
Alternative Postgraduate Careers
Alternative Postgraduate CareersAlternative Postgraduate Careers
Alternative Postgraduate Careers
 
Prescottleicesterquad
PrescottleicesterquadPrescottleicesterquad
Prescottleicesterquad
 

Similar to Big Data in the Arts and Humanities

Big Data: Some Initial Reflectons
Big Data: Some Initial ReflectonsBig Data: Some Initial Reflectons
Big Data: Some Initial Reflectons
Andrew Prescott
 
Prescottbigdata
PrescottbigdataPrescottbigdata
Prescottbigdata
Andrew Prescott
 
nternational Biodiversity Projects and Natural History Museums: Current stat...
nternational Biodiversity Projects and Natural History Museums:  Current stat...nternational Biodiversity Projects and Natural History Museums:  Current stat...
nternational Biodiversity Projects and Natural History Museums: Current stat...
Klaus Riede
 
Introduction
IntroductionIntroduction
Introduction
Kinnudj Amee
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
lljohnston
 
Why Big Data Will Survive the Hype - and Change the Way We Work
Why Big Data Will Survive the Hype - and Change the Way We WorkWhy Big Data Will Survive the Hype - and Change the Way We Work
Why Big Data Will Survive the Hype - and Change the Way We Work
Blair Reeves
 
Massive Scale Librarianship
Massive Scale LibrarianshipMassive Scale Librarianship
Massive Scale Librarianship
rdlankes
 
digital Preservation
digital Preservationdigital Preservation
digital Preservation
budhisantoso14
 
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting..."Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
Tom Moritz
 
Converging on the Universal Library: From Memex to Googolplex
Converging on the Universal Library: From Memex to GoogolplexConverging on the Universal Library: From Memex to Googolplex
Converging on the Universal Library: From Memex to Googolplex
Martin Kalfatovic
 
World Archaeology Congress paper
World Archaeology Congress paperWorld Archaeology Congress paper
World Archaeology Congress paper
dejp3
 
Redefining Libraries
Redefining LibrariesRedefining Libraries
Redefining Libraries
Peter Brantley
 
Digital Research at the British Library, by Stella Wisdom
Digital Research at the British Library, by Stella WisdomDigital Research at the British Library, by Stella Wisdom
Digital Research at the British Library, by Stella Wisdom
Digital Research and Curator Team @ British Library
 
iSGTW - What it is
iSGTW - What it isiSGTW - What it is
iSGTW - What it is
iSGTW
 
I am Library: an ode to self-discovery and collective creativity in Second Li...
I am Library: an ode to self-discovery and collective creativity in Second Li...I am Library: an ode to self-discovery and collective creativity in Second Li...
I am Library: an ode to self-discovery and collective creativity in Second Li...
Bernadette Daly Swanson
 
Digital librarianship - BIALL/CLSIG/SLA Europe Open Day
Digital librarianship - BIALL/CLSIG/SLA Europe Open DayDigital librarianship - BIALL/CLSIG/SLA Europe Open Day
Digital librarianship - BIALL/CLSIG/SLA Europe Open Day
Simon Bowie
 
April 2012 Wolverine Caucus Event Featuring Paul N. Courant
April 2012 Wolverine Caucus Event Featuring Paul N. CourantApril 2012 Wolverine Caucus Event Featuring Paul N. Courant
April 2012 Wolverine Caucus Event Featuring Paul N. Courant
University of Michigan, Government Relations
 
Artistic Practice and The Archive
Artistic Practice and The ArchiveArtistic Practice and The Archive
Artistic Practice and The Archive
Andrew Prescott
 
Following Google: Don’t Follow the Followers, Follow the Leaders
Following Google: Don’t Follow the Followers, Follow the LeadersFollowing Google: Don’t Follow the Followers, Follow the Leaders
Following Google: Don’t Follow the Followers, Follow the Leaders
C4Media
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Wouter Beek
 

Similar to Big Data in the Arts and Humanities (20)

Big Data: Some Initial Reflectons
Big Data: Some Initial ReflectonsBig Data: Some Initial Reflectons
Big Data: Some Initial Reflectons
 
Prescottbigdata
PrescottbigdataPrescottbigdata
Prescottbigdata
 
nternational Biodiversity Projects and Natural History Museums: Current stat...
nternational Biodiversity Projects and Natural History Museums:  Current stat...nternational Biodiversity Projects and Natural History Museums:  Current stat...
nternational Biodiversity Projects and Natural History Museums: Current stat...
 
Introduction
IntroductionIntroduction
Introduction
 
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
Leslie Johnston: Big Data at Libraries, Georgetown University Law School Symp...
 
Why Big Data Will Survive the Hype - and Change the Way We Work
Why Big Data Will Survive the Hype - and Change the Way We WorkWhy Big Data Will Survive the Hype - and Change the Way We Work
Why Big Data Will Survive the Hype - and Change the Way We Work
 
Massive Scale Librarianship
Massive Scale LibrarianshipMassive Scale Librarianship
Massive Scale Librarianship
 
digital Preservation
digital Preservationdigital Preservation
digital Preservation
 
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting..."Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
"Toward Sustainability: "Margin" and "Mission" in the Natural History Setting...
 
Converging on the Universal Library: From Memex to Googolplex
Converging on the Universal Library: From Memex to GoogolplexConverging on the Universal Library: From Memex to Googolplex
Converging on the Universal Library: From Memex to Googolplex
 
World Archaeology Congress paper
World Archaeology Congress paperWorld Archaeology Congress paper
World Archaeology Congress paper
 
Redefining Libraries
Redefining LibrariesRedefining Libraries
Redefining Libraries
 
Digital Research at the British Library, by Stella Wisdom
Digital Research at the British Library, by Stella WisdomDigital Research at the British Library, by Stella Wisdom
Digital Research at the British Library, by Stella Wisdom
 
iSGTW - What it is
iSGTW - What it isiSGTW - What it is
iSGTW - What it is
 
I am Library: an ode to self-discovery and collective creativity in Second Li...
I am Library: an ode to self-discovery and collective creativity in Second Li...I am Library: an ode to self-discovery and collective creativity in Second Li...
I am Library: an ode to self-discovery and collective creativity in Second Li...
 
Digital librarianship - BIALL/CLSIG/SLA Europe Open Day
Digital librarianship - BIALL/CLSIG/SLA Europe Open DayDigital librarianship - BIALL/CLSIG/SLA Europe Open Day
Digital librarianship - BIALL/CLSIG/SLA Europe Open Day
 
April 2012 Wolverine Caucus Event Featuring Paul N. Courant
April 2012 Wolverine Caucus Event Featuring Paul N. CourantApril 2012 Wolverine Caucus Event Featuring Paul N. Courant
April 2012 Wolverine Caucus Event Featuring Paul N. Courant
 
Artistic Practice and The Archive
Artistic Practice and The ArchiveArtistic Practice and The Archive
Artistic Practice and The Archive
 
Following Google: Don’t Follow the Followers, Follow the Leaders
Following Google: Don’t Follow the Followers, Follow the LeadersFollowing Google: Don’t Follow the Followers, Follow the Leaders
Following Google: Don’t Follow the Followers, Follow the Leaders
 
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
Dutch Book Trade 1660-1750: using the STCN to gain insight in publishers’ str...
 

More from Andrew Prescott

Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and OpportunitiesResearching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Andrew Prescott
 
Working with Archives
Working with ArchivesWorking with Archives
Working with Archives
Andrew Prescott
 
Is Search the Right Way?
Is Search the Right Way?Is Search the Right Way?
Is Search the Right Way?
Andrew Prescott
 
Medieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the FutureMedieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the Future
Andrew Prescott
 
New Modernist Editing meeting
New Modernist Editing meetingNew Modernist Editing meeting
New Modernist Editing meeting
Andrew Prescott
 
What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?
Andrew Prescott
 
New Materialities of the Book
New Materialities of the BookNew Materialities of the Book
New Materialities of the Book
Andrew Prescott
 
Avoiding the Rear View Mirror
Avoiding the Rear View MirrorAvoiding the Rear View Mirror
Avoiding the Rear View Mirror
Andrew Prescott
 
Challenges in the Digital Humanities
Challenges in the Digital HumanitiesChallenges in the Digital Humanities
Challenges in the Digital Humanities
Andrew Prescott
 
Prescott Emda2015
Prescott Emda2015Prescott Emda2015
Prescott Emda2015
Andrew Prescott
 
Sustainability
SustainabilitySustainability
Sustainability
Andrew Prescott
 
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the ComputerDoing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Andrew Prescott
 
New Materialities
New MaterialitiesNew Materialities
New Materialities
Andrew Prescott
 
What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?
Andrew Prescott
 
The Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and ContinuitiesThe Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and Continuities
Andrew Prescott
 
Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...
Andrew Prescott
 
AHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So FarAHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So Far
Andrew Prescott
 
Beyond the REF: the Role of Repositories
Beyond the REF: the Role of RepositoriesBeyond the REF: the Role of Repositories
Beyond the REF: the Role of Repositories
Andrew Prescott
 
Electronic Beowulf @ 21
Electronic Beowulf @ 21Electronic Beowulf @ 21
Electronic Beowulf @ 21
Andrew Prescott
 
The Digitisation of Chaucer
The Digitisation of Chaucer The Digitisation of Chaucer
The Digitisation of Chaucer
Andrew Prescott
 

More from Andrew Prescott (20)

Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and OpportunitiesResearching Freemasonry in a Time of Coronavirus: Resources and Opportunities
Researching Freemasonry in a Time of Coronavirus: Resources and Opportunities
 
Working with Archives
Working with ArchivesWorking with Archives
Working with Archives
 
Is Search the Right Way?
Is Search the Right Way?Is Search the Right Way?
Is Search the Right Way?
 
Medieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the FutureMedieval Studies: Some Hopes and Fears for the Future
Medieval Studies: Some Hopes and Fears for the Future
 
New Modernist Editing meeting
New Modernist Editing meetingNew Modernist Editing meeting
New Modernist Editing meeting
 
What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?What Happens When the Internet of Things Meets the Middle Ages?
What Happens When the Internet of Things Meets the Middle Ages?
 
New Materialities of the Book
New Materialities of the BookNew Materialities of the Book
New Materialities of the Book
 
Avoiding the Rear View Mirror
Avoiding the Rear View MirrorAvoiding the Rear View Mirror
Avoiding the Rear View Mirror
 
Challenges in the Digital Humanities
Challenges in the Digital HumanitiesChallenges in the Digital Humanities
Challenges in the Digital Humanities
 
Prescott Emda2015
Prescott Emda2015Prescott Emda2015
Prescott Emda2015
 
Sustainability
SustainabilitySustainability
Sustainability
 
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the ComputerDoing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
Doing the Digital: How Scholars Learned to Stop Worrying and Love the Computer
 
New Materialities
New MaterialitiesNew Materialities
New Materialities
 
What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?What are the Digital Humanities and what use are they to me?
What are the Digital Humanities and what use are they to me?
 
The Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and ContinuitiesThe Arts and Humanities in a Digital Age: Disruptions and Continuities
The Arts and Humanities in a Digital Age: Disruptions and Continuities
 
Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...Digital Transformations: keynote talk to Listening Experience Database Sympos...
Digital Transformations: keynote talk to Listening Experience Database Sympos...
 
AHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So FarAHRC Digital Transformations theme: the Story So Far
AHRC Digital Transformations theme: the Story So Far
 
Beyond the REF: the Role of Repositories
Beyond the REF: the Role of RepositoriesBeyond the REF: the Role of Repositories
Beyond the REF: the Role of Repositories
 
Electronic Beowulf @ 21
Electronic Beowulf @ 21Electronic Beowulf @ 21
Electronic Beowulf @ 21
 
The Digitisation of Chaucer
The Digitisation of Chaucer The Digitisation of Chaucer
The Digitisation of Chaucer
 

Recently uploaded

Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
Tarandeep Singh
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
3a0sd7z3
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
3a0sd7z3
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
APNIC
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
APNIC
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
GNAMBIKARAO
 
How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
Infosec train
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
rtunex8r
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
thezot
 
HijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process HollowingHijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process Hollowing
Donato Onofri
 
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
dtagbe
 

Recently uploaded (11)

Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
 
How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
 
HijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process HollowingHijackLoader Evolution: Interactive Process Hollowing
HijackLoader Evolution: Interactive Process Hollowing
 
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
 

Big Data in the Arts and Humanities

  • 1. • It has been calculated that the books in the Library of Congress represent 235 terabytes of data • In 1994, at a conference on data management at ULCC, I heard the word ‘petabyte’ for first time: ‘the problems which confront the meteorologist today will worry the arts and humanities in ten years time’ • A petabyte equals approximately four Library of Congresses • The Large Hadron Collider generates one petabyte of data every second (which would cost $1 trillion a year to store) • The Square Kilometre Array will generate 4 petabytes of data a second • A new word for me in 2013: zettabyte (equivalent of 250 billion DVDs) • By 2020, digital universe will be 35 zettabytes (44 times as big as in 2009)
  • 2. Is there big data in the arts and humanities? Letter of Gladstone to Disraeli, 1878: British Library, Add. MS. 44457, f. 166 The political and literary papers of Gladstone preserved in the British Library comprise 762 volumes containing approx. 160,000 documents.
  • 3. George W. Bush Presidential Library: 200 million e-mails 4 million photographs
  • 4.
  • 5.
  • 6. • FamilySearch  Genealogy Information from around the world  2.8 billion names; 300 million added every year  560 million digital images  10 million hits per day; 7,000 transactions / minute  Digitising microfilm to JPEG2000  Digitised in Utah, preserved in Maryland  Initial storage 7Pb  Ingest rate : 20Tb per day, rising to 50Tb per day
  • 7. Anglo-American Legal Tradition (http://aalt.law.uh.edu) contains over 8,500,000 images of the major series of legal records for medieval and early modern England, but how do we navigate, annotate and share these materials?
  • 8. Credit: Mark Basham, Diamond, Raw images (2,600 X 4,000 px each image/projection) (Total: 6,000 images/projections) Sinograms from the projections (Total: 4,000 sinograms Each sinogram: 2,600 x 4,000 x 1 px) A sinogram Reconstructed slices in a 3d volume (Total: 4,000 slices) 2,600 px 4,000 px 6, 000 projections A slice Tomographic Reconstruction ~100Gb per 3D image - ~40 mins on 16 GPU cluster ~10 TB per experiment” - ~3 days on site ~ 1PB per year (per beamline) Working on using the Emerald (376 GPUs)
  • 9. Use of Diamond Light Source to image fragile manuscripts Use of Diamond Light Source to investigate silver degradation in altarpieces
  • 10. Laser Scans of St Kilda and Rosslyn Chapel, produced by Digital Design Studio, Glasgow School of Art, as part of the ‘Scottish Ten’ project. Scans such as these produce data sets containing billions of points, amounting to many terabytes. Such data can have great cultural and economic value.
  • 12.
  • 13.
  • 14. Big Data in the Humanities • Frequently (but not always) heterogeneous, comprising different layers of information: comparable to climate and environmental data • Extremely varied in format, from linguistic corpora to images and video. Navigation and processing major issue, even when in digital format • Nevertheless strong common characteristics with big data issues in other disciplines and scope for shared dialogue • One characteristic of ‘big data’ hype is use of predictive analytics. Can predictive analytics work with some arts and humanities data (eg sound?) • But data driven research (as opposed to hypothesis driven research) is potentially important in arts and humanities • Arts and humanities can potentially help in dealing with big data by new interpretative approaches
  • 15.
  • 16. Lise Autogena and Joshua Portway, Most Blue Skies, 2010: http://www.autogena.org/mbs.html
  • 17. Fabio Lattanzi Antinori,The Obelisk (2012): Open Data Institute Scanning online news websites in real-time, the artwork changes from opaque to transparent, according to the quantity of references to the four main crimes against peace (genocide, crimes against humanity, crimes of aggression and crimes of war)
  • 19. ParametricModeling Quantitatively MapsSingle Cell Protein Levelsto Individual Qualitative Components