SlideShare a Scribd company logo
Breakout #3:
Historical Research
Ian Milligan (in the hot seat today), Stephen Robertson,
Thomas Risse, Kris Carpenter Negulescu, Pamela
Graham, Niels Brügger, Ivana Marenzi, Katie Kang (thanks
for the notes!), Ed Fox, Meghan Dougherty, Matt Connolley
What we did? (I think)
• Tackled some big problems/questions
• Explored what historians and others might want to
do with this material
• Thought about community building..
• And came up with a tangible path forward!
First big problem we
discussed about
historians..
Building Interest?
• Eric Meyer: rooms were “thin on the ground”
• Internet Archive needs to be viewed from many
different perspectives
• CS doing their way, historians their own, Internet
researchers their own, etc.
• Historians are interested, but get FRUSTRATED
quickly when they realize how hard their tasks are
Building Interest?
• Problem is that historians trained in a particular
way, but this offers a new way (the shift from
scarcity to abundance)
• And really important to emphasize the
QUESTIONS that we have. Can’t just study the
web, have to have reasons to do so.
First big question
• Should we create new research corpuses, or use
existing ones?!
• Should we make research corpuses ourselves?
• What moment are we at now?
Second problem
• Technical ones
• Historians are interested, but too hard to answer
questions because we don’t have enough
technical ability, resources, etc., to do it this way.
• How to effect collaboration? (apparently being
in a room together helps)!
• How can we contextualize documents? How do
we avoid viewing them as decontextualized?
Distant/Close
• Ideal is a way to move between distant and close,
but not always there (though some awesome
examples today!)
• Discussion about whether historians will even care
about domain-level studies
• So much emphasis on network studies, etc., which
may be of limited utility to humanists like historians
(or at the very least, need to be complemented by
more qualitative forms of research)
Too much information, so
what can we do?
Lots of other problems
• How do we deal with the big bang of electronic
records (starting in the 1970s)?
• How much data is there (we could compare Google
index/Altavista etc) to come here?
• What about emulation when it comes to digital
preservation - using old browsers, etc.? Should we
use virtual machines?
Community Building
• IIPC has no overlap with AoIR
• Live web scholars don’t deal with archived web
scholars, no community!
• How could we build a better community?
• Are people seeing themselves reflected in these
communities?
How quickly can this
happen?
• Will it take generations?
• Or will it come quicker?
Final Outcome
• Could we have a history of the 1990s without using
web archives?!
• How could we harness web archives to tell the story
of 1995 through 2005? First decade of material?
• Or just 1995-2000 due to manageability of this dataset.!
• Sort of a sweet spot - manageable data - before the
rise of closed off social media, etc. A good way to sell
the utility of these archives!
Idea
• Reach out to other datasets and archives
• Pick data from 1995-2000 from Internet Archive
(~20TB) and build tools, ask questions, etc.!
• Look at different strands (.gov data), GeoCities,
news data, e-mails, digital records, etc.
• Cool factoid: Herman Miller invented the idea of
‘cube farm,’ if you do term search you can see it
rise with the early web.
Idea
• Some name ideas:
• History 2.0
• History in the Digital Age
• Polarization
• End of Millennium (1995-2000) Historical Studies
• End of Millennium (1995-2000) Studies
NIST (National Institute of Standards and
Technology) TREC (Text Retrieval
Conference) TRAK Idea
• Sources for the following:
• IA collections
• BYU, Corpus of Contemporary American English,
wide range of digitized resources
• Related collections from UK, Germany
NIST TREC Track for End of
Millennium (1995-2000) Studies
• To be proposed and led by Jimmy Lin
• IA and others provide <30TB of content
• Historians and provide an initial set of questions and
tasks (e.g., spread of polarization), for which they have
ground truth, from other sources
• Researchers materialize systems that can handle those
• Final competition: historians pose new questions and
tasks and research groups submit results for analysis

More Related Content

What's hot

The Technology perspective- Staffan Truvé, Recorded Future
The Technology perspective- Staffan Truvé, Recorded Future The Technology perspective- Staffan Truvé, Recorded Future
The Technology perspective- Staffan Truvé, Recorded Future
LIBER Europe
 
The World of Digital Humanities : Digital Humanities in the World
The World of Digital Humanities : Digital Humanities in the WorldThe World of Digital Humanities : Digital Humanities in the World
The World of Digital Humanities : Digital Humanities in the World
Edward Vanhoutte
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
J T "Tom" Johnson
 
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNETCOMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
Lauren Bradshaw
 
Timeline history of internet
Timeline history of internetTimeline history of internet
Timeline history of internet
Neil Bulaong
 
Digital Humanities Workshop
Digital Humanities WorkshopDigital Humanities Workshop
Evolution of educational technology
Evolution of educational technologyEvolution of educational technology
Evolution of educational technology
sajanazoya
 
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
Todd Suomela
 
The History of the Internet
The History of the InternetThe History of the Internet
The History of the Internet
Natashagregory1
 
Technology & communication
Technology & communicationTechnology & communication
Technology & communication
Josh Koodray
 
Tim Berners Lee
Tim Berners LeeTim Berners Lee
Tim Berners Lee
Burcu Durmuşoğlu
 
International Image Interoperability Framework panel at #CIDOC2017 conference
International Image Interoperability Framework panel at #CIDOC2017 conferenceInternational Image Interoperability Framework panel at #CIDOC2017 conference
International Image Interoperability Framework panel at #CIDOC2017 conference
Emmanuelle Delmas-Glass
 
History of Internet
History of InternetHistory of Internet
History of Internet
izelmay
 
History of-educational-technology-timeline
History of-educational-technology-timelineHistory of-educational-technology-timeline
History of-educational-technology-timeline
Tics Umg
 
History of educational technology
History of educational technologyHistory of educational technology
History of educational technology
Meesha Ahansal
 
A history of education technology
A history of education technologyA history of education technology
A history of education technology
April Gealene Alera
 
Developing Digital Literacies
Developing Digital LiteraciesDeveloping Digital Literacies
Developing Digital Literacies
JamesDiffin
 
Cte216 slides
Cte216 slidesCte216 slides
Cte216 slides
Idris Abdulhameed
 
373.1standard
373.1standard373.1standard
373.1standard
eheseman
 
Introduction to digital scholarship tools
Introduction to digital scholarship toolsIntroduction to digital scholarship tools
Introduction to digital scholarship tools
librarianrafia
 

What's hot (20)

The Technology perspective- Staffan Truvé, Recorded Future
The Technology perspective- Staffan Truvé, Recorded Future The Technology perspective- Staffan Truvé, Recorded Future
The Technology perspective- Staffan Truvé, Recorded Future
 
The World of Digital Humanities : Digital Humanities in the World
The World of Digital Humanities : Digital Humanities in the WorldThe World of Digital Humanities : Digital Humanities in the World
The World of Digital Humanities : Digital Humanities in the World
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
 
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNETCOMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
 
Timeline history of internet
Timeline history of internetTimeline history of internet
Timeline history of internet
 
Digital Humanities Workshop
Digital Humanities WorkshopDigital Humanities Workshop
Digital Humanities Workshop
 
Evolution of educational technology
Evolution of educational technologyEvolution of educational technology
Evolution of educational technology
 
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
2011 U of Indiana - Ethics and Collecting Social Media: Twitter and the Libra...
 
The History of the Internet
The History of the InternetThe History of the Internet
The History of the Internet
 
Technology & communication
Technology & communicationTechnology & communication
Technology & communication
 
Tim Berners Lee
Tim Berners LeeTim Berners Lee
Tim Berners Lee
 
International Image Interoperability Framework panel at #CIDOC2017 conference
International Image Interoperability Framework panel at #CIDOC2017 conferenceInternational Image Interoperability Framework panel at #CIDOC2017 conference
International Image Interoperability Framework panel at #CIDOC2017 conference
 
History of Internet
History of InternetHistory of Internet
History of Internet
 
History of-educational-technology-timeline
History of-educational-technology-timelineHistory of-educational-technology-timeline
History of-educational-technology-timeline
 
History of educational technology
History of educational technologyHistory of educational technology
History of educational technology
 
A history of education technology
A history of education technologyA history of education technology
A history of education technology
 
Developing Digital Literacies
Developing Digital LiteraciesDeveloping Digital Literacies
Developing Digital Literacies
 
Cte216 slides
Cte216 slidesCte216 slides
Cte216 slides
 
373.1standard
373.1standard373.1standard
373.1standard
 
Introduction to digital scholarship tools
Introduction to digital scholarship toolsIntroduction to digital scholarship tools
Introduction to digital scholarship tools
 

Viewers also liked

How To Stop Nervousness In 3 Simple Steps
How To Stop Nervousness In 3 Simple StepsHow To Stop Nervousness In 3 Simple Steps
How To Stop Nervousness In 3 Simple Steps
Michael Lee
 
motivation
motivationmotivation
motivation
simran wadhwani
 
Overcoming and managing nervousness by Dr.Shazia Zamir
Overcoming and managing nervousness by Dr.Shazia ZamirOvercoming and managing nervousness by Dr.Shazia Zamir
Overcoming and managing nervousness by Dr.Shazia Zamir
Dr.Shazia Zamir
 
Interviewing Skills PowerPoint
Interviewing Skills PowerPointInterviewing Skills PowerPoint
Interviewing Skills PowerPoint
emurfield
 
Interview skills Presentation
Interview skills PresentationInterview skills Presentation
Interview skills Presentation
Vikram Kerkar
 
Interviewing PPT
Interviewing PPTInterviewing PPT
Interviewing PPT
Eric Machan Howd
 

Viewers also liked (6)

How To Stop Nervousness In 3 Simple Steps
How To Stop Nervousness In 3 Simple StepsHow To Stop Nervousness In 3 Simple Steps
How To Stop Nervousness In 3 Simple Steps
 
motivation
motivationmotivation
motivation
 
Overcoming and managing nervousness by Dr.Shazia Zamir
Overcoming and managing nervousness by Dr.Shazia ZamirOvercoming and managing nervousness by Dr.Shazia Zamir
Overcoming and managing nervousness by Dr.Shazia Zamir
 
Interviewing Skills PowerPoint
Interviewing Skills PowerPointInterviewing Skills PowerPoint
Interviewing Skills PowerPoint
 
Interview skills Presentation
Interview skills PresentationInterview skills Presentation
Interview skills Presentation
 
Interviewing PPT
Interviewing PPTInterviewing PPT
Interviewing PPT
 

Similar to Historical Research Breakout Session Notes, WIRE 2014

The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
Digital History
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
Roi Blanco
 
Miscellaneous Info: The Digital Past, Present, Future
Miscellaneous Info: The Digital Past, Present, FutureMiscellaneous Info: The Digital Past, Present, Future
Miscellaneous Info: The Digital Past, Present, Future
Lee Cafferata
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
Rafael Alvarado
 
Prescottimperialbigdata
PrescottimperialbigdataPrescottimperialbigdata
Prescottimperialbigdata
Andrew Prescott
 
Balbi_Keynote_AarhusWARCnet.pptx
Balbi_Keynote_AarhusWARCnet.pptxBalbi_Keynote_AarhusWARCnet.pptx
Balbi_Keynote_AarhusWARCnet.pptx
WARCnet
 
Chapter_1_Gift_of_Fire(6).ppt
Chapter_1_Gift_of_Fire(6).pptChapter_1_Gift_of_Fire(6).ppt
Chapter_1_Gift_of_Fire(6).ppt
daniloalbay1
 
Education2.0 or Economy 3.0
Education2.0 or Economy 3.0 Education2.0 or Economy 3.0
Education2.0 or Economy 3.0
London Knowledge Lab
 
Digital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issuesDigital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issues
Peter Webster
 
Digital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issuesDigital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issues
Peter Webster
 
Digital pedagogy in the Humanities: Models, Keywords, Prototypes
Digital pedagogy in the Humanities: Models, Keywords, PrototypesDigital pedagogy in the Humanities: Models, Keywords, Prototypes
Digital pedagogy in the Humanities: Models, Keywords, Prototypes
Rebecca Davis
 
How databases learn - iconference 2014
How databases learn - iconference 2014How databases learn - iconference 2014
How databases learn - iconference 2014
andrea thomer
 
Ecdl2004
Ecdl2004Ecdl2004
Ecdl2004
madhuvardhan
 
ChangePresentationDL2006
ChangePresentationDL2006ChangePresentationDL2006
ChangePresentationDL2006
Greg Thweatt
 
Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...
WARCnet
 
Research Transformed by Cyberinfrastructure: Two Possible Scenarios for the...
Research Transformed by Cyberinfrastructure:   Two Possible Scenarios for the...Research Transformed by Cyberinfrastructure:   Two Possible Scenarios for the...
Research Transformed by Cyberinfrastructure: Two Possible Scenarios for the...
Cybera Inc.
 
Intro to Digital Humanities Workshop
Intro to Digital Humanities WorkshopIntro to Digital Humanities Workshop
Intro to Digital Humanities Workshop
Deb Boyer
 
Anchorage public focus group web version
Anchorage public focus group   web versionAnchorage public focus group   web version
Anchorage public focus group web version
Carson Block
 
Summary Day 2
Summary Day 2Summary Day 2
Summary Day 2
Anita de Waard
 
Information symposium
Information symposiumInformation symposium
Information symposium
Chandra Altoff
 

Similar to Historical Research Breakout Session Notes, WIRE 2014 (20)

The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
The Challenge of Digital Sources in the Web Age: Common Tensions Across Three...
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
 
Miscellaneous Info: The Digital Past, Present, Future
Miscellaneous Info: The Digital Past, Present, FutureMiscellaneous Info: The Digital Past, Present, Future
Miscellaneous Info: The Digital Past, Present, Future
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
 
Prescottimperialbigdata
PrescottimperialbigdataPrescottimperialbigdata
Prescottimperialbigdata
 
Balbi_Keynote_AarhusWARCnet.pptx
Balbi_Keynote_AarhusWARCnet.pptxBalbi_Keynote_AarhusWARCnet.pptx
Balbi_Keynote_AarhusWARCnet.pptx
 
Chapter_1_Gift_of_Fire(6).ppt
Chapter_1_Gift_of_Fire(6).pptChapter_1_Gift_of_Fire(6).ppt
Chapter_1_Gift_of_Fire(6).ppt
 
Education2.0 or Economy 3.0
Education2.0 or Economy 3.0 Education2.0 or Economy 3.0
Education2.0 or Economy 3.0
 
Digital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issuesDigital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issues
 
Digital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issuesDigital contemporary history: sources, tools, methods, issues
Digital contemporary history: sources, tools, methods, issues
 
Digital pedagogy in the Humanities: Models, Keywords, Prototypes
Digital pedagogy in the Humanities: Models, Keywords, PrototypesDigital pedagogy in the Humanities: Models, Keywords, Prototypes
Digital pedagogy in the Humanities: Models, Keywords, Prototypes
 
How databases learn - iconference 2014
How databases learn - iconference 2014How databases learn - iconference 2014
How databases learn - iconference 2014
 
Ecdl2004
Ecdl2004Ecdl2004
Ecdl2004
 
ChangePresentationDL2006
ChangePresentationDL2006ChangePresentationDL2006
ChangePresentationDL2006
 
Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...Whose Archives? Reflections on ethics and the cultural significance of web ar...
Whose Archives? Reflections on ethics and the cultural significance of web ar...
 
Research Transformed by Cyberinfrastructure: Two Possible Scenarios for the...
Research Transformed by Cyberinfrastructure:   Two Possible Scenarios for the...Research Transformed by Cyberinfrastructure:   Two Possible Scenarios for the...
Research Transformed by Cyberinfrastructure: Two Possible Scenarios for the...
 
Intro to Digital Humanities Workshop
Intro to Digital Humanities WorkshopIntro to Digital Humanities Workshop
Intro to Digital Humanities Workshop
 
Anchorage public focus group web version
Anchorage public focus group   web versionAnchorage public focus group   web version
Anchorage public focus group web version
 
Summary Day 2
Summary Day 2Summary Day 2
Summary Day 2
 
Information symposium
Information symposiumInformation symposium
Information symposium
 

More from Ian Milligan

Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
Ian Milligan
 
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
Ian Milligan
 
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Ian Milligan
 
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Ian Milligan
 
Congress text-mining-event
Congress text-mining-eventCongress text-mining-event
Congress text-mining-event
Ian Milligan
 
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
Ian Milligan
 
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
Ian Milligan
 
Ruest and Milligan - The Great WARC Adventure
Ruest and Milligan - The Great WARC AdventureRuest and Milligan - The Great WARC Adventure
Ruest and Milligan - The Great WARC Adventure
Ian Milligan
 
International Internet Preservation Consortium Research Slides from Ian Milligan
International Internet Preservation Consortium Research Slides from Ian MilliganInternational Internet Preservation Consortium Research Slides from Ian Milligan
International Internet Preservation Consortium Research Slides from Ian Milligan
Ian Milligan
 

More from Ian Milligan (9)

Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
Welcome to the GeoHood: Using the GeoCities Web Archive to Explore Virtual Co...
 
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
Making Sense of Abundance: Opportunity and Challenges Across Three Web Archiv...
 
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
 
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
 
Congress text-mining-event
Congress text-mining-eventCongress text-mining-event
Congress text-mining-event
 
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
WARCs, WATs, and wgets: Opportunity and Challenge for a Historian Amongst Thr...
 
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
Clustering Search to Navigate A Case Study of the Canadian World Wide Web as ...
 
Ruest and Milligan - The Great WARC Adventure
Ruest and Milligan - The Great WARC AdventureRuest and Milligan - The Great WARC Adventure
Ruest and Milligan - The Great WARC Adventure
 
International Internet Preservation Consortium Research Slides from Ian Milligan
International Internet Preservation Consortium Research Slides from Ian MilliganInternational Internet Preservation Consortium Research Slides from Ian Milligan
International Internet Preservation Consortium Research Slides from Ian Milligan
 

Recently uploaded

Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
SEO Article Boost
 
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
CIOWomenMagazine
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
uehowe
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
davidjhones387
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
xjq03c34
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
cuobya
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
Laura Szabó
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
keoku
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
fovkoyb
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
cuobya
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
uehowe
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
Toptal Tech
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?
Paul Walk
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
zyfovom
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 

Recently uploaded (20)

Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
 
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
 
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
办理新西兰奥克兰大学毕业证学位证书范本原版一模一样
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?Should Repositories Participate in the Fediverse?
Should Repositories Participate in the Fediverse?
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 

Historical Research Breakout Session Notes, WIRE 2014

  • 1. Breakout #3: Historical Research Ian Milligan (in the hot seat today), Stephen Robertson, Thomas Risse, Kris Carpenter Negulescu, Pamela Graham, Niels Brügger, Ivana Marenzi, Katie Kang (thanks for the notes!), Ed Fox, Meghan Dougherty, Matt Connolley
  • 2. What we did? (I think) • Tackled some big problems/questions • Explored what historians and others might want to do with this material • Thought about community building.. • And came up with a tangible path forward!
  • 3. First big problem we discussed about historians..
  • 4.
  • 5. Building Interest? • Eric Meyer: rooms were “thin on the ground” • Internet Archive needs to be viewed from many different perspectives • CS doing their way, historians their own, Internet researchers their own, etc. • Historians are interested, but get FRUSTRATED quickly when they realize how hard their tasks are
  • 6. Building Interest? • Problem is that historians trained in a particular way, but this offers a new way (the shift from scarcity to abundance) • And really important to emphasize the QUESTIONS that we have. Can’t just study the web, have to have reasons to do so.
  • 7. First big question • Should we create new research corpuses, or use existing ones?! • Should we make research corpuses ourselves? • What moment are we at now?
  • 8. Second problem • Technical ones • Historians are interested, but too hard to answer questions because we don’t have enough technical ability, resources, etc., to do it this way. • How to effect collaboration? (apparently being in a room together helps)! • How can we contextualize documents? How do we avoid viewing them as decontextualized?
  • 9. Distant/Close • Ideal is a way to move between distant and close, but not always there (though some awesome examples today!) • Discussion about whether historians will even care about domain-level studies • So much emphasis on network studies, etc., which may be of limited utility to humanists like historians (or at the very least, need to be complemented by more qualitative forms of research)
  • 10. Too much information, so what can we do?
  • 11. Lots of other problems • How do we deal with the big bang of electronic records (starting in the 1970s)? • How much data is there (we could compare Google index/Altavista etc) to come here? • What about emulation when it comes to digital preservation - using old browsers, etc.? Should we use virtual machines?
  • 12. Community Building • IIPC has no overlap with AoIR • Live web scholars don’t deal with archived web scholars, no community! • How could we build a better community? • Are people seeing themselves reflected in these communities?
  • 13. How quickly can this happen? • Will it take generations? • Or will it come quicker?
  • 14. Final Outcome • Could we have a history of the 1990s without using web archives?! • How could we harness web archives to tell the story of 1995 through 2005? First decade of material? • Or just 1995-2000 due to manageability of this dataset.! • Sort of a sweet spot - manageable data - before the rise of closed off social media, etc. A good way to sell the utility of these archives!
  • 15. Idea • Reach out to other datasets and archives • Pick data from 1995-2000 from Internet Archive (~20TB) and build tools, ask questions, etc.! • Look at different strands (.gov data), GeoCities, news data, e-mails, digital records, etc. • Cool factoid: Herman Miller invented the idea of ‘cube farm,’ if you do term search you can see it rise with the early web.
  • 16. Idea • Some name ideas: • History 2.0 • History in the Digital Age • Polarization • End of Millennium (1995-2000) Historical Studies • End of Millennium (1995-2000) Studies
  • 17. NIST (National Institute of Standards and Technology) TREC (Text Retrieval Conference) TRAK Idea • Sources for the following: • IA collections • BYU, Corpus of Contemporary American English, wide range of digitized resources • Related collections from UK, Germany
  • 18. NIST TREC Track for End of Millennium (1995-2000) Studies • To be proposed and led by Jimmy Lin • IA and others provide <30TB of content • Historians and provide an initial set of questions and tasks (e.g., spread of polarization), for which they have ground truth, from other sources • Researchers materialize systems that can handle those • Final competition: historians pose new questions and tasks and research groups submit results for analysis