SlideShare a Scribd company logo
1 of 16
Download to read offline
Scholarly Requirements
for Large Scale Text
Analysis
A USER NEEDS ASSESSMENT FOR THE HATHITRUST RESEARCH CENTER
Harriett Green, Eleanor Dickson, and Sayan Bhattacharyya
DH 2016, July 15, 2016
What is the HathiTrust Research Center?
• Jointly led by the University of Illinois at Urbana-Champaign and
Indiana University Bloomington
• Facilitates text analysis of HTDL content  Focus on large-scale,
computational research
• Research & Development
• Finding technical solutions
• Building tools and services
• Conducting user studies
http://www.hathitrust.org/htrc
Scholarly Practices with Digital
Collections and Tools
How humanities scholars use digital collections: Brockman et al., 2001;
Palmer and Neumann, 2002; Babeu 2011; Rutner and Schonfeld, 2011; Green
and Courtney, 2015
How humanities scholars use digital tools: Frischer et al., 2006; Warwick
2008; Toms and O’Brien, 2008; Gibbs and Owen, 2012
Tools and resources for textual analysis: ARTFL and Philologic (Argamon et
al., 2009; Horton et al., 2009), MONK (Unsworth, 2011), Wordseer
(Muralidharan and Hearst, 2013), Voyant and TaPOR (Rockwell et al., 2010),
and Lexos (LeBlanc et al., 2013)
Workset Creation for Scholarly Analysis
GOAL: Find out how researchers collect together digital materials and build
textual corpora for research purposes.
Findings (Green et al. 2014, Fenlon et al. 2014):
 Need the ability to create and manipulate collections as reusable datasets
and research products
 The ability to work at different units of analysis
 Access to highly enriched metadata
HTRC User Requirements Study:
Research Goals
 Learn how researchers use digitized textual corpora, apply relevant
methods and approaches, and seek needed tools
 Develop illustrative use cases of text analysis research that will help
shape the development and expansion of HTRC research services and
training curricula for scholars
 Obtain information that can inform development of text analysis data
providers and research services
HTRC Users Requirements Study:
Methods
 Recruited interviewees from 2015 professional conferences and
meetings on digital libraries and digital humanities
 Semi-Structured Interviews with 15 scholars
All interviews coded by hand and in ATLAS.ti by HTRC Scholarly
Commons members
 Currently conducting in-depth qualitative content analysis
Preliminary Findings
What are scholars’ needs and practices for conducting textual analysis
with large text archives?
Data Acquisition and Management
Negotiating Results and Findings
Research Collaborations
Teaching and Training
Analysis: Data Acquisition and
Management
“I think the biggest challenge is data, getting good data to work with. I think people
underestimate the problems and difficulties in doing that.”
“My general like philosophical approach to these things is I like to do things small. I
build my corpora. I like to read them myself. I’m a little weary of like big distant
reading approaches, especially with stuff as far away from the present as my stuff.
So I’m still trying to perfect the stuff that I’m currently doing.”
“The bad thing is that you can get a negative result in a way you can’t get a negative
result in other methods...I might get garbage data. I might get stuff that doesn’t
make sense. I might get no findings at all.”
Analysis: Generating and Negotiating
Findings
“I yearn, I think, for workflows where we can actually—I don’t know what this would
look like in an interface particularly, but so that the scholar could actually set their own
tokenization rules. I think that would be really valuable. It would be a way that we could
create less language specific or actually, I should say control the language specificity of
the algorithm. I think that is the real need.”
“I wish more people were archiving their data and their algorithms from the source
code, as you see CS papers that will benchmark results against a dataset that’s no
longer valid, available. Then how do you try to replicate or beat those results? It
becomes impossible to evaluate your own methods against theirs and really slows
down the pace of research, because if one could surpass state of the art, then that’s an
application and [a] step forward.”
Analysis: Research Collaborations
“I’m not worried about publishing venues, I’m not worried about
reproducibility, I’m not worried about statistics. My own knowledge of
that is pretty good. But the collaborative work style is really hard.”
“I do think that at some point in the not too distant future we need to
form a test bed, which would be a subset of the HathiTrust corpus that
meets certain characteristics. So rather than a random sample I could see
one step would be a corpus around which multiple people could work
and do different kinds of machine tasks.”
Analysis: Teaching and Training
“I once imagined teaching a class in which students learn to script and
actually run analyses against data, but I was told, basically, that that class
isn’t a humanities class anymore— that belongs in computer science.”
“There is however, I will say, a demand amongst faculty to learn this stuff.
I’ve been asked to think about teaching a faculty course, or like a short
course to tell faculty members what is out there.”
“Because the technology moves so quickly, smart people will move with it.
There’s no escape from the fact that this is a self-educational problem. So,
the real challenge is the data itself and getting the data to talk.”
Findings: User Personas
Credits: Alex
Kinnaman, Peter
Organisciak,
Eleanor Dickson
Digital Project
Librarian
• Wants flexible,
transparent
tools
• Role: Research
Support staff
• Challenges:
Inaccessible
data, matching
tool to
researcher
Faculty Member
• Wants
computational
resources
• Role:
Experienced
Researcher
• Challenges:
Collaboration,
Finding texts
Graduate
Student
• Wants
examples
• Role: New
Researcher
• Challenges:
Understanding
stats, choosing
areas of
interest
Looking Forward
 IMLS-funded “Digging Deeper, Reaching Further: Libraries
Empowering Users to Mine the HathiTrust Digital Library”:
http://teach.htrc.Illinois.edu
 Data Capsule development (WCSA II Mellon grant)
 Revision to HTRC Portal and Workset Builder
 Release of extracted features from in-copyright works
Interested in working with HTRC?
HTRC Announcements:
htrc-announce-l @ list.indiana.edu
HTRC User Group:
htrc-usergroup-l @ list.indiana.edu
Questions?
htrc-help@hathitrust.org
Advanced Collaborative Support
program:
htrc.acs.awards@gmail.com
http://www.hathitrust.org/htrc
Acknowledgements
University of Illinois:
Beth Sandore Namachichivaya
Stephen Downie
Megan Senseney
Peter Organisciak, UX Specialist
Alex Kinnaman, Graduate Assistant
Indiana University:
Angela Courtney
Nicholae Cline
Leanne Mobley
Robert McDonald
Thank you!
Harriett Green
green19@Illinois.edu | @greenharr
Eleanor Dickson
dicksone@Illinois.edu
Sayan Bhattacharyya
sayan@Illinois.edu

More Related Content

What's hot

Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...
Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...
Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...alixk
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online Harriett Green
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedbackjisc-elearning
 
Collaborative Digital Pedagogy for Digital Literacies in Humanities Classrooms
Collaborative Digital Pedagogy for Digital Literacies in Humanities ClassroomsCollaborative Digital Pedagogy for Digital Literacies in Humanities Classrooms
Collaborative Digital Pedagogy for Digital Literacies in Humanities ClassroomsHarriett Green
 
Digital Libraries on International Campuses
Digital Libraries on International CampusesDigital Libraries on International Campuses
Digital Libraries on International CampusesHarriett Green
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineUser Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineHarriett Green
 
Library Of The Future – An Academic Librarian
Library Of The Future – An Academic LibrarianLibrary Of The Future – An Academic Librarian
Library Of The Future – An Academic LibrarianKara Jones
 
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...Harriett Green
 
In Context: Case Studies in Integrated Physical and Virtual Library Service D...
In Context: Case Studies in Integrated Physical and Virtual Library Service D...In Context: Case Studies in Integrated Physical and Virtual Library Service D...
In Context: Case Studies in Integrated Physical and Virtual Library Service D...Jason Casden
 
Building and Managing Social Media Collections
Building and Managing Social Media CollectionsBuilding and Managing Social Media Collections
Building and Managing Social Media CollectionsJason Casden
 
LSC Glasgow 061609
LSC Glasgow 061609LSC Glasgow 061609
LSC Glasgow 061609John MacColl
 
Blending in-person and online library services by utilizing mobile technology
Blending in-person and online library services by utilizing mobile technologyBlending in-person and online library services by utilizing mobile technology
Blending in-person and online library services by utilizing mobile technologyJason Casden
 
The Future of Libraries (for beginners)
The Future of Libraries (for beginners)The Future of Libraries (for beginners)
The Future of Libraries (for beginners)Jenna Kammer
 
Potential future of reference presentation keough 2016
Potential future of reference presentation keough 2016Potential future of reference presentation keough 2016
Potential future of reference presentation keough 2016Cathay Keough (she, her, hers)
 
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Jenn Riley
 
Who do they think we are? Addressing library identity perception in the academy
Who do they think we are? Addressing library identity perception in the academyWho do they think we are? Addressing library identity perception in the academy
Who do they think we are? Addressing library identity perception in the academyAnnis Lee Adams
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 

What's hot (20)

Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...
Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...
Collaborative Research Relationships in Digital Humanities (HASTAC 2015 prese...
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedback
 
Collaborative Digital Pedagogy for Digital Literacies in Humanities Classrooms
Collaborative Digital Pedagogy for Digital Literacies in Humanities ClassroomsCollaborative Digital Pedagogy for Digital Literacies in Humanities Classrooms
Collaborative Digital Pedagogy for Digital Literacies in Humanities Classrooms
 
Digital Libraries on International Campuses
Digital Libraries on International CampusesDigital Libraries on International Campuses
Digital Libraries on International Campuses
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineUser Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online
 
Library Of The Future – An Academic Librarian
Library Of The Future – An Academic LibrarianLibrary Of The Future – An Academic Librarian
Library Of The Future – An Academic Librarian
 
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
Beyond the Scanned Image: A Needs Assessment of Faculty Users of Digital Coll...
 
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
Supporting Open Access Publishing via Open Journal Systems – One Library’s ex...
 
In Context: Case Studies in Integrated Physical and Virtual Library Service D...
In Context: Case Studies in Integrated Physical and Virtual Library Service D...In Context: Case Studies in Integrated Physical and Virtual Library Service D...
In Context: Case Studies in Integrated Physical and Virtual Library Service D...
 
Building and Managing Social Media Collections
Building and Managing Social Media CollectionsBuilding and Managing Social Media Collections
Building and Managing Social Media Collections
 
LSC Glasgow 061609
LSC Glasgow 061609LSC Glasgow 061609
LSC Glasgow 061609
 
Blending in-person and online library services by utilizing mobile technology
Blending in-person and online library services by utilizing mobile technologyBlending in-person and online library services by utilizing mobile technology
Blending in-person and online library services by utilizing mobile technology
 
The Future of Libraries (for beginners)
The Future of Libraries (for beginners)The Future of Libraries (for beginners)
The Future of Libraries (for beginners)
 
Potential future of reference presentation keough 2016
Potential future of reference presentation keough 2016Potential future of reference presentation keough 2016
Potential future of reference presentation keough 2016
 
What Libraries Still Need from Discovery Layers
What Libraries Still Need from Discovery LayersWhat Libraries Still Need from Discovery Layers
What Libraries Still Need from Discovery Layers
 
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
Digital Libraries, Digital Archives, Digital Humanities, Digital Scholarship:...
 
Who do they think we are? Addressing library identity perception in the academy
Who do they think we are? Addressing library identity perception in the academyWho do they think we are? Addressing library identity perception in the academy
Who do they think we are? Addressing library identity perception in the academy
 
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 

Viewers also liked

Open text analysis tools & techniques for the eap practitioner
Open text analysis tools & techniques for the eap practitionerOpen text analysis tools & techniques for the eap practitioner
Open text analysis tools & techniques for the eap practitionerAlannah Fitzgerald
 
Movie Review Apothecary
Movie Review ApothecaryMovie Review Apothecary
Movie Review ApothecaryJunaid Rahsain
 
Concept mapping and text analysis (WRAB3 poster)
Concept mapping and text analysis (WRAB3 poster)Concept mapping and text analysis (WRAB3 poster)
Concept mapping and text analysis (WRAB3 poster)Lawrie Hunter
 
bahubali movie review
bahubali movie reviewbahubali movie review
bahubali movie reviewsudheer kumar
 
Haapus Movie Review
Haapus Movie ReviewHaapus Movie Review
Haapus Movie ReviewPrem Raut
 
Character sketch
Character sketch Character sketch
Character sketch wasayrao
 
Character Traits
Character TraitsCharacter Traits
Character TraitsH Seefeldt!
 
Movie review of LIFE OF PI
Movie review of LIFE OF PIMovie review of LIFE OF PI
Movie review of LIFE OF PIMohit Soni
 
PLOTCON NYC: Text is data! Analysis and Visualization Methods
PLOTCON NYC: Text is data! Analysis and Visualization MethodsPLOTCON NYC: Text is data! Analysis and Visualization Methods
PLOTCON NYC: Text is data! Analysis and Visualization MethodsPlotly
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsYousef Fadila
 
How to Write a Movie Review: Full Guide
How to Write a Movie Review: Full GuideHow to Write a Movie Review: Full Guide
How to Write a Movie Review: Full GuideReview Essay
 
Introduction to Text Analysis
Introduction to Text AnalysisIntroduction to Text Analysis
Introduction to Text AnalysisLauren Klein
 
Text analysis presentation ppt
Text analysis presentation pptText analysis presentation ppt
Text analysis presentation pptMs A
 
Understanding text-structure-powerpoint
Understanding text-structure-powerpointUnderstanding text-structure-powerpoint
Understanding text-structure-powerpointaelowans
 

Viewers also liked (20)

Open text analysis tools & techniques for the eap practitioner
Open text analysis tools & techniques for the eap practitionerOpen text analysis tools & techniques for the eap practitioner
Open text analysis tools & techniques for the eap practitioner
 
Movie Review Apothecary
Movie Review ApothecaryMovie Review Apothecary
Movie Review Apothecary
 
Breakfast at tiffany's Movie review
Breakfast at tiffany's  Movie reviewBreakfast at tiffany's  Movie review
Breakfast at tiffany's Movie review
 
Concept mapping and text analysis (WRAB3 poster)
Concept mapping and text analysis (WRAB3 poster)Concept mapping and text analysis (WRAB3 poster)
Concept mapping and text analysis (WRAB3 poster)
 
bahubali movie review
bahubali movie reviewbahubali movie review
bahubali movie review
 
Haapus Movie Review
Haapus Movie ReviewHaapus Movie Review
Haapus Movie Review
 
Review movie
Review movieReview movie
Review movie
 
Character sketch
Character sketch Character sketch
Character sketch
 
Character Traits
Character TraitsCharacter Traits
Character Traits
 
Text analysis using python
Text analysis using pythonText analysis using python
Text analysis using python
 
Movie review of LIFE OF PI
Movie review of LIFE OF PIMovie review of LIFE OF PI
Movie review of LIFE OF PI
 
PLOTCON NYC: Text is data! Analysis and Visualization Methods
PLOTCON NYC: Text is data! Analysis and Visualization MethodsPLOTCON NYC: Text is data! Analysis and Visualization Methods
PLOTCON NYC: Text is data! Analysis and Visualization Methods
 
Titanic Movie Review
Titanic Movie ReviewTitanic Movie Review
Titanic Movie Review
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie Reviews
 
How to Write a Movie Review: Full Guide
How to Write a Movie Review: Full GuideHow to Write a Movie Review: Full Guide
How to Write a Movie Review: Full Guide
 
Character Sketch
Character SketchCharacter Sketch
Character Sketch
 
Character.Sketch
Character.SketchCharacter.Sketch
Character.Sketch
 
Introduction to Text Analysis
Introduction to Text AnalysisIntroduction to Text Analysis
Introduction to Text Analysis
 
Text analysis presentation ppt
Text analysis presentation pptText analysis presentation ppt
Text analysis presentation ppt
 
Understanding text-structure-powerpoint
Understanding text-structure-powerpointUnderstanding text-structure-powerpoint
Understanding text-structure-powerpoint
 

Similar to Scholarly Requirements for Large Scale Text Analysis

Carter ACSPRI July2016
Carter ACSPRI July2016Carter ACSPRI July2016
Carter ACSPRI July2016Jackie Carter
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat..."UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...Andrew Preater
 
The Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposiumThe Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposiumKatrin Weller
 
UCLALibrary_DResSUP.pptx
UCLALibrary_DResSUP.pptxUCLALibrary_DResSUP.pptx
UCLALibrary_DResSUP.pptxrobhill123
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsSimon Knight
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning AnalyticsTore Hoel
 
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...Poster: Perspectives on Increasing Competency in Using Digital Practices and ...
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...Katja Reuter, PhD
 
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...librarianrafia
 
User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...Andrew Preater
 
Slides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesSlides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesLibrary_Connect
 
Asa integrating data 2 19-2014 with cites
Asa integrating data 2 19-2014 with citesAsa integrating data 2 19-2014 with cites
Asa integrating data 2 19-2014 with citesICPSR
 
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeEnhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeSt. Petersburg College
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsSusanMRob
 
Reviewing the role of teaching librarians in supporting student's digital cap...
Reviewing the role of teaching librarians in supporting student's digital cap...Reviewing the role of teaching librarians in supporting student's digital cap...
Reviewing the role of teaching librarians in supporting student's digital cap...IL Group (CILIP Information Literacy Group)
 

Similar to Scholarly Requirements for Large Scale Text Analysis (20)

Carter ACSPRI July2016
Carter ACSPRI July2016Carter ACSPRI July2016
Carter ACSPRI July2016
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat..."UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...
"UX for the win!" at #CityMash: how we did grounded theory coding of qualitat...
 
The Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposiumThe Hidden Data of Social Media Rearch_CSS-winter-symposium
The Hidden Data of Social Media Rearch_CSS-winter-symposium
 
UCLALibrary_DResSUP.pptx
UCLALibrary_DResSUP.pptxUCLALibrary_DResSUP.pptx
UCLALibrary_DResSUP.pptx
 
Ps rwebinar january2019final
Ps rwebinar january2019finalPs rwebinar january2019final
Ps rwebinar january2019final
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
 
Requirements for Learning Analytics
Requirements for Learning AnalyticsRequirements for Learning Analytics
Requirements for Learning Analytics
 
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...Poster: Perspectives on Increasing Competency in Using Digital Practices and ...
Poster: Perspectives on Increasing Competency in Using Digital Practices and ...
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
 
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...
Digital Frontiers 2014: Developing Library Services for Digital Humanities & ...
 
Qs1 group a
Qs1 group a Qs1 group a
Qs1 group a
 
User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...User experience at Imperial: a case study of qualitative approaches to Primo ...
User experience at Imperial: a case study of qualitative approaches to Primo ...
 
El suport bibliotecari a la recerca en un context de ciència oberta
El suport bibliotecari a la recerca en un context de ciència obertaEl suport bibliotecari a la recerca en un context de ciència oberta
El suport bibliotecari a la recerca en un context de ciència oberta
 
Slides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research servicesSlides | Targeting the librarian’s role in research services
Slides | Targeting the librarian’s role in research services
 
Asa integrating data 2 19-2014 with cites
Asa integrating data 2 19-2014 with citesAsa integrating data 2 19-2014 with cites
Asa integrating data 2 19-2014 with cites
 
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & PracticeEnhancing Learning & Participation: Critical Thinking Strategies & Practice
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
 
Hahn "Student perspectives on personalized account-based recommender systems ...
Hahn "Student perspectives on personalized account-based recommender systems ...Hahn "Student perspectives on personalized account-based recommender systems ...
Hahn "Student perspectives on personalized account-based recommender systems ...
 
Rscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libsRscd 2017 bo f data lifecycle data skills for libs
Rscd 2017 bo f data lifecycle data skills for libs
 
Reviewing the role of teaching librarians in supporting student's digital cap...
Reviewing the role of teaching librarians in supporting student's digital cap...Reviewing the role of teaching librarians in supporting student's digital cap...
Reviewing the role of teaching librarians in supporting student's digital cap...
 

More from Harriett Green

Building Capacities and Communities for Digital Scholarship: The "Digging Dee...
Building Capacities and Communities for Digital Scholarship: The "Digging Dee...Building Capacities and Communities for Digital Scholarship: The "Digging Dee...
Building Capacities and Communities for Digital Scholarship: The "Digging Dee...Harriett Green
 
Digital Public History and Collaborative Teaching Initiatives
Digital Public History and Collaborative Teaching InitiativesDigital Public History and Collaborative Teaching Initiatives
Digital Public History and Collaborative Teaching InitiativesHarriett Green
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineUser Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineHarriett Green
 
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...Harriett Green
 
Enhancing User Services in the Emblematica Online Portal
Enhancing User Services in the Emblematica Online PortalEnhancing User Services in the Emblematica Online Portal
Enhancing User Services in the Emblematica Online PortalHarriett Green
 
Digital Humanities Research and Academic Librarian
Digital Humanities Research and Academic LibrarianDigital Humanities Research and Academic Librarian
Digital Humanities Research and Academic LibrarianHarriett Green
 
The Role of the Humanities Librarian in Digital Humanities
The Role of the Humanities Librarian in Digital HumanitiesThe Role of the Humanities Librarian in Digital Humanities
The Role of the Humanities Librarian in Digital HumanitiesHarriett Green
 
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...Harriett Green
 
Workset Creation for Scholarly Analysis Project presentation at CNI 2013
Workset Creation for Scholarly Analysis Project presentation at CNI 2013Workset Creation for Scholarly Analysis Project presentation at CNI 2013
Workset Creation for Scholarly Analysis Project presentation at CNI 2013Harriett Green
 
Building the Archive of DH Research
Building the Archive of DH ResearchBuilding the Archive of DH Research
Building the Archive of DH ResearchHarriett Green
 
Humanities data curation slides
Humanities data curation slidesHumanities data curation slides
Humanities data curation slidesHarriett Green
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities researchHarriett Green
 
Journals and Knowledge Economy slides
Journals and Knowledge Economy slidesJournals and Knowledge Economy slides
Journals and Knowledge Economy slidesHarriett Green
 
Bandits and Browsing: Effective Collection Size as Way of Quantifying Search...
Bandits and Browsing:  Effective Collection Size as Way of Quantifying Search...Bandits and Browsing:  Effective Collection Size as Way of Quantifying Search...
Bandits and Browsing: Effective Collection Size as Way of Quantifying Search...Harriett Green
 

More from Harriett Green (16)

Building Capacities and Communities for Digital Scholarship: The "Digging Dee...
Building Capacities and Communities for Digital Scholarship: The "Digging Dee...Building Capacities and Communities for Digital Scholarship: The "Digging Dee...
Building Capacities and Communities for Digital Scholarship: The "Digging Dee...
 
Digital Public History and Collaborative Teaching Initiatives
Digital Public History and Collaborative Teaching InitiativesDigital Public History and Collaborative Teaching Initiatives
Digital Public History and Collaborative Teaching Initiatives
 
User Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica OnlineUser Engagement with Digital Archives: A Case Study of Emblematica Online
User Engagement with Digital Archives: A Case Study of Emblematica Online
 
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...
Collaborative Digital Pedagogy: Teaching Digital Humanities in the Classroom ...
 
Enhancing User Services in the Emblematica Online Portal
Enhancing User Services in the Emblematica Online PortalEnhancing User Services in the Emblematica Online Portal
Enhancing User Services in the Emblematica Online Portal
 
Digital Humanities Research and Academic Librarian
Digital Humanities Research and Academic LibrarianDigital Humanities Research and Academic Librarian
Digital Humanities Research and Academic Librarian
 
The Role of the Humanities Librarian in Digital Humanities
The Role of the Humanities Librarian in Digital HumanitiesThe Role of the Humanities Librarian in Digital Humanities
The Role of the Humanities Librarian in Digital Humanities
 
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...
Libraries and Digital Pedagogy: Faculty-Librarian Partnerships for Digital Hu...
 
Workset Creation for Scholarly Analysis Project presentation at CNI 2013
Workset Creation for Scholarly Analysis Project presentation at CNI 2013Workset Creation for Scholarly Analysis Project presentation at CNI 2013
Workset Creation for Scholarly Analysis Project presentation at CNI 2013
 
Building the Archive of DH Research
Building the Archive of DH ResearchBuilding the Archive of DH Research
Building the Archive of DH Research
 
Dlf2012 slides
Dlf2012 slidesDlf2012 slides
Dlf2012 slides
 
Humanities data curation slides
Humanities data curation slidesHumanities data curation slides
Humanities data curation slides
 
Monk slides final
Monk slides finalMonk slides final
Monk slides final
 
Digital collections and humanities research
Digital collections and humanities researchDigital collections and humanities research
Digital collections and humanities research
 
Journals and Knowledge Economy slides
Journals and Knowledge Economy slidesJournals and Knowledge Economy slides
Journals and Knowledge Economy slides
 
Bandits and Browsing: Effective Collection Size as Way of Quantifying Search...
Bandits and Browsing:  Effective Collection Size as Way of Quantifying Search...Bandits and Browsing:  Effective Collection Size as Way of Quantifying Search...
Bandits and Browsing: Effective Collection Size as Way of Quantifying Search...
 

Recently uploaded

MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 

Recently uploaded (20)

MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 

Scholarly Requirements for Large Scale Text Analysis

  • 1. Scholarly Requirements for Large Scale Text Analysis A USER NEEDS ASSESSMENT FOR THE HATHITRUST RESEARCH CENTER Harriett Green, Eleanor Dickson, and Sayan Bhattacharyya DH 2016, July 15, 2016
  • 2. What is the HathiTrust Research Center? • Jointly led by the University of Illinois at Urbana-Champaign and Indiana University Bloomington • Facilitates text analysis of HTDL content  Focus on large-scale, computational research • Research & Development • Finding technical solutions • Building tools and services • Conducting user studies http://www.hathitrust.org/htrc
  • 3. Scholarly Practices with Digital Collections and Tools How humanities scholars use digital collections: Brockman et al., 2001; Palmer and Neumann, 2002; Babeu 2011; Rutner and Schonfeld, 2011; Green and Courtney, 2015 How humanities scholars use digital tools: Frischer et al., 2006; Warwick 2008; Toms and O’Brien, 2008; Gibbs and Owen, 2012 Tools and resources for textual analysis: ARTFL and Philologic (Argamon et al., 2009; Horton et al., 2009), MONK (Unsworth, 2011), Wordseer (Muralidharan and Hearst, 2013), Voyant and TaPOR (Rockwell et al., 2010), and Lexos (LeBlanc et al., 2013)
  • 4. Workset Creation for Scholarly Analysis GOAL: Find out how researchers collect together digital materials and build textual corpora for research purposes. Findings (Green et al. 2014, Fenlon et al. 2014):  Need the ability to create and manipulate collections as reusable datasets and research products  The ability to work at different units of analysis  Access to highly enriched metadata
  • 5. HTRC User Requirements Study: Research Goals  Learn how researchers use digitized textual corpora, apply relevant methods and approaches, and seek needed tools  Develop illustrative use cases of text analysis research that will help shape the development and expansion of HTRC research services and training curricula for scholars  Obtain information that can inform development of text analysis data providers and research services
  • 6. HTRC Users Requirements Study: Methods  Recruited interviewees from 2015 professional conferences and meetings on digital libraries and digital humanities  Semi-Structured Interviews with 15 scholars All interviews coded by hand and in ATLAS.ti by HTRC Scholarly Commons members  Currently conducting in-depth qualitative content analysis
  • 7. Preliminary Findings What are scholars’ needs and practices for conducting textual analysis with large text archives? Data Acquisition and Management Negotiating Results and Findings Research Collaborations Teaching and Training
  • 8. Analysis: Data Acquisition and Management “I think the biggest challenge is data, getting good data to work with. I think people underestimate the problems and difficulties in doing that.” “My general like philosophical approach to these things is I like to do things small. I build my corpora. I like to read them myself. I’m a little weary of like big distant reading approaches, especially with stuff as far away from the present as my stuff. So I’m still trying to perfect the stuff that I’m currently doing.” “The bad thing is that you can get a negative result in a way you can’t get a negative result in other methods...I might get garbage data. I might get stuff that doesn’t make sense. I might get no findings at all.”
  • 9. Analysis: Generating and Negotiating Findings “I yearn, I think, for workflows where we can actually—I don’t know what this would look like in an interface particularly, but so that the scholar could actually set their own tokenization rules. I think that would be really valuable. It would be a way that we could create less language specific or actually, I should say control the language specificity of the algorithm. I think that is the real need.” “I wish more people were archiving their data and their algorithms from the source code, as you see CS papers that will benchmark results against a dataset that’s no longer valid, available. Then how do you try to replicate or beat those results? It becomes impossible to evaluate your own methods against theirs and really slows down the pace of research, because if one could surpass state of the art, then that’s an application and [a] step forward.”
  • 10. Analysis: Research Collaborations “I’m not worried about publishing venues, I’m not worried about reproducibility, I’m not worried about statistics. My own knowledge of that is pretty good. But the collaborative work style is really hard.” “I do think that at some point in the not too distant future we need to form a test bed, which would be a subset of the HathiTrust corpus that meets certain characteristics. So rather than a random sample I could see one step would be a corpus around which multiple people could work and do different kinds of machine tasks.”
  • 11. Analysis: Teaching and Training “I once imagined teaching a class in which students learn to script and actually run analyses against data, but I was told, basically, that that class isn’t a humanities class anymore— that belongs in computer science.” “There is however, I will say, a demand amongst faculty to learn this stuff. I’ve been asked to think about teaching a faculty course, or like a short course to tell faculty members what is out there.” “Because the technology moves so quickly, smart people will move with it. There’s no escape from the fact that this is a self-educational problem. So, the real challenge is the data itself and getting the data to talk.”
  • 12. Findings: User Personas Credits: Alex Kinnaman, Peter Organisciak, Eleanor Dickson Digital Project Librarian • Wants flexible, transparent tools • Role: Research Support staff • Challenges: Inaccessible data, matching tool to researcher Faculty Member • Wants computational resources • Role: Experienced Researcher • Challenges: Collaboration, Finding texts Graduate Student • Wants examples • Role: New Researcher • Challenges: Understanding stats, choosing areas of interest
  • 13. Looking Forward  IMLS-funded “Digging Deeper, Reaching Further: Libraries Empowering Users to Mine the HathiTrust Digital Library”: http://teach.htrc.Illinois.edu  Data Capsule development (WCSA II Mellon grant)  Revision to HTRC Portal and Workset Builder  Release of extracted features from in-copyright works
  • 14. Interested in working with HTRC? HTRC Announcements: htrc-announce-l @ list.indiana.edu HTRC User Group: htrc-usergroup-l @ list.indiana.edu Questions? htrc-help@hathitrust.org Advanced Collaborative Support program: htrc.acs.awards@gmail.com http://www.hathitrust.org/htrc
  • 15. Acknowledgements University of Illinois: Beth Sandore Namachichivaya Stephen Downie Megan Senseney Peter Organisciak, UX Specialist Alex Kinnaman, Graduate Assistant Indiana University: Angela Courtney Nicholae Cline Leanne Mobley Robert McDonald
  • 16. Thank you! Harriett Green green19@Illinois.edu | @greenharr Eleanor Dickson dicksone@Illinois.edu Sayan Bhattacharyya sayan@Illinois.edu