SlideShare a Scribd company logo
HATHITRUST
                A Shared Digital Repository




HathiTrust: Aspiring to Build
   the Universal Library
             UKSG Annual Conference
                March 26-28, 2012
      Jeremy York, Project Librarian, HathiTrust
Partnership
Arizona State University     North Carolina State        University of Connecticut
Baylor University                 University             University of Florida
Boston College               Northwestern University     University of Illinois
Boston University            The Ohio State University   University of Illinois at Chicago
California Digital Library   The Pennsylvania State      The University of Iowa
Columbia University               University
                             Princeton University        University of Maryland
Cornell University
                             Purdue University           University of Miami
Dartmouth College
Duke University              Stanford University         University of Michigan
Emory University             Texas A&M University        University of Minnesota
Florida State University     Universidad Complutense     University of Missouri
Getty Research Institute          de Madrid              University of Nebraska-Lincoln
Harvard University Library   University of Arizona       The University of North
Indiana University           University of Calgary             Carolina at Chapel Hill
Johns Hopkins University     University of California    University of Notre Dame
Lafayette College                 Berkeley
                                  Davis                  University of Pennsylvania
Library of Congress
                                  Irvine                 University of Pittsburgh
Massachusetts Institute of
     Technology                   Los Angeles            University of Utah
McGill University`                Merced                 University of Virginia
Michigan State University         Riverside              University of Washington
New York Public Library           San Diego              University of Wisconsin-
New York University               San Francisco                Madison
North Carolina Central            Santa Barbara          Utah State University
     University                   Santa Cruz
                                                         Washington University
                             The University of Chicago
                                                         Yale University Library
Digital Repository
• Launched 2008
• Initial focus on digitized book and journal
  content
  – 10,109,919 total volumes
  – 5,372,755 book titles
  – 266,540 serial titles
  – 2,802,347 public domain (~28%)
The Name
• The meaning behind the name
  – Hathi (hah-tee)--Hindi for elephant
  – Big, strong
  – Never forgets, wise
  – Secure
  – Trustworthy
Mission
• To contribute to the common good by collecting,
  organizing, preserving, communicating, and
  sharing the record of human knowledge
HathiTrust

     Universal Library

      Common Goal

Single Entity, Many Partners
Collections and Collaboration
• Comprehensive collection
  - Preservation…with Access
• Shared strategies
  –   Copyright
  –   Collection management, development
  –   Preservation
  –   Discovery / Use
  –   Bibliographic Indeterminacy
  –   Efficient user services
• Public Good
Content Distribution
                                                  U.S. Federal
                                                  Government
                                                  Documents
                                                  (worldwide)
                                                       4%
                                                 Public
                                                Domain
72%   "Public Domain"   Public Domain             (US)
            28%          (worldwide)              10%
                             14%

                                                     Open Access
                                                         .1%
                                        Creative Commons
                                               .01%
Content Sources
                       LC     Minnesota
                      1%          1%
                                              Yale UNC-Chapel Hill
                  Harvard    Madrid                        0%
                                     Virginia 0% Utah State
      Indiana       1%        1%
                                       0%             0%       Chicago
         2%                                              NCSU    0%
                     Columbia       NorthwesternDuke
                        1%                         0%     0%
        Princeton                         0% Illinois
                                                        Purdue     Penn State
            3%                                  0%
                NYPL                                      0%           0%
        Cornell 3%
Wisconsin 4%
   5%                                                    Michigan
                                                           45%


                California
                  33%
Dates
                                                  1500-1599
                                   1600-1699
                      1800-1849                      0%
          1900-1909                   0%
                          3%
             4%                                0-1500   2000-2009
                                1700-1799
                      1850-1899                  0%        10%
                         8%        1%
       1910-1919
          4%                                                  1990-1999
1920-1929
   4%                                                            14%
       1930-1939
          4%                                            1980-1989
                                                           15%
       1940-1949            1960-1969       1970-1979
          4%                   11%             13%
         1950-1959
            6%
Language Distribution (1)
                                           The top 10 languages make up
                      Remaining            ~86% of all content
                      Languages
       Arabic   Latin    14%
Italian 2%       1%
  3% Japanese
          3%                                      English
Russian                                            48%
  4%

   Chinese
                                  German
     4%
                                    9%
      Spanish
        5%
             French
               7%
Language Distribution (2)
       Bulgarian ArmenianAncient-Greek
                                     Panjabi Catalan      Malayalam
          1%        1%
                              1%       1%      1%            1%
                                                                  Multiple        The next 40
 Sanskrit                                                           1%
   2% Ukrainian Serbian Marathi          Malay                Undetermined
                                                                                  languages make
             1%       1%Romanian Telugu 1%
                             1%                     Finnish        7%             up ~13% of total
                                                       Slovak
      Vietnamese Greek 1%            1%               1%1%            Polish
   Hungarian          1%                                               7%
          1%
      2%                                                                      Portuguese
        Norwegian                                                          Dutch 7%
            2%                                                               5%
     Music
       2%

Bengali    Tamil
  2%                                                                     Hebrew
            2%
                                                                           5%
  Persian                                                           Hindi
    2%                                                               5%
         Unknown Czech
                                                                     Indonesian
           3%        3%     Thai                             Korean
                                   Turkish Urdu                           4%
                Danish       3%                      Swedish   4%
Croatian                             3%     3%
                 3%                                    3%
  2%
Preservation with Access
• Cost effective preservation and access services
• Preservation
  – TRAC-certified
  – Robust infrastructure
  – Long-term commitments on digital content
    facilitate planning, decision-making
Executive Committee
                                                 Strategic Advisory Board
Budget/Finances Decision-making
                                           Guidance on Policy, Planning
                        Collective Work: Working
                        Groups and Committees

           Operational
          Operational                           Strategic
          •• Communications
             Communications                     • Collections
          •• User Support
             User Support                       • Discovery Interface
          •• User Experience
             User Experience                    • Full-text Search


                             Distributed work

       • Driven by needs of institutions
       • Leverage across the partnership
       • Projects, Grant Work, Ingest
         Specifications, PageTurner, Bibliographic Data Management



                                  HathiTrust
Bibliographic
                          Enterprise           Repository               Repository               Rights                                     Collection
  Governance                                                                                                             Data
                         Management           Administration           Administration          Management                                  Development
                                                                                                                    Management
                            Communication                                Data management                                                    Digital
    Budget, Finances                                 Hardware                                       Copyright         Entity description
                           and Coordination                                   (content                                                      • Expansion beyond
                                                  configuration and                               determination         (record-level)
                             with partner                                storage, backup, in                                                  books and journals
                              institutions          maintenance                                                                               (born-
    Decision-making                                                            tegrity                                                        digital, images and
                                                                          checks, deletion)                                 Object            maps, audio)
                               Project                                                           Copyright review       identification      • Selection of
         Policy              management               Web and                                                            (item-level)         content (for non-
                                                  application server                                                                          Google volume
                                                  configuration and      Hardware selection                                                   ingest and pilots
                                                                                                    Copyright                                 projects)
                                                    maintenance           and replacement         information          Data availability
        Planning
                                                                                                  management                                Print
                                                                                                   (database)                               • Cloud Library (effect
                                                       Security                                                                               of digital on print)
                                                                             Content and
                                                                              Metadata
                                                                            specifications         Rightsholder
                                                                                                   permissions
                                                     Permissions
                                                                          Disaster Recovery

                                                       Logging
                                                                           Processes for
                                                                          ensuring content
                                                                              integrity



                                                                          Quality
e-Commerce             Content Ingest         Content Access                                   User Services           Outreach                       Legal
                                                                         Assurance

                           Transformation          PageTurner             Quality Review                                                            Risk management
  Print on Demand                                                                                    Usability           Project website            (use of materials)


                             Validation         Collection Builder            Content              User support                                         Partner
                                                                            Certification                                   Monthly                   agreements
                                                                                                    (helpdesk)
                                                                                                                           newsletter


                                                Large-scale Search                                                                                      Advocacy
                                                                                                                           Papers and
     Financial                                                                                                            presentations

     contributions                               Research Center
                                                                        HathiTrust Functional                            Communication
     of partners
                                                   Bibliographic
                                                                            Framework                                     with potential
                                                                                                                             partners
                                                      Catalog
                                                                                                                        Surveys, general
                                                       APIs                                                                 inquiries


                                                                                                                           Repository
                                                                                                                         evaluation and
                                                                                                                               audit
                                                                                                                       (e.g., DRAMBORA,
                                                                                                                               TRAC)
Constitutional Convention
•   October 2011
•   52 partners
•   3-year review overseen by SAB
•   Ballot Proposals
    – Print monograph storage
    – Approval Process for development initiatives
    – U.S. Government Documents
    – Fee-for-service content deposit
    – Governance
Emerging Governance
• 12-member Board of Governors
  – 3-member Executive Committee
  – Executive Director
• 6 seats to founding institutions
  – 2 California, 2 CIC (minus Indiana and Michigan)
  – 1 Indiana, 1 Michigan
• Voting (March 1 – March 15)
• Announcement of Results March 30
• Begin work April 16, 2012
Preservation with Access
• Cost effective preservation and access services
• Preservation
  – TRAC-certified
  – Robust infrastructure
  – Long-term commitments on digital content
    facilitate planning, decision-making
Preservation with Access (2)
• Discovery
  – Bibliographic and full-text search of all materials
  – Extended discovery (ProQuest, EBSCO, OCLC, Ex
    Libris)
  – Mechanisms for local loading of records
Preservation with Access (3)
• Access and Use
  – Public domain and open access works
  – Full download of materials where possible*
  – Print on demand
  – Collections and APIs
  – Research Center*
  – Lawful uses of in-copyright works*
Lawful uses
• Access to users who have print disabilities
• Section 108 uses of materials
• Access to orphan works
Terms of Access
• Available to students, faculty, staff of
  partnering institutions
  – On library premises or authenticated into
    HathiTrust
• Partner libraries own a print copy
  – One simultaneous user per print copy owned
• Users must be on U.S. soil
• One page at a time download
How do we facilitate uses?
• Fundamental issues of
  – Identification
  – Description
  – Rights
Approach
• Collective problems as collective
• Web of relationships                   Rights


       Records                     Digital
                                   Volumes




                 Libraries            Print Volumes
Bibliographic Data
• Normalization of bibliographic data
  – University of Michigan
     • Efficiency
  – California Digital Library
Copyright
• Bibliographic metadata
• Automatic and manual rights determination
Automatic Rights Determination
• Conducted on all works at time of ingest and
  when records are modified
  – Public domain worldwide
     • US works published before 1923, US federal
       government publications, non-US works published prior
       to 1872
  – Public domain in the United States
     • Non-US works published prior to 1923
Manual Rights Determination
• IMLS-funded CRMS project
  – US-published works 1923-1963
  – Conformance with formalities
  – Expanding to non-US works
  – Double-blind review with expert review for conflicts
  – Staff at 4 HathiTrust partner institutions (15 will take
    part in non-US)
  – As of February 2012 ~190,000 reviewed, more than
    100,000 opened
• Rights Holder Permissions
Breakdown of HathiTrust book corpus by publication date




Bibliographic Indeterminacy and the Scale of Problems and Opportunities of "Rights" in Digital Collection Building – 2/2011
Breakdown of HathiTrust book corpus by publication date
Copyright status of books published pre-1923 and US works
published 1923-1963
Copyright status of books published pre-1923 and US works
published 1923-1963




                                               Pre-1872 ~ 5%
Copyright status of books published pre-1923 and US works
published 1923-1963




                                               Pre-1872 ~ 5%
                           Public Domain in
                           the US
Copyright status of books published pre-1923 and US works
    published 1923-1963




?




                                                   Pre-1872 ~ 5%
                               Public Domain in
                               the US
Copyright status of books published pre-1923 and US works
published 1923-1963
Copyright status of books published pre-1923 and US works
             published 1923-1963




In Print ?
Collection Management, Development
• Overlap
A global change in the library environment
        60%


                                       Academic print book collection already substantially
        50%
                                       duplicated in mass digitized book corpus
                                                                                         June 2010
% of Titles in Local Collection




        40%                                                                              Median duplication: 31%


        30%




        20%




        10%                                                                              June 2009
                                                                                         Median duplication: 19%

                    0%
                                  0        20        40             60              80         100           120

                                                     Rank in 2008 ARL Investment Index
Digitized Books in Shared Repositories
                                                                                                                      ~3.5M titles

                3,500,000
                               ~75% of mass digitized corpus is ‘backed up’ in one
                               or more shared print repositories
                3,000,000                                                                                                         ~2.5M

                2,500,000
Unique Titles




                2,000,000



                1,500,000



                1,000,000



                 500,000



                       0
                            Sep-09     Oct-09     Nov-09     Dec-09      Jan-10   Feb-10     Mar-10     Apr-10    May-10      Jun-10

                      Mass digitized books in Hathi digital repository        Mass digitized books in shared print repositories
Collection Management, Development
• Overlap
  – More than 50% median overlap with ARL
    institutions; higher for small liberal arts colleges
• Pricing model based on Print holdings
  – Requires print holdings database
  – Also support expansion of legal uses, efforts in de-
    duplication
  – Facilitate individual and collaborative collection
    development and management operations
• Print monographs archiving
Collection Management, Development
• Discovery (OCLC)
• Collections Committee
Comprehensive Picture
• “Definitional Issues”
   – Identification, Description, Rights
• Discovery and Use
   – Finding
   – Relating (APIs and integration)
   – Using (Reading, Computational activities)
• Collection management, development
• Preservation infrastructure
   – Digital and Print
   – Relationships
Work going forward
• Definitional elements
• Print archiving, management
• Discovery and use
    – Lawful uses
•   Research Center
•   Quality
•   Government documents
•   Beyond books and journals
•   Publishing
•   Transitioning to next phase of partnership
How to find out more
• Web site “About” section
   • http://www.hathitrust.org/about
• HathiTrust Research Center
   • http://www.hathitrust.org/htrc
• Twitter
   • http://twitter.com/hathitrust
• Monthly newsletter
   • http://www.hathitrust.org/updates
   • RSS: http://www.hathitrust.org/updates_rss
• Contact us: feedback@issues.hathitrust.org
• Blogs: http://www.hathitrust.org/blogs
   • Large-scale search
   • Perspectives from HathiTrust
Thank you very much!

More Related Content

Similar to 1330 mon katrine york

Key note Joanna Motion
Key note Joanna MotionKey note Joanna Motion
Key note Joanna Motion
Hans Hoornstra
 
Laredo retail real estate
Laredo retail real estateLaredo retail real estate
Laredo retail real estateLaredoCVB
 
Public libraries pulling rank - statistics on the policy maker's agenda
Public libraries pulling rank - statistics on the policy maker's agendaPublic libraries pulling rank - statistics on the policy maker's agenda
Public libraries pulling rank - statistics on the policy maker's agenda
Kristīne Pabērza
 
Institutional Uses of HathiTrust
Institutional Uses of HathiTrustInstitutional Uses of HathiTrust
Institutional Uses of HathiTrust
Maine_SharedCollections
 
Institutional Uses of HathiTrust
Institutional Uses of HathiTrustInstitutional Uses of HathiTrust
Institutional Uses of HathiTrust
Maine_SharedCollections
 
Reaching Your Audience in the Digital Age: Key Research Trends to Watch
Reaching Your Audience in the Digital Age: Key Research Trends to WatchReaching Your Audience in the Digital Age: Key Research Trends to Watch
Reaching Your Audience in the Digital Age: Key Research Trends to Watch
Pew Research Center's Internet & American Life Project
 
ALA 2010 -- Jeremy York
ALA 2010 -- Jeremy YorkALA 2010 -- Jeremy York
ALA 2010 -- Jeremy Yorkbisg
 
RDAP13 DPN Keynote Presentation by Steve Morales
RDAP13 DPN Keynote Presentation by Steve MoralesRDAP13 DPN Keynote Presentation by Steve Morales
RDAP13 DPN Keynote Presentation by Steve Morales
ASIS&T
 
IARSLC Presentation Bonner & CIRCLE
IARSLC Presentation Bonner & CIRCLEIARSLC Presentation Bonner & CIRCLE
IARSLC Presentation Bonner & CIRCLE
Bonner Foundation
 
IMPACT Final Conference - Paul Fogel
IMPACT Final Conference - Paul FogelIMPACT Final Conference - Paul Fogel
IMPACT Final Conference - Paul Fogel
IMPACT Centre of Competence
 
Affordable Learning $olutions Fair, San Jose State University
Affordable Learning $olutions Fair, San Jose State UniversityAffordable Learning $olutions Fair, San Jose State University
Affordable Learning $olutions Fair, San Jose State University
Emily Puckett Rodgers
 
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
Hayder Hamzoz
 
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
Bonner Foundation
 

Similar to 1330 mon katrine york (14)

NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
NISO Webinar: Understanding Critical Elements of E-books: Part 2: Heritage Lo...
 
Key note Joanna Motion
Key note Joanna MotionKey note Joanna Motion
Key note Joanna Motion
 
Laredo retail real estate
Laredo retail real estateLaredo retail real estate
Laredo retail real estate
 
Public libraries pulling rank - statistics on the policy maker's agenda
Public libraries pulling rank - statistics on the policy maker's agendaPublic libraries pulling rank - statistics on the policy maker's agenda
Public libraries pulling rank - statistics on the policy maker's agenda
 
Institutional Uses of HathiTrust
Institutional Uses of HathiTrustInstitutional Uses of HathiTrust
Institutional Uses of HathiTrust
 
Institutional Uses of HathiTrust
Institutional Uses of HathiTrustInstitutional Uses of HathiTrust
Institutional Uses of HathiTrust
 
Reaching Your Audience in the Digital Age: Key Research Trends to Watch
Reaching Your Audience in the Digital Age: Key Research Trends to WatchReaching Your Audience in the Digital Age: Key Research Trends to Watch
Reaching Your Audience in the Digital Age: Key Research Trends to Watch
 
ALA 2010 -- Jeremy York
ALA 2010 -- Jeremy YorkALA 2010 -- Jeremy York
ALA 2010 -- Jeremy York
 
RDAP13 DPN Keynote Presentation by Steve Morales
RDAP13 DPN Keynote Presentation by Steve MoralesRDAP13 DPN Keynote Presentation by Steve Morales
RDAP13 DPN Keynote Presentation by Steve Morales
 
IARSLC Presentation Bonner & CIRCLE
IARSLC Presentation Bonner & CIRCLEIARSLC Presentation Bonner & CIRCLE
IARSLC Presentation Bonner & CIRCLE
 
IMPACT Final Conference - Paul Fogel
IMPACT Final Conference - Paul FogelIMPACT Final Conference - Paul Fogel
IMPACT Final Conference - Paul Fogel
 
Affordable Learning $olutions Fair, San Jose State University
Affordable Learning $olutions Fair, San Jose State UniversityAffordable Learning $olutions Fair, San Jose State University
Affordable Learning $olutions Fair, San Jose State University
 
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
Audience Analysis:Role of Journalism & Social Media in the Consumption of New...
 
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
Integrating Social Media with Civic Engagement (Bonner Foundation & CIRCLE)
 

More from UKSG: connecting the knowledge community

UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
UKSG: connecting the knowledge community
 
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdfUKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About GreenUKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About Green
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG: connecting the knowledge community
 
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA ContentUKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open ResearchUKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG: connecting the knowledge community
 
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG: connecting the knowledge community
 
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impactUKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG: connecting the knowledge community
 
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG: connecting the knowledge community
 
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG: connecting the knowledge community
 
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG: connecting the knowledge community
 
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG: connecting the knowledge community
 
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG: connecting the knowledge community
 
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG: connecting the knowledge community
 

More from UKSG: connecting the knowledge community (20)

UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
UKSG 2024 - Demystifying AI - Evaluating future uses and limits in library co...
 
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
UKSG 2024 Plenary Session 3 - There is No List: (How) Can We Combat “Predator...
 
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdfUKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
UKSG 2024 From algorithms to empowerment by Christina Dinh Nguyen.pdf
 
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
UKSG 2024 Plenary 4 - Combining Open Access research and large language model...
 
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
UKSG 2024 Plenary 3 - There is No List: (How) Can We Combat “Predatory” Publi...
 
UKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About GreenUKSG 2024 Plenary 2 - Let's Talk About Green
UKSG 2024 Plenary 2 - Let's Talk About Green
 
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
UKSG 2024 Plenary 2 - Are we there yet? A review of transitional agreements i...
 
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
UKSG 2024 Plenary 2 - What did we Read, What did we Publish: Distilling the d...
 
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA ContentUKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
UKSG 2024 Lightning 2 - How GetFTR Supports Discovery and Access of OA Content
 
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
UKSG 2024 Lightning 2 - Advocating for data sharing: messaging frameworks for...
 
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open ResearchUKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
UKSG 2024 Lightning 2 - All Watched Over By Machines That Love Open Research
 
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
UKSG 2024 Lightning 1 - Responding to the UN SDG Publishers Compact – Bristol...
 
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
UKSG 2024 Lightning 1 - Practical steps towards an open research culture: Bui...
 
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impactUKSG 2024 - Open infrastructure and standards: small bodies, big impact
UKSG 2024 - Open infrastructure and standards: small bodies, big impact
 
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
UKSG 2024 - Reckoning or Retreat? A Longitudinal Look at DEIA in Scholarly Co...
 
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
UKSG 2024 - You don't know what you've got till it's gone: Future directions ...
 
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
UKSG 2024 - Vision, mission, passion: how UK University Presses collaborate t...
 
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
UKSG - 2024 - Fostering an Open Research culture: ARU's Graduate Trainee Seco...
 
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...UKSG 2024 - Creating credibility through community: Encouraging high quality ...
UKSG 2024 - Creating credibility through community: Encouraging high quality ...
 
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
UKSG 2024 - Author Identity Metadata: Why a Small Publisher Can Address a Maj...
 

Recently uploaded

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Atul Kumar Singh
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 

Recently uploaded (20)

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Guidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th SemesterGuidance_and_Counselling.pdf B.Ed. 4th Semester
Guidance_and_Counselling.pdf B.Ed. 4th Semester
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 

1330 mon katrine york

  • 1. HATHITRUST A Shared Digital Repository HathiTrust: Aspiring to Build the Universal Library UKSG Annual Conference March 26-28, 2012 Jeremy York, Project Librarian, HathiTrust
  • 2. Partnership Arizona State University North Carolina State University of Connecticut Baylor University University University of Florida Boston College Northwestern University University of Illinois Boston University The Ohio State University University of Illinois at Chicago California Digital Library The Pennsylvania State The University of Iowa Columbia University University Princeton University University of Maryland Cornell University Purdue University University of Miami Dartmouth College Duke University Stanford University University of Michigan Emory University Texas A&M University University of Minnesota Florida State University Universidad Complutense University of Missouri Getty Research Institute de Madrid University of Nebraska-Lincoln Harvard University Library University of Arizona The University of North Indiana University University of Calgary Carolina at Chapel Hill Johns Hopkins University University of California University of Notre Dame Lafayette College Berkeley Davis University of Pennsylvania Library of Congress Irvine University of Pittsburgh Massachusetts Institute of Technology Los Angeles University of Utah McGill University` Merced University of Virginia Michigan State University Riverside University of Washington New York Public Library San Diego University of Wisconsin- New York University San Francisco Madison North Carolina Central Santa Barbara Utah State University University Santa Cruz Washington University The University of Chicago Yale University Library
  • 3. Digital Repository • Launched 2008 • Initial focus on digitized book and journal content – 10,109,919 total volumes – 5,372,755 book titles – 266,540 serial titles – 2,802,347 public domain (~28%)
  • 4. The Name • The meaning behind the name – Hathi (hah-tee)--Hindi for elephant – Big, strong – Never forgets, wise – Secure – Trustworthy
  • 5. Mission • To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge
  • 6. HathiTrust Universal Library Common Goal Single Entity, Many Partners
  • 7. Collections and Collaboration • Comprehensive collection - Preservation…with Access • Shared strategies – Copyright – Collection management, development – Preservation – Discovery / Use – Bibliographic Indeterminacy – Efficient user services • Public Good
  • 8. Content Distribution U.S. Federal Government Documents (worldwide) 4% Public Domain 72% "Public Domain" Public Domain (US) 28% (worldwide) 10% 14% Open Access .1% Creative Commons .01%
  • 9. Content Sources LC Minnesota 1% 1% Yale UNC-Chapel Hill Harvard Madrid 0% Virginia 0% Utah State Indiana 1% 1% 0% 0% Chicago 2% NCSU 0% Columbia NorthwesternDuke 1% 0% 0% Princeton 0% Illinois Purdue Penn State 3% 0% NYPL 0% 0% Cornell 3% Wisconsin 4% 5% Michigan 45% California 33%
  • 10.
  • 11. Dates 1500-1599 1600-1699 1800-1849 0% 1900-1909 0% 3% 4% 0-1500 2000-2009 1700-1799 1850-1899 0% 10% 8% 1% 1910-1919 4% 1990-1999 1920-1929 4% 14% 1930-1939 4% 1980-1989 15% 1940-1949 1960-1969 1970-1979 4% 11% 13% 1950-1959 6%
  • 12. Language Distribution (1) The top 10 languages make up Remaining ~86% of all content Languages Arabic Latin 14% Italian 2% 1% 3% Japanese 3% English Russian 48% 4% Chinese German 4% 9% Spanish 5% French 7%
  • 13. Language Distribution (2) Bulgarian ArmenianAncient-Greek Panjabi Catalan Malayalam 1% 1% 1% 1% 1% 1% Multiple The next 40 Sanskrit 1% 2% Ukrainian Serbian Marathi Malay Undetermined languages make 1% 1%Romanian Telugu 1% 1% Finnish 7% up ~13% of total Slovak Vietnamese Greek 1% 1% 1%1% Polish Hungarian 1% 7% 1% 2% Portuguese Norwegian Dutch 7% 2% 5% Music 2% Bengali Tamil 2% Hebrew 2% 5% Persian Hindi 2% 5% Unknown Czech Indonesian 3% 3% Thai Korean Turkish Urdu 4% Danish 3% Swedish 4% Croatian 3% 3% 3% 3% 2%
  • 14. Preservation with Access • Cost effective preservation and access services • Preservation – TRAC-certified – Robust infrastructure – Long-term commitments on digital content facilitate planning, decision-making
  • 15. Executive Committee Strategic Advisory Board Budget/Finances Decision-making Guidance on Policy, Planning Collective Work: Working Groups and Committees Operational Operational Strategic •• Communications Communications • Collections •• User Support User Support • Discovery Interface •• User Experience User Experience • Full-text Search Distributed work • Driven by needs of institutions • Leverage across the partnership • Projects, Grant Work, Ingest Specifications, PageTurner, Bibliographic Data Management HathiTrust
  • 16. Bibliographic Enterprise Repository Repository Rights Collection Governance Data Management Administration Administration Management Development Management Communication Data management Digital Budget, Finances Hardware Copyright Entity description and Coordination (content • Expansion beyond configuration and determination (record-level) with partner storage, backup, in books and journals institutions maintenance (born- Decision-making tegrity digital, images and checks, deletion) Object maps, audio) Project Copyright review identification • Selection of Policy management Web and (item-level) content (for non- application server Google volume configuration and Hardware selection ingest and pilots Copyright projects) maintenance and replacement information Data availability Planning management Print (database) • Cloud Library (effect Security of digital on print) Content and Metadata specifications Rightsholder permissions Permissions Disaster Recovery Logging Processes for ensuring content integrity Quality e-Commerce Content Ingest Content Access User Services Outreach Legal Assurance Transformation PageTurner Quality Review Risk management Print on Demand Usability Project website (use of materials) Validation Collection Builder Content User support Partner Certification Monthly agreements (helpdesk) newsletter Large-scale Search Advocacy Papers and Financial presentations contributions Research Center HathiTrust Functional Communication of partners Bibliographic Framework with potential partners Catalog Surveys, general APIs inquiries Repository evaluation and audit (e.g., DRAMBORA, TRAC)
  • 17. Constitutional Convention • October 2011 • 52 partners • 3-year review overseen by SAB • Ballot Proposals – Print monograph storage – Approval Process for development initiatives – U.S. Government Documents – Fee-for-service content deposit – Governance
  • 18. Emerging Governance • 12-member Board of Governors – 3-member Executive Committee – Executive Director • 6 seats to founding institutions – 2 California, 2 CIC (minus Indiana and Michigan) – 1 Indiana, 1 Michigan • Voting (March 1 – March 15) • Announcement of Results March 30 • Begin work April 16, 2012
  • 19. Preservation with Access • Cost effective preservation and access services • Preservation – TRAC-certified – Robust infrastructure – Long-term commitments on digital content facilitate planning, decision-making
  • 20. Preservation with Access (2) • Discovery – Bibliographic and full-text search of all materials – Extended discovery (ProQuest, EBSCO, OCLC, Ex Libris) – Mechanisms for local loading of records
  • 21.
  • 22.
  • 23.
  • 24. Preservation with Access (3) • Access and Use – Public domain and open access works – Full download of materials where possible* – Print on demand – Collections and APIs – Research Center* – Lawful uses of in-copyright works*
  • 25. Lawful uses • Access to users who have print disabilities • Section 108 uses of materials • Access to orphan works
  • 26. Terms of Access • Available to students, faculty, staff of partnering institutions – On library premises or authenticated into HathiTrust • Partner libraries own a print copy – One simultaneous user per print copy owned • Users must be on U.S. soil • One page at a time download
  • 27. How do we facilitate uses? • Fundamental issues of – Identification – Description – Rights
  • 28. Approach • Collective problems as collective • Web of relationships Rights Records Digital Volumes Libraries Print Volumes
  • 29. Bibliographic Data • Normalization of bibliographic data – University of Michigan • Efficiency – California Digital Library
  • 30. Copyright • Bibliographic metadata • Automatic and manual rights determination
  • 31. Automatic Rights Determination • Conducted on all works at time of ingest and when records are modified – Public domain worldwide • US works published before 1923, US federal government publications, non-US works published prior to 1872 – Public domain in the United States • Non-US works published prior to 1923
  • 32. Manual Rights Determination • IMLS-funded CRMS project – US-published works 1923-1963 – Conformance with formalities – Expanding to non-US works – Double-blind review with expert review for conflicts – Staff at 4 HathiTrust partner institutions (15 will take part in non-US) – As of February 2012 ~190,000 reviewed, more than 100,000 opened • Rights Holder Permissions
  • 33. Breakdown of HathiTrust book corpus by publication date Bibliographic Indeterminacy and the Scale of Problems and Opportunities of "Rights" in Digital Collection Building – 2/2011
  • 34. Breakdown of HathiTrust book corpus by publication date
  • 35. Copyright status of books published pre-1923 and US works published 1923-1963
  • 36. Copyright status of books published pre-1923 and US works published 1923-1963 Pre-1872 ~ 5%
  • 37. Copyright status of books published pre-1923 and US works published 1923-1963 Pre-1872 ~ 5% Public Domain in the US
  • 38. Copyright status of books published pre-1923 and US works published 1923-1963 ? Pre-1872 ~ 5% Public Domain in the US
  • 39. Copyright status of books published pre-1923 and US works published 1923-1963
  • 40. Copyright status of books published pre-1923 and US works published 1923-1963 In Print ?
  • 42. A global change in the library environment 60% Academic print book collection already substantially 50% duplicated in mass digitized book corpus June 2010 % of Titles in Local Collection 40% Median duplication: 31% 30% 20% 10% June 2009 Median duplication: 19% 0% 0 20 40 60 80 100 120 Rank in 2008 ARL Investment Index
  • 43. Digitized Books in Shared Repositories ~3.5M titles 3,500,000 ~75% of mass digitized corpus is ‘backed up’ in one or more shared print repositories 3,000,000 ~2.5M 2,500,000 Unique Titles 2,000,000 1,500,000 1,000,000 500,000 0 Sep-09 Oct-09 Nov-09 Dec-09 Jan-10 Feb-10 Mar-10 Apr-10 May-10 Jun-10 Mass digitized books in Hathi digital repository Mass digitized books in shared print repositories
  • 44. Collection Management, Development • Overlap – More than 50% median overlap with ARL institutions; higher for small liberal arts colleges • Pricing model based on Print holdings – Requires print holdings database – Also support expansion of legal uses, efforts in de- duplication – Facilitate individual and collaborative collection development and management operations • Print monographs archiving
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
  • 55.
  • 56.
  • 57.
  • 58.
  • 59.
  • 60.
  • 61.
  • 62.
  • 63. Collection Management, Development • Discovery (OCLC) • Collections Committee
  • 64.
  • 65. Comprehensive Picture • “Definitional Issues” – Identification, Description, Rights • Discovery and Use – Finding – Relating (APIs and integration) – Using (Reading, Computational activities) • Collection management, development • Preservation infrastructure – Digital and Print – Relationships
  • 66. Work going forward • Definitional elements • Print archiving, management • Discovery and use – Lawful uses • Research Center • Quality • Government documents • Beyond books and journals • Publishing • Transitioning to next phase of partnership
  • 67. How to find out more • Web site “About” section • http://www.hathitrust.org/about • HathiTrust Research Center • http://www.hathitrust.org/htrc • Twitter • http://twitter.com/hathitrust • Monthly newsletter • http://www.hathitrust.org/updates • RSS: http://www.hathitrust.org/updates_rss • Contact us: feedback@issues.hathitrust.org • Blogs: http://www.hathitrust.org/blogs • Large-scale search • Perspectives from HathiTrust
  • 68. Thank you very much!