Using digitized historical newspapers for genealogical research
Brian Geiger, California Digital Newspaper Collection
Frederick Zarndt, IFLA Governing Board
1. Introductory remarks: Who we are; focus on freely available collections and especially those that allow researchers to create accounts; numerous sites they can pay to access but we won’t spend much time on them
2. Only small percentage of surviving newspapers have been digitized
3. How newspapers are digitized. Focusing especially on OCR, if it’s not OCR’ed well it’s not discoverable
4. How Coronado newspapers were digitized. CDNC’s work with the public library, Coronado Public Library’s work with the publisher, the process of scanning the film and processing the images, etc.
5. Free vs. Pay. 2 kinds of digitized newspaper archives: 1) publicly funded and available for free, 2) commercial sites you pay to access. Dozens or even hundreds of public sites, from small institutional to national.
6. Google won’t always get you what you want
7. Basic search using Elephind: What elephind is. Search “Abraham Lincoln” and explain what they see. Described “facets”
8. CDNC advanced search
9. Collecting What You Find: Right-click features in the CDNC
10. Collecting What You Find: CDNC user accounts
11. Interacting with Content: CDNC
12. Interacting with Content: Tagging and commenting in CDNC
9. 2008-present: Local Institutions
§ Produced over 400,000 pages
§ Partnering institutions include
§ Sausalito Public Library
§ Coronado Public Library
§ Santa Monica Community College
§ Palm Springs Public Library
§ Tehama Public Library
§ Occidental College
§ Healdsburg Museum & Historical Society
§ San Bernardino County Historical Archives
§ Port of Los Angeles
§ Barbro Osher Pro Suecia Foundation
§ Madera Public Library
§ Woodside High School
§ Los Medanos Community College
Partnerships
14. Ancestry.com/newspapers.com
Overview
§ Ancestry digitizes film at their cost
§ Adhere to CDNC digitization standards
§ Provide copies of all data to CDNC
§ Available immediately at newspapers.com for a fee
§ After 3-year embargo available for free to everyone at CDNC
§ By end 2015 embargoed data searchable but not viewable in CDNC
§ Interested primarily in long runs, late 19th to early 21st century
§ To date have digitized San Bernardino Sun, Santa Cruz Sentinel, & Oakland Tribune
Local Institution
§ Clears copyright by researching or working with copyright holder
§ Gets free access to titles at newspapers.com during embargo period
Partnerships