Converging on the Universal Library: From Memex to Googolplex Martin R. Kalfatovic Smithsonian Institution Libraries
Vast, But Not Infinite
Vast, But Not Infinite <ul><li>100 characters (Western European languages, plus spaces and some punctuation) </li></ul><ul...
Vast, But Not Infinite dsoydrebnefdsellnhvaaipevls oadisaedhoholnarmea dlhecrositnsoritln gdustoaeaeb ossibrseltlmlie ey f...
Vast, But Not Infinite “ I assumed you packed the library in 1,000 volume boxes, each box having a capacity of precisely o...
Vast, But Not Infinite The sum of our collections, libraries, archives, and museums is Vast, but by most practical – and e...
But Is It Google-able? <ul><li>“ Each book … contains a progressively smaller fraction of all that is known”  – Alvin Toff...
But Is It Google-able? <ul><li>32 million published books </li></ul><ul><li>750 million articles and essays </li></ul><ul>...
But Is It Google-able? <ul><li>Compressed (at today’s standards) this would be about 50 petabytes (about the size of a sma...
The Memex In 1945, Vannavar Bush, Director of the Office of Scientific Research and Development, outlined the ultimate too...
The Memex
The Memex Memex images from  MouseSite (Stanford University)
The Memex
Filling the Memex: Digitization Projects “ As  an  agent of change, printing altered methods of data collections, storage ...
<ul><li>Million Book Project </li></ul><ul><li>Amazon: Search Inside the Book </li></ul><ul><li>Google Book Search </li></...
Filling the Memex: Digitization Projects Million Book Project
Filling the Memex: Digitization Projects Amazon: Search Inside the Book
Filling the Memex: Digitization Projects Google Book Search
Filling the Memex: Digitization Projects Open Content Alliance
Filling the Memex: Digitization Projects Biodiversity Heritage Library
<ul><ul><li>Fringe works will receive use (the “long tail”) </li></ul></ul><ul><ul><li>Create a deepened sense of history ...
Future Shock? Or Death by Snippets! “ This is … a pretty grisly scenario … Books traditionally have edges: some are rough-...
Future Shock? Or Death by Snippets! “ Defenders of the book often stress the ‘pleasure of handling books’ as a reason for ...
Future Shock? Or Death by Snippets! “ All books are divisible into two classes, the books of the hour, and the books of al...
<ul><li>Index Animalium: Sherborn’s  Index Animalium  is a compendium of zoological taxonomic species nomenclature from 17...
Future Shock? Or Death by Snippets!
Memory Institutions
Memory Institutions Each object in the Museum …  would have been associated with a book (or several books) in the Library....
Memory Institutions The fact that these books greatly outnumber those whose function is to catalogue the exhibits next doo...
Memory Institutions: Archives Unlike libraries, where the objects, books, can have their content transformed to other medi...
Memory Institutions: Archives …  a manuscript letter, a mimeographed memo, a diary page, loses some of its being when tran...
Memory Institutions: Archives At the same time, digital project allow for the creation of digital spaces where various arc...
Memory Institutions: Archives
Memory Institutions: Archives
<ul><li>Art works: Stephen Weil calculated (in 1990) that in the U.S. alone there were over 8 million art works created: <...
Memory Institutions: Museums
Memory Institutions: Museums
Memory Institutions: Museums
Memory Institutions: Museums 26,000,000 21,000,000 5,000,000 Harvard Univ. Herbarium/Mus. Comp. Zoo. 30,000,000 21,000,000...
Memory Institutions: Museums Plant Collection Institution 5,219,216 Missouri Botanical Garden 5,500,000 Conservatoire et J...
Total Specimens in top 17 collecting museums and botanical gardens world-wide: 384,166,516 Memory Institutions: Museums Bi...
What Do You Do With a Million …? <ul><li>Books? </li></ul><ul><li>Linear feet of archival material? </li></ul><ul><li>Phot...
What Do You Do With a Million …? <ul><li>Visited a museum in the last year? </li></ul><ul><li>Visited their public library...
What Do You Do With a Million …? One of the key things we need to remember is that whatever we digitize and make available...
What Do You Do With a Million …?
 
Factors: Wetware
<ul><li>Decreased storage costs: floppy discs (c. 1985) vs. USB drive (2006) </li></ul>Factors: Hardware
<ul><li>Cheap Scanning </li></ul><ul><ul><li>Internet Archive Scribe </li></ul></ul><ul><ul><li>Kirtas APT 2400 Scanner </...
<ul><li>Cheap Print on Demand: Internet Archive Bookmobile </li></ul>Factors: Hardware
<ul><li>One Laptop Per Child </li></ul>Factors: Hardware
Factors: Networks <ul><li>Net Neutrality: “The most potent force shaping the future of the Internet is neither Mountain Vi...
Factors: Networks <ul><li>Real Broadband </li></ul><ul><ul><li>The U.S. ranked 12th among industrialized nations, with 16....
Factors: Networks Broadband over Power Line (BPL) is a new technology that offers a potential competitor to the cable and ...
Factors: Networks <ul><li>Transparency of the Network  </li></ul><ul><ul><li>Everything REALLY is on the net </li></ul></u...
Factors: Digital Preservation <ul><li>Save the Bytes! </li></ul><ul><li>Save the Content! </li></ul><ul><li>Call me when y...
Factors: Intellectual Property U.S. Constitution Article I, Section 8, Clause 8: To promote the Progress of Science and us...
Factors: Intellectual Property “ History belongs to everybody. It shouldn’t be locked away in dark rooms,” says Michael Ed...
“ The vindication lies in the finding ... Now every volume lies instantly within our grasp, and we possess a far greater u...
Conclusion? “ The world has arrived at an age of cheap complex devices of great reliability; and something is bound to com...
10 Words/Phrases I Did Not Use In This Presentation <ul><li>Cyber- </li></ul><ul><li>“ Ceci tuera cela” </li></ul><ul><li>...
Converging on the Universal Library: From Memex to Googolplex Martin R. Kalfatovic Smithsonian Institution Libraries [emai...
Upcoming SlideShare
Loading in …5
×

Converging on the Universal Library: From Memex to Googolplex

1,166 views

Published on

Presentation at the 40th COLT conference, Washington, DC (June 21, 2007)

0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,166
On SlideShare
0
From Embeds
0
Number of Embeds
34
Actions
Shares
0
Downloads
29
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • Converging on the Universal Library: From Memex to Googolplex

    1. 1. Converging on the Universal Library: From Memex to Googolplex Martin R. Kalfatovic Smithsonian Institution Libraries
    2. 2. Vast, But Not Infinite
    3. 3. Vast, But Not Infinite <ul><li>100 characters (Western European languages, plus spaces and some punctuation) </li></ul><ul><li>Each line has 50 spaces </li></ul><ul><li>Each page is 40 lines long </li></ul><ul><li>Each book is 500 pages long </li></ul><ul><li>Total Books: 100 1,000,000 </li></ul><ul><li>Googolplex: 1 followed by a googol (10 100 ) zeros </li></ul>
    4. 4. Vast, But Not Infinite dsoydrebnefdsellnhvaaipevls oadisaedhoholnarmea dlhecrositnsoritln gdustoaeaeb ossibrseltlmlie ey f tkstmdd eoiicai ytjtymniemruincrerlrc kym sdgklemy ecc poslr nnpz lrmpdcgeebf tetlealnm ceekctn sa oseesese erlsede c rotoire nhndlidd irb nwsrpeedsrc tey a lbneeu epon e cv mm tjn tarreohaaosimnicuse lvyubsu gurec egrll gnruveielrreuie w izbegii tckrne aseieoogrmia hmiuhtydaesdat uketwe te mnha egs foueirr tirstnnyriytuodnar inaon itaonlaai osn ral fd nurroysceo rteepcrn nrswrdstlr dt beeod sracpope Oh tiempo tus pirámides irhesrksesan ueada cte osamnec u gn bri seo aaircdgir kgielddaltneasfsdi n rwtlira etgtu erdoi nnte poycd rmogn t rdaebodpc ha tcl asrlceao usc tiosdrbaesci retiigvnooneeoe e m a b y fcoiihlha dcguo cdiem o pnz aitfehn sigtssipc isa seemue edk aerbgelgninbe n d aboaarlsdpd ele felr
    5. 5. Vast, But Not Infinite “ I assumed you packed the library in 1,000 volume boxes, each box having a capacity of precisely one cubic meter. All space to the farthest known spiral galaxies would not hold the Universal Library. In fact, you would need this volume of space so often that the number of packed universes would be a figure with only some 60 zeros less than the figure for the number of volumes… The figure is not infinite, it is a finite figure.” - Kurd Lasswitz, “The Universal Library.” 1901
    6. 6. Vast, But Not Infinite The sum of our collections, libraries, archives, and museums is Vast, but by most practical – and even impractical counting methodologies – it is finite. Vast, but Finite!
    7. 7. But Is It Google-able? <ul><li>“ Each book … contains a progressively smaller fraction of all that is known” – Alvin Toffler, 1970 </li></ul><ul><li>125,000 new blogs appear DAILY </li></ul>
    8. 8. But Is It Google-able? <ul><li>32 million published books </li></ul><ul><li>750 million articles and essays </li></ul><ul><li>25 million songs </li></ul><ul><li>500 million images </li></ul><ul><li>500,000 movies </li></ul><ul><li>3 million videos, TV shows and short films </li></ul><ul><li>100 billion web pages </li></ul>
    9. 9. But Is It Google-able? <ul><li>Compressed (at today’s standards) this would be about 50 petabytes (about the size of a small-town library building) </li></ul>
    10. 10. The Memex In 1945, Vannavar Bush, Director of the Office of Scientific Research and Development, outlined the ultimate tool of the near term future, the Memex, in the article “As We May Think”
    11. 11. The Memex
    12. 12. The Memex Memex images from MouseSite (Stanford University)
    13. 13. The Memex
    14. 14. Filling the Memex: Digitization Projects “ As an agent of change, printing altered methods of data collections, storage and retrieval systems, and communications networks used by learned communities throughout Europe” <ul><li>Elizabeth L. Eisenstein </li></ul><ul><li>The Printing Revolution in Early Modern Europe (1983) </li></ul>
    15. 15. <ul><li>Million Book Project </li></ul><ul><li>Amazon: Search Inside the Book </li></ul><ul><li>Google Book Search </li></ul><ul><li>Open Content Alliance </li></ul><ul><li>Biodiversity Heritage Library </li></ul>Filling the Memex: Digitization Projects
    16. 16. Filling the Memex: Digitization Projects Million Book Project
    17. 17. Filling the Memex: Digitization Projects Amazon: Search Inside the Book
    18. 18. Filling the Memex: Digitization Projects Google Book Search
    19. 19. Filling the Memex: Digitization Projects Open Content Alliance
    20. 20. Filling the Memex: Digitization Projects Biodiversity Heritage Library
    21. 21. <ul><ul><li>Fringe works will receive use (the “long tail”) </li></ul></ul><ul><ul><li>Create a deepened sense of history </li></ul></ul><ul><ul><li>A comprehensive topical library will let people know both what they do know and what they don’t know </li></ul></ul><ul><ul><li>A new culture of interaction and participation </li></ul></ul>Future Shock? Or Death by Snippets!
    22. 22. Future Shock? Or Death by Snippets! “ This is … a pretty grisly scenario … Books traditionally have edges: some are rough-cut, some are smooth-cut, and a few, at least at my extravagant publishing house, are even top-stained. In the electronic anthill, where are the edges? The book revolution, which, from the Renaissance on, taught men and women to cherish and cultivate their individuality, threatens to end in a sparkling cloud of snippets.” - John Updike, 2006
    23. 23. Future Shock? Or Death by Snippets! “ Defenders of the book often stress the ‘pleasure of handling books’ as a reason for the continued use. Of course this argument sometimes amounts to little more than an appeal to the bibliophile’s pleasure in handling his possessions” - Geoffrey Nunberg (1993)
    24. 24. Future Shock? Or Death by Snippets! “ All books are divisible into two classes, the books of the hour, and the books of all time.” ~ John Ruskin
    25. 25. <ul><li>Index Animalium: Sherborn’s Index Animalium is a compendium of zoological taxonomic species nomenclature from 1758 to 1850. For each species described in this period it clearly lists the genus name, author, publication, pages, and date </li></ul><ul><li>Lists over 400,000 taxonomic names </li></ul>Future Shock? Or Death by Snippets!
    26. 26. Future Shock? Or Death by Snippets!
    27. 27. Memory Institutions
    28. 28. Memory Institutions Each object in the Museum … would have been associated with a book (or several books) in the Library. However, there would also be many books which could not correspond with any exhibit (the natural history of unicorns, for example, or the geometry of round squares) …
    29. 29. Memory Institutions The fact that these books greatly outnumber those whose function is to catalogue the exhibits next door means that the overall size of the Library (despite the density of its shelving) is equal to that of its neighbour …. One had then … a perfectly balanced edifice, in which everything which the human mind is capable of inventing or understanding has its place.” - Andrew Crumey, Pfitz (1995)
    30. 30. Memory Institutions: Archives Unlike libraries, where the objects, books, can have their content transformed to other media without too much loss of original intent…
    31. 31. Memory Institutions: Archives … a manuscript letter, a mimeographed memo, a diary page, loses some of its being when translated to a printed page or a computer screen.
    32. 32. Memory Institutions: Archives At the same time, digital project allow for the creation of digital spaces where various archival resources – journals, photographs, contextual essays, video – can be easily brought together in one space
    33. 33. Memory Institutions: Archives
    34. 34. Memory Institutions: Archives
    35. 35. <ul><li>Art works: Stephen Weil calculated (in 1990) that in the U.S. alone there were over 8 million art works created: </li></ul><ul><ul><li>200,000 working artists </li></ul></ul><ul><ul><li>40 works per artist </li></ul></ul><ul><ul><li>8 million total works per year! </li></ul></ul>Memory Institutions: Museums
    36. 36. Memory Institutions: Museums
    37. 37. Memory Institutions: Museums
    38. 38. Memory Institutions: Museums
    39. 39. Memory Institutions: Museums 26,000,000 21,000,000 5,000,000 Harvard Univ. Herbarium/Mus. Comp. Zoo. 30,000,000 21,000,000 9,000,000 American Museum of Natural History, New York 58,877,300 50,000,000 8,877,300 Musee National d'Histoire Naturelle, Paris 60,200,000 55,000,000 5,200,000 Natural History Museum, London 83,000,000 78,500,000 4,500,000 National Museum of Natural History, Washington, DC Total Animal Specimens Plant Specimens Institution
    40. 40. Memory Institutions: Museums Plant Collection Institution 5,219,216 Missouri Botanical Garden 5,500,000 Conservatoire et Jardin Botaniques, Geneva 5,600,000 Swedish Museum of Natural History, Stockholm 5,770,000 Komarov Botanical Institute 7,000,000 New York Botanical Garden 7,000,000 Royal Botanical Garden, Kew
    41. 41. Total Specimens in top 17 collecting museums and botanical gardens world-wide: 384,166,516 Memory Institutions: Museums Bill Gates' Flower Fly. Eristalis gatesi Thompson . Found in the high montane cloud forests of Costa Rica June 8, 2007 Bill Gates’ net worth: $26.98 Billion
    42. 42. What Do You Do With a Million …? <ul><li>Books? </li></ul><ul><li>Linear feet of archival material? </li></ul><ul><li>Photographs? </li></ul><ul><li>Cultural objects? </li></ul><ul><li>Mosquito specimens? </li></ul><ul><li>Web pages? </li></ul>
    43. 43. What Do You Do With a Million …? <ul><li>Visited a museum in the last year? </li></ul><ul><li>Visited their public library in the last six months? </li></ul><ul><li>Watched public television in the last week? </li></ul><ul><li>Listened to public radio in the last few days? </li></ul>
    44. 44. What Do You Do With a Million …? One of the key things we need to remember is that whatever we digitize and make available, people will use it in ways we can’t or won’t imagine. People are going to slice, dice, reformat, reuse, repurpose and recreate whatever is out there. This is one of the disturbing things John Updike points out in his essay
    45. 45. What Do You Do With a Million …?
    46. 47. Factors: Wetware
    47. 48. <ul><li>Decreased storage costs: floppy discs (c. 1985) vs. USB drive (2006) </li></ul>Factors: Hardware
    48. 49. <ul><li>Cheap Scanning </li></ul><ul><ul><li>Internet Archive Scribe </li></ul></ul><ul><ul><li>Kirtas APT 2400 Scanner </li></ul></ul>Factors: Hardware
    49. 50. <ul><li>Cheap Print on Demand: Internet Archive Bookmobile </li></ul>Factors: Hardware
    50. 51. <ul><li>One Laptop Per Child </li></ul>Factors: Hardware
    51. 52. Factors: Networks <ul><li>Net Neutrality: “The most potent force shaping the future of the Internet is neither Mountain View’s Googleplex nor the Microsoft campus in Redmond. It’s rather a small army of Gucci-shod lobbyists on Washington’s K Street and the powerful legislators whose favor they curry.” </li></ul>Underwater cable for Bering Strait, 1866 National Museum of American History, from Isabelle Field Judson
    52. 53. Factors: Networks <ul><li>Real Broadband </li></ul><ul><ul><li>The U.S. ranked 12th among industrialized nations, with 16.8 broadband subscribers per 100 inhabitants, as of December, the OECD said. Iceland overtook longtime leader South Korea for the top spot. Countries in Northern Europe filled seven of the top 10 spots, underscoring how the region is leading the way in taking up this pillar of modern infrastructure. </li></ul></ul>Sample of Atlantic cable, 1866 National Museum of American History, from Western Union
    53. 54. Factors: Networks Broadband over Power Line (BPL) is a new technology that offers a potential competitor to the cable and telecommunications lobbies
    54. 55. Factors: Networks <ul><li>Transparency of the Network </li></ul><ul><ul><li>Everything REALLY is on the net </li></ul></ul><ul><ul><li>Remember the Internet Coke Machine at Carnegie-Mellon University (circa 1992 through today)? </li></ul></ul><ul><ul><li>How about the clothes washer? In July 2006, The Internet Home Alliance was formed with Whirlpool, Hewlett-Packard, Microsoft, and others and are developing “Laundry Time” software that will connect “smart” washers and dryers to the ‘Net and send you IM or text messages as they move through their cycles! ) </li></ul></ul>
    55. 56. Factors: Digital Preservation <ul><li>Save the Bytes! </li></ul><ul><li>Save the Content! </li></ul><ul><li>Call me when you have the answer! </li></ul>
    56. 57. Factors: Intellectual Property U.S. Constitution Article I, Section 8, Clause 8: To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries
    57. 58. Factors: Intellectual Property “ History belongs to everybody. It shouldn’t be locked away in dark rooms,” says Michael Edmonds, deputy administrator of the Wisconsin Historical Society’s library archives division. “It should be on everybody’s laptops at Starbucks.”
    58. 59. “ The vindication lies in the finding ... Now every volume lies instantly within our grasp, and we possess a far greater understanding of our identical impotence. I would that I lived in the old days.” Daniel Langford “ The Net of Babel by J*rg* L**s B*rg*s” (1995)
    59. 60. Conclusion? “ The world has arrived at an age of cheap complex devices of great reliability; and something is bound to come of it” - Vannevar Bush (1945)
    60. 61. 10 Words/Phrases I Did Not Use In This Presentation <ul><li>Cyber- </li></ul><ul><li>“ Ceci tuera cela” </li></ul><ul><li>Flickr </li></ul><ul><li>FaceBook </li></ul><ul><li>Blogosphere </li></ul><ul><li>Wikipedia </li></ul><ul><li>Paradigm </li></ul><ul><li>My Space </li></ul><ul><li>Podcast </li></ul><ul><li>Flash Mob </li></ul>
    61. 62. Converging on the Universal Library: From Memex to Googolplex Martin R. Kalfatovic Smithsonian Institution Libraries [email_address]

    ×