First, Firster, FirstestThree lessons fromhistory on informationoverload and technologyStrata ConferenceSeptember, 2011Mar...
Sivowitch’s Law of Firsts   “Whenever you prove who was first, the harder   you look you will find someone else who was   ...
"Those who cannot remember                   the past are condemned to                   repeat it.”                      ...
The future of data is the relational database
You keep using that word.I do not think it meanswhat you think it means.
Good conceptual model, bad implementationThe relational database is the franchise technology for storing and retrieving da...
Big Data: The SQL vs noSQL argument
There’s a difference between having no past and actively rejecting it.
“There is nothing new under the sun but there are lots of old things we dont know.”                      Ambrose Bierce
The fundamental data storage device for a thousand years
The Elizabethan EraAutomated printing. Information explosion:   ▪ 8M books in 1500  ▪ 200M by 1600  ▪ CommoditizationData ...
The Elizabethan Era: Storage and Retrieval
The Elizabethan Era: Storage and Retrieval
The Elizabethan Era: Storage and Retrieval
The Georgian Era: The Explosion of Natural Philosophy
Buffon         Bottom up orientation         Flexible structure         Explanatory, descriptive         Faceted classific...
Linnaeus           Top down orientation           Static structure           Descriptive rather than            explanator...
The Theory of American Degeneracy  vs                vs
The Theory of American Degeneracy
The Theory of American Degeneracy
vs     vs
The Victorian Era
Charles Ammi Cutter                      Cutter Expansive                       Classification System                     ...
Melvil Dewey               Dewey Decimal System               Top down orientation               Static structure         ...
vs
Every technology is a tradeoff between somethingHistory is always the same:  ▪ Top down vs. bottom up  ▪ Authority vs. ana...
So why did Linnaeus and Dewey win?          Good enough           wins the day          It wasn’t solving           the pr...
What lesson might we apply from this?                            Ok, it’s not   You write a   Did youSo how do     It’s no...
Dealing with data in the industrial era                              Paul Otlet at his desk
19th Century Data Loading
Writing to the Database, Note Multi‐processing
Large Scale Information Storage
Information Retrieval
The Computer & Internet Were Invented in 1934                      Otlet’s future vision:                        ▪ Technol...
The Mundaneum Worked, For a While                       Two primary flaws of                        the Mundaneum:        ...
Information Management Through Human History         New technology development                    creates             New...
Big Data
You keep using that word.I do not think it meanswhat you think it means.
Big data?      Unstructured data isn’t       really unstructured.      The problem is that this       data is unmodeled.
The future of data is the relational database     SQL                      noSQL
The future of data is the relational database     SQL                      noSQL
The false dichotomy can be removed by technologyCode defines what’s possible now - maybe it’s time to recode
Conclusion
CC Image AttributionsThanks to the people who supplied the creative commons licensed images used in this presentation:manu...
About the Presenter                      Mark Madsen is president of Third                      Nature, a technology resea...
About Third NatureThird Nature is a research and consulting firm focused on new andemerging technology and practices in bu...
Upcoming SlideShare
Loading in...5
×

First, Firster, Firstest: Three lessons from history on information overload

1,690

Published on

Keynote from the 2011 Strata New York conference.

The first person to conceive of something is usually not the first. They're the first to re-conceive at a point where the current technology caught up to someone else's idea. We're at a point today where many old ideas are being reinvented. Hear why looking to the past, beyond your core field of interest, is worthwhile.

Video can be found at http://www.youtube.com/watch?v=Qv0yF47L8WE

Published in: Technology, Education
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,690
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
24
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

First, Firster, Firstest: Three lessons from history on information overload

  1. 1. First, Firster, FirstestThree lessons fromhistory on informationoverload and technologyStrata ConferenceSeptember, 2011Mark R. Madsenhttp://ThirdNature.net
  2. 2. Sivowitch’s Law of Firsts “Whenever you prove who was first, the harder you look you will find someone else who was more first. And if you persist in your efforts you find that the person whom you thought was first was third.” - Eliot Sivowitch Page 2
  3. 3. "Those who cannot remember  the past are condemned to  repeat it.”  George SantayanaIf there’s one lesson we can take from history, It’sthat nobody learns any lessons from history.
  4. 4. The future of data is the relational database
  5. 5. You keep using that word.I do not think it meanswhat you think it means.
  6. 6. Good conceptual model, bad implementationThe relational database is the franchise technology for storing and retrieving data, but… 1. Single, static schema model 2. No rich typing system 3. Limited API in atomic SQL statement syntax
  7. 7. Big Data: The SQL vs noSQL argument
  8. 8. There’s a difference between having no past and actively rejecting it.
  9. 9. “There is nothing new under the sun but there are lots of old things we dont know.” Ambrose Bierce
  10. 10. The fundamental data storage device for a thousand years
  11. 11. The Elizabethan EraAutomated printing. Information explosion:  ▪ 8M books in 1500 ▪ 200M by 1600 ▪ CommoditizationData management tech: ▪ Perfect copies ▪ Indices ▪ Topical catalogs ▪ First real encyclopedia ▪ Font standardization
  12. 12. The Elizabethan Era: Storage and Retrieval
  13. 13. The Elizabethan Era: Storage and Retrieval
  14. 14. The Elizabethan Era: Storage and Retrieval
  15. 15. The Georgian Era: The Explosion of Natural Philosophy
  16. 16. Buffon Bottom up orientation Flexible structure Explanatory, descriptive Faceted classification
  17. 17. Linnaeus Top down orientation Static structure Descriptive rather than  explanatory Taxonomic classification
  18. 18. The Theory of American Degeneracy vs vs
  19. 19. The Theory of American Degeneracy
  20. 20. The Theory of American Degeneracy
  21. 21. vs vs
  22. 22. The Victorian Era
  23. 23. Charles Ammi Cutter Cutter Expansive  Classification System  (~1882) Bottom up orientation More flexible structure Explanatory, descriptive
  24. 24. Melvil Dewey Dewey Decimal System Top down orientation Static structure Descriptive rather than  explanatory
  25. 25. vs
  26. 26. Every technology is a tradeoff between somethingHistory is always the same: ▪ Top down vs. bottom up ▪ Authority vs. anarchy ▪ Bureaucracy vs. autonomy ▪ Control vs. creativity ▪ Hierarchy vs. network ▪ Power vs. ease ▪ Dynamic vs. staticIn every choice, something is lost when something is gained.
  27. 27. So why did Linnaeus and Dewey win? Good enough  wins the day It wasn’t solving  the problem you  thought it was.
  28. 28. What lesson might we apply from this? Ok, it’s not You write a Did youSo how do It’s not a a database distributed just tell meI query the database, How do I mapreduce to go todatabase? it’s a key- query it? function in hell? I believe I value erlang. did, Bob. store! Perhaps you should think about pragmatism a little bit.
  29. 29. Dealing with data in the industrial era Paul Otlet at his desk
  30. 30. 19th Century Data Loading
  31. 31. Writing to the Database, Note Multi‐processing
  32. 32. Large Scale Information Storage
  33. 33. Information Retrieval
  34. 34. The Computer & Internet Were Invented in 1934 Otlet’s future vision: ▪ Technological  developments will  improve the ability to  manage information ▪ Current technologies  can be integrated to  provide individual  discovery, access and  collaboration
  35. 35. The Mundaneum Worked, For a While Two primary flaws of  the Mundaneum: ▪ Static, top‐down  classification system ▪ Loading could not  keep up with data  production rates Sounds familiar
  36. 36. Information Management Through Human History New technology development creates New methods to cope creates New information scale and availability creates…
  37. 37. Big Data
  38. 38. You keep using that word.I do not think it meanswhat you think it means.
  39. 39. Big data? Unstructured data isn’t  really unstructured. The problem is that this  data is unmodeled.
  40. 40. The future of data is the relational database SQL noSQL
  41. 41. The future of data is the relational database SQL noSQL
  42. 42. The false dichotomy can be removed by technologyCode defines what’s possible now - maybe it’s time to recode
  43. 43. Conclusion
  44. 44. CC Image AttributionsThanks to the people who supplied the creative commons licensed images used in this presentation:manuscript_page.jpg ‐ http://www.flickr.com/photos/calliope/306564541/manuscript_illum.jpg ‐ http://www.flickr.com/photos/diorama_sky/2975796332bookshelf by spectrum.jpg ‐ http://flickr.com/photos/santos/1704875109/moose.jpg ‐ http://www.flickr.com/photos/stephenandjes/4286949510/Vatican library ‐ http://www.flickr.com/photos/paullew/1550844955Copyright or unknownLittle girl and fire – Dave RothProcrastinate – http://www.cracked.comFault tolerance ‐ http://browsertoolkit.com/fault‐tolerance.png
  45. 45. About the Presenter Mark Madsen is president of Third Nature, a technology research and consulting firm focused on business intelligence, analytics and information management. Mark is an award-winning author, architect and former CTO whose work has been featured in numerous industry publications. During his career Mark received awards from the American Productivity & Quality Center, TDWI, Computerworld and the Smithsonian Institute. He is an international speaker, contributing editor at Intelligent Enterprise, and manages the open source channel at the Business Intelligence Network. For more information or to contact Mark, visit http://ThirdNature.net.
  46. 46. About Third NatureThird Nature is a research and consulting firm focused on new andemerging technology and practices in business intelligence, dataintegration and information management. If your question is related to BI,open source, web 2.0 or data integration then you‘re at the right place.Our goal is to help companies take advantage of information-drivenmanagement practices and applications. We offer education, consultingand research services to support business and IT organizations as well astechnology vendors.We fill the gap between what the industry analyst firms cover and what ITneeds. We specialize in product and technology analysis, so we look atemerging technologies and markets, evaluating the products rather thanvendor market positions.
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×