SlideShare a Scribd company logo
1 of 22
History of the Info: Part II Nick Ducoff CEO and Co-Founder, Infochimps
Early 2000s
Mid 2000s
Present Day
3000 BC Recording
3000 BC 1200 BC Recording Aggregating
3000 BC 1200 BC 300 BC Recording Aggregating Storing  at Scale
300s AD – Random Access 3000 BC 1200 BC 300 BC 300 AD Recording Aggregating Storing  at Scale Random  Access
3000 BC 1200 BC 300 BC 300 AD 1400 AD Recording Aggregating Storing  at Scale Random  Access Mass Distribution
3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD Recording Aggregating Storing  at Scale Random  Access Mass Distribution Infographics
1930s – Computation theory (Turing) 1940s – Information theory (Shannon) 1950s – Computer languages (1GL,2GL,3GL) 1960s – Standardized metadata (Avram) 1970s – Relational databases (IBM) 1980s – WWW (Al Gore   ) 1990s – Internet archive (Kahle) 3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD Recording Aggregating Storing  at Scale Random  Access Mass Distribution Infographics
 
 
 
 
 
 
Tables on web pages Open APIs Commercial data sources Augmentation Completion Normalization Name ZIP Average Rent Walter Cureton 78701 $400-$599 Ivy Caldwell 94103 >$1500 Regina Wootton 10027 $1000-$1499 Name Address City ZIP Brian James 901 Red River Austin 78701 Terri Becraft 262 7th St. San Francisco 94103 Paz Brummit 603 W. 114th St. New York 10027 Name Address Normalized Address Cecil Bartz 901 red river austin texas 901 Red River, Austin, TX 78701 Genaro Luz 702 w. 32nd st austin 702 W. 32nd St., Austin, TX 78705 Ruth Brown 114th + broadway, nyc W. 114th St. & Broadway, New York, NY 10027
 
 
 
[email_address]

More Related Content

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 

History of Data

  • 1. History of the Info: Part II Nick Ducoff CEO and Co-Founder, Infochimps
  • 6. 3000 BC 1200 BC Recording Aggregating
  • 7. 3000 BC 1200 BC 300 BC Recording Aggregating Storing at Scale
  • 8. 300s AD – Random Access 3000 BC 1200 BC 300 BC 300 AD Recording Aggregating Storing at Scale Random Access
  • 9. 3000 BC 1200 BC 300 BC 300 AD 1400 AD Recording Aggregating Storing at Scale Random Access Mass Distribution
  • 10. 3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD Recording Aggregating Storing at Scale Random Access Mass Distribution Infographics
  • 11. 1930s – Computation theory (Turing) 1940s – Information theory (Shannon) 1950s – Computer languages (1GL,2GL,3GL) 1960s – Standardized metadata (Avram) 1970s – Relational databases (IBM) 1980s – WWW (Al Gore  ) 1990s – Internet archive (Kahle) 3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD Recording Aggregating Storing at Scale Random Access Mass Distribution Infographics
  • 12.  
  • 13.  
  • 14.  
  • 15.  
  • 16.  
  • 17.  
  • 18. Tables on web pages Open APIs Commercial data sources Augmentation Completion Normalization Name ZIP Average Rent Walter Cureton 78701 $400-$599 Ivy Caldwell 94103 >$1500 Regina Wootton 10027 $1000-$1499 Name Address City ZIP Brian James 901 Red River Austin 78701 Terri Becraft 262 7th St. San Francisco 94103 Paz Brummit 603 W. 114th St. New York 10027 Name Address Normalized Address Cecil Bartz 901 red river austin texas 901 Red River, Austin, TX 78701 Genaro Luz 702 w. 32nd st austin 702 W. 32nd St., Austin, TX 78705 Ruth Brown 114th + broadway, nyc W. 114th St. & Broadway, New York, NY 10027
  • 19.  
  • 20.  
  • 21.  

Editor's Notes

  1. Internet brought offline businesses online
  2. Social networks created massive amounts of data
  3. Social networks created massive amounts of data
  4. Babylon was first society to systematically record knowledge, including the first census which systematically counted and recorded people and commodities for taxation and other purposes
  5. Library at Thebes was first known effort to gather and make many sources of knowledge available in one place
  6. Charged with collecting all the world's knowledge, the Library of Alexandria collected what is thought to have been nearly a half million objects
  7. Codex replaces scrolls, enabling random access of information, or browsing.
  8. Gutenberg’s printing press enables mass production and distribution of information
  9. William Playfair invents the line, bar and pie charts, paving the way for Charles Minard’s famous graphical representation of Napoleon’s March
  10. Alan Turing showed that any reasonable computation could be done by programming a machine Claude Shannon solved the engineering problem of the transmission of information over a noisy channel Computer language advanced quickly from first generation languages to third generation languages such as COBAL Henriette Avram created the Machine-readable cataloging system to metatag books Relational databases enabled storing and lookups of data at scale Tim Berners-Lee creates WWW which leads to mass adoption of internet, quickly growing to billions of pages, causing Brewster Kahle to begin systematically capturing and storing the information 1930s – Computation theory (Turing) 1940s – Information theory (Shannon) 1950s – Computer languages (1GL, 2GL, 3GL) 1960s – Standardized metadata (Avram) 1970s – Relational databases (IBM) 1980s – WWW (Al Gore  ) 1990s – Internet archive (Kahle)
  11. 1.8 ZB of data but still hard to find the pieces you want
  12. Aggregated, organized, accessible. When you can easily identify, understand and access the pieces, you can build anything.
  13. Map by Charles Joseph Minard portrays the losses suffered by Napoleon's army in the Russian campaign of 1812
  14. Better BI decisions and data-driven apps