All Data Big and Small - By Stephen O'Grady

764 views

Published on

See conference video - http://www.lucidimagination.com/devzone/events/conferences/revolution/2011

In 2009 The Guardian launched The Open Platform, a suite of services and tools that enable
content partners and developers to build applications with The Guardian’s rich content. The content
API, hosted on Solr instances on EC2, contains JSON representations of all Guardian articles back
to 1999 - over 1 million articles, and is an increasingly complete representation of the output of the
organization. The DataStore contains curated data sets for use in applications and virtualizations.
This talk will cover how The Guardian opened up their business, enriched it, and reached new
markets with its Open Platform strategy. Stephen will cover the technical architecture,
implementation of Solr (the key technology powering the platform), and how The Guardian has
used it to embrace disruption in the media space, while finding new sources of revenue and
innovation.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
764
On SlideShare
0
From Embeds
0
Number of Embeds
126
Actions
Shares
0
Downloads
10
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

All Data Big and Small - By Stephen O'Grady

  1. 1. All Data Big and SmallMay 201110.20.2005
  2. 2. http://redmonk.com/public/lucene.pdf 2
  3. 3. In the beginning, there was the database... 3
  4. 4. 1979 1983 1989 4
  5. 5. When you have a hammer and so on 5
  6. 6. Source: http://www.flickr.com/photos/pagedooley/2234031789/ 6
  7. 7. December 29, 2004 7
  8. 8. The Cambrian Non-relational Explosion 8
  9. 9. Source: http://www.pnas.org/content/97/9/4426/F1.expansion.html 9
  10. 10. 10
  11. 11. Why? 11
  12. 12. Different tools for different jobs 12
  13. 13. Or, rather, different data 13
  14. 14. A lot of different data 14
  15. 15. 15
  16. 16. Most of the attention goes to Big Data 16
  17. 17. In spite of the fact thatcomparatively few have it 17
  18. 18. Less heralded is unstructured data 18
  19. 19. 19
  20. 20. Between the size and (un)structure, its amazing anything gets found 20
  21. 21. 21Source: http://www.flickr.com/photos/28705377@N04/4142872268/
  22. 22. Its hard to ask the right question 22
  23. 23. To make matters worse,you may only get one chance 23
  24. 24. The most important answer is the next question 24
  25. 25. Some questions 25
  26. 26. 26
  27. 27. 27
  28. 28. 28
  29. 29. 29
  30. 30. 30
  31. 31. OTHERQUESTIONS 31

×