Big Data and Wee Data by Avery Rosen

680 views
554 views

Published on

Big Data and Wee Data - We all know MongoDB is great for Big Data, but it's also great for work on the other end of the scale -- call it "Wee Data". In this talk, MongoDB expert an Principal at Bringing Fire Consulting, Avery Rosen, talks on how this type of data is far more common than Big Data scenarios. Avery discusses how just about every project starts with it. In this domain, we don't care about disk access and indices; instead, we care about skipping past the wheel inventing and getting right down to playing with the data. MongoDB lets you persist your prototype or small-working-set data without making you deal with freeze-drying and reconstitution, provides structure well beyond csv, gets out of your way as you evolve your schemas, and provides simple tools for introspecting data and crunching numbers.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
680
On SlideShare
0
From Embeds
0
Number of Embeds
139
Actions
Shares
0
Downloads
5
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Big Data and Wee Data by Avery Rosen

  1. 1. MongoDB and Wee Data hacking a workflow
  2. 2. what it’s all about • MongoDB lets you persist and investigate your application data while getting out of your way • p.s. BDD
  3. 3. me • http://bringingfire.com/ • @4BringingFire • expertise at the intersection of technology, product formulation, and business strategy
  4. 4. stuff I’m into • products > tech • UX testing • iterative, analytics-driven dev • agility in production
  5. 5. http://www.shopwiki.com
  6. 6. © wired.com these guys
  7. 7. http://penrosejudaica.com
  8. 8. 
 N T A AC T projects 1% < ? O LF= total projects big data N A U T C A
  9. 9. what is wee data? • query performance is not an issue • in fact it might fit in RAM • in fact it might fit in 2MB
  10. 10. where mongo fits • scales down nicely • access programmatically without SQL • persists for free • query later, too
  11. 11. ORM puh-lease.
  12. 12. http://penrosejudaica.com
  13. 13. Behavior Driven Development • “Outside-in” approach • Everything is behavior • Acceptance-test-driven planning • Ubiquitous language
  14. 14. Cucumber
  15. 15. Cucumber
  16. 16. Cucumber
  17. 17. Cucumber
  18. 18. rspec • specification • object doubles (mocks, fakes, stubs, spies) • method stubbing • expectations
  19. 19. Behavior Driven Development • http://cukes.info/ • http://rspec.info/
  20. 20. the Penrose ETL pipeline Name Materials Price fancy menorah gold, silver 600 plain menorah pewter Description 150 Blah blah blah exquisite craftsmanship .csv .csv .csv Product Spreadsheets CSV files Buy AdWords? db.products.aggregate( { $match : { product_type : “menorah” } }, { $group : { _id: “$material”, count: Product Descriptions { $sum : 1 } } )
  21. 21. Q &A

×