What are NoSQL databases and when do
you need them?
About Us: Blackhawk Enterprise Inc.

Hardware

Software
Visualization
Analysis
Big data experience
Big Data Platforms

Technologies

Google BigQuery

Hadoop MapReduce/HDFS

Amazon EC2, EMR
Army Cloud
D...
Symptoms of a big data problem
o If what you are doing works for you, don’t change it!
o Storage space
o Data throughput
o...
Real world problem example

o Customer receives daily “deliveries” of textual data
o Couldn’t get all the data loaded into...
Original data load
New data load
Original Algorithm
New Algorithm
Original Query
New Query
Demo setup

o
o
o
o
o

Google BigQuery
CDC Natality data set: live births in US (1969- 2008)
~138 Million rows
30 columns
...
Upcoming SlideShare
Loading in …5
×

What are NoSQL databases and when do you need them?

501 views

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
501
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

What are NoSQL databases and when do you need them?

  1. 1. What are NoSQL databases and when do you need them?
  2. 2. About Us: Blackhawk Enterprise Inc. Hardware Software Visualization Analysis
  3. 3. Big data experience Big Data Platforms Technologies Google BigQuery Hadoop MapReduce/HDFS Amazon EC2, EMR Army Cloud DIA Cloud NoSQL: Hbase, Accumulo, MongoDB Apache Storm: data stream processing DIA Cloud
  4. 4. Symptoms of a big data problem o If what you are doing works for you, don’t change it! o Storage space o Data throughput o Computations take too long o Queries take too long o You have lots of disparate data
  5. 5. Real world problem example o Customer receives daily “deliveries” of textual data o Couldn’t get all the data loaded into the server. Wouldn’t fit and it was taking a long time o Unable to run algorithms on in a timely manner o User interfaces were sluggish because of slow query times
  6. 6. Original data load
  7. 7. New data load
  8. 8. Original Algorithm
  9. 9. New Algorithm
  10. 10. Original Query
  11. 11. New Query
  12. 12. Demo setup o o o o o Google BigQuery CDC Natality data set: live births in US (1969- 2008) ~138 Million rows 30 columns 21.9 GB of data Run Queries…

×