How To Tell if Your Business Needs NoSQL


Published on

Expert IT analyst groups like Wikibon forecast that NoSQL database usage will grow at a compound rate of 60% each year for the next five years, and Gartner Groups says NoSQL databases are one of the top trends impacting information management in 2013. But is NoSQL right for your business? How do you know which business applications will benefit from NoSQL and which won't? What questions do you need to ask in order to make such decisions?

If you're wondering what NoSQL is and if your business can benefit from NoSQL technology, join DataStax for the Webinar, "How to Tell if Your Business Needs NoSQL". This to-the-point presentation will provide practical litmus tests to help you understand whether NoSQL is right for your use case, and supplies examples of NoSQL technology in action with leading businesses that demonstrate how and where NoSQL databases can have the greatest impact."

Speaker: Robin Schumacher, Vice President of Products at DataStax

Robin Schumacher has spent the last 20 years working with databases and big data. He comes to DataStax from EnterpriseDB, where he built and led a market-driven product management group. Previously, Robin started and led the product management team at MySQL for three years before they were bought by Sun (the largest open source acquisition in history), and then by Oracle. He also started and led the product management team at Embarcadero Technologies, which was the #1 IPO in 2000. Robin is the author of three database performance books and frequent speaker at industry events. Robin holds BS, MA, and Ph.D. degrees from various universities.

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide
  • How To Tell if Your Business Needs NoSQL

    1. 1. How to Tell if Your BusinessNeeds NoSQLRobin SchumacherVP Products
    2. 2. • Founded in April 2010• The Apache Cassandra™ company• Home to Apache Cassandra Chair & most committers• Cassandra is a massively scalable NoSQL database• Provide enterprise-class big data platform based onCassandra• 270+ customers• Headquartered in San Francisco Bay area• Funded by prominent venture firmsOverview of DataStax
    3. 3. Serving Every Industry
    4. 4. Leading in Performance Cloud Benchmark…“In terms of scalability, there is a clear winner throughoutour experiments. Cassandra achieves the highestthroughput for the maximum number of nodes in allexperiments with a linear increasing throughput.”Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013. Point Independent NoSQL BenchmarkHighest in throughput…Lowest in latency…
    5. 5. NoSQL Momentum“According to analysis byWikibon‟s David Floyer (andhighlighted in the WallStreet Journal), the NoSQLdatabase market isexpected to grow at acompound annual growthrate of nearly 60% between2011 and 2017. The SQLslice of the Big Data market,in contrast, will grow at justa 26% CAGR during thatsame time period.”
    6. 6. NoSQL Momentum“NoSQL is the stuff of the InternetAge.”- AndrewOliver, InfoWorld
    7. 7. But Does My Business Need NoSQL…?Just because a technologyappears to be havingstrong adoption in themarket, that doesn‟t meanit‟s right for yourbusiness…
    8. 8. What is NoSQL…?• Progressive data managementengines• Go beyond legacy relationaldatabases• Flexible data model• Horizontal scalability• Distributed architectures• Use of languages andinterfaces that are “not only”SQL
    9. 9. NoSQL Example – Apache CassandraApache Cassandra is a massively scalable NoSQL database thatoffers continuous availability and easy data distribution.
    10. 10. NoSQL Example – Apache Cassandra“Cassandra stands at the front of the NoSQL pack when itcomes to supporting real-time, big data applications.”– Wikibon
    11. 11. How Can I Tell if NoSQL Can Help Me Run MyBusiness and Reduce Costs?©2013 DataStax Confidential. Do not distribute without consent.
    12. 12. NoSQL Business Considerations• Need scale-out (vs. scale-up)?• Manage different types of data like social media?• Lots of data coming in (and fast)?• Have non-RDBMS, non-ACID transactions?• Must keep large data volumes online?• Continuous uptime necessary?• Wide-scale data distribution needed?• Need to integrate different systems?• Cost a factor?
    13. 13. Need Scale-Out (vs. Scale-Up)?No• Application does not require multiple machines• Can scale-up and meet the application’s current and future needsYes• Application demands divide-and-conquer• Capacity expansion is best/can only be handled via new machinesKey takeaway: If your applications can easily run on one machine, fit all yourdata in RAM or can easily expand via new cores/more drives to fulfill currentand future requirements, you may not need NoSQL…
    14. 14. NoSQL Case StudyOoyala distributes and analyzes video content for companies likeESPN, Rolling Stone and others. They track about one quarter of allonline video viewers each day and generate 1-2 billion events that arestreaming in real-time through their system.
    15. 15. Manage Different Types of Data?No• No non-structured data (all or mostly rigid formats)• E.G. No social media dataYes• All types of data (structured, semi, and unstructured)• Social media dataKey takeaway: If all your data systems deal with standard RDBMS structureddata and that won‟t be changing, then you may not need NoSQL…
    16. 16. NoSQL Case StudyHealthCare Anytime needs to analyze doctor’s notes and other typesof difficult data to properly bill back Medicare / Medicaid.
    17. 17. NoSQL Case Study“Cassandra‟s NoSQL data model allows us to insert and query data much morenaturally than what we had previously. The analysts who routinely use this data wereimpressed with the flexibility and speed at which the queries came back.”– CSC/NASA
    18. 18. Lots of Data Coming In (and Fast)?No• No high velocity data (e.g. device, sensors, web streaming, etc.)• No multiple locations• Little/no concern about write speedYes• High velocity, write intensive• Multiple locations sending data• Must consume data as quickly as possibleKey takeaway: Business applications involving rapid time series data, device„exhaust‟, web or financial streaming data make good use cases forNoSQL…
    19. 19. NoSQL Case StudyGnip takes in huge volumes of social media data at high rates ofspeed (e.g. 20,000 Tweets per second).
    20. 20. Non-RDBMS, Non-ACID transactions?No• Standard RDBMS, Nested, ACID transactions required• Complex, requiring rollbacks, savepoints, etc., neededYes• “Big Data” transactions OK or are necessary• Atomic, Isolated, Durable (AID), but eventual or tunable consistencyallowedKey takeaway: NoSQL databases do transactions, but since they don‟tsupport joins or foreign keys, consistency conforms to the CAP theorem vs.RDBMS ACID styled consistency…
    21. 21. NoSQL Case StudyeBay does transactions, but does not want overhead of RDBMSACID-type transactions.
    22. 22. Cassandra and TransactionsIndividual or batch transactions with AID and tunable consistency.
    23. 23. Must Keep Large Data Volumes Online?No• No application requirement to keep large volumes of data• System typically purges data older than certain time periodYes• Must keep large volumes of data online and available to customers• Retain both hot and cold dataKey takeaway: Some NoSQL databases like Cassandra can excel overtypical RDBMS‟s when it comes to maintaining large volumes of data onlineand meeting stringent performance SLA‟s …
    24. 24. NoSQL Case StudyEasou is the #1 mobile search firm in China. One of their Cassandraapplications stores online video images for retrieval / viewing and is300TB in size.
    25. 25. Continuous Uptime Necessary?No• Applications have no need for constant uptime• Unplanned downtime can be handled via traditional failoverYes• Applications cannot tolerate any downtime• Standard log shipping, failover, hot backups, won’t doKey takeaway: Some NoSQL databases like Cassandra are able toguarantee no downtime because of their architectures…
    26. 26. NoSQL Case StudyNetflix systems are run in the cloud across multiple availability zoneswith Cassandra and sport constant uptime.
    27. 27. NoSQL Case StudyCommenting on Amazon outage in Oct 2012: “We configure all our clustersto use a replication factor of three, with each replica located in a differentAvailability Zone. This allowed Cassandra to handle the outage remarkablywell. When a single zone became unavailable, we didnt need to doanything. Cassandra routed requests around the unavailable zone and whenit recovered, the ring was repaired.”- Netflix Tech Blog
    28. 28. Wide-Scale Data Distribution Needed?No• Application’s data needs are single site only• No need to distribute data in other locales for any reasonYes• Application serves customers in multiple locations• Data is distributed across multiple data centers / cloud zones forlatency/performance or disaster recover reasonsKey takeaway: Cassandra is the gold standard among NoSQL databases formulti-data center, data distribution use cases…
    29. 29. NoSQL Case StudyRightscale keeps its customers in contact with each other all over theworld via Cassandra clusters in 5+ global data centers.
    30. 30. Need to Integrate Different Systems?No• Applications use siloed databases• No need for different data systems to interact with each otherYes• Application has different database workloads• Multiple data domains serve single applicationKey takeaway: ETL and simple connectors oftentimes do not do the job.Instead, what‟s needed is something like DataStax Enterprise, whichprovides one database that serves multiple database workloads…
    31. 31. NoSQL Case StudyDatafiniti, which is a search engine for data, needs to consume lotsof data in real time and provide fast search on top of the same data.
    32. 32. Cost a Factor?No• Application is small and not cost intensive to operate• Software license costs not a factorYes• Large scale business applications• Traditional RDBMS software costs a significant concernKey takeaway: NoSQL databases costs can oftentimes be 70-80% less thanlegacy RDBMS software. Further large operations staff are not required tomanage NoSQL systems.
    33. 33. NoSQL Case StudyConstant Contact found that scaling out with NoSQL vs. an RDBMSsaved them 90% in software costs, and was implemented in 1/3 thetime...
    34. 34. What Strategies Can I Use To ImplementNoSQL in my Business?©2013 DataStax Confidential. Do not distribute without consent.
    35. 35. NoSQL Implementation StrategiesNew Hybrid Replacement• New big dataapplications• Legacy systems keepold databases• NoSQL databaseused for heavy lifting /big data management• Legacy RDBMSmaintains smallerparts of database• Legacy RDBMScannot meetdemands of new orevolving big datasystem• Data models and dataare migrated
    36. 36. DataStax Enterprise – NoSQL for the EnterpriseDataStax Enterprise is a complete big data platform, built on Cassandra, thatis architected to manage real-time, analytic, and enterprise search data allin the same database cluster.
    37. 37. What You Get With DataStax Enterprise1. DataStax EnterpriseDatabase Server1. OpsCenter EnterpriseManagement solution1. Expert 24x7 support
    38. 38. Use Cases Handled By DataStax EnterpriseManaged by Cassandra Managed by Hadoop Managed by Solr• Time series data• Device/Sensor/Data“exhaust” systems• Distributed applications• Media streaming• Online Web retail(transactional, shoppingcarts, etc.)• Real-time data analytics• Social media capture andanalysis• Web click-stream analysis• Write-intensive transactionalsystems• Buyer behavior analytics• Compliance/regulatoryanalysis• Customerrecommendation output• Fraud detection• Risk analysis• Sales programcampaign analysis• Supply chain analytics• Batch Web clickstreamanalysis• General Web search• Web retail faceted(categorization) search• Search/hit prioritizationand highlighting• Application log search andanalysis• Document (PDF, MSWord, etc.) search andanalysis• Geospatial search• Real estate location andproperty search• Social media match ups
    39. 39. Next StepsDownload DataStax Enterprise and try it in your ownenvironment.• Go to• Download a copy of DataStaxEnterprise• Installs and configures in minutes• Completely free for development use;subscription required for productiondeployments
    40. 40. For More Information
    41. 41. Thank YouWe power the big data applicationsthat transform business.©2013 DataStax Confidential. Do not distribute without consent.