DataStax & 451 Group Webinar - Real NoSQL Applications in the Enterprise Today
Dec, 7 2011Real NoSQLApplications in theEnterprise Today. Apache Cassandra Jonathan Ellis, CTO DataStax Matt Aslett, 451 Group
Welcome and Housekeeping We will email the presentation after the webinar Please ask questions using the Q&A panel. I will ask the panelists at the end of the presentation. You can contact me at email@example.com
Our presenters Matt Aslett - Senior Analyst Jonathan Ellis – CTO 451 Group DataStax Matthew covers data Jonathan is CTO and co-founder management software for The at DataStax. Prior to DataStax, 451 Groups Information Jonathan worked extensively Management practice, including with Apache Cassandra while relational and non-relational employed at Racksace. Prior to databases, data warehousing Rackspace, Jonathan built a and data caching. Matthew is multi-petabyte, scalable storage also an expert in open source system based on Reed-Solomon software and contributes encoding for backup provider regularly to reports produced Mozy. In addition to his work with through the 451 Commercial DataStax, Jonathan is project Adoption of Open Source chair of Apache Cassandra. (CAOS) Research Service, as well as to the 451 CAOS Theory blog.
Navigating the NoSQL waters Distributed Horizontally scalable Eventually consistent Non-relational Column store Document stores Key-value Graph … and more
Cassandra: the best for “big data” Elegant architecture Operational ﬂexibility Industry-leading performance Youshould be using Cassandra for applications requiring high-performance, realtime queries scalability past one machine bulletproof reliability
Cassandra Highlights Multi-master, multi-DC Linearly scalable Larger-than-memory datasets High performance Full durability Integrated caching Tuneable consistency
PerformanceA single four-core machine; one million inserts + one million updates
The Cassandra Difference Scalable Operational Cost Performance Ease EffectiveCassandra * ✔ ✔ ✔Oracle Exadata ✔ ✔ ✖MySQL ✖ ✔ ✔MongoDB ✖ ✔ ✔Sharding ✔ ✖ ✔HBase ✔ ✖ ✔ *And when it comes to Performance, we’re unmatched.
Why Businesses Choose CassandraVertical Big-Data Never Very Easy to Non- Flexible Multi- Cost Scale Down Fast Operate Structured Schema DC / Effective Data CloudMedia /Advertising ✔ ✔ ✔ ✔ ✔ ✔ ✔Telecomm ✔ ✔ ✔ ✔ ✔ ✔ ✔Financial ✔ ✔ ✔ ✔ ✔ ✔Social ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔IT (DaaS) ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔Healthcare ✔ ✔ ✔ ✔ ✔Online Retail ✔ ✔ ✔ ✔ ✔ ✔The most popular types of applications that use Cassandra are those that…• Are web/SaaS-based, and/or• Collect high volumes of “Data Exhaust” from machine-generated sources
“With Cassandra, we get better business agility, and we don’t have to plan capacity in advance, we don’t need to ask permission of other people to build things for us, and we don’t worry about running out of space or power.” Adrian Cockcroft, Cloud Architect
Netﬂix’s problems Could not build datacenters fast enough Made decision to go to cloud (AWS) Cassandra on AWS is a key infrastructure component of its globally distributed streaming product. Applications include Netﬂix’s subscriber system, AB testing, and viewing history service (including positions at which members stopped watching a streaming program).
Netﬂix on Cassandra Fast Cheap Scalable Flexible No SPOF
“Without Cassandra, our engineers would’ve had to create something that could scale to our needs, that would’ve prevented us from focusing on building product and solving problems for Backupify’s users, which are far more important tasks.” Matt Conway, VP Engineering
Backupify’s problem Cloud-based utility that enables businesses and consumers to backup, search and restore the content of popular online applications such as Google Apps, Gmail, Facebook, Twitter, and Blogger Needs: Horizontal scaling Ability to handle high write loads Elasticity with no manual sharding
Backupify on Cassandra Ease of scale enabled engineers to focus on building great applications DataStax OpsCenter made it easy to monitor the health and perf of their cluster Reliable, redundant and scalable low- balance data storage helped eliminate down-time Ability to offer both backup and storage, but also analysis of data eventually
“You can seamlessly add new nodes and expand your total capacity without deteriorating the performance of the data store. Cassandra has allowed us to scale very effectively.” Harry Robertson, Tech Lead
Ooyala’s problem Ooyala provides a suite of technologies and services that support content owners in managing, analyzing and monetizing the digital video they publish online Needs: Elasticity,to respond to spikes in data scale Ability to respond to increasingly sophisticated analytic needs of customers
Ooyala on Cassandra Classic “Big Data” problem did not require re-architecting Application agility was enabled – developers spend time building cool apps, not ﬁguring out how to scale Enabled more powerful and granular analytics to their customers
“Cassandra has allowed us to build bigger features faster and more reliably, while using less money and without needing to expand our staff.” Kyle Ambroff, Sr. Engineer
Formspring’ problem Usersof Formspring engage with and learn more about each other by asking and responding to questions. With close to 4B responses in the system and 30M unique users, they needed: To support explosive growth To seamlessly syndicate user content To avoid sharding Application ﬂexiblity
Formspring on Cassandra No sharding needed – just add nodes to scale Performance – the popular users with many followers saw no speed reduction. No more memcached! Flexibility of a schema-optional architecture is very developer friendly
Why DataStax?DataStax delivers database products and servicesbased on Apache Cassandra from experts who areat the forefront of todays data revolution. Database Software & Tools Support & Services DataStax Enterprise Production Support DataStax Community Consultative Help DataStax OpsCenter Professional Training Drivers & Connectors Online Documentation
DataStax Overview Founded in April 2010 Commercial leader in Apache Cassandra™, the popular open-source “big data” database Headquartered in San Francisco Bay area 100+ customers 35+ employees (split between San Fran and Austin) Home to Apache Cassandra Chair & most committers Secured $11M in Series B funding in Sep 2011
DataStax Value The simplest way to get started with Apache Cassandra: DataStax Community Edition A smart, integrated platform that provides Analytics and Real-Time capabilities in the same database, without any resource contention: DataStax Enterprise The backing of the Cassandra Experts
DataStax Enterprise1. DataStax Enterprise Database Server2. OpsCenter Enterprise Management solution3. Expert production support & consultative services
Enterprise Database ServerEnterprise-class database built to handletoday’s big-data needs in a cost-effective, easy,and reliable way. Leverages resources on-premise or in the cloud Guarantees uptime with a master-less distributed architecture Allows for fast application changes via ﬂexible schemas 2 3 Handles structured, semi-structured, and Real-Time unstructured data Replication 1 4 Provides advanced security Eliminates the need for separate analytics Analytics system 6 5
OpsCenter EnterpriseOpsCenter Enterprise supplies management,monitoring, and control over DataStax Enterprise Visual, browser-based user Proactive alerts that warn interface of impending issues Administration tasks Built-in external carried out in point-and- notiﬁcation abilities click fashion Allows for visual rebalance of data across a cluster when new nodes are added
Expert Production SupportDataStax Enterprise includes production supportand consultative services from the Cassandraexperts. Support service level agreements that range from business hours to 24x7x365 Consultative support for assistance on architecture, design, and tuning Certiﬁed quarterly service packs Hot-ﬁx support
DataStax – Your One-Stop Shop DataStax Enterprise and Community Editions Professional Training, Expert Consulting Documentation and Dev Center http://www.datastax.com/docs http://www.datastax.com/dev Whitepapers, Case Studies, FAQ’s and more http://www.datastax.com/resources/whitepapers http://www.datastax.com/resources/casestudiesThank you!