Your SlideShare is downloading. ×
0
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Announcing Spark Driver for Cassandra
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Announcing Spark Driver for Cassandra

3,862

Published on

Spark Driver v 1.0 is now available with free Apache 2.0 license with DataStax. DataStax and Databricks have partnered together to make it easier than ever before to integrate both open source …

Spark Driver v 1.0 is now available with free Apache 2.0 license with DataStax. DataStax and Databricks have partnered together to make it easier than ever before to integrate both open source technologies and help the most progressive data-driven businesses succeed today. Cassandra and Spark together arm users with analysis to power online recommendations, personalized customer experiences, sensor-data detection, fraud detection and more.

Published in: Data & Analytics, Technology
0 Comments
6 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,862
On Slideshare
0
From Embeds
0
Number of Embeds
16
Actions
Shares
0
Downloads
70
Comments
0
Likes
6
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • So now that we covered what Cassandra is and why it is so popular…. I am excited to announce that version one of the integration is done and available as of today
  • Transcript

    • 1. Spark and Cassandra Martin Van Ryswyk, EVP Engineering
    • 2. Partnership DataStax 2014 2 Jonathan Ellis Chairman, Apache Cassandra + Matei Zaharia Creator, Apache Spark Announced in May 2014 Based on clear feedback from both communities Committed to an integration between Spark and Cassandra
    • 3. 3 What is Apache Cassandra?
    • 4. Apache Cassandra Massively scalable database Always on Fully distributed Linear scale performance Flexible NoSQL data model Operationally simple SQL-Like language Free tools and drivers San Francisco New York Dubai ©2014 DataStax
    • 5. Apache Cassandra
    • 6. Netflix Ensures Constant Uptime with DataStax Enterprise World’s leading streaming media provider with digital revenue $1.5BN+ 95% of all Netflix data stored in DSE Introduction of ‘Profiles’ drove throughput to over 10M transactions per second Does 1 trillion transactions/day with DSE Replaced Oracle in six data centers, worldwide, 100% in the cloud 6
    • 7. Delivers 150+ Billion Content Recommendations Per Month Serves content for largest media brands in the world: Reuters, Wall St Journal, USA Today Needed a massively scalable data store High velocity of data with 58,000 links to content per second Always-on data architecture Lost a data center during Hurricane Sandy but never went offline 7
    • 8. The Weather Channel While I have years of experience using Cassandra, my team was mostly new to it; CQL made their transition essentially painless. But where Cassandra really shines is in speed and operational simplicity, and I would say those two points were critical.” ROBBIE STRICKLAND Software Dev Manager 8
    • 9. Faster Feedback Loops 9 AnalyticalTransactional C*
    • 10. 10 Announcing cassandra-driver-spark
    • 11. cassandra-driver-spark v1.0 • Developed by DataStax with support and review by Databricks • Free with Apache 2.0 license from DataStax https://github.com/datastax/cassandra-driver-spark • Question to Apache Spark User List • user@spark.apache.org • Offering driver code to Apache Spark community ©2014 DataStax
    • 12. • Exposes Cassandra tables as RDD • Map table rows to CassandraRow objects • Data type conversions between Cassandra and Scala • Save RDDs back to Cassandra by implicit saveToCassandra call • Filter rows on server via CQL WHERE clause (CassandraRDD#where method) • Select subset of columns (CassandraRDD#select method) • Optimizations for Cassandra vnodes ©2014 DataStax cassandra-driver-spark v1.0
    • 13. Example ©2014 DataStax
    • 14. Performance ©2014 DataStax 0 10 20 30 40 50 60 Extreme Simple seconds Hadoop M/R Spark Extreme: data not in memory and fetched from multiple nodes Simple: data set fits in memory 2-30x faster than highly optimized “Hadoop on C*” implementation in DataStax Enterprise
    • 15. 15 What is DataStax Enterprise?
    • 16. Security Analytics Search Visual Monitoring Management Services In-Memory Delivering Apache Cassandra to the Enterprise Dev.IDE& Drivers Professional Services Support& Training Strong Data Protection In-Memory OLTP/Analytics Point-and-Click/Automated Mgmt Certified Production Cassandra Multi-Workload/Use Case Capable Integrated OLTP, Analytics, Search ©2014 DataStax
    • 17. 17 Announcing Certified Spark Distribution
    • 18. cassandra-driver-spark Roadmap • Integrate into Apache Spark • Update for Spark 1.0 • Support for Additional Spark components • Spark SQL • Spark Streaming • GraphX • Listen to both communities…
    • 19. Next Steps datastax.com/cassandrasummit14 Today: Attend Tupshin and Al’s talk at 1pm Visit DataStax booth – meet experts Tonight: Download the integration github.com/datastax/cassandra-driver-spark Learn about Cassandra planetcassandra.org Download DataStax Enterprise www.datastax.com/downloads September:

    ×