Datastax Expedia


Published on

Expedia Use Case for DataStax Enterprise

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Datastax Expedia

  1. 1. Cassandra Evolution at Expedia
  2. 2. Adoption timeline 2011 • Q2 Cassandra 0.6 live for First Pricing index • Q3 Cassandra 0.7 live for Session state cache • Q3 Cassandra 0.7 for high speed transaction logging • Q4 Cassandra Splunk integration complete for put/get • Q4 Cassandra Web Portal and RESTful interface available 2012 • Q2 DataStax upgrade for first pricing index & enable SOLR • Q2 DataStax live for Second Pricing index & enable SOLR • Q2 DataStax live for high speed transaction logging • Q2 Onboarding new markets to indexes • Q2 DataStax PoC for Content cache • Q3 Upgrade Web Portal to use SOLR and Hadoop functionality
  3. 3. Pricing Index Use Case
  4. 4. Pricing Index Ring SolutionCurrent Solution• 9 nodes split across 3 racks running Cassandra 1.0• There is a separate 2 node App tier written in Java that access data in CFS• On top of that is a 4 node SOLR tier for searchNext Gen 2Q12 solution• DataStax upgrade and elimination of app and SOLR nodes• Given this change we can onboard two new indexes and all major markets• New ring is 27 nodes in one physical datacenter and will expand to 54 nodes in June split across two datacenters• Expanding from initial small market set to more markets, price elements and options• Using DataStax solution is showing positive results in the lab and shaving seconds off of user search times
  5. 5. High Speed Logging Use Case
  6. 6. Logging Ring SolutionCurrent Solution• 40 MSSQL database high end servers using 160TB Tier 1 SAN and 14 days of online information and no redundancy or replication• 5 Reporting archive MSSQL databases that keep 30 day roll ups of the 40 databases for trending• Need to know SQL to pull any data or reports, have multiple db developers working on teamNext Gen 2Q12 solution• DataStax deployment of 72 commodity nodes across 2 physical datacenter with replication• 1/3 of nodes will have Hadoop enabled for reporting and will be available via web service for non-SQL business personnel• 90 day retention window of raw data• Cost Savings $1.5M in 2012
  7. 7. Logging Ring Architecture Data Data Center 1 Center 2
  8. 8. Access Architecture
  9. 9. Summary• Current footprint 132 nodes production, 27 nodes dev, 24 nodes PoC• Expedia is investing heavily in Cassandra and specifically DataStax solution to optimize user experience• Heavily leveraging Hadoop and SOLR integrations to get rapid value from solutions• Using DataStax is shaving seconds off of search times which equates to significantly better customer experience• Continuing to look at new use cases and expect to have 240 nodes of DataStax in production by Q412
  10. 10. Questions ?