Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Oracle NoSQL DatabaseDave RubinDirector – NoSQL Database Development
The following is intended to outline our generalproduct direction. It is intended for informationpurposes only, and may no...
Agenda• NoSQL Use Case• Oracle NoSQL Database   • Architecture   • Integration with the RDBMS   • Benchmark Results
Use Case – Online Display Advertising• Problem  • Very low latency requirements – Publishers require 50 – 60 ms response  ...
Online Display Advertising Overall Solution                Real Time Reporting and                Campaign Management     ...
Online Display Advertising – Usage   Characteristics• NoSQL Database  • Low latency high volume     • Millions of ad servi...
Agenda• NoSQL Use Case• Oracle NoSQL Database   • Architecture   • Integration with the RDBMS   • Benchmark Results
A Distributed, Scalable Key-Value Database• Simple Data Model   • Key-value pair with major+minor-key paradigm   • CRUD + ...
Architecture – The Application’s Perspective                   Application                 NoSQL DB Driver Shard 1        ...
Transactions• ACID transactions at shard granularity• Transaction Scope  • Single API call  • All records must have the sa...
Simple Data Model ACID Transactions – Configurability• Configurable Durability Policy• Configurable Consistency Policy
Integration with the RDBMS and Other Products• Oracle External Tables   • Export data directly from NoSQL database and cre...
Benchmarks – General Configuration•   YCSB-based QA/benchmarking    •   Key ~= 10 bytes, Data = 1108 bytes•   Configuratio...
Benchmark Results                                                           Insert Throughput                             ...
Benchmark Results (cont.)                                                                Mixed Throughput                 ...
Benchmark Results (cont.)                                                     Insert Throughput                           ...
Questions
Upcoming SlideShare
Loading in …5
×

Oracle no sql overview brief

851 views

Published on

  • Be the first to comment

  • Be the first to like this

Oracle no sql overview brief

  1. 1. Oracle NoSQL DatabaseDave RubinDirector – NoSQL Database Development
  2. 2. The following is intended to outline our generalproduct direction. It is intended for informationpurposes only, and may not be incorporated into anycontract. It is not a commitment to deliver anymaterial, code, or functionality, and should not berelied upon in making purchasing decisions.The development, release, and timing of anyfeatures or functionality described for Oracle’sproducts remains at the sole discretion of Oracle.
  3. 3. Agenda• NoSQL Use Case• Oracle NoSQL Database • Architecture • Integration with the RDBMS • Benchmark Results
  4. 4. Use Case – Online Display Advertising• Problem • Very low latency requirements – Publishers require 50 – 60 ms response time from the ad serving platform • Extreme data velocity – Multi-millions of requests per second • Highly available – 24/7 sites • Revenue maximization – Deliver the most relevant ad to maximize revenue• Solution – Where to use a NoSQL Database? • Cookie store – NoSQL database used to store cookies and associated behavioral segments • Track behavioral data – Beacons utilized during browsing to store timestamp, frequency, and behavioral segments by cookie • Optimize ad delivery – Recency, frequency, and behavioral segments used to determine optimal ad to deliver to user
  5. 5. Online Display Advertising Overall Solution Real Time Reporting and Campaign Management RDBMS Hadoop Cluster Ad Server Multi Dimensional Reporting
  6. 6. Online Display Advertising – Usage Characteristics• NoSQL Database • Low latency high volume • Millions of ad serving requests per minute or second • Stringent latency requirements from publishers • Loose consistency • Cookie data used for ad targeting – Increase probability that user will click on ad.• Relational Database • Campaign booking information – hundreds of users • Real time business metrics for publishers and advertisers • Business financials for ad serving company • Year to date revenue, quarter over quarter etc. • Billing • SOX reporting for public companies• Hadoop • Unique visits (select count(distinct)) over many terabytes of data • Inventory forecasting across behavioral segments
  7. 7. Agenda• NoSQL Use Case• Oracle NoSQL Database • Architecture • Integration with the RDBMS • Benchmark Results
  8. 8. A Distributed, Scalable Key-Value Database• Simple Data Model • Key-value pair with major+minor-key paradigm • CRUD + range scans Application Application• Scalability NoSQL DB Driver NoSQL DB Driver • Dynamic data partitioning and distribution • Optimized data access via intelligent driver• High availability • One or more replicas • Resilient to partition failures • Disaster recovery through location of replicas • No single point of failure• Transparent load balancing Storage Nodes Storage Nodes Data Center A Data Center B • Reads from master or replicas • Driver is network topology & latency aware• Elastic Expansion • Online addition/removal of storage nodes and automatic data redistribution
  9. 9. Architecture – The Application’s Perspective Application NoSQL DB Driver Shard 1 Shard 2 Shard N Master Master Master Replicas Replicas Replicas
  10. 10. Transactions• ACID transactions at shard granularity• Transaction Scope • Single API call • All records must have the same major key • Multiple operations within a transaction via collections• Can be relaxed for increased performance on a per- operation basis
  11. 11. Simple Data Model ACID Transactions – Configurability• Configurable Durability Policy• Configurable Consistency Policy
  12. 12. Integration with the RDBMS and Other Products• Oracle External Tables • Export data directly from NoSQL database and create Oracle External Table • Pre-packaged utility• Oracle Loader for Hadoop • Parallel map reduce job • Utilizes InputFormat• Oracle Event Processing • NoSQL data available through OEP query language (CQL)
  13. 13. Benchmarks – General Configuration• YCSB-based QA/benchmarking • Key ~= 10 bytes, Data = 1108 bytes• Configurations of 6-30 nodes • Typical Replication Factor of 3 (master + 2 replicas) • 200m records per shard, 2 billion records in total • 2 replication nodes per storage node • Used SSDs - Two of them per host• Minimal I/O overhead • B+Tree fits in memory => one I/O per record read • Writes are buffered + log structured storage system == fast write throughput
  14. 14. Benchmark Results Insert Throughput 250,000 Average Latency (ms) Throughput (ops/sec)• 2 billion records 200,000 4• 226K ops/sec 150,000 3• HA ack. policy = 100,000 2‘Majority’ 50,000 1• Low latency 0 0• Highly Scalable 6 (2x3) 12 (4x3) 24 (8x3) 30 (10x3) Cluster Size Throughput (insert/sec) Write Latency (ms)
  15. 15. Benchmark Results (cont.) Mixed Throughput 1,400,000 4• 95% read, 5% update 1,200,000 Average Latency (ms) Throughput (ops/sec)• 2 billion records 1,000,000 3 800,000• 1.25M ops/sec 600,000 2• HA ack. policy =‘Majority’ 400,000 1• Low read/write latency 200,000 0 0• Highly Scalable 6 (2x3) 12 (4x3) 24 (8x3) 30 (10x3) Cluster Size Throughput (ops/sec) Write Latency (ms) Read Latency (ms)
  16. 16. Benchmark Results (cont.) Insert Throughput 500,000 Throughput (ops/sec) 400,000• Changed ack-policy from‘MAJORITY’ to ‘NONE’ 300,000•Throughput increasedfrom 226K to 407K Majorityops/sec None 200,000• 80% improvement 100,000 0 30 (10x3)
  17. 17. Questions

×