Bleeding Edge Databases
Upcoming SlideShare
Loading in...5
×
 

Bleeding Edge Databases

on

  • 494 views

On Aerospike, AlgebraixData and Google BigQuery for BigDataCampLA

On Aerospike, AlgebraixData and Google BigQuery for BigDataCampLA

Statistics

Views

Total Views
494
Views on SlideShare
484
Embed Views
10

Actions

Likes
1
Downloads
9
Comments
1

3 Embeds 10

https://twitter.com 8
https://www.linkedin.com 1
http://www.slideee.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • http://db-engines.com/en/ranking_trend
  • http://documentary.net/the-art-of-data-visualization/
  • http://www.aerospike.com/blog/aerospike-doubles-in-memory-nosql-database-performance/ <br /> <br /> 8 CPU & 32 GB RAM
  • Results by Thumbtack Technology
  • YCSB Benchmark
  • http://www.aerospike.com/free-aerospike-3-community-edition/
  • http://lod-cloud.net/versions/2011-09-19/lod-cloud.html
  • http://dbis.informatik.uni-freiburg.de/index.php?project=SP2B <br /> http://www.algebraixdata.com/algebraix-data-achieves-unrivaled-semantic-benchmark-performance/
  • http://demo.algebraixdata.com/#!/ss/math
  • Mathematics-based data management platform <br /> Kernel for any data model <br /> High performance <br /> High scalability <br /> Self-tuning <br /> Automatic data re-organization <br /> Small footprint <br />
  • http://www.algebraixdata.com/
  • http://gdeltproject.org/
  • http://martinfowler.com/articles/bigQueryPOC.html
  • https://developers.google.com/bigquery/pricing#data <br /> <br /> http://g-calculator.appspot.com/bigtable.html
  • http://www.megapivot.com/blog/posts/redshift-vs-bigquery-vs-hadoop.html <br /> <br /> http://courses.cs.washington.edu/courses/cse544/13sp/final-projects/p18-lijl.pdf
  • http://bigqueri.es/categories
  • https://developers.google.com/bigquery/third-party-tools <br /> <br /> http://bigquery.bimeanalytics.com/
  • http://bigqueri.es/ <br /> <br /> https://developers.google.com/bigquery/streaming-data-into-bigquery
  • https://cloud.google.com/developers/starterpack/
  • www.teachingkidsprogramming.org

Bleeding Edge Databases Bleeding Edge Databases Presentation Transcript

  • Bleeding Edge Databases @LynnLangit
  • Unstructured Data
  • Live Tweets on a Building
  • What is Aerospike?
  • Benchmark Results • 200,000 tps (read-write) & 300,000 tps (read-heavy) • 10X Faster for R/W loads on SSDs
  • DEMO
  • More Benchmark Results Config • 10G network • Aerospike 3 • Same hardware • 4-node CentOS Data • 500GB • 50M records Each Record • 100 bytes • 23 byte key • 10 fields
  • Aerospike Architecture
  • Example Architecture
  • How to try it out • Bare metal or pick a Cloud, set up a VM • Get the free community edition • Go…
  • Linked Open Data Cloud
  • What is Algebraix Data? IoT – Semantic Web Super Powerful 1 Billion Triples on 1 Node Native Mathematical Engine Triple store RDF (Graph)
  • SPARQL Server™ W3C & OGC compliant RDF / SPARQL Semantic Database Natively built with proprietary Math • Algebraix technology (and patents) Runs on commodity hardware • In the cloud (or on premise) • Scales Up and Down Significantly better benchmark performance • over leading RDF databases
  • Benchmark Results • SP2Bench SPARQL Performance Benchmark
  • SP^2 Benchmark Visualized
  • DEMO
  • It’s the Math…
  • Patents
  • Runs on common hardware • Any Cloud or • On Rremises High Performance & Capacity • Needs no indexes • Works particularly well w/sparse data Self-tuning • Retains results & intermediate sets • Supports point- in-time queries SPARQL Server™
  • Algebraix Solution Stack Data Algebra DatabaseNoSQL Relational RDF Semantic Applications Meaning Organization Optimization & Execution Conceptual Data Loaders Query Translators • Modern abstract algebra • Zermelo-Fraenkel set theory • Mathematics-based data management platform • Universal data language • Collection of I.P. • SPARQL Server – RDF • A2DB - Relational • Search • Analytics • Business Intelligence • Data Integration Algebraix Platform
  • How to try it out • Sign up on their website • Try out when notified (this July)
  • What is Google Big Query? QaaS – interactive RESTful web service SQL-like language Queries data stored in Google cloud Wide Column Tables Uses OAuth for access control Very Fast 750M Rows in <10 secs
  • Easy & Fast •Text or Json •Up to 100k inserts/sec (streaming) Load it •Supports core SQL query concepts •SELECT, FROM, JOIN, WHERE, ORDER BY, GROUP BY •Windowing functions (OVER / PARTITION) •Common Aggregates (SUM, COUNT, MAX) •Includes ‘analytic’ SQL •STDDEV, VARIANCE, CORRELATION •REGEXP_MATCH Query it •Query is $ 5 per TB processed •Storage is around $30 TB per month Pay (for) it
  • Benchmark Results • TCP-H Benchmark
  • DEMO
  • Partners and BigQuery Google Sheets Tableau QlikView Bime Excel
  • How to try it out • Set up a Google Cloud account • Upload or stream data • Query
  • Google Cloud Starter Pack Use code “gde-in”
  • Next steps Try them out @LynnLangit