• Like
Bleeding Edge Databases
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

Bleeding Edge Databases

  • 600 views
Published

On Aerospike, AlgebraixData and Google BigQuery for BigDataCampLA

On Aerospike, AlgebraixData and Google BigQuery for BigDataCampLA

Published in Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
No Downloads

Views

Total Views
600
On SlideShare
0
From Embeds
0
Number of Embeds
3

Actions

Shares
Downloads
10
Comments
1
Likes
1

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • http://db-engines.com/en/ranking_trend
  • http://documentary.net/the-art-of-data-visualization/
  • http://www.aerospike.com/blog/aerospike-doubles-in-memory-nosql-database-performance/

    8 CPU & 32 GB RAM
  • Results by Thumbtack Technology
  • YCSB Benchmark
  • http://www.aerospike.com/free-aerospike-3-community-edition/
  • http://lod-cloud.net/versions/2011-09-19/lod-cloud.html
  • http://dbis.informatik.uni-freiburg.de/index.php?project=SP2B
    http://www.algebraixdata.com/algebraix-data-achieves-unrivaled-semantic-benchmark-performance/
  • http://demo.algebraixdata.com/#!/ss/math
  • Mathematics-based data management platform
    Kernel for any data model
    High performance
    High scalability
    Self-tuning
    Automatic data re-organization
    Small footprint
  • http://www.algebraixdata.com/
  • http://gdeltproject.org/
  • http://martinfowler.com/articles/bigQueryPOC.html
  • https://developers.google.com/bigquery/pricing#data

    http://g-calculator.appspot.com/bigtable.html
  • http://www.megapivot.com/blog/posts/redshift-vs-bigquery-vs-hadoop.html

    http://courses.cs.washington.edu/courses/cse544/13sp/final-projects/p18-lijl.pdf
  • http://bigqueri.es/categories
  • https://developers.google.com/bigquery/third-party-tools

    http://bigquery.bimeanalytics.com/
  • http://bigqueri.es/

    https://developers.google.com/bigquery/streaming-data-into-bigquery
  • https://cloud.google.com/developers/starterpack/
  • www.teachingkidsprogramming.org

Transcript

  • 1. Bleeding Edge Databases @LynnLangit
  • 2. Unstructured Data
  • 3. Live Tweets on a Building
  • 4. What is Aerospike?
  • 5. Benchmark Results • 200,000 tps (read-write) & 300,000 tps (read-heavy) • 10X Faster for R/W loads on SSDs
  • 6. DEMO
  • 7. More Benchmark Results Config • 10G network • Aerospike 3 • Same hardware • 4-node CentOS Data • 500GB • 50M records Each Record • 100 bytes • 23 byte key • 10 fields
  • 8. Aerospike Architecture
  • 9. Example Architecture
  • 10. How to try it out • Bare metal or pick a Cloud, set up a VM • Get the free community edition • Go…
  • 11. Linked Open Data Cloud
  • 12. What is Algebraix Data? IoT – Semantic Web Super Powerful 1 Billion Triples on 1 Node Native Mathematical Engine Triple store RDF (Graph)
  • 13. SPARQL Server™ W3C & OGC compliant RDF / SPARQL Semantic Database Natively built with proprietary Math • Algebraix technology (and patents) Runs on commodity hardware • In the cloud (or on premise) • Scales Up and Down Significantly better benchmark performance • over leading RDF databases
  • 14. Benchmark Results • SP2Bench SPARQL Performance Benchmark
  • 15. SP^2 Benchmark Visualized
  • 16. DEMO
  • 17. It’s the Math…
  • 18. Patents
  • 19. Runs on common hardware • Any Cloud or • On Rremises High Performance & Capacity • Needs no indexes • Works particularly well w/sparse data Self-tuning • Retains results & intermediate sets • Supports point- in-time queries SPARQL Server™
  • 20. Algebraix Solution Stack Data Algebra DatabaseNoSQL Relational RDF Semantic Applications Meaning Organization Optimization & Execution Conceptual Data Loaders Query Translators • Modern abstract algebra • Zermelo-Fraenkel set theory • Mathematics-based data management platform • Universal data language • Collection of I.P. • SPARQL Server – RDF • A2DB - Relational • Search • Analytics • Business Intelligence • Data Integration Algebraix Platform
  • 21. How to try it out • Sign up on their website • Try out when notified (this July)
  • 22. What is Google Big Query? QaaS – interactive RESTful web service SQL-like language Queries data stored in Google cloud Wide Column Tables Uses OAuth for access control Very Fast 750M Rows in <10 secs
  • 23. Easy & Fast •Text or Json •Up to 100k inserts/sec (streaming) Load it •Supports core SQL query concepts •SELECT, FROM, JOIN, WHERE, ORDER BY, GROUP BY •Windowing functions (OVER / PARTITION) •Common Aggregates (SUM, COUNT, MAX) •Includes ‘analytic’ SQL •STDDEV, VARIANCE, CORRELATION •REGEXP_MATCH Query it •Query is $ 5 per TB processed •Storage is around $30 TB per month Pay (for) it
  • 24. Benchmark Results • TCP-H Benchmark
  • 25. DEMO
  • 26. Partners and BigQuery Google Sheets Tableau QlikView Bime Excel
  • 27. How to try it out • Set up a Google Cloud account • Upload or stream data • Query
  • 28. Google Cloud Starter Pack Use code “gde-in”
  • 29. Next steps Try them out @LynnLangit