SlideShare a Scribd company logo
@arnon86@sqreamtech
GPU DATABASES:
HOW TO USE THEM
AND WHAT THE FUTURE HOLDS
or
GD: HTUT AWTFH
for short
@arnon86@sqreamtech
Before we start…
•We offer a free consultation and assessment
to anyone here
•We can help you understand the benefits of
using a GPU database
@arnon86@sqreamtech
Who I am
•From Israel
•4 years at SQream
•Originally part of the dev team
•Tweet about animals a lot - @arnon86
@arnon86@sqreamtech
Who I am
•A big aviation nerd
@arnon86@sqreamtech
“Moore’s law is ending”
@arnon86@sqreamtech
“The consensus was that if we could keep
doing that, if we could go to chips with
1,000 cores, everything would be fine,”
@arnon86@sqreamtech
“It turns out that’s really hard”
Dr. Doug Burger, an expert in chip design at Microsoft.
@arnon86@sqreamtech
So we just take things parallel, right?
@arnon86@sqreamtech
Let’s talk BIG data
Hundreds of TB
(Sometimes even petabytes of data)
coming in at a rate of multiple terabytes per day
Up to 1-4TB
2010 20162008
Up to 10TB
Data is STILL growing exponentially
@arnon86@sqreamtech
530 PB
12000
PB
15000
PB
CERN NSA Google
We’re in the petabyte age
• Petabyte datasets are now the norm
• Even small companies have dozens of terabytes of data for analysis
• Some outliers have more:
– CERN processes 1 petabyte per day,
stores 530 PB total
– In 2012, Facebook analyzed 5 petabytes per day,
stores estimated a few exabytes
– The NSA might hold 12 exabytes
Are we only analyzing the tip of the iceberg?
@arnon86@sqreamtech
What we’ll talk about
•Why GPUs?
•What are GPU databases?
•When are GPU databases good?
•The future
@arnon86@sqreamtech
@arnon86@sqreamtech
What is a GPU?
• A processor specialized for display functions
• The GPU renders images, animations and video for the computer's screen.
@arnon86@sqreamtech
What is a GPGPU?
• A general-purpose GPU (GPGPU) is a GPU that performs non-specialized calculations that
would typically be conducted by the CPU.
• Put simply, it’s about taking the GPU and generalizing it for non-graphics.
• AMD and NVIDIA have their own APIs for doing GPGPU programming – rockM and CUDA
respectively.
@arnon86@sqreamtech
Let’s talk core count
@arnon86@sqreamtech
Tesla p100 – 3584 cuda cores
@arnon86@sqreamtech
it’s not a strange piece of hardware
@arnon86@sqreamtech
Gpus all around
• Pretty much all cloud providers now offer GPU instances
• Most hardware vendors offer specially tuned GPU servers
GPUCLOUD
@arnon86@sqreamtech
How gpu acceleration works
@arnon86@sqreamtech
What are GPU Databases?
• A GPU database is a database, relational or non-relational, that uses a GPU to perform
some database operations
• Most of the GPU databases tend to focus on analytics, and they’re offering it to a market
that was oversold on Hadoop for Big Data analytics
• And they’re typically pretty fast
And they’re not only disrupting the in-memory crowd
• GPU databases are more flexible in processing many different types of data, or much
larger amounts of data
@arnon86@sqreamtech
Why gpus in big data?
• High core count allows offloading of ‘heavy’ stuff like JOINs, ORDER BY, GROUP BY from the
CPU to the GPU
• Compression and Decompression processes reduce PCI and disk I/O. These are basically
free on the GPU
• Can also use GPU to do computationally intensive operations like deep learning,
cryptography.
@arnon86@sqreamtech
Today’s data market - databases
• A lot of new databases are in-memory, because “memory is cheap”
• In-memory can’t handle more than ~2TB without very expensive hardware
• Scaling out with in-memory gets very expensive, very fast:
8 SAP HANA machines for handling 40TB has a TCO of $22,000,000 for 4 years
@arnon86@sqreamtech
There’s more than one type of gpu database
In-memory GPU databases
• Typically for small datasets
• Stores data in-memory
• Very fast performance (milliseconds)
• For relatively simple queries
• Limited due to memory constraints
Big Data GPU databases
• Typically for giant datasets
• Stores data on-disk
• Fast performance (seconds-minutes)
• For complex queries
• Theoretically unlimited data-sets
• A good fit for today’s evolving needs
@arnon86@sqreamtech
Don’t BUY hardware, BUY the results
• Your boss (probably) does not care about the chips in the servers
• GPU is a cool buzzword, but buzzwords alone won’t get the job done
• Achieve incredible speeds without betting the (server) farm
• Evaluate databases based on functionality and what they can do for you
@arnon86@sqreamtech
@arnon86@sqreamtech
Understanding 40m telecom customers with sqream db
Tracking customer behaviour at a large national mobile telecom operator with Tableau and
SQream DB to improve offering and increase revenue
@arnon86@sqreamtech
Understanding 40m telecom customers with sqream db
Understanding 40 million customers with SQream DB
80 nodes – 5 full racks
7600 CPU cores
SQream DB v1.9.6
HP Server with NVIDIA Tesla
96 GB RAM + 6 TB storage
Ingest time
Reporting time
Cost of Ownership $$$10,000,000
120 m
300 m 20 m
10 m
$200,000
@arnon86@sqreamtech
33.70
4.0
56
12,000,000
The cost of performance
ACV calculation on 24 TB of data, 300B rows, 8 different tables - with complex, nested joins
31.70
4.7
4
500,000
Netezza
8 full 42U racks, 56 S-Blades
7 TB RAM
SQream DB v1.9.6
Dell C4130 with 4x NVIDIA Tesla K80
512 GB RAM + iSCSI JBOD (20TB)
Average query time
(seconds)
Processing Units
(S-Blade / GPUs)
Compression ratio
Cost of Ownership $$
Major ad-tech increased revenues by improving bids
A major ad-tech deployed an 8 GPU SQream DB instances to unlock more insights from their Hadoop
cluster
Why they chose SQream DB
• TRILLIONS of ad impressions monthly equate to 360TB (raw).
This was too slow with Hadoop / Phoenix.
• Live analytics was unavailable due to Hadoop limitations
• The need to construct bidding histograms for dynamic CPM campaigns was extremely time-consuming
in the current system – query time around 5 hours!
8x NVIDIA Tesla GPUs
Qumulo NAS – 360TB
@arnon86@sqreamtech
Let’s see it in action
@arnon86@sqreamtech
Genome Research - Speed & Scale
SQream and Sheba medical center cut cancer cure research time from years to weeks
200 GB
Average size of a single human
genome sequencing
2 Months
Time it takes a genome researcher to
compare a handful of sequences
1 PB
The amount of storage needed by a
genome research institute
2 Hours
Time it takes a researcher to
compare up to hundreds of
sequences with SQream DB
x100
Factor of
improvement over
existing methods
@arnon86@sqreamtech
Chanel says racks are fashionable. Our customers
think otherwise
@arnon86@sqreamtech
BE EFFICIENT with your hardware
This configuration can analyze ~40TB of data
SQream DB with Tesla cards
@arnon86@sqreamtech
Environmentally friendly
GPU
GPU
GPU
GPU
GPU
GPU
GPU
GPU
Certified servers
Enabled with
Certified storage
@arnon86@sqreamtech
Let’s talk about the future
@arnon86@sqreamtech
Don’t be afraid of the future
• We know new databases are scary
• It’s a risk, but the reward is big
• Innovate all aspects of your data pipeline
Incremental Cold Fusion
The
scary
zone
@arnon86@sqreamtech
How we see the future of GPU databases
• The future is not just GPU databases. Different databases for different needs.
The relational model is still king for most of us
• More data = more processing power needed.
Scalable database solutions that can handle growing data become more relevant
• GPUs used for compute intensive stuff, e.g. graph processing, machine learning, AI
• Rising GPU offerings in the public cloud will allow adoption by more companies
GPUCLOUD
@arnon86@sqreamtech
How we see the future – hardware/Stack
• Improved programming extensions and better compilers in new CUDA/rockM will make it
easier to write good GPU code
• Faster HBM2 memory and PCIe v5.0 to reduce overhead of GPU processing
• More tightly-knit hardware integration, like the Intel H-series integrated GPU processor
@arnon86@sqreamtech
Reminder
•We offer a free consultation and assessment
to anyone here
•We can help you understand the benefits of
using a GPU database
@arnon86@sqreamtech
Don’t BUY hardware, BUY the results
• Your boss (probably) does not care about the chips in the servers
• GPU is a cool buzzword, but buzzwords alone won’t get the job done
• Achieve incredible speeds without betting the (server) farm
• Evaluate databases based on functionality and what they can do for you

More Related Content

What's hot

SQREAM DB on IBM Power9
SQREAM DB on IBM Power9SQREAM DB on IBM Power9
SQREAM DB on IBM Power9
Ganesan Narayanasamy
 
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
Spark Summit
 
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
Spark Summit
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
Alluxio, Inc.
 
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_ComputingIntroduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
YanpingWang
 
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data AnalyticsOpen Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
inside-BigData.com
 
How to teach your data scientist to leverage an analytics cluster with Presto...
How to teach your data scientist to leverage an analytics cluster with Presto...How to teach your data scientist to leverage an analytics cluster with Presto...
How to teach your data scientist to leverage an analytics cluster with Presto...
Alluxio, Inc.
 
Apache Spark At Scale in the Cloud
Apache Spark At Scale in the CloudApache Spark At Scale in the Cloud
Apache Spark At Scale in the Cloud
Databricks
 
Operationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
Operationalizing Machine Learning Using GPU Accelerated, In-Database AnalyticsOperationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
Operationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
Kinetica
 
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
DataStax
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
Spark Summit
 
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
DataStax
 
Stsg17 speaker yousunjeong
Stsg17 speaker yousunjeongStsg17 speaker yousunjeong
Stsg17 speaker yousunjeong
Yousun Jeong
 
Apache Druid Design and Future prospect
Apache Druid Design and Future prospectApache Druid Design and Future prospect
Apache Druid Design and Future prospect
c-bslim
 
Data Science Across Data Sources with Apache Arrow
Data Science Across Data Sources with Apache ArrowData Science Across Data Sources with Apache Arrow
Data Science Across Data Sources with Apache Arrow
Databricks
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks Streaming
Databricks
 
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Amazon Web Services
 
How Adobe uses Structured Streaming at Scale
How Adobe uses Structured Streaming at ScaleHow Adobe uses Structured Streaming at Scale
How Adobe uses Structured Streaming at Scale
Databricks
 
DIscover Spark and Spark streaming
DIscover Spark and Spark streamingDIscover Spark and Spark streaming
DIscover Spark and Spark streaming
Maturin BADO
 

What's hot (19)

SQREAM DB on IBM Power9
SQREAM DB on IBM Power9SQREAM DB on IBM Power9
SQREAM DB on IBM Power9
 
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
R&D to Product Pipeline Using Apache Spark in AdTech: Spark Summit East talk ...
 
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
Analytics at the Real-Time Speed of Business: Spark Summit East talk by Manis...
 
Accelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud EraAccelerate Analytics and ML in the Hybrid Cloud Era
Accelerate Analytics and ML in the Hybrid Cloud Era
 
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_ComputingIntroduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
Introduce_non-volatile_generic_object_programming_model_for_In-Memory_Computing
 
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data AnalyticsOpen Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
 
How to teach your data scientist to leverage an analytics cluster with Presto...
How to teach your data scientist to leverage an analytics cluster with Presto...How to teach your data scientist to leverage an analytics cluster with Presto...
How to teach your data scientist to leverage an analytics cluster with Presto...
 
Apache Spark At Scale in the Cloud
Apache Spark At Scale in the CloudApache Spark At Scale in the Cloud
Apache Spark At Scale in the Cloud
 
Operationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
Operationalizing Machine Learning Using GPU Accelerated, In-Database AnalyticsOperationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
Operationalizing Machine Learning Using GPU Accelerated, In-Database Analytics
 
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
 
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
A New “Sparkitecture” for Modernizing your Data Warehouse: Spark Summit East ...
 
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
C* Capacity Forecasting (Ajay Upadhyay, Jyoti Shandil, Arun Agrawal, Netflix)...
 
Stsg17 speaker yousunjeong
Stsg17 speaker yousunjeongStsg17 speaker yousunjeong
Stsg17 speaker yousunjeong
 
Apache Druid Design and Future prospect
Apache Druid Design and Future prospectApache Druid Design and Future prospect
Apache Druid Design and Future prospect
 
Data Science Across Data Sources with Apache Arrow
Data Science Across Data Sources with Apache ArrowData Science Across Data Sources with Apache Arrow
Data Science Across Data Sources with Apache Arrow
 
Build Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks StreamingBuild Real-Time Applications with Databricks Streaming
Build Real-Time Applications with Databricks Streaming
 
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
 
How Adobe uses Structured Streaming at Scale
How Adobe uses Structured Streaming at ScaleHow Adobe uses Structured Streaming at Scale
How Adobe uses Structured Streaming at Scale
 
DIscover Spark and Spark streaming
DIscover Spark and Spark streamingDIscover Spark and Spark streaming
DIscover Spark and Spark streaming
 

Viewers also liked

Java on the GPU: Where are we now?
Java on the GPU: Where are we now?Java on the GPU: Where are we now?
Java on the GPU: Where are we now?
Dmitry Alexandrov
 
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPCHPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
HPC DAY
 
HPC DAY 2017 | Prometheus - energy efficient supercomputing
HPC DAY 2017 | Prometheus - energy efficient supercomputingHPC DAY 2017 | Prometheus - energy efficient supercomputing
HPC DAY 2017 | Prometheus - energy efficient supercomputing
HPC DAY
 
Database Security Threats - MariaDB Security Best Practices
Database Security Threats - MariaDB Security Best PracticesDatabase Security Threats - MariaDB Security Best Practices
Database Security Threats - MariaDB Security Best Practices
MariaDB plc
 
LinuxKit and OpenOverlay
LinuxKit and OpenOverlayLinuxKit and OpenOverlay
LinuxKit and OpenOverlay
Moby Project
 
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY
 
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. AvailabilityHPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
HPC DAY
 
Latency tracing in distributed Java applications
Latency tracing in distributed Java applicationsLatency tracing in distributed Java applications
Latency tracing in distributed Java applications
Constantine Slisenka
 
Libnetwork updates
Libnetwork updatesLibnetwork updates
Libnetwork updates
Moby Project
 
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC ComputingHPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY
 
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
Benoit Combemale
 
Raspberry home server
Raspberry home serverRaspberry home server
Raspberry home server
Massimiliano Perrone
 
Design patterns in Java - Monitis 2017
Design patterns in Java - Monitis 2017Design patterns in Java - Monitis 2017
Design patterns in Java - Monitis 2017
Arsen Gasparyan
 
Getting Started with Embedded Python: MicroPython and CircuitPython
Getting Started with Embedded Python: MicroPython and CircuitPythonGetting Started with Embedded Python: MicroPython and CircuitPython
Getting Started with Embedded Python: MicroPython and CircuitPython
Ayan Pahwa
 
An Introduction to OMNeT++ 5.1
An Introduction to OMNeT++ 5.1An Introduction to OMNeT++ 5.1
An Introduction to OMNeT++ 5.1
Alpen-Adria-Universität
 
Drive into calico architecture
Drive into calico architectureDrive into calico architecture
Drive into calico architecture
Anirban Sen Chowdhary
 
Vertx
VertxVertx
세션1. block chain as a platform
세션1. block chain as a platform세션1. block chain as a platform
세션1. block chain as a platform
Jay JH Park
 
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
ScyllaDB
 

Viewers also liked (20)

Java on the GPU: Where are we now?
Java on the GPU: Where are we now?Java on the GPU: Where are we now?
Java on the GPU: Where are we now?
 
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPCHPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
HPC DAY 2017 | HPE Strategy And Portfolio for AI, BigData and HPC
 
HPC DAY 2017 | Prometheus - energy efficient supercomputing
HPC DAY 2017 | Prometheus - energy efficient supercomputingHPC DAY 2017 | Prometheus - energy efficient supercomputing
HPC DAY 2017 | Prometheus - energy efficient supercomputing
 
Database Security Threats - MariaDB Security Best Practices
Database Security Threats - MariaDB Security Best PracticesDatabase Security Threats - MariaDB Security Best Practices
Database Security Threats - MariaDB Security Best Practices
 
LinuxKit and OpenOverlay
LinuxKit and OpenOverlayLinuxKit and OpenOverlay
LinuxKit and OpenOverlay
 
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
 
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. AvailabilityHPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
HPC DAY 2017 | NVIDIA Volta Architecture. Performance. Efficiency. Availability
 
Latency tracing in distributed Java applications
Latency tracing in distributed Java applicationsLatency tracing in distributed Java applications
Latency tracing in distributed Java applications
 
Libnetwork updates
Libnetwork updatesLibnetwork updates
Libnetwork updates
 
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC ComputingHPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
HPC DAY 2017 | Altair's PBS Pro: Your Gateway to HPC Computing
 
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
Model Simulation, Graphical Animation, and Omniscient Debugging with EcoreToo...
 
Raspberry home server
Raspberry home serverRaspberry home server
Raspberry home server
 
Design patterns in Java - Monitis 2017
Design patterns in Java - Monitis 2017Design patterns in Java - Monitis 2017
Design patterns in Java - Monitis 2017
 
Getting Started with Embedded Python: MicroPython and CircuitPython
Getting Started with Embedded Python: MicroPython and CircuitPythonGetting Started with Embedded Python: MicroPython and CircuitPython
Getting Started with Embedded Python: MicroPython and CircuitPython
 
An Introduction to OMNeT++ 5.1
An Introduction to OMNeT++ 5.1An Introduction to OMNeT++ 5.1
An Introduction to OMNeT++ 5.1
 
Drive into calico architecture
Drive into calico architectureDrive into calico architecture
Drive into calico architecture
 
Vertx
VertxVertx
Vertx
 
세션1. block chain as a platform
세션1. block chain as a platform세션1. block chain as a platform
세션1. block chain as a platform
 
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
 

Similar to GPU databases - How to use them and what the future holds

Deep Learning Computer Build
Deep Learning Computer BuildDeep Learning Computer Build
Deep Learning Computer Build
PetteriTeikariPhD
 
Infrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep LearningInfrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep Learning
Sergey Karayev
 
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
Amazon Web Services
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Community
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
PyData
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
Mike Pittaro
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
Nicolas Poggi
 
GPU Computing for Data Science
GPU Computing for Data Science GPU Computing for Data Science
GPU Computing for Data Science
Domino Data Lab
 
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
NAVER D2
 
Evolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionEvolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server Solution
NVIDIA Taiwan
 
Choose Your Weapon: Comparing Spark on FPGAs vs GPUs
Choose Your Weapon: Comparing Spark on FPGAs vs GPUsChoose Your Weapon: Comparing Spark on FPGAs vs GPUs
Choose Your Weapon: Comparing Spark on FPGAs vs GPUs
Databricks
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and AlluxioAdvancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Alluxio, Inc.
 
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
Big Data Week
 
High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data Production
MattBethel1
 
Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.
Michael Zhang
 
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Spark Summit
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
Hakka Labs
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!
DataWorks Summit
 
FPGAs versus GPUs in Data centers
FPGAs versus GPUs in Data centersFPGAs versus GPUs in Data centers
FPGAs versus GPUs in Data centers
Mehedi Hasan Raju
 

Similar to GPU databases - How to use them and what the future holds (20)

Deep Learning Computer Build
Deep Learning Computer BuildDeep Learning Computer Build
Deep Learning Computer Build
 
Infrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep LearningInfrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep Learning
 
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
AWS re:Invent 2016: Deep Learning, 3D Content Rendering, and Massively Parall...
 
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
Ceph Day Shanghai - SSD/NVM Technology Boosting Ceph Performance
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
 
GPU Computing for Data Science
GPU Computing for Data Science GPU Computing for Data Science
GPU Computing for Data Science
 
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
[2C5]Map-D: A GPU Database for Interactive Big Data Analytics
 
Evolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server SolutionEvolution of Supermicro GPU Server Solution
Evolution of Supermicro GPU Server Solution
 
Choose Your Weapon: Comparing Spark on FPGAs vs GPUs
Choose Your Weapon: Comparing Spark on FPGAs vs GPUsChoose Your Weapon: Comparing Spark on FPGAs vs GPUs
Choose Your Weapon: Comparing Spark on FPGAs vs GPUs
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and AlluxioAdvancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
 
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
BDW Chicago 2016 - Manny Puentes, CTO, Altitude digital - How We Built a Data...
 
High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data Production
 
Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.Hadoop Hardware @Twitter: Size does matter.
Hadoop Hardware @Twitter: Size does matter.
 
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
Going Real-Time: Creating Frequently-Updating Datasets for Personalization: S...
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!
 
FPGAs versus GPUs in Data centers
FPGAs versus GPUs in Data centersFPGAs versus GPUs in Data centers
FPGAs versus GPUs in Data centers
 

Recently uploaded

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 

Recently uploaded (20)

Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 

GPU databases - How to use them and what the future holds

  • 1. @arnon86@sqreamtech GPU DATABASES: HOW TO USE THEM AND WHAT THE FUTURE HOLDS or GD: HTUT AWTFH for short
  • 2. @arnon86@sqreamtech Before we start… •We offer a free consultation and assessment to anyone here •We can help you understand the benefits of using a GPU database
  • 3. @arnon86@sqreamtech Who I am •From Israel •4 years at SQream •Originally part of the dev team •Tweet about animals a lot - @arnon86
  • 6. @arnon86@sqreamtech “The consensus was that if we could keep doing that, if we could go to chips with 1,000 cores, everything would be fine,”
  • 7. @arnon86@sqreamtech “It turns out that’s really hard” Dr. Doug Burger, an expert in chip design at Microsoft.
  • 8. @arnon86@sqreamtech So we just take things parallel, right?
  • 9. @arnon86@sqreamtech Let’s talk BIG data Hundreds of TB (Sometimes even petabytes of data) coming in at a rate of multiple terabytes per day Up to 1-4TB 2010 20162008 Up to 10TB Data is STILL growing exponentially
  • 10. @arnon86@sqreamtech 530 PB 12000 PB 15000 PB CERN NSA Google We’re in the petabyte age • Petabyte datasets are now the norm • Even small companies have dozens of terabytes of data for analysis • Some outliers have more: – CERN processes 1 petabyte per day, stores 530 PB total – In 2012, Facebook analyzed 5 petabytes per day, stores estimated a few exabytes – The NSA might hold 12 exabytes
  • 11. Are we only analyzing the tip of the iceberg?
  • 12. @arnon86@sqreamtech What we’ll talk about •Why GPUs? •What are GPU databases? •When are GPU databases good? •The future
  • 14. @arnon86@sqreamtech What is a GPU? • A processor specialized for display functions • The GPU renders images, animations and video for the computer's screen.
  • 15. @arnon86@sqreamtech What is a GPGPU? • A general-purpose GPU (GPGPU) is a GPU that performs non-specialized calculations that would typically be conducted by the CPU. • Put simply, it’s about taking the GPU and generalizing it for non-graphics. • AMD and NVIDIA have their own APIs for doing GPGPU programming – rockM and CUDA respectively.
  • 18. @arnon86@sqreamtech it’s not a strange piece of hardware
  • 19. @arnon86@sqreamtech Gpus all around • Pretty much all cloud providers now offer GPU instances • Most hardware vendors offer specially tuned GPU servers GPUCLOUD
  • 21. @arnon86@sqreamtech What are GPU Databases? • A GPU database is a database, relational or non-relational, that uses a GPU to perform some database operations • Most of the GPU databases tend to focus on analytics, and they’re offering it to a market that was oversold on Hadoop for Big Data analytics • And they’re typically pretty fast And they’re not only disrupting the in-memory crowd • GPU databases are more flexible in processing many different types of data, or much larger amounts of data
  • 22. @arnon86@sqreamtech Why gpus in big data? • High core count allows offloading of ‘heavy’ stuff like JOINs, ORDER BY, GROUP BY from the CPU to the GPU • Compression and Decompression processes reduce PCI and disk I/O. These are basically free on the GPU • Can also use GPU to do computationally intensive operations like deep learning, cryptography.
  • 23. @arnon86@sqreamtech Today’s data market - databases • A lot of new databases are in-memory, because “memory is cheap” • In-memory can’t handle more than ~2TB without very expensive hardware • Scaling out with in-memory gets very expensive, very fast: 8 SAP HANA machines for handling 40TB has a TCO of $22,000,000 for 4 years
  • 24. @arnon86@sqreamtech There’s more than one type of gpu database In-memory GPU databases • Typically for small datasets • Stores data in-memory • Very fast performance (milliseconds) • For relatively simple queries • Limited due to memory constraints Big Data GPU databases • Typically for giant datasets • Stores data on-disk • Fast performance (seconds-minutes) • For complex queries • Theoretically unlimited data-sets • A good fit for today’s evolving needs
  • 25. @arnon86@sqreamtech Don’t BUY hardware, BUY the results • Your boss (probably) does not care about the chips in the servers • GPU is a cool buzzword, but buzzwords alone won’t get the job done • Achieve incredible speeds without betting the (server) farm • Evaluate databases based on functionality and what they can do for you
  • 27. @arnon86@sqreamtech Understanding 40m telecom customers with sqream db Tracking customer behaviour at a large national mobile telecom operator with Tableau and SQream DB to improve offering and increase revenue
  • 28. @arnon86@sqreamtech Understanding 40m telecom customers with sqream db Understanding 40 million customers with SQream DB 80 nodes – 5 full racks 7600 CPU cores SQream DB v1.9.6 HP Server with NVIDIA Tesla 96 GB RAM + 6 TB storage Ingest time Reporting time Cost of Ownership $$$10,000,000 120 m 300 m 20 m 10 m $200,000
  • 29. @arnon86@sqreamtech 33.70 4.0 56 12,000,000 The cost of performance ACV calculation on 24 TB of data, 300B rows, 8 different tables - with complex, nested joins 31.70 4.7 4 500,000 Netezza 8 full 42U racks, 56 S-Blades 7 TB RAM SQream DB v1.9.6 Dell C4130 with 4x NVIDIA Tesla K80 512 GB RAM + iSCSI JBOD (20TB) Average query time (seconds) Processing Units (S-Blade / GPUs) Compression ratio Cost of Ownership $$
  • 30. Major ad-tech increased revenues by improving bids A major ad-tech deployed an 8 GPU SQream DB instances to unlock more insights from their Hadoop cluster Why they chose SQream DB • TRILLIONS of ad impressions monthly equate to 360TB (raw). This was too slow with Hadoop / Phoenix. • Live analytics was unavailable due to Hadoop limitations • The need to construct bidding histograms for dynamic CPM campaigns was extremely time-consuming in the current system – query time around 5 hours! 8x NVIDIA Tesla GPUs Qumulo NAS – 360TB
  • 32. @arnon86@sqreamtech Genome Research - Speed & Scale SQream and Sheba medical center cut cancer cure research time from years to weeks 200 GB Average size of a single human genome sequencing 2 Months Time it takes a genome researcher to compare a handful of sequences 1 PB The amount of storage needed by a genome research institute 2 Hours Time it takes a researcher to compare up to hundreds of sequences with SQream DB x100 Factor of improvement over existing methods
  • 33. @arnon86@sqreamtech Chanel says racks are fashionable. Our customers think otherwise
  • 34. @arnon86@sqreamtech BE EFFICIENT with your hardware This configuration can analyze ~40TB of data SQream DB with Tesla cards
  • 37. @arnon86@sqreamtech Don’t be afraid of the future • We know new databases are scary • It’s a risk, but the reward is big • Innovate all aspects of your data pipeline Incremental Cold Fusion The scary zone
  • 38. @arnon86@sqreamtech How we see the future of GPU databases • The future is not just GPU databases. Different databases for different needs. The relational model is still king for most of us • More data = more processing power needed. Scalable database solutions that can handle growing data become more relevant • GPUs used for compute intensive stuff, e.g. graph processing, machine learning, AI • Rising GPU offerings in the public cloud will allow adoption by more companies GPUCLOUD
  • 39. @arnon86@sqreamtech How we see the future – hardware/Stack • Improved programming extensions and better compilers in new CUDA/rockM will make it easier to write good GPU code • Faster HBM2 memory and PCIe v5.0 to reduce overhead of GPU processing • More tightly-knit hardware integration, like the Intel H-series integrated GPU processor
  • 40. @arnon86@sqreamtech Reminder •We offer a free consultation and assessment to anyone here •We can help you understand the benefits of using a GPU database
  • 41. @arnon86@sqreamtech Don’t BUY hardware, BUY the results • Your boss (probably) does not care about the chips in the servers • GPU is a cool buzzword, but buzzwords alone won’t get the job done • Achieve incredible speeds without betting the (server) farm • Evaluate databases based on functionality and what they can do for you