3. Datawarehouse Appliance
What is Netezza?
H/W & S/W pre-bundled, pre-configured
Little configuration needed after deployment
Solves the traditional datawarehouse complexities!
8. Netezza Architecture (major) Principles
Processing close to the data source
Balanced massively parallel architecture
Appliance Simplicity
Flexible configurations and extreme scalability
9. SELECT DISTRICT, PRODUCTGRP, SUM (NRX)
FROM MTHLY_PROD_DATA
WHERE PDATE=“20140401”
AND MARKET = “2014”
And SPECIALITY = “GASTRO”
Slice of table
MTHLY_PROD_DATA
(Compressed)
SELECT
DISTRICT,
PRODUCTGRP,
NRX
SUM (NRX)
FPGA in Action!
13. What happens when you submit a query?
Host compiles the query & divides into snippets
Optimizer creates a query execution plan by making intelligent decisions like join order/
redistribution/broadcast
Each snippet has two elements: Compiled code & FPGA parameters
Object Cache: Improves query performance. You can avoid code compilation
Scheduler: Maintains maximum utilization and throughput
S-Blades execute these snippets in parallel. Sends the results back to host
Host accumulates the results and results will be returned to Client …
16. Various Datawarehouse appliances in the market!
IBM (Netezza)
HP (Vertica)
EMC (Greenplum)
SAP (HANA-High Performance Analytics Appliance)
Oracle (Exadata)
Teradata (Teradata, Asterdata)
Microsoft (DATAllegro)
http://ybigdata.blogspot.com/2013/01/vertica-vs-aster-data-vs-greenplum-vs.html
17. Netezza Delivers …
• Speed: 10-100x faster than traditional custom systems
• Simplicity & Ease: Minimal tuning & administration and greater resilience
• Fast time to value: 5 TB/Hour load speed
• Smart: Complex algorithms in minutes. A rich library of integrated analytics