Submit Search
Upload
Performance Evaluation of Cloudera Impala GA
•
5 likes
•
3,541 views
Yukinori Suda
Follow
Performance Evaluation of Cloudera Impala GA
Read less
Read more
Technology
Report
Share
Report
Share
1 of 12
Download now
Download to read offline
Recommended
Evaluation of cloudera impala 1.1
Evaluation of cloudera impala 1.1
Yukinori Suda
Os Gopal
Os Gopal
oscon2007
RedGateWebinar - Where did my CPU go?
RedGateWebinar - Where did my CPU go?
Kristofferson A
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analytics
Aerospike
OakTableWorld 2013: Ultimate Exadata IO monitoring – Flash, HardDisk , & Writ...
OakTableWorld 2013: Ultimate Exadata IO monitoring – Flash, HardDisk , & Writ...
Kristofferson A
OOW 2013: Where did my CPU go
OOW 2013: Where did my CPU go
Kristofferson A
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
Equnix Business Solutions
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
Equnix Business Solutions
Recommended
Evaluation of cloudera impala 1.1
Evaluation of cloudera impala 1.1
Yukinori Suda
Os Gopal
Os Gopal
oscon2007
RedGateWebinar - Where did my CPU go?
RedGateWebinar - Where did my CPU go?
Kristofferson A
Aerospike DB and Storm for real-time analytics
Aerospike DB and Storm for real-time analytics
Aerospike
OakTableWorld 2013: Ultimate Exadata IO monitoring – Flash, HardDisk , & Writ...
OakTableWorld 2013: Ultimate Exadata IO monitoring – Flash, HardDisk , & Writ...
Kristofferson A
OOW 2013: Where did my CPU go
OOW 2013: Where did my CPU go
Kristofferson A
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
PGConf.ASIA 2019 Bali - Building PostgreSQL as a Service with Kubernetes - Ta...
Equnix Business Solutions
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
PGConf.ASIA 2019 - High Availability, 10 Seconds Failover - Lucky Haryadi
Equnix Business Solutions
Embulk, an open-source plugin-based parallel bulk data loader
Embulk, an open-source plugin-based parallel bulk data loader
Sadayuki Furuhashi
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
Equnix Business Solutions
HBase replication
HBase replication
wchevreuil
Oracle Exadata Exam Dump
Oracle Exadata Exam Dump
Pooja C
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Community
HBase Replication for Bulk Loaded Data
HBase Replication for Bulk Loaded Data
Ashish Singhi
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Community
Oracle ebs db platform migration
Oracle ebs db platform migration
maaz khan
Ceph Day Taipei - Bring Ceph to Enterprise
Ceph Day Taipei - Bring Ceph to Enterprise
Ceph Community
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
Equnix Business Solutions
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
Equnix Business Solutions
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackIQ
Live issues resolution on Kubernates Cluster
Live issues resolution on Kubernates Cluster
♛Kumar Aneesh♛
OpenShift4 Installation by UPI on kvm
OpenShift4 Installation by UPI on kvm
Jooho Lee
HBaseCon 2013: A Developer’s Guide to Coprocessors
HBaseCon 2013: A Developer’s Guide to Coprocessors
Cloudera, Inc.
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
Altinity Ltd
2011 384 hackworth_ppt
2011 384 hackworth_ppt
maclean liu
Quay 3.3 installation
Quay 3.3 installation
Jooho Lee
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Altinity Ltd
Virtualize and automate your development environment for fun and profit
Virtualize and automate your development environment for fun and profit
Andreas Heim
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Yukinori Suda
Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)
Cloudera, Inc.
More Related Content
What's hot
Embulk, an open-source plugin-based parallel bulk data loader
Embulk, an open-source plugin-based parallel bulk data loader
Sadayuki Furuhashi
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
Equnix Business Solutions
HBase replication
HBase replication
wchevreuil
Oracle Exadata Exam Dump
Oracle Exadata Exam Dump
Pooja C
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Community
HBase Replication for Bulk Loaded Data
HBase Replication for Bulk Loaded Data
Ashish Singhi
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Community
Oracle ebs db platform migration
Oracle ebs db platform migration
maaz khan
Ceph Day Taipei - Bring Ceph to Enterprise
Ceph Day Taipei - Bring Ceph to Enterprise
Ceph Community
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
Equnix Business Solutions
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
Equnix Business Solutions
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackIQ
Live issues resolution on Kubernates Cluster
Live issues resolution on Kubernates Cluster
♛Kumar Aneesh♛
OpenShift4 Installation by UPI on kvm
OpenShift4 Installation by UPI on kvm
Jooho Lee
HBaseCon 2013: A Developer’s Guide to Coprocessors
HBaseCon 2013: A Developer’s Guide to Coprocessors
Cloudera, Inc.
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
Altinity Ltd
2011 384 hackworth_ppt
2011 384 hackworth_ppt
maclean liu
Quay 3.3 installation
Quay 3.3 installation
Jooho Lee
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Altinity Ltd
Virtualize and automate your development environment for fun and profit
Virtualize and automate your development environment for fun and profit
Andreas Heim
What's hot
(20)
Embulk, an open-source plugin-based parallel bulk data loader
Embulk, an open-source plugin-based parallel bulk data loader
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
HBase replication
HBase replication
Oracle Exadata Exam Dump
Oracle Exadata Exam Dump
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Day KL - Ceph Tiering with High Performance Archiecture
HBase Replication for Bulk Loaded Data
HBase Replication for Bulk Loaded Data
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise
Oracle ebs db platform migration
Oracle ebs db platform migration
Ceph Day Taipei - Bring Ceph to Enterprise
Ceph Day Taipei - Bring Ceph to Enterprise
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
PGConf.ASIA 2019 Bali - AppOS: PostgreSQL Extension for Scalable File I/O - K...
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
PGConf.ASIA 2019 Bali - Foreign Data Wrappers - Etsuro Fujita & Tatsuro Yamada
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
Live issues resolution on Kubernates Cluster
Live issues resolution on Kubernates Cluster
OpenShift4 Installation by UPI on kvm
OpenShift4 Installation by UPI on kvm
HBaseCon 2013: A Developer’s Guide to Coprocessors
HBaseCon 2013: A Developer’s Guide to Coprocessors
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
ClickHouse Mark Cache, by Mik Kocikowski, Cloudflare
2011 384 hackworth_ppt
2011 384 hackworth_ppt
Quay 3.3 installation
Quay 3.3 installation
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Analytics at Speed: Introduction to ClickHouse and Common Use Cases. By Mikha...
Virtualize and automate your development environment for fun and profit
Virtualize and automate your development environment for fun and profit
Viewers also liked
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Yukinori Suda
Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)
Cloudera, Inc.
Performance evaluation of cloudera impala (with Comparison to Hive)
Performance evaluation of cloudera impala (with Comparison to Hive)
Yukinori Suda
1763 murcia
1763 murcia
SUSANA FERREYRA
Presentations from the Cloudera Impala meetup on Aug 20 2013
Presentations from the Cloudera Impala meetup on Aug 20 2013
Cloudera, Inc.
ImpalaToGo introduction
ImpalaToGo introduction
David Groozman
Viewers also liked
(6)
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Performance evaluation of cloudera impala 0.6 beta with comparison to Hive
Bay Area Impala User Group Meetup (Sept 16 2014)
Bay Area Impala User Group Meetup (Sept 16 2014)
Performance evaluation of cloudera impala (with Comparison to Hive)
Performance evaluation of cloudera impala (with Comparison to Hive)
1763 murcia
1763 murcia
Presentations from the Cloudera Impala meetup on Aug 20 2013
Presentations from the Cloudera Impala meetup on Aug 20 2013
ImpalaToGo introduction
ImpalaToGo introduction
Similar to Performance Evaluation of Cloudera Impala GA
N(ot)-o(nly)-(Ha)doop - the DAG showdown
N(ot)-o(nly)-(Ha)doop - the DAG showdown
DataWorks Summit
Switch as a Server - PuppetConf 2014 - Leslie Carr
Switch as a Server - PuppetConf 2014 - Leslie Carr
Cumulus Networks
What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0
ScyllaDB
Inno db 5_7_features
Inno db 5_7_features
Tinku Ajit
The Switch as a Server - PuppetConf 2014
The Switch as a Server - PuppetConf 2014
Puppet
Gruter_TECHDAY_2014_03_ApacheTajo (in Korean)
Gruter_TECHDAY_2014_03_ApacheTajo (in Korean)
Gruter
Resume_CQ_Edward
Resume_CQ_Edward
caiqi wang
Ansible & Salt - Vincent Boon
Ansible & Salt - Vincent Boon
MyNOG
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Chris Fregly
Pro PostgreSQL, OSCon 2008
Pro PostgreSQL, OSCon 2008
Robert Treat
OpenStack in 10 minutes with Devstack
OpenStack in 10 minutes with Devstack
Sean Dague
CoreOS, or How I Learned to Stop Worrying and Love Systemd
CoreOS, or How I Learned to Stop Worrying and Love Systemd
Richard Lister
A CI/CD Pipeline to Deploy and Maintain OpenStack - cfgmgmtcamp2015
A CI/CD Pipeline to Deploy and Maintain OpenStack - cfgmgmtcamp2015
Simon McCartney
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Chris Fregly
Quantifying Container Runtime Performance: OSCON 2017 Open Container Day
Quantifying Container Runtime Performance: OSCON 2017 Open Container Day
Phil Estes
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
Chris Fregly
PROSE
PROSE
Eric Van Hensbergen
Katello on TorqueBox
Katello on TorqueBox
lzap
Streaming Analytics @ Uber
Streaming Analytics @ Uber
Xiang Fu
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
Chris Fregly
Similar to Performance Evaluation of Cloudera Impala GA
(20)
N(ot)-o(nly)-(Ha)doop - the DAG showdown
N(ot)-o(nly)-(Ha)doop - the DAG showdown
Switch as a Server - PuppetConf 2014 - Leslie Carr
Switch as a Server - PuppetConf 2014 - Leslie Carr
What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0
Inno db 5_7_features
Inno db 5_7_features
The Switch as a Server - PuppetConf 2014
The Switch as a Server - PuppetConf 2014
Gruter_TECHDAY_2014_03_ApacheTajo (in Korean)
Gruter_TECHDAY_2014_03_ApacheTajo (in Korean)
Resume_CQ_Edward
Resume_CQ_Edward
Ansible & Salt - Vincent Boon
Ansible & Salt - Vincent Boon
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...
Pro PostgreSQL, OSCon 2008
Pro PostgreSQL, OSCon 2008
OpenStack in 10 minutes with Devstack
OpenStack in 10 minutes with Devstack
CoreOS, or How I Learned to Stop Worrying and Love Systemd
CoreOS, or How I Learned to Stop Worrying and Love Systemd
A CI/CD Pipeline to Deploy and Maintain OpenStack - cfgmgmtcamp2015
A CI/CD Pipeline to Deploy and Maintain OpenStack - cfgmgmtcamp2015
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...
Quantifying Container Runtime Performance: OSCON 2017 Open Container Day
Quantifying Container Runtime Performance: OSCON 2017 Open Container Day
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...
PROSE
PROSE
Katello on TorqueBox
Katello on TorqueBox
Streaming Analytics @ Uber
Streaming Analytics @ Uber
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...
More from Yukinori Suda
Hadoop operation chaper 4
Hadoop operation chaper 4
Yukinori Suda
Cloudera Impalaをサービスに組み込むときに苦労した話
Cloudera Impalaをサービスに組み込むときに苦労した話
Yukinori Suda
Hadoopエコシステムを駆使したこれからのWebアクセス解析サービス
Hadoopエコシステムを駆使したこれからのWebアクセス解析サービス
Yukinori Suda
自宅でHive愛を育む方法 〜Raspberry Pi編〜
自宅でHive愛を育む方法 〜Raspberry Pi編〜
Yukinori Suda
⾃宅で Hive 愛を育むための⼿順(Raspberry Pi 編)
⾃宅で Hive 愛を育むための⼿順(Raspberry Pi 編)
Yukinori Suda
HiveとImpalaのおいしいとこ取り
HiveとImpalaのおいしいとこ取り
Yukinori Suda
Cloudera impalaの性能評価(Hiveとの比較)
Cloudera impalaの性能評価(Hiveとの比較)
Yukinori Suda
More from Yukinori Suda
(7)
Hadoop operation chaper 4
Hadoop operation chaper 4
Cloudera Impalaをサービスに組み込むときに苦労した話
Cloudera Impalaをサービスに組み込むときに苦労した話
Hadoopエコシステムを駆使したこれからのWebアクセス解析サービス
Hadoopエコシステムを駆使したこれからのWebアクセス解析サービス
自宅でHive愛を育む方法 〜Raspberry Pi編〜
自宅でHive愛を育む方法 〜Raspberry Pi編〜
⾃宅で Hive 愛を育むための⼿順(Raspberry Pi 編)
⾃宅で Hive 愛を育むための⼿順(Raspberry Pi 編)
HiveとImpalaのおいしいとこ取り
HiveとImpalaのおいしいとこ取り
Cloudera impalaの性能評価(Hiveとの比較)
Cloudera impalaの性能評価(Hiveとの比較)
Recently uploaded
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Scott Keck-Warren
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Padma Pradeep
Key Features Of Token Development (1).pptx
Key Features Of Token Development (1).pptx
LBM Solutions
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
Rafal Los
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
soniya singh
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
AndikSusilo4
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
Paola De la Torre
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Memoori
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Mark Billinghurst
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Patryk Bandurski
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Safe Software
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
shyamraj55
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Alan Dix
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
ThousandEyes
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Delhi Call girls
Recently uploaded
(20)
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Key Features Of Token Development (1).pptx
Key Features Of Token Development (1).pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
Performance Evaluation of Cloudera Impala GA
1.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / 1 1 Performance Evaluation of Cloudera impala 1.0 May 1, 2013 CELLANT Corp. R&D Strategy Division Yukinori SUDA @sudabon
2.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / v Support for a subset of ANSI-‐‑‒92 SQL v CREATE, ALTER, SELECT, INSERT, JOIN, and subqueries v Support for partitioned joins, fully distributed aggregations, and fully distributed top-‐‑‒n queries v Support for a variety of data formats: v Hadoop native (Apache Avro, SequenceFile, RCFile with Snappy, GZIP, BZIP, or uncompressed) v text (uncompressed or LZO-‐‑‒compressed) v Parquet (Snappy or uncompressed) v Support for all CDH4 64-‐‑‒bit packages: v RHEL 6.2/5.7, Ubuntu, Debian, SLES v Connectivity via JDBC, ODBC, Hue GUI, or command-‐‑‒line shell v Kerberos authentication and MR/Impala resource isolation v etc Cloudera Impala GA was released !! 2
3.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / Our System Environment 3 v Install using Cloudera Manager Free Edition 4.5.2 Master Slave 11 Servers All servers are connected with 1Gbps Ethernet through an L2 switch Active NameNode DataNode TaskTracker Impalad Stand-‐‑‒by NameNode JobTracker statestored 3 Servers DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad DataNode TaskTracker Impalad
4.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / v CPU l Intel Core 2 Duo 2.13 GHz with Hyper Threading v Memory l 4GB v Disk l 7,200 rpm SATA mechanical Hard Disk Drive * 1 v OS l Cent OS 6.2 Our “wimpy” Server Specification 4
5.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / v Use CDH4.2.1 + Impala version 1.0 v Use hivebench in open-‐‑‒sourced benchmark tool “HiBench” l https://github.com/hibench v Modified datasets to 1/10 scale l Default configuration generates table with 1 billion rows v Modified query sentence l Deleted “INSERT INTO TABLE …” to evaluate read-‐‑‒only performance v Combines a few storage format with a few compression method l TextFile, SequenceFile, RCFile, ParquestFile l No compression, Gzip, Snappy v Comparison with job query latency v Average job latency over 5 measurements Benchmark 5
6.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / • Uservisits table – 100 million rows – 16,895 MB as TextFile – Table Definitions • sourceIP string • destURL string • visitDate string • adRevenue double • userAgent string • countryCode string • languageCode string • searchWord string • duration int • Rankings table – 12 million rows – 744 MB as TextFile – Table Definitions • pageURL string • pageRank int • avgDuration int Modified Datasets 6
7.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / SELECT sourceIP, sum(adRevenue) as totalRevenue, avg(pageRank) FROM rankings_̲t R JOIN ( SELECT sourceIP, destURL, adRevenue FROM uservisits_̲t UV WHERE (datediff(UV.visitDate, '1999-‐‑‒01-‐‑‒01')>=0 AND datediff(UV.visitDate, '2000-‐‑‒01-‐‑‒01')<=0) ) NUV ON (R.pageURL = NUV.destURL) group by sourceIP order by totalRevenue DESC limit 1; Modified Query 7
8.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / Benchmark Result (Hive) cited from “Performance evaluation of Cloudera impala 0.6 beta...” 8 0 50 100 150 200 250 No Comp. Gzip Snappy Gzip Snappy TextFileSequenceFileRCFile 235.843 227.883 213.616 234.289 197.894 Avg. Job Latency [sec]
9.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / Benchmark Result (Impala) 9 0 50 100 150 200 250 No Comp. Gzip Snappy Gzip Snappy Snappy Text File Sequence FileRCFile Parquet File 36.61 29.736 24.024 26.083 19.586 16.2 Avg. Job Latency [sec]
10.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / v Exchange the order of JOINed Tables like below SELECT sourceIP, sum(adRevenue) as totalRevenue, avg(pageRank) FROM (SELECT sourceIP, destURL, adRevenue FROM uservisits_̲ps UV WHERE (datediff(UV.visitDate, '1999-‐‑‒01-‐‑‒01')>=0 AND datediff(UV.visitDate, '2000-‐‑‒01-‐‑‒01')<=0)) NUV JOIN rankings_̲ps R ON (R.pageURL = NUV.destURL) group by sourceIP order by totalRevenue DESC limit 1; v Result l Parquet compressed as Snappy: 34.374 sec Additional Experiments 10
11.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / v Parquet + Snappy is the fastest v Specifically, l ParquetFile compressed as Snappy: 16.2 sec v Need to take care the order of JOINed tables v Hope for future extension l Support UDF l Window Function l etc Conclusion 11
12.
Copyright © CELLANT
Corp. All Rights Reserved. h t t p : / / w w w . c e l l a n t . j p / 12 Letʼ’s try it out on your envrionment!! Thanks!
Download now