Submit Search
Upload
Hive - Cost Based Optimizer
•
Download as PPTX, PDF
•
1 like
•
1,072 views
J
John pullokkaran
Follow
Cost Based Optimizer in Apache Hive
Read less
Read more
Software
Report
Share
Report
Share
1 of 10
Download now
Recommended
CBO-2
CBO-2
John pullokkaran
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Caserta
Tuning up with Apache Tez
Tuning up with Apache Tez
Gal Vinograd
MySql to HBase in 5 Steps
MySql to HBase in 5 Steps
Scott Cinnamond
Stinger hadoop summit june 2013
Stinger hadoop summit june 2013
alanfgates
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARN
Hortonworks
February 2014 HUG : Hive On Tez
February 2014 HUG : Hive On Tez
Yahoo Developer Network
HBaseCon 2013: Being Smarter Than the Smart Meter
HBaseCon 2013: Being Smarter Than the Smart Meter
Cloudera, Inc.
Recommended
CBO-2
CBO-2
John pullokkaran
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Caserta
Tuning up with Apache Tez
Tuning up with Apache Tez
Gal Vinograd
MySql to HBase in 5 Steps
MySql to HBase in 5 Steps
Scott Cinnamond
Stinger hadoop summit june 2013
Stinger hadoop summit june 2013
alanfgates
Apache Ambari: Managing Hadoop and YARN
Apache Ambari: Managing Hadoop and YARN
Hortonworks
February 2014 HUG : Hive On Tez
February 2014 HUG : Hive On Tez
Yahoo Developer Network
HBaseCon 2013: Being Smarter Than the Smart Meter
HBaseCon 2013: Being Smarter Than the Smart Meter
Cloudera, Inc.
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks
5. pivotal hd 2013
5. pivotal hd 2013
Chiou-Nan Chen
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Joydeep Sen Sarma
Apache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data Processing
hitesh1892
The future of Essbase: Hybrid database format
The future of Essbase: Hybrid database format
Erik De Rouck
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Hortonworks
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
t3rmin4t0r
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
EMC
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
DataWorks Summit
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
Cloudera, Inc.
Mixing Analytic Workloads with Greenplum and Apache Spark
Mixing Analytic Workloads with Greenplum and Apache Spark
VMware Tanzu
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
EDB
LLAP: long-lived execution in Hive
LLAP: long-lived execution in Hive
DataWorks Summit
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
Hortonworks
Ambari Meetup: YARN
Ambari Meetup: YARN
Hortonworks
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
DataWorks Summit
An overview of reference architectures for Postgres
An overview of reference architectures for Postgres
EDB
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
EMC
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
DataWorks Summit
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and Future
DataWorks Summit
Cost-based query optimization in Apache Hive 0.14
Cost-based query optimization in Apache Hive 0.14
Julian Hyde
Stinger Initiative - Deep Dive
Stinger Initiative - Deep Dive
Hortonworks
More Related Content
What's hot
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks
5. pivotal hd 2013
5. pivotal hd 2013
Chiou-Nan Chen
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Joydeep Sen Sarma
Apache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data Processing
hitesh1892
The future of Essbase: Hybrid database format
The future of Essbase: Hybrid database format
Erik De Rouck
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Hortonworks
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
t3rmin4t0r
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
EMC
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
DataWorks Summit
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
Cloudera, Inc.
Mixing Analytic Workloads with Greenplum and Apache Spark
Mixing Analytic Workloads with Greenplum and Apache Spark
VMware Tanzu
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
EDB
LLAP: long-lived execution in Hive
LLAP: long-lived execution in Hive
DataWorks Summit
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
Hortonworks
Ambari Meetup: YARN
Ambari Meetup: YARN
Hortonworks
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
DataWorks Summit
An overview of reference architectures for Postgres
An overview of reference architectures for Postgres
EDB
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
EMC
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
DataWorks Summit
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and Future
DataWorks Summit
What's hot
(20)
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks Technical Workshop: HBase and Apache Phoenix
5. pivotal hd 2013
5. pivotal hd 2013
Qubole @ AWS Meetup Bangalore - July 2015
Qubole @ AWS Meetup Bangalore - July 2015
Apache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data Processing
The future of Essbase: Hybrid database format
The future of Essbase: Hybrid database format
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
HBaseCon 2013: Apache Drill - A Community-driven Initiative to Deliver ANSI S...
Mixing Analytic Workloads with Greenplum and Apache Spark
Mixing Analytic Workloads with Greenplum and Apache Spark
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
PostgreSQL 12: What is coming up?, Enterprise Postgres Day
LLAP: long-lived execution in Hive
LLAP: long-lived execution in Hive
NextGen Apache Hadoop MapReduce
NextGen Apache Hadoop MapReduce
Ambari Meetup: YARN
Ambari Meetup: YARN
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
An overview of reference architectures for Postgres
An overview of reference architectures for Postgres
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
Apache Hadoop YARN 2015: Present and Future
Apache Hadoop YARN 2015: Present and Future
Similar to Hive - Cost Based Optimizer
Cost-based query optimization in Apache Hive 0.14
Cost-based query optimization in Apache Hive 0.14
Julian Hyde
Stinger Initiative - Deep Dive
Stinger Initiative - Deep Dive
Hortonworks
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
viirya
Apache Tez – Present and Future
Apache Tez – Present and Future
Jianfeng Zhang
Apache Tez – Present and Future
Apache Tez – Present and Future
Rajesh Balamohan
February 2014 HUG : Tez Details and Insides
February 2014 HUG : Tez Details and Insides
Yahoo Developer Network
April 2013 HUG: The Stinger Initiative - Making Apache Hive 100 Times Faster
April 2013 HUG: The Stinger Initiative - Making Apache Hive 100 Times Faster
Yahoo Developer Network
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query Processing
Bikas Saha
La big datacamp2014_vikram_dixit
La big datacamp2014_vikram_dixit
Data Con LA
Datacenter 2014: HP - Brian Andersen
Datacenter 2014: HP - Brian Andersen
Mediehuset Ingeniøren Live
Managing your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache Ambari
DataWorks Summit
Gunther hagleitner:apache hive & stinger
Gunther hagleitner:apache hive & stinger
hdhappy001
Apache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data Processing
DataWorks Summit
Munich HUG 21.11.2013
Munich HUG 21.11.2013
Emil Andreas Siemes
Deploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARI
DataWorks Summit
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
Hortonworks
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
DataWorks Summit
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
SharePoint 2016 Upgrade Planning
SharePoint 2016 Upgrade Planning
Chandima Kulathilake
Cursor Implementation in Apache Phoenix
Cursor Implementation in Apache Phoenix
Biju Nair
Similar to Hive - Cost Based Optimizer
(20)
Cost-based query optimization in Apache Hive 0.14
Cost-based query optimization in Apache Hive 0.14
Stinger Initiative - Deep Dive
Stinger Initiative - Deep Dive
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Apache Tez – Present and Future
Apache Tez – Present and Future
Apache Tez – Present and Future
Apache Tez – Present and Future
February 2014 HUG : Tez Details and Insides
February 2014 HUG : Tez Details and Insides
April 2013 HUG: The Stinger Initiative - Making Apache Hive 100 Times Faster
April 2013 HUG: The Stinger Initiative - Making Apache Hive 100 Times Faster
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query Processing
La big datacamp2014_vikram_dixit
La big datacamp2014_vikram_dixit
Datacenter 2014: HP - Brian Andersen
Datacenter 2014: HP - Brian Andersen
Managing your Hadoop Clusters with Apache Ambari
Managing your Hadoop Clusters with Apache Ambari
Gunther hagleitner:apache hive & stinger
Gunther hagleitner:apache hive & stinger
Apache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data Processing
Munich HUG 21.11.2013
Munich HUG 21.11.2013
Deploying and Managing Hadoop Clusters with AMBARI
Deploying and Managing Hadoop Clusters with AMBARI
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
Cloud Optimized Big Data
Cloud Optimized Big Data
SharePoint 2016 Upgrade Planning
SharePoint 2016 Upgrade Planning
Cursor Implementation in Apache Phoenix
Cursor Implementation in Apache Phoenix
Recently uploaded
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Bert Jan Schrijver
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security Program
WSO2
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
WSO2
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
WSO2
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
masabamasaba
tonesoftg
tonesoftg
lanshi9
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
Jim McKeeth
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
SelfMade bd
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
masabamasaba
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Presentation.STUDIO
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
masabamasaba
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2
Recently uploaded
(20)
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security Program
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
tonesoftg
tonesoftg
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
Hive - Cost Based Optimizer
1.
Cost Based Optimizer
in Apache Hive Page 1 Laljo John Pullokkaran 12/03/2014
2.
Agenda Page 2
• Status • Performance Numbers • Architecture • Future Directions • Q&A
3.
© Hortonworks Inc.
2013 Status Page 3 •Merged in to Apache Hive Trunk • Included in Hive 14.0 • All of Hive Test passes with CBO • ~ 3 man years of development effort • Contributors: Harish Bhutani, Ashutosh Chauhan, Julian Hyde, Sergey Shelukhin, Pengcheng Xiong, Mostafa Moktar, Laljo John Pullokkaran
4.
Performance Numbers ©
Hortonworks Inc. 2013 Page 4 • Average gains of 2X on TPCDS queries • 30X gains with TPCDS query 29
5.
Performance Numbers -
Query Break Down © Hortonworks Inc. 2013 Page 5
6.
Performance Numbers –
Why? © Hortonworks Inc. 2013 Page 6 • Reduced Intermediary data set • Bushy Joins results in better parallelism • Enables more Map Join as opposed to shuffle Join
7.
Architecture - Overview
© Hortonworks Inc. 2013 Page 7 Hive SQL Hive AST Hive QB Tree Hive AST Optimized Calcite Plan Calcite Plan Hive Operator Tree Optimized Hive OP tree TEZ/MR DAG
8.
Architecture - Optimizations
in Calcite © Hortonworks Inc. 2013 Page 8 • Projection Pruning • Partition Pruning • Predicate Push Down • Transitive Predicate inference • Push Down of Semi Joins • Join Simplification • Predicate Factorization • Join Ordering
9.
Architecture - Join
Ordering © Hortonworks Inc. 2013 Page 9 • Requires Column Statistics • Cost Model uses NDV based heuristics • Cost Model favors bushy join plans • Cost Model is agnostic of execution engine
10.
Future Directions ©
Hortonworks Inc. 2013 Page 10 • Histogram support • Join Algorithm selection • Calcite return path
Download now