858 views

benchmarks-sigmod09

This document compares approaches to large-scale data analysis using MapReduce and parallel database management systems (DBMSs). It presents results from running a benchmark of tasks on an open-source MapReduce system (Hadoop) and two parallel DBMSs using a cluster of 100 nodes. The parallel DBMSs showed significantly better performance than MapReduce for the tasks, but took much longer to load data and tune executions. The document discusses architectural differences between the approaches and their performance implications.

Technology◦

More Related Content

PDF

Parallel Data Processing with MapReduce: A Survey

byKyong-Ha Lee

PDF

HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...

byKyong-Ha Lee

PDF

Hadoop Mapreduce Performance Enhancement Using In-Node Combiners

byijcsit

PPTX

KIISE:SIGDB Workshop presentation.

byKyong-Ha Lee

PDF

Implementation of p pic algorithm in map reduce to handle big data

byeSAT Publishing House

PDF

E031201032036

byijceronline

PDF

Application of MapReduce in Cloud Computing

byMohammad Mustaqeem

PDF

MapReduce in Cloud Computing

byMohammad Mustaqeem

Parallel Data Processing with MapReduce: A Survey

byKyong-Ha Lee

HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...

byKyong-Ha Lee

Hadoop Mapreduce Performance Enhancement Using In-Node Combiners

byijcsit

KIISE:SIGDB Workshop presentation.

byKyong-Ha Lee

Implementation of p pic algorithm in map reduce to handle big data

byeSAT Publishing House

E031201032036

byijceronline

Application of MapReduce in Cloud Computing

byMohammad Mustaqeem

MapReduce in Cloud Computing

byMohammad Mustaqeem

What's hot

PPTX

CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...

byBig Data Spain

PPTX

Optimal Execution Of MapReduce Jobs In Cloud - Voices 2015

byDeanna Kosaraju

PPTX

Pig Experience

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

PDF

SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...

byKyong-Ha Lee

PDF

Scalable and Adaptive Graph Querying with MapReduce

byKyong-Ha Lee

PDF

MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION

byIJDMS

PDF

IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...

byIRJET Journal

PDF

A Brief on MapReduce Performance

byAM Publications

PDF

A sql implementation on the map reduce framework

byeldariof

PDF

Python in an Evolving Enterprise System (PyData SV 2013)

byPyData

PDF

H04502048051

byijceronline

PDF

Jovian DATA: A multidimensional database for the cloud

byBharat Rane

PPTX

MapReduce: A useful parallel tool that still has room for improvement

byKyong-Ha Lee

PDF

Large Scale Data Analysis with Map/Reduce, part I

byMarin Dimitrov

PDF

MapReduce: Distributed Computing for Machine Learning

bybutest

PDF

Enhancing Performance and Fault Tolerance of Hadoop Cluster

byIRJET Journal

PPTX

MapReduce

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

PDF

Eg4301808811

byIJERA Editor

PDF

PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE

byijdpsjournal

PDF

A NOBEL HYBRID APPROACH FOR EDGE DETECTION

byijcses

CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...

byBig Data Spain

Optimal Execution Of MapReduce Jobs In Cloud - Voices 2015

byDeanna Kosaraju

Pig Experience

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...

byKyong-Ha Lee

Scalable and Adaptive Graph Querying with MapReduce

byKyong-Ha Lee

MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION

byIJDMS

IRJET - Evaluating and Comparing the Two Variation with Current Scheduling Al...

byIRJET Journal

A Brief on MapReduce Performance

byAM Publications

A sql implementation on the map reduce framework

byeldariof

Python in an Evolving Enterprise System (PyData SV 2013)

byPyData

H04502048051

byijceronline

Jovian DATA: A multidimensional database for the cloud

byBharat Rane

MapReduce: A useful parallel tool that still has room for improvement

byKyong-Ha Lee

Large Scale Data Analysis with Map/Reduce, part I

byMarin Dimitrov

MapReduce: Distributed Computing for Machine Learning

bybutest

Enhancing Performance and Fault Tolerance of Hadoop Cluster

byIRJET Journal

MapReduce

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

Eg4301808811

byIJERA Editor

PERFORMANCE EVALUATION OF BIG DATA PROCESSING OF CLOAK-REDUCE

byijdpsjournal

A NOBEL HYBRID APPROACH FOR EDGE DETECTION

byijcses

Similar to benchmarks-sigmod09

PDF

Where Does Big Data Meet Big Database - QCon 2012

byBen Stopford

PPTX

Microsoft's Big Play for Big Data

byAndrew Brust

PPT

Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012

byAndrew Brust

PPTX

A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...

byQian Lin

PDF

Sqlmr

byAjay Ohri

PDF

Sqlmr

byblogboy

PDF

Sqlmr

byTeradata Aster

PDF

Sqlmr

byMap Reduce

PPTX

Silicon valley nosql meetup april 2012

byInfiniteGraph

PPTX

Information processing architectures

byRaji Gogulapati

PPTX

"Navigating the Database Universe" by Dr. Michael Stonebraker and Scott Jarr,...

bylisapaglia

PPTX

Hadoop DB

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

PPTX

Microsoft Openness Mongo DB

byHeriyadi Janwar

PDF

What Does Big Data Mean and Who Will Win

byBigDataCloud

PDF

Big data: analyzing large data sets

byR A Akerkar

PDF

Nosql intro

byHoang Nguyen

PPTX

Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...

byKrishnan Parasuraman

PPTX

Intro to Big Data and NoSQL

byDon Demcsak

PPTX

NoSQL for the SQL Server Pro

byLynn Langit

PDF

B036407011

bytheijes

Where Does Big Data Meet Big Database - QCon 2012

byBen Stopford

Microsoft's Big Play for Big Data

byAndrew Brust

Microsoft's Big Play for Big Data- Visual Studio Live! NY 2012

byAndrew Brust

A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...

byQian Lin

Sqlmr

byAjay Ohri

Sqlmr

byblogboy

Sqlmr

byTeradata Aster

Sqlmr

byMap Reduce

Silicon valley nosql meetup april 2012

byInfiniteGraph

Information processing architectures

byRaji Gogulapati

"Navigating the Database Universe" by Dr. Michael Stonebraker and Scott Jarr,...

bylisapaglia

Hadoop DB

byTilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL

Microsoft Openness Mongo DB

byHeriyadi Janwar

What Does Big Data Mean and Who Will Win

byBigDataCloud

Big data: analyzing large data sets

byR A Akerkar

Nosql intro

byHoang Nguyen

Hadoop World 2011: Building Scalable Data Platforms ; Hadoop & Netezza Deploy...

byKrishnan Parasuraman

Intro to Big Data and NoSQL

byDon Demcsak

NoSQL for the SQL Server Pro

byLynn Langit

B036407011

bytheijes

More from Hiroshi Ono

PDF

Voltdb - wikipedia

benchmarks-sigmod09

More Related Content

What's hot

Similar to benchmarks-sigmod09

More from Hiroshi Ono

Recently uploaded