Embed presentation
Downloaded 44 times















This document compares approaches to large-scale data analysis using MapReduce and parallel database management systems (DBMSs). It presents results from running a benchmark of tasks on an open-source MapReduce system (Hadoop) and two parallel DBMSs using a cluster of 100 nodes. The parallel DBMSs showed significantly better performance than MapReduce for the tasks, but took much longer to load data and tune executions. The document discusses architectural differences between the approaches and their performance implications.













