SlideShare a Scribd company logo
1 of 8
Download to read offline
Hadoop Business
Cases
Dell | Hadoop White Paper
By Joey Jablonski




Dell | Hadoop White Paper Series
Dell | Hadoop White Paper Series: Hadoop Business Cases




Table of Contents
Hadoop brings new capabilities                                                                                      3
Management of the data fire hose                                                                                    5
Hadoop enters the enterprise                                                                                        5
   Analytics                                                                                                        6
   Risk modeling                                                                                                    6

Hadoop ecosystem                                                                                                    6
Hadoop futures                                                                                                      7
About the author                                                                                                    7
Special thanks                                                                                                      7
About Dell Next Generation Computing Solutions                                                                      7
References                                                                                                          8
To learn more                                                                                                       8




This White Paper is for informational purposes only, and may contain typographical errors and technical inaccuracies.
The content is provided as is, without express or implied warranties of any kind.


© 2011 Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden.
For more information, contact Dell. Dell, the Dell logo, and the Dell badge, and PowerEdge are trademarks of Dell Inc.




                                                                                    2
Dell | Hadoop White Paper Series: Hadoop Business Cases

Hadoop brings new capabilities
Growing data volumes and interconnected systems create a need for a tool capable of building the next generation of
analytics and data management solutions. Hadoop provides a framework for your company to analyze and manage
growing volumes of data while storing data longer than previously possible at a competitive price point. By extending the
life of data and not discarding it, you can enable staff to review historic data in new ways and analyze it as new methods
emerge.

The Hadoop taxonomy is outlined in Figure 1, showing the components common to all Hadoop environments. These
components are part of the core Apache Hadoop project. The Hadoop architecture is very pluggable, allowing any
component to be replaced with one optimized for a specific workload, while allowing a large variety of data presentation
layers to utilize the data stored in Hadoop. The vertical bars on the right designate components not included as part of a
default Hadoop distrobution; these components are commonly provided by IT providers to enhance their Hadoop
offerings.




Figure 1. Core components of a Hadoop deployment



In addition to the core Hadoop components shown in Figure 1, a variety of projects have developed as part of the
Hadoop ecosystem to provide specific solutions for using data within Hadoop in common ways. Many projects have
evolved for storing and processing specific types of data within Hadoop, allowing many industries to create specific
solutions built on a common storage and compute engine within Hadoop.




                                                               3
Dell | Hadoop White Paper Series: Hadoop Business Cases




Figure 2. The core Hadoop ecosystem with additional tools for data presentation



Components of the Hadoop ecosystem can be built on one or more of the three primary Hadoop use cases:


        Compute                                 Storage                                     Database

 Hadoop is commonly used as a           One primary component of the                The Hadoop ecosystem contains
 distributed compute platform for       Hadoop ecosystem is the Hadoop              components that allow the data
 analyzing or processing large          Distributed File System (HDFS). The         within the HDFS to be presented
 amounts of data. The Hadoop            HDFS allows users to have a single          in a SQL interface. This allows the
 ecosystem provides APIs                addressable namespace, spread               use of standard functions,
 necessary to distribute and track      across many hundreds or thousands           including INSERT, SELECT, and
 workloads as they are run on large     of servers, creating a single large file    UPDATE of data within the
 numbers of distributed machines.       system. Hadoop manages the                  Hadoop environment, with
                                        replication of the data within this file    minimal code changes to existing
                                        system to ensure hardware failures do       applications. These components
                                        not lead to data loss. Many                 allow developers to quickly
                                        organizations will use this scalable file   access the data stored within a
                                        system as a place to store large            Hadoop environment with tools
                                        amounts of data that is then accessed       they are experienced at using.
                                        by jobs run within Hadoop or by
                                        external systems.




Hadoop provides a consistent, scalable base of tools within an organization for storing, managing, and analyzing data,
without being tied to any specific department or framework. Hadoop enables your organization to use a single set of data


                                                               4
Dell | Hadoop White Paper Series: Hadoop Business Cases

for all departments’ reporting, analysis, and research needs. This single source enables better quality results and eliminates
the cost and complexity of managing multiple islands of data.

Business is changing quickly; the goals of any individual tool today may not be the same tomorrow. The same goes for
organizations and their areas of focus within a large corporation. Making the decision about what data to discard in the
current floods of data most companies are experiencing is a difficult challenge. Hadoop enables your company to store
more data, with less overhead than ever before. This enables your staff to ask questions of that data and analyze it in new
ways later that are not even thought of today.


Management of the data fire hose
The evolving community around “big data” (the industry
term for environments containing large volumes of
related but un-structured data) finds new ways for
analyzing and managing growing volumes of data. We
are also exploring the creation of new ways for making
sense of otherwise large piles of previously                                               Decisions
misunderstood data sets.

On any given day, most companies do not know what
questions to ask of certain data. When this has occurred
in the past, companies would purge that data because of
the cost of storing data with indeterminate value. Today,
                                                                            Questions
companies exploit tools like Hadoop for storing that data
for much longer periods of time, often until such time
that staff find new ways to understand how the data can
be used and what questions can be asked of the data.
                                                                                                         Data

Today, data is as valuable as any software written
by a company or any product it designs.

The data is the component that drives next-generation
products and enables maximum revenue attainment
from existing products. Hadoop provides a low-barrier-to-               Figure 3. Questions bring out the value of data.
entry solution for storing the additional data being created by
today’s companies.


Hadoop enters the enterprise
Hadoop is rarely initially deployed as a company-wide data analytics solution; more often, Hadoop is deployed by a single
department or organization that sees it as a solution to certain challenges. Hadoop inevitably is then used by more and
more departments, becoming a more critical piece of the corporation’s storage and analytics solutions.

Hadoop deployments commonly start with a smaller deployment within a virtual environment; this could be virtual
machines hosted on premise or in a public cloud environment. This method enables your IT staff to learn about managing
Hadoop and enables your developers to begin testing ideas they have about uses of Hadoop. This use of virtual
infrastructure will usually stop as soon as real workloads are tested, and this usually signifies a move to physical hardware
dedicated to Hadoop. This change is primarily driven by data volumes and performance needs. At a certain inflection
point, moving data to a public cloud becomes too time-consuming, so companies look to internally hosted and managed
Hadoop solutions.

It is important to understand the evolution of Hadoop in your environment to ensure that you adequatly plan each stage
of the evolution. Hadoop can rapidly become a large, complex component of your information technology (IT)
department. By understanding how Hadoop commonly evolves, you can better manage that evolution in your
environment and ensure Hadoop meets your company’s needs, without causing an undue operations burden.


                                                                  5
Dell | Hadoop White Paper Series: Hadoop Business Cases

Analytics
Analytics are becoming a more critical component in all business environments. Analytics are being used to provide near
real-time reporting on the state of a business, allowing leaders to make rapid decisions to correct the course of an
organization or to capitalize on the needs of the market. The emerging market of tools for analytics allows companies to
manipulate the raw data they get from a variety of sources and make intelligent decisions about the state of the business.

Many marketing and sales-focused organizations are now using Hadoop as the core of their analytics programs. Hadoop
is used to store a central copy of customer data and product usage information, allowing those developing pricing
models and sales models to refine the data in new ways, looking for new relationships. These analytics allow the analysts
to look for new relationships, not previously possible with traditional, separate relational database-driven data warehouse
environments.

Another example of using analytics to minimize operational expenses is in IT. By leveraging the hyperscale compute and
storage capabilities of Hadoop, your IT personnel can optimize system reporting, analyze system performance versus
operational expenses, detect potential cases for system failures, and minimize system downtime. Your CIO and IT
managers can analyze the most optimal operational models, determine operational inflection points, and plan the next
budget cycle.

Risk modeling
Many financial services firms are beginning to use Hadoop for risk modeling. Hadoop provides a base for storing and
processing large amounts of data, enabling firms to focus on algorithm development and optimization. Hadoop enables
companies to avoid the difficulty in massively parallel programming, while exploiting the capabilitie s provided by
commodity hardware and software.

By using Hadoop to enable your company’s risk modeling projects, data from many different sources can be pulled into a
single location and modeled by a single set of algorithms. A traditionally large company required risk modeling to occur at
business unit or departmental levels. This modeling was commonly was done in different ways by the different financial
analyst teams. Hadoop enables a single, companywide team to model a company’s exposure to risk and understand what
dynamics are at play against that risk position.




Figure 4. Hadoop enables a single, companywide team to model exposure to risk.


Hadoop ecosystem
The Hadoop ecosystem is a rapidly growing and evolving set of tools for Hadoop operations and tools specific to verticals
and uses for Hadoop. The Hadoop ecosystem contains many tools specific to operational use cases and the manipulation
of specific types of data. This large ecosystem makes Hadoop a strong platform for companies as they evaluate and grow
their analytics or business intelligence environments. Some of the most common tools within the Hadoop ecosystem for
supporting scale-out environments include Flume, Sqoop, and Zookeeper.

Flume is a commonly used tool within the Hadoop ecosystem for handling streaming data. Flume provides a framework
for agents on one or many servers to collect events and store them in a single HDFS namespace. Flume also provides the
necessary frameworks for developing work streams for processing those events, reporting on them, and taking action on
them.

Sqoop is a component within the Hadoop ecosystem for enabling connectivity between Hadoop environments and
traditional SQL environments, including relational databases and data warehouses. Sqoop enables automated processes
to be developed for moving data between Hadoop and data warehouses, enabling data warehouses to have access to

                                                                6
Dell | Hadoop White Paper Series: Hadoop Business Cases

large amounts of data traditionally stored in other environments or not available at all to business intelligence d evelopers
and analysts.

Zookeeper is a component commonly used by applications that exploit data stored in the HDFS. Zookeeper provides a
framework for managing distributed applications and the locks between them for consistent data access, providing
naming services, and providing synchronization between separate servers and processes that are part of a single, larger
application.


Hadoop futures
Most organizations have used specialized teams for business intelligence development and exploitation of a compan y’s
data. Hadoop enables that functionality to be pushed to a larger group of staff within the organization. Hadoop provides a
single unified interface and data store for many staff across all departments to use when analyzing company statistics and
developing new methods for success in a market.

Hadoop empowers all your employees to think of new ways to improve the bottom line and allows them access to the
necessary information to test their theories, develop strategies, and report on changes in the business.

Hadoop provides the base software and associated ecosystem to manage growing amounts of data. Hadoop enables
your company to store more data than ever before and provide it to a larger portion of the staff for analysis both today
and tomorrow. Hadoop can be used to enable near real-time decision making by your company leadership and allow
your staff to test new ideas and analyze data in new ways.


About the author
Joey Jablonski is a principal solution architect with Dell’s Data Center Solutions team. Joey works to define and
implement Dell’s solutions for Big Data, including solutions based on Apache Hadoop. Joey has spent more than 10 years
working in high performance computing, with an emphasis on interconnects, including Infiniband and parallel fi le
systems. Joey has led technical solution design and implementation at Sun Microsystems and Hewlett-Packard, as well as
consulted for customers, including Sandia National Laboratories, BP, ExxonMobil, E*Trade, Juelich Supercomputing
Centre, and Clumeq.


Special thanks
The author extends special thanks to:
    Rob Hirschfeld, Principal Cloud Solutions Architect, Dell
    Aurelian Dumitru, Principal Cloud Solutions Architect, Dell
    John Igoe, Executive Director, Next Generation Computing Solutions, Dell


About Dell Next Generation Computing Solutions
When cloud computing is the core of your business and its efficiency and vitality underpin your success, the Dell Next
Generation Computing Solutions are Dell’s response to your unique needs. We understand your challeng es—from
compute and power density to global scaling and environmental impact. Dell has the knowledge and expertise to tune
your company’s “factory” for maximum performance and efficiency.

Dell’s Next Generation Computing Solutions provide operational models backed by unique product solutions to meet the
needs of companies at all stages of their lifecycle. Solutions are designed to meet the needs of small startups while
allowing scalability as your company grows.

Deployment and support are tailored to your unique operational requirements. Dell’s Cloud Computing Solutions can
help you minimize the tangible operating costs that have hyper-scale impact on your business results.




                                                                 7
Dell | Hadoop White Paper Series: Hadoop Business Cases

References
Big Data
http://en.wikipedia.org/wiki/Big_data

Hadoop
http://hadoop.apache.org/

Public Cloud
http://en.wikipedia.org/wiki/Cloud_computing

Flume
https://github.com/cloudera/flume

HBase
http://wiki.apache.org/hadoop/Hbase?action=show&redirect=HBase

Zookeeper
http://wiki.apache.org/hadoop/ZooKeeper




    To learn more
    To learn more about Dell cloud solutions,
    contact your Dell representative or visit:
    www.dell.com/cloud


©2011 Dell Inc. All rights reserved. Trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Specifications are
correct at date of publication but are subject to availability or ch ange without notice at any time. Dell and its affiliates cannot be responsible for errors or omissions in typography or
photography. Dell’s Terms and Conditions of Sales and Service apply and are available on request. Dell service offerings do not affect consumer’s statutory rights.

Dell, the DELL logo, and the DELL badge, PowerConnect, and PowerVault are trademarks of Dell Inc.




                                                                                              8

More Related Content

What's hot

Using hadoop to expand data warehousing
Using hadoop to expand data warehousingUsing hadoop to expand data warehousing
Using hadoop to expand data warehousingDataWorks Summit
 
50 must read hadoop interview questions & answers - whizlabs
50 must read hadoop interview questions & answers - whizlabs50 must read hadoop interview questions & answers - whizlabs
50 must read hadoop interview questions & answers - whizlabsWhizlabs
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in detailsMahmoud Yassin
 
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...Cloudera, Inc.
 
Hw09 Data Processing In The Enterprise
Hw09   Data Processing In The EnterpriseHw09   Data Processing In The Enterprise
Hw09 Data Processing In The EnterpriseCloudera, Inc.
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - IntroductionTomy Rhymond
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Thanh Nguyen
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?Hortonworks
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleHarald Erb
 
Intro to HDFS and MapReduce
Intro to HDFS and MapReduceIntro to HDFS and MapReduce
Intro to HDFS and MapReduceRyan Tabora
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopIOSR Journals
 
BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? Datameer
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khanKamranKhan587
 
Introducing the hadoop ecosystem
Introducing the hadoop ecosystemIntroducing the hadoop ecosystem
Introducing the hadoop ecosystemGeert Van Landeghem
 
Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hortonworks
 
Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作James Chen
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop BasicsSonal Tiwari
 

What's hot (20)

Using hadoop to expand data warehousing
Using hadoop to expand data warehousingUsing hadoop to expand data warehousing
Using hadoop to expand data warehousing
 
50 must read hadoop interview questions & answers - whizlabs
50 must read hadoop interview questions & answers - whizlabs50 must read hadoop interview questions & answers - whizlabs
50 must read hadoop interview questions & answers - whizlabs
 
Actian DataFlow Whitepaper
Actian DataFlow WhitepaperActian DataFlow Whitepaper
Actian DataFlow Whitepaper
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...
Strata + Hadoop World 2012: Data Science on Hadoop: How Cloudera Impala Unloc...
 
Hw09 Data Processing In The Enterprise
Hw09   Data Processing In The EnterpriseHw09   Data Processing In The Enterprise
Hw09 Data Processing In The Enterprise
 
Oracle in Database Hadoop
Oracle in Database HadoopOracle in Database Hadoop
Oracle in Database Hadoop
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1Overview of Big data, Hadoop and Microsoft BI - version1
Overview of Big data, Hadoop and Microsoft BI - version1
 
Why hadoop for data science?
Why hadoop for data science?Why hadoop for data science?
Why hadoop for data science?
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
 
Intro to HDFS and MapReduce
Intro to HDFS and MapReduceIntro to HDFS and MapReduce
Intro to HDFS and MapReduce
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
 
BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics? BI, Hive or Big Data Analytics?
BI, Hive or Big Data Analytics?
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khan
 
Ddn 2017 10_dse_primer
Ddn 2017 10_dse_primerDdn 2017 10_dse_primer
Ddn 2017 10_dse_primer
 
Introducing the hadoop ecosystem
Introducing the hadoop ecosystemIntroducing the hadoop ecosystem
Introducing the hadoop ecosystem
 
Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)
 
Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop Basics
 

Similar to Hadoop Business Cases

Similar to Hadoop Business Cases (20)

Big Data Hadoop Technology
Big Data Hadoop TechnologyBig Data Hadoop Technology
Big Data Hadoop Technology
 
Machine Learning Hadoop
Machine Learning HadoopMachine Learning Hadoop
Machine Learning Hadoop
 
Hadoop Training in Delhi
Hadoop Training in DelhiHadoop Training in Delhi
Hadoop Training in Delhi
 
Bigdata and hadoop
Bigdata and hadoopBigdata and hadoop
Bigdata and hadoop
 
Hadoop in the Enterprise
Hadoop in the EnterpriseHadoop in the Enterprise
Hadoop in the Enterprise
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoption
 
Introduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptxIntroduction-to-Big-Data-and-Hadoop.pptx
Introduction-to-Big-Data-and-Hadoop.pptx
 
Hadoop info
Hadoop infoHadoop info
Hadoop info
 
Big data overview
Big data overviewBig data overview
Big data overview
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Hadoop .pdf
Hadoop .pdfHadoop .pdf
Hadoop .pdf
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
HDFS
HDFSHDFS
HDFS
 
paper
paperpaper
paper
 
2.1-HADOOP.pdf
2.1-HADOOP.pdf2.1-HADOOP.pdf
2.1-HADOOP.pdf
 
What is hadoop
What is hadoopWhat is hadoop
What is hadoop
 
Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 

More from Joey Jablonski

PCA26 - Product Management in IT
PCA26 - Product Management in ITPCA26 - Product Management in IT
PCA26 - Product Management in ITJoey Jablonski
 
Feeding 10 Billion People with Cloud-Scale Compute and Analytics
Feeding 10 Billion People with  Cloud-Scale Compute and AnalyticsFeeding 10 Billion People with  Cloud-Scale Compute and Analytics
Feeding 10 Billion People with Cloud-Scale Compute and AnalyticsJoey Jablonski
 
Redefining Security for Big Data - Cassandra Summit 2013
Redefining Security for Big Data - Cassandra Summit 2013Redefining Security for Big Data - Cassandra Summit 2013
Redefining Security for Big Data - Cassandra Summit 2013Joey Jablonski
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformJoey Jablonski
 

More from Joey Jablonski (7)

PCA26 - Product Management in IT
PCA26 - Product Management in ITPCA26 - Product Management in IT
PCA26 - Product Management in IT
 
Feeding 10 Billion People with Cloud-Scale Compute and Analytics
Feeding 10 Billion People with  Cloud-Scale Compute and AnalyticsFeeding 10 Billion People with  Cloud-Scale Compute and Analytics
Feeding 10 Billion People with Cloud-Scale Compute and Analytics
 
Security for Big Data
Security for Big DataSecurity for Big Data
Security for Big Data
 
Big Data for Security
Big Data for SecurityBig Data for Security
Big Data for Security
 
Virtualized Hadoop
Virtualized HadoopVirtualized Hadoop
Virtualized Hadoop
 
Redefining Security for Big Data - Cassandra Summit 2013
Redefining Security for Big Data - Cassandra Summit 2013Redefining Security for Big Data - Cassandra Summit 2013
Redefining Security for Big Data - Cassandra Summit 2013
 
SNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop PlatformSNIA 2012 - Creating an Enterprise Hadoop Platform
SNIA 2012 - Creating an Enterprise Hadoop Platform
 

Recently uploaded

Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...World Wide Tickets And Hospitality
 
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics TradeTechnical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics TradeOptics-Trade
 
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfTAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfSocial Samosa
 
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...Neil Horowitz
 
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改atducpo
 
08448380779 Call Girls In International Airport Women Seeking Men
08448380779 Call Girls In International Airport Women Seeking Men08448380779 Call Girls In International Airport Women Seeking Men
08448380779 Call Girls In International Airport Women Seeking MenDelhi Call girls
 
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...Eticketing.co
 
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxAlbania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxWorld Wide Tickets And Hospitality
 
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺anilsa9823
 
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...gurkirankumar98700
 
Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.Marina Costa
 
🔝|97111༒99012🔝 Call Girls In {Delhi} Cr Park ₹5.5k Cash Payment With Room De...
🔝|97111༒99012🔝 Call Girls In  {Delhi} Cr Park ₹5.5k Cash Payment With Room De...🔝|97111༒99012🔝 Call Girls In  {Delhi} Cr Park ₹5.5k Cash Payment With Room De...
🔝|97111༒99012🔝 Call Girls In {Delhi} Cr Park ₹5.5k Cash Payment With Room De...Diya Sharma
 
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改atducpo
 
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docxSlovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docxWorld Wide Tickets And Hospitality
 
08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking MenDelhi Call girls
 
Top Call Girls In Jankipuram ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Jankipuram ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Jankipuram ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Jankipuram ( Lucknow ) 🔝 8923113531 🔝 Cash Paymentanilsa9823
 

Recently uploaded (20)

Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
Spain Vs Italy 20 players confirmed for Spain's Euro 2024 squad, and three po...
 
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics TradeTechnical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
Technical Data | Sig Sauer Easy6 BDX 1-6x24 | Optics Trade
 
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdfTAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
TAM Sports_IPL 17 Till Match 37_Celebrity Endorsement _Report.pdf
 
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
Atlanta Dream Exec Dan Gadd on Driving Fan Engagement and Growth, Serving the...
 
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改
大学假文凭《原版英国Imperial文凭》帝国理工学院毕业证制作成绩单修改
 
08448380779 Call Girls In International Airport Women Seeking Men
08448380779 Call Girls In International Airport Women Seeking Men08448380779 Call Girls In International Airport Women Seeking Men
08448380779 Call Girls In International Airport Women Seeking Men
 
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...
Croatia vs Italy Euro Cup 2024 Three pitfalls for Spalletti’s Italy in Group ...
 
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Noida Extension @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docxAlbania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
Albania Vs Spain Albania is Loaded with Defensive Talent on their Roster.docx
 
Call Girls In RK Puram 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In RK Puram 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In RK Puram 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In RK Puram 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service  🦺
CALL ON ➥8923113531 🔝Call Girls Saharaganj Lucknow best Female service 🦺
 
Call Girls 🫤 Paharganj ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ENJOY
Call Girls 🫤 Paharganj ➡️ 9999965857  ➡️ Delhi 🫦  Russian Escorts FULL ENJOYCall Girls 🫤 Paharganj ➡️ 9999965857  ➡️ Delhi 🫦  Russian Escorts FULL ENJOY
Call Girls 🫤 Paharganj ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ENJOY
 
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...
Jankipuram / Call Girls Lucknow | Whatsapp No 🫗 8923113531 🎳 VIP Escorts Serv...
 
Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.Who Is Emmanuel Katto Uganda? His Career, personal life etc.
Who Is Emmanuel Katto Uganda? His Career, personal life etc.
 
🔝|97111༒99012🔝 Call Girls In {Delhi} Cr Park ₹5.5k Cash Payment With Room De...
🔝|97111༒99012🔝 Call Girls In  {Delhi} Cr Park ₹5.5k Cash Payment With Room De...🔝|97111༒99012🔝 Call Girls In  {Delhi} Cr Park ₹5.5k Cash Payment With Room De...
🔝|97111༒99012🔝 Call Girls In {Delhi} Cr Park ₹5.5k Cash Payment With Room De...
 
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改
大学学位办理《原版美国USD学位证书》圣地亚哥大学毕业证制作成绩单修改
 
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docxSlovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
Slovenia Vs Serbia UEFA Euro 2024 Fixture Guide Every Fixture Detailed.docx
 
08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men08448380779 Call Girls In IIT Women Seeking Men
08448380779 Call Girls In IIT Women Seeking Men
 
Top Call Girls In Jankipuram ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Jankipuram ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Jankipuram ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Jankipuram ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
 
Call Girls 🫤 Malviya Nagar ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ENJOY
Call Girls 🫤 Malviya Nagar ➡️ 9999965857  ➡️ Delhi 🫦  Russian Escorts FULL ENJOYCall Girls 🫤 Malviya Nagar ➡️ 9999965857  ➡️ Delhi 🫦  Russian Escorts FULL ENJOY
Call Girls 🫤 Malviya Nagar ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ENJOY
 

Hadoop Business Cases

  • 1. Hadoop Business Cases Dell | Hadoop White Paper By Joey Jablonski Dell | Hadoop White Paper Series
  • 2. Dell | Hadoop White Paper Series: Hadoop Business Cases Table of Contents Hadoop brings new capabilities 3 Management of the data fire hose 5 Hadoop enters the enterprise 5 Analytics 6 Risk modeling 6 Hadoop ecosystem 6 Hadoop futures 7 About the author 7 Special thanks 7 About Dell Next Generation Computing Solutions 7 References 8 To learn more 8 This White Paper is for informational purposes only, and may contain typographical errors and technical inaccuracies. The content is provided as is, without express or implied warranties of any kind. © 2011 Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden. For more information, contact Dell. Dell, the Dell logo, and the Dell badge, and PowerEdge are trademarks of Dell Inc. 2
  • 3. Dell | Hadoop White Paper Series: Hadoop Business Cases Hadoop brings new capabilities Growing data volumes and interconnected systems create a need for a tool capable of building the next generation of analytics and data management solutions. Hadoop provides a framework for your company to analyze and manage growing volumes of data while storing data longer than previously possible at a competitive price point. By extending the life of data and not discarding it, you can enable staff to review historic data in new ways and analyze it as new methods emerge. The Hadoop taxonomy is outlined in Figure 1, showing the components common to all Hadoop environments. These components are part of the core Apache Hadoop project. The Hadoop architecture is very pluggable, allowing any component to be replaced with one optimized for a specific workload, while allowing a large variety of data presentation layers to utilize the data stored in Hadoop. The vertical bars on the right designate components not included as part of a default Hadoop distrobution; these components are commonly provided by IT providers to enhance their Hadoop offerings. Figure 1. Core components of a Hadoop deployment In addition to the core Hadoop components shown in Figure 1, a variety of projects have developed as part of the Hadoop ecosystem to provide specific solutions for using data within Hadoop in common ways. Many projects have evolved for storing and processing specific types of data within Hadoop, allowing many industries to create specific solutions built on a common storage and compute engine within Hadoop. 3
  • 4. Dell | Hadoop White Paper Series: Hadoop Business Cases Figure 2. The core Hadoop ecosystem with additional tools for data presentation Components of the Hadoop ecosystem can be built on one or more of the three primary Hadoop use cases: Compute Storage Database Hadoop is commonly used as a One primary component of the The Hadoop ecosystem contains distributed compute platform for Hadoop ecosystem is the Hadoop components that allow the data analyzing or processing large Distributed File System (HDFS). The within the HDFS to be presented amounts of data. The Hadoop HDFS allows users to have a single in a SQL interface. This allows the ecosystem provides APIs addressable namespace, spread use of standard functions, necessary to distribute and track across many hundreds or thousands including INSERT, SELECT, and workloads as they are run on large of servers, creating a single large file UPDATE of data within the numbers of distributed machines. system. Hadoop manages the Hadoop environment, with replication of the data within this file minimal code changes to existing system to ensure hardware failures do applications. These components not lead to data loss. Many allow developers to quickly organizations will use this scalable file access the data stored within a system as a place to store large Hadoop environment with tools amounts of data that is then accessed they are experienced at using. by jobs run within Hadoop or by external systems. Hadoop provides a consistent, scalable base of tools within an organization for storing, managing, and analyzing data, without being tied to any specific department or framework. Hadoop enables your organization to use a single set of data 4
  • 5. Dell | Hadoop White Paper Series: Hadoop Business Cases for all departments’ reporting, analysis, and research needs. This single source enables better quality results and eliminates the cost and complexity of managing multiple islands of data. Business is changing quickly; the goals of any individual tool today may not be the same tomorrow. The same goes for organizations and their areas of focus within a large corporation. Making the decision about what data to discard in the current floods of data most companies are experiencing is a difficult challenge. Hadoop enables your company to store more data, with less overhead than ever before. This enables your staff to ask questions of that data and analyze it in new ways later that are not even thought of today. Management of the data fire hose The evolving community around “big data” (the industry term for environments containing large volumes of related but un-structured data) finds new ways for analyzing and managing growing volumes of data. We are also exploring the creation of new ways for making sense of otherwise large piles of previously Decisions misunderstood data sets. On any given day, most companies do not know what questions to ask of certain data. When this has occurred in the past, companies would purge that data because of the cost of storing data with indeterminate value. Today, Questions companies exploit tools like Hadoop for storing that data for much longer periods of time, often until such time that staff find new ways to understand how the data can be used and what questions can be asked of the data. Data Today, data is as valuable as any software written by a company or any product it designs. The data is the component that drives next-generation products and enables maximum revenue attainment from existing products. Hadoop provides a low-barrier-to- Figure 3. Questions bring out the value of data. entry solution for storing the additional data being created by today’s companies. Hadoop enters the enterprise Hadoop is rarely initially deployed as a company-wide data analytics solution; more often, Hadoop is deployed by a single department or organization that sees it as a solution to certain challenges. Hadoop inevitably is then used by more and more departments, becoming a more critical piece of the corporation’s storage and analytics solutions. Hadoop deployments commonly start with a smaller deployment within a virtual environment; this could be virtual machines hosted on premise or in a public cloud environment. This method enables your IT staff to learn about managing Hadoop and enables your developers to begin testing ideas they have about uses of Hadoop. This use of virtual infrastructure will usually stop as soon as real workloads are tested, and this usually signifies a move to physical hardware dedicated to Hadoop. This change is primarily driven by data volumes and performance needs. At a certain inflection point, moving data to a public cloud becomes too time-consuming, so companies look to internally hosted and managed Hadoop solutions. It is important to understand the evolution of Hadoop in your environment to ensure that you adequatly plan each stage of the evolution. Hadoop can rapidly become a large, complex component of your information technology (IT) department. By understanding how Hadoop commonly evolves, you can better manage that evolution in your environment and ensure Hadoop meets your company’s needs, without causing an undue operations burden. 5
  • 6. Dell | Hadoop White Paper Series: Hadoop Business Cases Analytics Analytics are becoming a more critical component in all business environments. Analytics are being used to provide near real-time reporting on the state of a business, allowing leaders to make rapid decisions to correct the course of an organization or to capitalize on the needs of the market. The emerging market of tools for analytics allows companies to manipulate the raw data they get from a variety of sources and make intelligent decisions about the state of the business. Many marketing and sales-focused organizations are now using Hadoop as the core of their analytics programs. Hadoop is used to store a central copy of customer data and product usage information, allowing those developing pricing models and sales models to refine the data in new ways, looking for new relationships. These analytics allow the analysts to look for new relationships, not previously possible with traditional, separate relational database-driven data warehouse environments. Another example of using analytics to minimize operational expenses is in IT. By leveraging the hyperscale compute and storage capabilities of Hadoop, your IT personnel can optimize system reporting, analyze system performance versus operational expenses, detect potential cases for system failures, and minimize system downtime. Your CIO and IT managers can analyze the most optimal operational models, determine operational inflection points, and plan the next budget cycle. Risk modeling Many financial services firms are beginning to use Hadoop for risk modeling. Hadoop provides a base for storing and processing large amounts of data, enabling firms to focus on algorithm development and optimization. Hadoop enables companies to avoid the difficulty in massively parallel programming, while exploiting the capabilitie s provided by commodity hardware and software. By using Hadoop to enable your company’s risk modeling projects, data from many different sources can be pulled into a single location and modeled by a single set of algorithms. A traditionally large company required risk modeling to occur at business unit or departmental levels. This modeling was commonly was done in different ways by the different financial analyst teams. Hadoop enables a single, companywide team to model a company’s exposure to risk and understand what dynamics are at play against that risk position. Figure 4. Hadoop enables a single, companywide team to model exposure to risk. Hadoop ecosystem The Hadoop ecosystem is a rapidly growing and evolving set of tools for Hadoop operations and tools specific to verticals and uses for Hadoop. The Hadoop ecosystem contains many tools specific to operational use cases and the manipulation of specific types of data. This large ecosystem makes Hadoop a strong platform for companies as they evaluate and grow their analytics or business intelligence environments. Some of the most common tools within the Hadoop ecosystem for supporting scale-out environments include Flume, Sqoop, and Zookeeper. Flume is a commonly used tool within the Hadoop ecosystem for handling streaming data. Flume provides a framework for agents on one or many servers to collect events and store them in a single HDFS namespace. Flume also provides the necessary frameworks for developing work streams for processing those events, reporting on them, and taking action on them. Sqoop is a component within the Hadoop ecosystem for enabling connectivity between Hadoop environments and traditional SQL environments, including relational databases and data warehouses. Sqoop enables automated processes to be developed for moving data between Hadoop and data warehouses, enabling data warehouses to have access to 6
  • 7. Dell | Hadoop White Paper Series: Hadoop Business Cases large amounts of data traditionally stored in other environments or not available at all to business intelligence d evelopers and analysts. Zookeeper is a component commonly used by applications that exploit data stored in the HDFS. Zookeeper provides a framework for managing distributed applications and the locks between them for consistent data access, providing naming services, and providing synchronization between separate servers and processes that are part of a single, larger application. Hadoop futures Most organizations have used specialized teams for business intelligence development and exploitation of a compan y’s data. Hadoop enables that functionality to be pushed to a larger group of staff within the organization. Hadoop provides a single unified interface and data store for many staff across all departments to use when analyzing company statistics and developing new methods for success in a market. Hadoop empowers all your employees to think of new ways to improve the bottom line and allows them access to the necessary information to test their theories, develop strategies, and report on changes in the business. Hadoop provides the base software and associated ecosystem to manage growing amounts of data. Hadoop enables your company to store more data than ever before and provide it to a larger portion of the staff for analysis both today and tomorrow. Hadoop can be used to enable near real-time decision making by your company leadership and allow your staff to test new ideas and analyze data in new ways. About the author Joey Jablonski is a principal solution architect with Dell’s Data Center Solutions team. Joey works to define and implement Dell’s solutions for Big Data, including solutions based on Apache Hadoop. Joey has spent more than 10 years working in high performance computing, with an emphasis on interconnects, including Infiniband and parallel fi le systems. Joey has led technical solution design and implementation at Sun Microsystems and Hewlett-Packard, as well as consulted for customers, including Sandia National Laboratories, BP, ExxonMobil, E*Trade, Juelich Supercomputing Centre, and Clumeq. Special thanks The author extends special thanks to:  Rob Hirschfeld, Principal Cloud Solutions Architect, Dell  Aurelian Dumitru, Principal Cloud Solutions Architect, Dell  John Igoe, Executive Director, Next Generation Computing Solutions, Dell About Dell Next Generation Computing Solutions When cloud computing is the core of your business and its efficiency and vitality underpin your success, the Dell Next Generation Computing Solutions are Dell’s response to your unique needs. We understand your challeng es—from compute and power density to global scaling and environmental impact. Dell has the knowledge and expertise to tune your company’s “factory” for maximum performance and efficiency. Dell’s Next Generation Computing Solutions provide operational models backed by unique product solutions to meet the needs of companies at all stages of their lifecycle. Solutions are designed to meet the needs of small startups while allowing scalability as your company grows. Deployment and support are tailored to your unique operational requirements. Dell’s Cloud Computing Solutions can help you minimize the tangible operating costs that have hyper-scale impact on your business results. 7
  • 8. Dell | Hadoop White Paper Series: Hadoop Business Cases References Big Data http://en.wikipedia.org/wiki/Big_data Hadoop http://hadoop.apache.org/ Public Cloud http://en.wikipedia.org/wiki/Cloud_computing Flume https://github.com/cloudera/flume HBase http://wiki.apache.org/hadoop/Hbase?action=show&redirect=HBase Zookeeper http://wiki.apache.org/hadoop/ZooKeeper To learn more To learn more about Dell cloud solutions, contact your Dell representative or visit: www.dell.com/cloud ©2011 Dell Inc. All rights reserved. Trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Specifications are correct at date of publication but are subject to availability or ch ange without notice at any time. Dell and its affiliates cannot be responsible for errors or omissions in typography or photography. Dell’s Terms and Conditions of Sales and Service apply and are available on request. Dell service offerings do not affect consumer’s statutory rights. Dell, the DELL logo, and the DELL badge, PowerConnect, and PowerVault are trademarks of Dell Inc. 8