What is Hadoop & its Use cases-PromtpCloud

HADOOP
THE SIGNIFICANT BIG DATA SOFTWARE

The Apache™ Hadoop® project
•Open-source software for reliable, scalable, distributed computing
•Allows distributed processing of large data sets across clusters of computers
Designed for:
•Scale up from single servers to thousands of machines, each offering local computation and storage
•To detect and handle failures at the application layer
•Delivering a highly-available service on top of a cluster of computers

•Hadoop Common (utilities) that support Hadoop models
•Hadoop Distributed File System (HDFS) for high-throughput access to application data
•Hadoop YARN for job scheduling and cluster resource management
•Hadoop MapReduce for parallel processing of large data-setsModules designed assuming hardware failures should be handled by framework
The Apache™ Hadoop® project

RELATED PROJECTS
•Ambari™
A tool for provisioning, managing, and monitoring Apache Hadoop clusters
•Avro™
A data serialization system
•Cassandra™
A scalable multi-master database with no single points of failure
•Chukwa™
A data collection system for managing large distributed systems

•HBase™
A scalable, distributed database supporting structured data storage for large tables
•Hive™
A data warehouse infrastructure for data summarization and ad hoc querying
•Mahout™
A scalable machine learning and data mining library
•Pig™
A high-level data-flow language and execution framework for parallel computation
RELATED PROJECTS

•Spark™
A fast and general compute engine for Hadoop data
•Tez™
A generalized data-flow programming framework providing a powerful and flexible engine to execute data processing for both batch and interactive use- cases
•ZooKeeper™
A high-performance coordination service for distributed applications
RELATED PROJECTS

Hadoop is useful because…
BIG DATA STORAGE
FAST PROCESSING
BETTER RESULTS & INSIGHTS

Hadoop is Big Data software that…
best meets industry needs
allows movement of large volumes of complex and relational data into a single repository
is affordable storage and retrieval for analytic applications
makes raw data always available
simultaneously processes Big Data divided into multiple parts

Hadoop Uses
PUBLIC HEALTH
PRODUCT DEVELOPMENT
R&D
STOCK & COMMODITIES TRADING
SALES & MARKETING

Economics of everything online
•Legacy systems are far too expensive for general use with large data-sets
•Hadoop relies on internally redundant data. Storing data not previously viable is possible
•Keep data for real-time interactive querying, business intelligence, analysis and visualization

Streamline Data Usage
•Unstructured data accounts for 90% of the data
•Data storage, management and analytics must be re-looked at
•Legacy systems will complement Hadoop-optimized data management
•Hadoop is cost-effective, scalable, and provided streamlined architecture

USE CASES
FOR BUSINESS ENTERPRISES

Hadoop helps
DATA PROCESSING
•extract, transform, and load (ETL) data from source systems
•to transfer data stored in Hadoop to and from a database management
•batch process large quantities of unstructured and semi-structured data
NETWORK MANAGEMENT
•capture, analyze, and display data collected from servers, storage devices, and other IT hardware
•monitor network activity and diagnose bottleneck and other issues

RETAIL FRAUD
•monitor, model, and analyze high volumes of data from transactions
•extract features and patterns, retailers can help prevent credit card account fraud
RECOMMENDATION TOOL
•match and recommend users to one another
•compare products and services based on analysis of user profiles and behavioral data
Hadoop helps

SENTIMENT ANALYSIS
•advanced text analytics tools analyze unstructured text of social media
•tweets and Facebook posts determine user sentiment related to particular companies, brands, or products
FINANCIAL RISK MODELING
•analysis of large volumes of transactional data to determine risk and exposure of financial assets,
•prepare for potential "what-if" scenarios based on simulated market behavior
•due diligence tasks
•rate potential clients for risk
Hadoop helps

MARKETING CAMPAIGN ANALYSIS
•monitor and determine the effectiveness of marketing campaigns
•increase the accuracy of analysis by incorporating higher volumes of detailed data
CUSTOMER INFLUENCER ANALYSIS
•Mine social networking data for mapping customer influence over others
•help enterprises determine customers most important and influential for focused marketing
Hadoop helps

CUSTOMER EXPERIENCE ANALYSIS
•integrate data from previously siloedcustomer interaction channels
•understand impact of customer interaction to optimize customer lifecycle experience
RESEARCH & DEVELOPMENT
•comb through volumes of text- based research and historical data to support development of new products
Hadoop helps

Hadoop provides a solid foundation onwhich to build critical big data solutions. As a tool, using it the right way from the very beginning can help ensuresuccess.

Visit our blogfor more interesting articles on Big Data, Crawling & Extraction, and Analytics

What is Hadoop & its Use cases-PromtpCloud

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to What is Hadoop & its Use cases-PromtpCloud

Similar to What is Hadoop & its Use cases-PromtpCloud (20)

More from PromptCloud

More from PromptCloud (20)

Recently uploaded

Recently uploaded (20)

What is Hadoop & its Use cases-PromtpCloud