SlideShare a Scribd company logo
1 of 6
Download to read offline
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 704
Analyzing and Improving the Efficiency of Hadoop-Cluster for
Big Data Analysis
Deepak Kumar1, Saurabh Charaya2
1M.Tech Research Scholar, Department of Computer Science & Engineering, OM Institute of Technology and
Management Hisar(Haryana)
2Assistant Professor, Head of Department of Computer Science & Engineering, OM Institute of Technology
and Management Hisar(Haryana)
Abstract:
The extent of Digitization is continuously increasing by leaps and bounds now a days, resulting
in accumulation of large amount of data every second. The data can be a transaction, it can be a
social media chat or from any other source. Processing such a Big Data is a very time
consuming and tedious task. Though we have advanced systems and techniques to process this
data but still there are possibilities of improvements. This paper analysis and explores such
possibilities to improve the performance of Hadoop-cluster which is being used to process the
big data. In this paper, we first analysis the performance of the cluster and then suggest some
method to improve the overall performance of the system.
Keywords: Big Data, Hadoop-cluster, Fault tolerance, MapReduce.
1. INTRODUCTION
Since the inception of computers, our lives
have been greatly changed by the growing
technologies. Large amount of data is being
generated every second and stored in
physical storage media. It may be
meaningful or may not be but has to be
stored indeed. Such a huge storage of data
that is exponentially growing too is termed
as Big data. It is a very tedious task to
process such a large amount of data to find
out the valuable information. Challenges[1]
include examination, get, data curation, look
for, sharing, amassing, trade, portrayal, and
information assurance. The term
consistently suggests just to the usage of
perceptive examination or other certain
moved systems to expel an impetus from
data, and some of the time to a particular
size of enlightening gathering. Accuracy in
tremendous data may provoke dynamically
beyond any doubt essential initiative. Also,
better decisions can mean increasingly
imperative operational profitability, cost
declines and lessened danger.
1.1 Processing Big Data
Google distributed a paper on a
methodology called MapReduce[2] in 2004;
the MapReduce framework gives a parallel
getting ready model and related utilization to
process huge proportions of data. With
MapReduce, request are part and scattered
transversely over parallel center points and
dealt with in parallel (the Map step). The
RESEARCH ARTICLE OPEN ACCESS
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 705
results are then aggregated and passed on
(the Reduce step). The structure was
particularly successful, so others expected to
copy the estimation. Thus, an execution of
the MapReduce framework was
implemented by an Apache open source
adventure named Hadoop [3].
Recent studies exhibit that the usage of an
alternate layer configuration is a probability
for overseeing colossal data. The Distributed
Parallel structure appropriates data over
different getting ready units and parallel
taking care of units give data significantly
faster. This kind of designing installs data
into a parallel DBMS, which executes the
usage of MapReduce and Hadoop
frameworks. This kind of structure wants to
make the dealing with power direct to the
end customer by using a front end
application server.
1.2 Mapreduce structure: Basic
engineering
The MapReduce framework[4] is introduced
as a fundamental and notable programming
model that engages straight forward
enhancement of flexible parallel applications
to process enormous proportions of data on
tremendous gatherings of thing machines
[1][2]. In particular, the execution portrayed
in the principal paper is predominantly
planned to achieve unrivaled on tremendous
packs of item PCs[2] One of the basic
inclinations of this approach is that it
separates the application from the nuances
of running a scattered program, for instance,
issues on data dissemination, booking, and
adjustment to inner disappointment. In this
model, the estimation takes a great deal of
key/regard sets data and delivers a ton of
key/regard consolidates as yield. The
MapReduce framework does perform
computation using two limits: Map and
Reduce. The Map work takes an information
consolidate and makes a ton of moderate
key/regard sets. The MapReduce framework
clusters together all widely appealing
regards related with a comparable
transitional key I and passes them to the
Reduce work. The Reduce work gets a
widely appealing key 1 with its game plan of
characteristics and solidifies them. Normally
just zero or one yield regard is conveyed per
Reduce conjuring. The guideline favored
point of view of this model is that it
empowers extensive figuring’s to be viably
parallelized and executed to be used as the
basic framework for adjustment to interior
disappointment. The arrangement of the
MapReduce framework has considered
going with essential guidelines [5].
—Unreliable Commodity Hardware with
low-cost
—A highly Scalable RAIN Cluster.
— Abstracted but highly parallel
— Easy to administer but fault tolerant.
1.3 Fault Tolerance
Fault tolerance[6] is described as, when the
system limits suitably with no data lost
paying little heed to whether some hardware
parts of the structure has failed. It is hard to
accomplish penny percent Fault tolerance
anyway faults can be persevered up
somewhat. HDFS give high throughput to
get to data application and proper to have
tremendous instructive records as their
information [7]. The essential inspiration
driving this Fault tolerances to oust
occasionally happening frustrations, which
happens as a rule and bothers the standard
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 706
working of the structure. Single point
disillusionment center points happen when a
lone center dissatisfaction makes the entire
structure crashes. The fundamental
commitment of Fault tolerances to empty
such center point which bothers the entire
commonplace working of the structure [8].
The three standard courses of action which
are used to convey adjustment to inside
disappointment are data replication,
heartbeat messages and checkpoint and
recovery.
1.3.1 Data replication
An application can indicate the quantity of
reproductions of a document at the time it is
made, and this number can be changed
whenever after that. [9] The name hub
settles on all choices concerning square
replication. HDFS utilizes a keen imitation
position demonstrate for unwavering quality
and execution. Advancing copy position
makes HDFS extraordinary from most other
dispersed record frameworks, and is
encouraged by a rack-mindful imitation
arrangement approach that utilizes organize
data transmission effectively. A similar
duplicate of information is situated on a few
distinctive processing hubs so when that
information duplicate is required it is given
by any of the information hub which isn't
occupied in speaking with different hubs.
The significant favorable position of
utilizing this method is to give moment
recuperation from hub and information
disappointments. Yet, one fundamental
impediment is by utilizing this kind of
adaptation to internal failure component
high memory is devoured in putting away
same information on various hubs. There
additionally might be potential outcomes of
information irregularity. It is often utilized
strategy since this procedure gives fast
recuperation of information from
disappointments. When the information is
put away on the hub in the group same
duplicate of the information is imitated in
two additional hubs. for example absolutely
three duplicates of information is situated on
a similar bunch.
1.3.2 Heartbeat messages
The answer for the over two issues are
heartbeat messages. Here heartbeat message
is a message sent from a creator to the
endpoint to recognize whether and when the
innovator fizzles or is never again
accessible. Heartbeat messages are relentless
on an occasional repeating premise from the
creator's startup until the designer's
shutdown. At the point when the collector
recognizes absence of heartbeat messages
amid a foreseen entry period, the goal may
discover that the designer has fizzled,
shutdown, or is commonly no longer
accessible.
1.3.3 Checkpoint and recuperation
In this strategy, comparable idea as that of
rollback is utilized to endure Faults up to
some point. After a settled time span interim
the duplicate report has been spared and put
away. It only rollbacks to the last spare
moment that the fault happens and after that
it begins performing exchange once more. In
general execution time of framework is
expanded, in light of the fact that the
rollback tasks need to return and check for
the last spared steady stages which
increment the time. Likewise there is one
noteworthy downside of this strategy is that
it is very tedious technique contrasted with
first strategy however it requires less extra
assets.
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 707
2. LITERATURE SURVEY
Expanded development of the information
and the need to move the information
between server farms, requests, an effective
information exchange apparatus which can,
exchange the information at higher rates as
well as handle the shortcomings happened
amid the exchange. Absence of fault
tolerance mechanisms, would need to
retransmit the entire information, if there
should arise an occurrence of any fault amid
the exchange. Thus, fault tolerance is an
essential part of big data tools.
There is enough enthusiasm for the
MapReduce(MR) perspective for generous
scale data examination [10]. Regardless of
the way that the basic control stream of this
structure has existed in parallel DBMS for
over 20 years, some have thought about MR
as a radically new handling model [2][11].
In this paper[11], depiction and connection
of the two perfect models have been made.
In addition, the both systems with respect to
execution and development have been
surveyed. To this end, a benchmark
containing a social occasion of endeavors
that they continued running on an open
source type of MR and furthermore on two
parallel DBMSs is portrayed. For each task,
every system's execution, for various
degrees of parallelism on a cluster of 100
nodes, is assessed. Their results revealed
some fascinating tradeoffs. Regardless of the
way that the methodology to stack data into
and tune the execution of parallel DBMSs
took any more extended than the MR
structure, the observed performance of these
DBMSs was too good. Hypothesis has been
made about the explanations behind the
thrilling execution qualification and
considers use thoughts that future systems
should take from the two sorts of structures.
Some investigation has been facilitated to
execution and appraisal of execution in
Hadoop [2][12][13]. Officer executed
MapReduce for shared memory systems.
Phoenix give a versatile execution with both
multi-focus and conventional symmetric
multi-processors. Bingsheng et al. made
Mars which is a Mapreduce framework for
plans multi-processors [12]. The goal of
Mars was to disguise the programming
capriciousness of GPU by giving clear
MapReduce interface.
There are two sorts of record systems
managing broad reports for bundles,
specifically, parallel file system and Internet
service file system [14]. Hadoop
distribution file system (HDFS) [2] is an
acclaimed Internet service file system that
gives the right reflection to data planning in
Mapreduce frameworks.
In recent times, High Performance
Computing (HPC) systems have been
moving from expensive immensely parallel
models to clusters of item PCs to use cost
and execution benefits properly. Fault
tolerance is creating a lot of stress for such
long running systems. In a paper[15] a short
review of the rate of failures of HPC
systems is given and besides think about the
Fault tolerance approaches for HPC systems
and issues with these strategies. Rollback-
recovery techniques are discussed in light of
the way that they are commonly used for
long-running applications on HPC
structures. Specifically, the component
necessities of rollback-recovery are
inspected and a logical order is made for in
excess of twenty common checkpoint/restart
game plans.
A paper[6] by T. Cowsalya and S.R.
Mugunthan likewise clarified different fault
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 708
tolerance techniques for big data Hadoop
bunches. A checkpoint based algorithm[16]
has been presented by Peng Hu and Wei Dai
to enhance fault toerances and recovery of
data using hadoop cluster.
3. CONCLUSION
Big data is vast term and there is an
immense scope of improvement in the
processing of this data as well.
In this paper, we are analyzing the Hadoop
system for big data analysis using map
reduce algorithm and purposing just one of
those ways to improve the performance of a
Hadoop-cluster. The basic idea here is to
analyze the hadoop-cluster and suggest the
best methodology to increase the efficiency.
REFERENCES
[1] Katal, Avita; Wazid, Mohammad;
Goudar, R. H. -- [IEEE 2013 Sixth
International Conference on Contemporary
Computing (IC3) - Noida, India (2013.08.8-
2013.08.10)] 2013 S
[2] MapReduce: Simplifed Data Processing
on Large Clusters by Jeffrey Dean and
Sanjay Ghemawat
[3] http://hadoop.apache.org
[4] Sakr, S., Liu, A., and Fayoumi, A. G.
2013. The family of mapreduce and large-
scale data processing systems. ACM
Comput. Surv. 46, 1, Article 11 (October
2013)
[5] H.-c. Yang, A. Dasdan, R.-L. Hsiao, and
D. S. Parker. Map-reduce-merge: simplified
relational data processing on large clusters.
[6] Hadoop architecture and fault tolerance
based hadoop clusters in geographically
distributed data center by T. Cowsalya and
S.R. Mugunthan, SVS College of
Engineering, Coimbatore, Tamil Nadu, India
[7] The Hadoop Distributed File System:
Architecture and Design” by Dhruba
Borthakur,
http://hadoop.apache.org/docs/r0.18.0/hdfs_
design.pdf
[8] Selic,B. 2004. Fault tolerance
techniques for distributed systems. IBM.
[9] Joey Joblonski. Introduction to Hadoop.
A Dell technical white paper.
http://i.dell.com/sites/content/business/soluti
ons/whitepapers/en/Documents/hadoop-
introduction.pdf
[10] A Comparison of Approaches to Large-
Scale Data Analysis (by Andrew Pavlo, Erik
Paulson Alexander Rasin et al) 2009.
[11] D. A. Patterson. Technical Perspective:
The Data Center is the Computer. Commun.
ACM, 51(1):105–105, 2008.
[12] B.He, W.Fang, Q.Luo, N.Govindaraju,
and T.Wang. Mars: a MapReduce
framework on graphics processors. ACM,
2008.
[13] C. Ranger, R. Raghuraman, A.
Penmetsa, G. Bradski, and C. Kozyrakis.
Evaluating mapreduce for multi-core and
multiprocessor systems. High-Performance
Computer Architecture, International
Symposium on, 0:13–24, 2007.
[14] A scalable, high performance file
system.
http://lustre.org.
International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019
Available at www.ijsred.com
ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 709
[15] A survey of fault tolerance mechanisms
and checkpoint/restart implementations for
high performance computing systems
(Ifeanyi P. Egwutuoha · David Levy · Bran
Selic ·
Shiping Chen)
[16] Enhancing fault tolerance based on
hadoop cluster: Peng Hu and Wei Dai
(2014)
Deepak kumar is a MTECH student in
department of Computer Science &
Engineering from OM Institute of
Technology and Management,
Hisar(Haryana)
Er. Saurabh Charaya is presently working as
Head of the Department in the Department
of Computer Science & Engineering at Om
Institute of Technology and Management,
Hisar(Haryana) since August, 2011. He has
13 years of rich experience in the field of
Computer Science & Engineering. He did
MCA and MTech(CSE) from Ch. Devi Lal
University, Sirsa. He did MBA(Information
Technology) from Amity University, Noida.
He is pursuing Ph.D(CSE) from Bhagwant
University, Ajmer. He has published 25
research papers in International and 10
research papers in National Journals. He has
presented 5 research papers in International
and 5 research papers in National
Conferences.

More Related Content

What's hot

Towards a low cost etl system
Towards a low cost etl systemTowards a low cost etl system
Towards a low cost etl systemijdms
 
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...eSAT Journals
 
Distributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data AnalysisDistributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data AnalysisIRJET Journal
 
6.a survey on big data challenges in the context of predictive
6.a survey on big data challenges in the context of predictive6.a survey on big data challenges in the context of predictive
6.a survey on big data challenges in the context of predictiveEditorJST
 
Survey on load balancing and data skew mitigation in mapreduce applications
Survey on load balancing and data skew mitigation in mapreduce applicationsSurvey on load balancing and data skew mitigation in mapreduce applications
Survey on load balancing and data skew mitigation in mapreduce applicationsIAEME Publication
 
Improving performance of apriori algorithm using hadoop
Improving performance of apriori algorithm using hadoopImproving performance of apriori algorithm using hadoop
Improving performance of apriori algorithm using hadoopeSAT Journals
 
IRJET - Health Medicare Data using Tweets in Twitter
IRJET - Health Medicare Data using Tweets in TwitterIRJET - Health Medicare Data using Tweets in Twitter
IRJET - Health Medicare Data using Tweets in TwitterIRJET Journal
 
Synchronization and replication through ocmdbs
Synchronization and replication through ocmdbsSynchronization and replication through ocmdbs
Synchronization and replication through ocmdbsIAEME Publication
 
Application of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSApplication of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSAnusha Basavaraj
 
Efficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingEfficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingIRJET Journal
 
User Inspired Management of Scientific Jobs in Grids and Clouds
User Inspired Management of Scientific Jobs in Grids and CloudsUser Inspired Management of Scientific Jobs in Grids and Clouds
User Inspired Management of Scientific Jobs in Grids and CloudsEran Chinthaka Withana
 
Disaster Recovery & Data Backup Strategies
Disaster Recovery & Data Backup StrategiesDisaster Recovery & Data Backup Strategies
Disaster Recovery & Data Backup StrategiesSpiceworks
 
What every IT audit should know about backup and recovery
What every IT audit should know about backup and recoveryWhat every IT audit should know about backup and recovery
What every IT audit should know about backup and recoveryessbaih
 
Map reduce in BIG DATA
Map reduce in BIG DATAMap reduce in BIG DATA
Map reduce in BIG DATAGauravBiswas9
 
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...Eran Chinthaka Withana
 

What's hot (20)

Dremel
DremelDremel
Dremel
 
Data Dimensional Reduction by Order Prediction in Heterogeneous Environment
Data Dimensional Reduction by Order Prediction in Heterogeneous EnvironmentData Dimensional Reduction by Order Prediction in Heterogeneous Environment
Data Dimensional Reduction by Order Prediction in Heterogeneous Environment
 
Select enterprise backup software
Select enterprise backup softwareSelect enterprise backup software
Select enterprise backup software
 
Towards a low cost etl system
Towards a low cost etl systemTowards a low cost etl system
Towards a low cost etl system
 
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
 
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
Cache mechanism to avoid dulpication of same thing in hadoop system to speed ...
 
Distributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data AnalysisDistributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data Analysis
 
6.a survey on big data challenges in the context of predictive
6.a survey on big data challenges in the context of predictive6.a survey on big data challenges in the context of predictive
6.a survey on big data challenges in the context of predictive
 
Survey on load balancing and data skew mitigation in mapreduce applications
Survey on load balancing and data skew mitigation in mapreduce applicationsSurvey on load balancing and data skew mitigation in mapreduce applications
Survey on load balancing and data skew mitigation in mapreduce applications
 
Fn3110961103
Fn3110961103Fn3110961103
Fn3110961103
 
Improving performance of apriori algorithm using hadoop
Improving performance of apriori algorithm using hadoopImproving performance of apriori algorithm using hadoop
Improving performance of apriori algorithm using hadoop
 
IRJET - Health Medicare Data using Tweets in Twitter
IRJET - Health Medicare Data using Tweets in TwitterIRJET - Health Medicare Data using Tweets in Twitter
IRJET - Health Medicare Data using Tweets in Twitter
 
Synchronization and replication through ocmdbs
Synchronization and replication through ocmdbsSynchronization and replication through ocmdbs
Synchronization and replication through ocmdbs
 
Application of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSSApplication of Distributed processing and Big data in agricultural DSS
Application of Distributed processing and Big data in agricultural DSS
 
Efficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingEfficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data Processing
 
User Inspired Management of Scientific Jobs in Grids and Clouds
User Inspired Management of Scientific Jobs in Grids and CloudsUser Inspired Management of Scientific Jobs in Grids and Clouds
User Inspired Management of Scientific Jobs in Grids and Clouds
 
Disaster Recovery & Data Backup Strategies
Disaster Recovery & Data Backup StrategiesDisaster Recovery & Data Backup Strategies
Disaster Recovery & Data Backup Strategies
 
What every IT audit should know about backup and recovery
What every IT audit should know about backup and recoveryWhat every IT audit should know about backup and recovery
What every IT audit should know about backup and recovery
 
Map reduce in BIG DATA
Map reduce in BIG DATAMap reduce in BIG DATA
Map reduce in BIG DATA
 
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...
Redefining ETL Pipelines with Apache Technologies to Accelerate Decision-Maki...
 

Similar to IJSRED-V2I3P84

IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET Journal
 
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...IRJET Journal
 
Deduplication on Encrypted Big Data in HDFS
Deduplication on Encrypted Big Data in HDFSDeduplication on Encrypted Big Data in HDFS
Deduplication on Encrypted Big Data in HDFSIRJET Journal
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewIRJET Journal
 
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...IRJET Journal
 
Managing Big data using Hadoop Map Reduce in Telecom Domain
Managing Big data using Hadoop Map Reduce in Telecom DomainManaging Big data using Hadoop Map Reduce in Telecom Domain
Managing Big data using Hadoop Map Reduce in Telecom DomainAM Publications
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with HadoopIRJET Journal
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computinghuda2018
 
Dataintensive
DataintensiveDataintensive
Dataintensivesulfath
 
Data Distribution Handling on Cloud for Deployment of Big Data
Data Distribution Handling on Cloud for Deployment of Big DataData Distribution Handling on Cloud for Deployment of Big Data
Data Distribution Handling on Cloud for Deployment of Big Dataneirew J
 
IRJET- A Comprehensive Review on Query Optimization for Distributed Databases
IRJET- A Comprehensive Review on Query Optimization for Distributed DatabasesIRJET- A Comprehensive Review on Query Optimization for Distributed Databases
IRJET- A Comprehensive Review on Query Optimization for Distributed DatabasesIRJET Journal
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsHappiest Minds Technologies
 
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET Journal
 
IRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET Journal
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111NavNeet KuMar
 
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATA
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATAREAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATA
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATAijp2p
 

Similar to IJSRED-V2I3P84 (20)

IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock ExchangeIRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
 
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
IRJET-Framework for Dynamic Resource Allocation and Efficient Scheduling Stra...
 
Deduplication on Encrypted Big Data in HDFS
Deduplication on Encrypted Big Data in HDFSDeduplication on Encrypted Big Data in HDFS
Deduplication on Encrypted Big Data in HDFS
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
 
E018142329
E018142329E018142329
E018142329
 
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
 
Facade
FacadeFacade
Facade
 
Managing Big data using Hadoop Map Reduce in Telecom Domain
Managing Big data using Hadoop Map Reduce in Telecom DomainManaging Big data using Hadoop Map Reduce in Telecom Domain
Managing Big data using Hadoop Map Reduce in Telecom Domain
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with Hadoop
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
 
Dataintensive
DataintensiveDataintensive
Dataintensive
 
Data Distribution Handling on Cloud for Deployment of Big Data
Data Distribution Handling on Cloud for Deployment of Big DataData Distribution Handling on Cloud for Deployment of Big Data
Data Distribution Handling on Cloud for Deployment of Big Data
 
B1803031217
B1803031217B1803031217
B1803031217
 
P2P Cache Resolution System for MANET
P2P Cache Resolution System for MANETP2P Cache Resolution System for MANET
P2P Cache Resolution System for MANET
 
IRJET- A Comprehensive Review on Query Optimization for Distributed Databases
IRJET- A Comprehensive Review on Query Optimization for Distributed DatabasesIRJET- A Comprehensive Review on Query Optimization for Distributed Databases
IRJET- A Comprehensive Review on Query Optimization for Distributed Databases
 
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest MindsWhitepaper: Big Data - Infrastructure Considerations - Happiest Minds
Whitepaper: Big Data - Infrastructure Considerations - Happiest Minds
 
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...IRJET-  	  Improving Data Availability by using VPC Strategy in Cloud Environ...
IRJET- Improving Data Availability by using VPC Strategy in Cloud Environ...
 
IRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFS
 
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
A Survey on Data Mapping Strategy for data stored in the storage cloud  111A Survey on Data Mapping Strategy for data stored in the storage cloud  111
A Survey on Data Mapping Strategy for data stored in the storage cloud 111
 
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATA
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATAREAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATA
REAL-TIME INTRUSION DETECTION SYSTEM FOR BIG DATA
 

More from IJSRED

IJSRED-V3I6P13
IJSRED-V3I6P13IJSRED-V3I6P13
IJSRED-V3I6P13IJSRED
 
School Bus Tracking and Security System
School Bus Tracking and Security SystemSchool Bus Tracking and Security System
School Bus Tracking and Security SystemIJSRED
 
BigBasket encashing the Demonetisation: A big opportunity
BigBasket encashing the Demonetisation: A big opportunityBigBasket encashing the Demonetisation: A big opportunity
BigBasket encashing the Demonetisation: A big opportunityIJSRED
 
Quantitative and Qualitative Analysis of Plant Leaf Disease
Quantitative and Qualitative Analysis of Plant Leaf DiseaseQuantitative and Qualitative Analysis of Plant Leaf Disease
Quantitative and Qualitative Analysis of Plant Leaf DiseaseIJSRED
 
DC Fast Charger and Battery Management System for Electric Vehicles
DC Fast Charger and Battery Management System for Electric VehiclesDC Fast Charger and Battery Management System for Electric Vehicles
DC Fast Charger and Battery Management System for Electric VehiclesIJSRED
 
Growth Path Followed by France
Growth Path Followed by FranceGrowth Path Followed by France
Growth Path Followed by FranceIJSRED
 
Acquisition System
Acquisition SystemAcquisition System
Acquisition SystemIJSRED
 
Parallelization of Graceful Labeling Using Open MP
Parallelization of Graceful Labeling Using Open MPParallelization of Graceful Labeling Using Open MP
Parallelization of Graceful Labeling Using Open MPIJSRED
 
Study of Phenotypic Plasticity of Fruits of Luffa Acutangula Var. Amara
Study of Phenotypic Plasticity of  Fruits of Luffa Acutangula Var. AmaraStudy of Phenotypic Plasticity of  Fruits of Luffa Acutangula Var. Amara
Study of Phenotypic Plasticity of Fruits of Luffa Acutangula Var. AmaraIJSRED
 
Understanding Architecture of Internet of Things
Understanding Architecture of Internet of ThingsUnderstanding Architecture of Internet of Things
Understanding Architecture of Internet of ThingsIJSRED
 
Smart shopping cart
Smart shopping cartSmart shopping cart
Smart shopping cartIJSRED
 
An Emperical Study of Learning How Soft Skills is Essential for Management St...
An Emperical Study of Learning How Soft Skills is Essential for Management St...An Emperical Study of Learning How Soft Skills is Essential for Management St...
An Emperical Study of Learning How Soft Skills is Essential for Management St...IJSRED
 
Smart Canteen Management
Smart Canteen ManagementSmart Canteen Management
Smart Canteen ManagementIJSRED
 
Gandhian trusteeship and Economic Ethics
Gandhian trusteeship and Economic EthicsGandhian trusteeship and Economic Ethics
Gandhian trusteeship and Economic EthicsIJSRED
 
Impacts of a New Spatial Variable on a Black Hole Metric Solution
Impacts of a New Spatial Variable on a Black Hole Metric SolutionImpacts of a New Spatial Variable on a Black Hole Metric Solution
Impacts of a New Spatial Variable on a Black Hole Metric SolutionIJSRED
 
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...IJSRED
 
Inginious Trafalgar Contrivition System
Inginious Trafalgar Contrivition SystemInginious Trafalgar Contrivition System
Inginious Trafalgar Contrivition SystemIJSRED
 
Farmer's Analytical assistant
Farmer's Analytical assistantFarmer's Analytical assistant
Farmer's Analytical assistantIJSRED
 
Functions of Forensic Engineering Investigator in India
Functions of Forensic Engineering Investigator in IndiaFunctions of Forensic Engineering Investigator in India
Functions of Forensic Engineering Investigator in IndiaIJSRED
 
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....IJSRED
 

More from IJSRED (20)

IJSRED-V3I6P13
IJSRED-V3I6P13IJSRED-V3I6P13
IJSRED-V3I6P13
 
School Bus Tracking and Security System
School Bus Tracking and Security SystemSchool Bus Tracking and Security System
School Bus Tracking and Security System
 
BigBasket encashing the Demonetisation: A big opportunity
BigBasket encashing the Demonetisation: A big opportunityBigBasket encashing the Demonetisation: A big opportunity
BigBasket encashing the Demonetisation: A big opportunity
 
Quantitative and Qualitative Analysis of Plant Leaf Disease
Quantitative and Qualitative Analysis of Plant Leaf DiseaseQuantitative and Qualitative Analysis of Plant Leaf Disease
Quantitative and Qualitative Analysis of Plant Leaf Disease
 
DC Fast Charger and Battery Management System for Electric Vehicles
DC Fast Charger and Battery Management System for Electric VehiclesDC Fast Charger and Battery Management System for Electric Vehicles
DC Fast Charger and Battery Management System for Electric Vehicles
 
Growth Path Followed by France
Growth Path Followed by FranceGrowth Path Followed by France
Growth Path Followed by France
 
Acquisition System
Acquisition SystemAcquisition System
Acquisition System
 
Parallelization of Graceful Labeling Using Open MP
Parallelization of Graceful Labeling Using Open MPParallelization of Graceful Labeling Using Open MP
Parallelization of Graceful Labeling Using Open MP
 
Study of Phenotypic Plasticity of Fruits of Luffa Acutangula Var. Amara
Study of Phenotypic Plasticity of  Fruits of Luffa Acutangula Var. AmaraStudy of Phenotypic Plasticity of  Fruits of Luffa Acutangula Var. Amara
Study of Phenotypic Plasticity of Fruits of Luffa Acutangula Var. Amara
 
Understanding Architecture of Internet of Things
Understanding Architecture of Internet of ThingsUnderstanding Architecture of Internet of Things
Understanding Architecture of Internet of Things
 
Smart shopping cart
Smart shopping cartSmart shopping cart
Smart shopping cart
 
An Emperical Study of Learning How Soft Skills is Essential for Management St...
An Emperical Study of Learning How Soft Skills is Essential for Management St...An Emperical Study of Learning How Soft Skills is Essential for Management St...
An Emperical Study of Learning How Soft Skills is Essential for Management St...
 
Smart Canteen Management
Smart Canteen ManagementSmart Canteen Management
Smart Canteen Management
 
Gandhian trusteeship and Economic Ethics
Gandhian trusteeship and Economic EthicsGandhian trusteeship and Economic Ethics
Gandhian trusteeship and Economic Ethics
 
Impacts of a New Spatial Variable on a Black Hole Metric Solution
Impacts of a New Spatial Variable on a Black Hole Metric SolutionImpacts of a New Spatial Variable on a Black Hole Metric Solution
Impacts of a New Spatial Variable on a Black Hole Metric Solution
 
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...
A Study to Assess the Effectiveness of Planned Teaching Programme on Knowledg...
 
Inginious Trafalgar Contrivition System
Inginious Trafalgar Contrivition SystemInginious Trafalgar Contrivition System
Inginious Trafalgar Contrivition System
 
Farmer's Analytical assistant
Farmer's Analytical assistantFarmer's Analytical assistant
Farmer's Analytical assistant
 
Functions of Forensic Engineering Investigator in India
Functions of Forensic Engineering Investigator in IndiaFunctions of Forensic Engineering Investigator in India
Functions of Forensic Engineering Investigator in India
 
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....
Participation Politique Feminine En Competition Électorale Au Congo-Kinshasa....
 

Recently uploaded

HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 

Recently uploaded (20)

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 

IJSRED-V2I3P84

  • 1. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 704 Analyzing and Improving the Efficiency of Hadoop-Cluster for Big Data Analysis Deepak Kumar1, Saurabh Charaya2 1M.Tech Research Scholar, Department of Computer Science & Engineering, OM Institute of Technology and Management Hisar(Haryana) 2Assistant Professor, Head of Department of Computer Science & Engineering, OM Institute of Technology and Management Hisar(Haryana) Abstract: The extent of Digitization is continuously increasing by leaps and bounds now a days, resulting in accumulation of large amount of data every second. The data can be a transaction, it can be a social media chat or from any other source. Processing such a Big Data is a very time consuming and tedious task. Though we have advanced systems and techniques to process this data but still there are possibilities of improvements. This paper analysis and explores such possibilities to improve the performance of Hadoop-cluster which is being used to process the big data. In this paper, we first analysis the performance of the cluster and then suggest some method to improve the overall performance of the system. Keywords: Big Data, Hadoop-cluster, Fault tolerance, MapReduce. 1. INTRODUCTION Since the inception of computers, our lives have been greatly changed by the growing technologies. Large amount of data is being generated every second and stored in physical storage media. It may be meaningful or may not be but has to be stored indeed. Such a huge storage of data that is exponentially growing too is termed as Big data. It is a very tedious task to process such a large amount of data to find out the valuable information. Challenges[1] include examination, get, data curation, look for, sharing, amassing, trade, portrayal, and information assurance. The term consistently suggests just to the usage of perceptive examination or other certain moved systems to expel an impetus from data, and some of the time to a particular size of enlightening gathering. Accuracy in tremendous data may provoke dynamically beyond any doubt essential initiative. Also, better decisions can mean increasingly imperative operational profitability, cost declines and lessened danger. 1.1 Processing Big Data Google distributed a paper on a methodology called MapReduce[2] in 2004; the MapReduce framework gives a parallel getting ready model and related utilization to process huge proportions of data. With MapReduce, request are part and scattered transversely over parallel center points and dealt with in parallel (the Map step). The RESEARCH ARTICLE OPEN ACCESS
  • 2. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 705 results are then aggregated and passed on (the Reduce step). The structure was particularly successful, so others expected to copy the estimation. Thus, an execution of the MapReduce framework was implemented by an Apache open source adventure named Hadoop [3]. Recent studies exhibit that the usage of an alternate layer configuration is a probability for overseeing colossal data. The Distributed Parallel structure appropriates data over different getting ready units and parallel taking care of units give data significantly faster. This kind of designing installs data into a parallel DBMS, which executes the usage of MapReduce and Hadoop frameworks. This kind of structure wants to make the dealing with power direct to the end customer by using a front end application server. 1.2 Mapreduce structure: Basic engineering The MapReduce framework[4] is introduced as a fundamental and notable programming model that engages straight forward enhancement of flexible parallel applications to process enormous proportions of data on tremendous gatherings of thing machines [1][2]. In particular, the execution portrayed in the principal paper is predominantly planned to achieve unrivaled on tremendous packs of item PCs[2] One of the basic inclinations of this approach is that it separates the application from the nuances of running a scattered program, for instance, issues on data dissemination, booking, and adjustment to inner disappointment. In this model, the estimation takes a great deal of key/regard sets data and delivers a ton of key/regard consolidates as yield. The MapReduce framework does perform computation using two limits: Map and Reduce. The Map work takes an information consolidate and makes a ton of moderate key/regard sets. The MapReduce framework clusters together all widely appealing regards related with a comparable transitional key I and passes them to the Reduce work. The Reduce work gets a widely appealing key 1 with its game plan of characteristics and solidifies them. Normally just zero or one yield regard is conveyed per Reduce conjuring. The guideline favored point of view of this model is that it empowers extensive figuring’s to be viably parallelized and executed to be used as the basic framework for adjustment to interior disappointment. The arrangement of the MapReduce framework has considered going with essential guidelines [5]. —Unreliable Commodity Hardware with low-cost —A highly Scalable RAIN Cluster. — Abstracted but highly parallel — Easy to administer but fault tolerant. 1.3 Fault Tolerance Fault tolerance[6] is described as, when the system limits suitably with no data lost paying little heed to whether some hardware parts of the structure has failed. It is hard to accomplish penny percent Fault tolerance anyway faults can be persevered up somewhat. HDFS give high throughput to get to data application and proper to have tremendous instructive records as their information [7]. The essential inspiration driving this Fault tolerances to oust occasionally happening frustrations, which happens as a rule and bothers the standard
  • 3. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 706 working of the structure. Single point disillusionment center points happen when a lone center dissatisfaction makes the entire structure crashes. The fundamental commitment of Fault tolerances to empty such center point which bothers the entire commonplace working of the structure [8]. The three standard courses of action which are used to convey adjustment to inside disappointment are data replication, heartbeat messages and checkpoint and recovery. 1.3.1 Data replication An application can indicate the quantity of reproductions of a document at the time it is made, and this number can be changed whenever after that. [9] The name hub settles on all choices concerning square replication. HDFS utilizes a keen imitation position demonstrate for unwavering quality and execution. Advancing copy position makes HDFS extraordinary from most other dispersed record frameworks, and is encouraged by a rack-mindful imitation arrangement approach that utilizes organize data transmission effectively. A similar duplicate of information is situated on a few distinctive processing hubs so when that information duplicate is required it is given by any of the information hub which isn't occupied in speaking with different hubs. The significant favorable position of utilizing this method is to give moment recuperation from hub and information disappointments. Yet, one fundamental impediment is by utilizing this kind of adaptation to internal failure component high memory is devoured in putting away same information on various hubs. There additionally might be potential outcomes of information irregularity. It is often utilized strategy since this procedure gives fast recuperation of information from disappointments. When the information is put away on the hub in the group same duplicate of the information is imitated in two additional hubs. for example absolutely three duplicates of information is situated on a similar bunch. 1.3.2 Heartbeat messages The answer for the over two issues are heartbeat messages. Here heartbeat message is a message sent from a creator to the endpoint to recognize whether and when the innovator fizzles or is never again accessible. Heartbeat messages are relentless on an occasional repeating premise from the creator's startup until the designer's shutdown. At the point when the collector recognizes absence of heartbeat messages amid a foreseen entry period, the goal may discover that the designer has fizzled, shutdown, or is commonly no longer accessible. 1.3.3 Checkpoint and recuperation In this strategy, comparable idea as that of rollback is utilized to endure Faults up to some point. After a settled time span interim the duplicate report has been spared and put away. It only rollbacks to the last spare moment that the fault happens and after that it begins performing exchange once more. In general execution time of framework is expanded, in light of the fact that the rollback tasks need to return and check for the last spared steady stages which increment the time. Likewise there is one noteworthy downside of this strategy is that it is very tedious technique contrasted with first strategy however it requires less extra assets.
  • 4. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 707 2. LITERATURE SURVEY Expanded development of the information and the need to move the information between server farms, requests, an effective information exchange apparatus which can, exchange the information at higher rates as well as handle the shortcomings happened amid the exchange. Absence of fault tolerance mechanisms, would need to retransmit the entire information, if there should arise an occurrence of any fault amid the exchange. Thus, fault tolerance is an essential part of big data tools. There is enough enthusiasm for the MapReduce(MR) perspective for generous scale data examination [10]. Regardless of the way that the basic control stream of this structure has existed in parallel DBMS for over 20 years, some have thought about MR as a radically new handling model [2][11]. In this paper[11], depiction and connection of the two perfect models have been made. In addition, the both systems with respect to execution and development have been surveyed. To this end, a benchmark containing a social occasion of endeavors that they continued running on an open source type of MR and furthermore on two parallel DBMSs is portrayed. For each task, every system's execution, for various degrees of parallelism on a cluster of 100 nodes, is assessed. Their results revealed some fascinating tradeoffs. Regardless of the way that the methodology to stack data into and tune the execution of parallel DBMSs took any more extended than the MR structure, the observed performance of these DBMSs was too good. Hypothesis has been made about the explanations behind the thrilling execution qualification and considers use thoughts that future systems should take from the two sorts of structures. Some investigation has been facilitated to execution and appraisal of execution in Hadoop [2][12][13]. Officer executed MapReduce for shared memory systems. Phoenix give a versatile execution with both multi-focus and conventional symmetric multi-processors. Bingsheng et al. made Mars which is a Mapreduce framework for plans multi-processors [12]. The goal of Mars was to disguise the programming capriciousness of GPU by giving clear MapReduce interface. There are two sorts of record systems managing broad reports for bundles, specifically, parallel file system and Internet service file system [14]. Hadoop distribution file system (HDFS) [2] is an acclaimed Internet service file system that gives the right reflection to data planning in Mapreduce frameworks. In recent times, High Performance Computing (HPC) systems have been moving from expensive immensely parallel models to clusters of item PCs to use cost and execution benefits properly. Fault tolerance is creating a lot of stress for such long running systems. In a paper[15] a short review of the rate of failures of HPC systems is given and besides think about the Fault tolerance approaches for HPC systems and issues with these strategies. Rollback- recovery techniques are discussed in light of the way that they are commonly used for long-running applications on HPC structures. Specifically, the component necessities of rollback-recovery are inspected and a logical order is made for in excess of twenty common checkpoint/restart game plans. A paper[6] by T. Cowsalya and S.R. Mugunthan likewise clarified different fault
  • 5. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 708 tolerance techniques for big data Hadoop bunches. A checkpoint based algorithm[16] has been presented by Peng Hu and Wei Dai to enhance fault toerances and recovery of data using hadoop cluster. 3. CONCLUSION Big data is vast term and there is an immense scope of improvement in the processing of this data as well. In this paper, we are analyzing the Hadoop system for big data analysis using map reduce algorithm and purposing just one of those ways to improve the performance of a Hadoop-cluster. The basic idea here is to analyze the hadoop-cluster and suggest the best methodology to increase the efficiency. REFERENCES [1] Katal, Avita; Wazid, Mohammad; Goudar, R. H. -- [IEEE 2013 Sixth International Conference on Contemporary Computing (IC3) - Noida, India (2013.08.8- 2013.08.10)] 2013 S [2] MapReduce: Simplifed Data Processing on Large Clusters by Jeffrey Dean and Sanjay Ghemawat [3] http://hadoop.apache.org [4] Sakr, S., Liu, A., and Fayoumi, A. G. 2013. The family of mapreduce and large- scale data processing systems. ACM Comput. Surv. 46, 1, Article 11 (October 2013) [5] H.-c. Yang, A. Dasdan, R.-L. Hsiao, and D. S. Parker. Map-reduce-merge: simplified relational data processing on large clusters. [6] Hadoop architecture and fault tolerance based hadoop clusters in geographically distributed data center by T. Cowsalya and S.R. Mugunthan, SVS College of Engineering, Coimbatore, Tamil Nadu, India [7] The Hadoop Distributed File System: Architecture and Design” by Dhruba Borthakur, http://hadoop.apache.org/docs/r0.18.0/hdfs_ design.pdf [8] Selic,B. 2004. Fault tolerance techniques for distributed systems. IBM. [9] Joey Joblonski. Introduction to Hadoop. A Dell technical white paper. http://i.dell.com/sites/content/business/soluti ons/whitepapers/en/Documents/hadoop- introduction.pdf [10] A Comparison of Approaches to Large- Scale Data Analysis (by Andrew Pavlo, Erik Paulson Alexander Rasin et al) 2009. [11] D. A. Patterson. Technical Perspective: The Data Center is the Computer. Commun. ACM, 51(1):105–105, 2008. [12] B.He, W.Fang, Q.Luo, N.Govindaraju, and T.Wang. Mars: a MapReduce framework on graphics processors. ACM, 2008. [13] C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis. Evaluating mapreduce for multi-core and multiprocessor systems. High-Performance Computer Architecture, International Symposium on, 0:13–24, 2007. [14] A scalable, high performance file system. http://lustre.org.
  • 6. International Journal of Scientific Research and Engineering Development-– Volume 2 Issue 3, May –June 2019 Available at www.ijsred.com ISSN : 2581-7175 ©IJSRED: All Rights are Reserved Page 709 [15] A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems (Ifeanyi P. Egwutuoha · David Levy · Bran Selic · Shiping Chen) [16] Enhancing fault tolerance based on hadoop cluster: Peng Hu and Wei Dai (2014) Deepak kumar is a MTECH student in department of Computer Science & Engineering from OM Institute of Technology and Management, Hisar(Haryana) Er. Saurabh Charaya is presently working as Head of the Department in the Department of Computer Science & Engineering at Om Institute of Technology and Management, Hisar(Haryana) since August, 2011. He has 13 years of rich experience in the field of Computer Science & Engineering. He did MCA and MTech(CSE) from Ch. Devi Lal University, Sirsa. He did MBA(Information Technology) from Amity University, Noida. He is pursuing Ph.D(CSE) from Bhagwant University, Ajmer. He has published 25 research papers in International and 10 research papers in National Journals. He has presented 5 research papers in International and 5 research papers in National Conferences.