SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2054
Generate Distributed Metadata using Blockchain Technology within
HDFS Environment
Shilpa K. Sangode1, Harish K. Barapatre2
1,2 Department of Computer Engineering, Yadavrao Tasgaonkar Institute of Engineering and Technology
Bhivpuri Road, Karjat
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - Big Data processing and analyzing is very crucial
across various industries. Rapidly increasing data volume and
multiple sources of data generation it has created a challenge
to analyze the huge amount of data. Traditional relational
database management system RDBMS cannotprocesstheload
of data generated. Hadoop Distributed File System (HDFS) is a
distributed and enormously parallel processing system suited
for big data whereby the storage is created on the distributed
file system. Single point of failure is one of the key challenges
due to the existence only one single Name Node in the existing
HDFS architecture. Metadata summarizes basic information
about the distribution of data blocks, block size information
and replica management. Newer HDFS architecture for big
data storage eliminates the concept of Name Node with the
functionalities of the Name Node being stores the metadata
using blockchain technology. mathematical cryptographic
techniques like hashcode with the help of BlockChain Miners.
Keywords—Big data; Hadoop Distributed File System;
Blockchain; Hashcode; Metadeta; MINER, HDFC
architecture, Single point of failure.
I. INTRODUCTION
With the fast expansion of semiconductor industry, the
manufacturing process became more multifaceted and
meticulous. With the advanced nature of technologies and
machines data processing became more precise and huge
scale. The five facets of the big data are Volume (size ofdata),
Velocity (swiftness of data), Variety (analysis of streaming
data), Veracity (data uncertainty) and Value (data
quantification) [1]. Day by day the data set is growing.
Conventional data processing application is not enough to
dispense desired analysis and usability. Nature of data may
be unstructured, semi-structured or structured. Majority of
the relational database management systems (RDBMS) are
process only structured data. Data are being generated and
collected from variety of standalone sources in different
formats. Post generation of the data it is collated together to
analyze and to create insights for givenproblems.Businesses
and institutes use intelligence to forecast future trends, for
improving decision making. These insights are a valuable
asset in current market. Hadoop was created to lessen the
time and efforts required to query the data of numerous data
formats.
Apache Hadoop is an open-source software framework.
It’s one of the main application is in distributed storage in
Hadoop Distributed File System (HDFS); and big data set
processing by means of MapReduce programming model. It
contains computer clusters madeupofcommodityhardware
[2]. HDFS is formed of a Name Node and a cluster of Data
Nodes. HDFS is must for storage and processing of terabytes
or petabytes of data; especially when hard disk capacity is
limited on local machines. The client data is split in multiple
data nodes in a cluster which uses physical hard disk
(commodity hardware’s) in the nodes.Metadata containsthe
information about the block size, distribution of data,replica
maintenance. In latest versions of Map Reduce framework it
is a layer configured with block size either 64MB (default) or
128MB placed on top of prevailing operating system [3] [4].
With possible hypothesis of a HDFS architecture for big
data storage which will eliminate the model of Name Node
with the functionalities of the Name Node storing the
metadata by means of blockchain technology. The vital role
of Name Node is generating and recording metadata
including the information about the position ofblocksstored
the size of the files, hierarchy and permissions. Additional
tasks include block report from Data Nodes, maintaining and
managing Data Nodes, reception of heartbeat using the
blockchain technology. Blockchain stores the metadata in
encrypted format. Bitcoin, the digital crypto currency uses
Blockchain technology. It keeps the metadata information in
a distributive fashion. Particularly, business transactions
authorizations with particular mathematical cryptographic
methods such as hashcode with theaidofBlockChainMiners.
II. ELEMENTS OF HDFS DESIGN
The Hadoop Distributed File System is created to gather a
mammoth volume of data in severalservers/clusters. Datain
a Hadoop cluster is fragmented in small blocks (default is
64MB) and then dispensed throughout the clusters. Rack
aware,a block placement algorithm, defines the placementof
the blocks based on the replication factor, which is mostly 3
by default.
The master slave configuration decides the working
principle of HDFS. A master service comprises of NameNode,
secondary Name Node and job tracker besides the slave
services involving Data Node and task tracker.
Master node and slave nodes are capable of
communicating with each other. The slave nodes talk with
each other. The job tracker can converse with the Task
Tracker as illustrated in figure 1.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2055
Figure 1: Elements of HDFS architecture.
NameNode: An HDFS cluster consists of an only one
NameNode and it is master node. It is responsible for storage
of the Meta data of files and directories. It preserves the file
system tree and the metadata for all the files and directories
in the tree. [3] The NameNode contains information about all
the blocks of fragmented files in a cluster.
HDFS clients constantly sends the updates of metadata to
namenode; when they release the write operations such as
create, rename, delete, mkdir, etc. Datanodes send block
report to namenode cyclically to notify it whether there are
corrupted data block or not. All these modifications in
metadata are stored in files, known as FsImage and EditLog
resident on namenode’s local disk. The FsImage comprisesof
a snapshot of the HDFS metadata. [5] If there is a change to
metadata, it will be appended to the EditLog file. It’s like a
database transactionlog,wherealterationswillnotbeapplied
to the data file immediately, however appended into a log.
When namenode fails for a definite amount of time,
changes are noted in EditLog would be combined with
FsImage, so that namenode could have latest report HDFS
metadata. Next to combining the changes from EditLog to
FsImage, namenode eliminates the old FsImage copy and
exchanges it with fresher one as it haslatestupdatedFsImage
which denotes present state of HDFS and then it starts up
latest EditLog. In HDFS cluster, namenode is the single point
of failure for the reason that if namenode upholds the
namespace and Editlog, if these records are distorted or lost,
entire cluster willgodown.[5]Namenodealsohasbecomethe
performance block of the HDFS cluster because there is only
one active namenode exists in cluster.
SecondaryNameNode: In HDFS the Name Node
experiences single point of failure, to recover the data some
tough additions are to be performed. For the same, it is
recommended to convert the Name Space Image to some
remote NFS mount device, from time to time. The secondary
Name Node is combines the Name Space Image and the Edit
log called checkpoints and then writes it to a file. This
supports the Name Node to free the main memory occupied
by the edit log till the point of last checkpoint. [4] Just in case
of failure, the Hadoop administrator boots up the new Name
Node and the previous data from the remote NFS mount
should be recovered by hand and configure the new node.
JobTracker: JobTracker going on master node [6].
[4]JobTracker proceeding a separate node. Job tracker not
usually on a Data Node and is a part of MapReduce execution
in MRv1.In MRv2 job tracker got replaced by Resource
Manager/Application Master. JobTracker receives the
requests from the client. It then ask the Name Node to
determine the location of the data. JobTracker identifies the
best TaskTracker nodes to execute tasks depending on the
data locality to execute a task on a given node. JobTracker
observers the individual TaskTrackers and presents the
complete status of the job back to the client. JobTracker
process is significant totheHadoopclusterinordertoexecute
MapReduce. When the JobTracker is not working, HDFS will
still be operational howevertheMapReduceexecutioncannot
be started; the existing MapReduce jobs will be stopped.
DataNode: TaskTrackers on slave nodes [6]. Data Nodes
are inexpensive commodity hardware with storage and
processing capacity which are assembled within different
clusters. HDFSshowsafilesystemnamespaceandallowsuser
information to be kept in files. The input file is divided into a
number of files blocks and the blocks are kept in a set of Data
Nodesas per the metadata report from given the NameNode.
The Name Node performs operations like opening, closing,
and renaming files and directories. It also governs the
mapping of blocks to Data Nodes. The Data Nodes helps to
read and write requests from the file system’s clients. The
Data Nodes also execute block creation, deletion, and
replication upon command from the Name Node. [4]When a
Data Node starts up, it scans across its local file system,
creates a listing of all HDFS datablockscorrespondingtoallof
these local files besides sending the report to the NameNode,
which is the Block report. Heartbeat is the signal sent by all
the data nodes regularly to the Name Node for retaining the
relationship.
Task Tracker: TaskTracker runs upon all Data Nodes.
Node Manager replaces it in MapReduce NextGen aka YARN
aka MRv2. The Mapper and Reducer tasks performed, as part
of Map reduce programming,onDataNodesareadministered
by TaskTrackers. TaskTrackers will be allocated Mapper and
Reducer tasks to be accomplished by JobTracker. [4]
TaskTracker will be in continuous interaction with the
JobTracker indicating the advancement of the task in
execution.
III. BLOCKCHAIN
A. Background
Blockchain was initially developed forthecryptocurrency
Bitcoin. It was firstly explained by Satoshi Nakamoto’s in a
2008 whitepaper. Nakamoto addressed the challenge of
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2056
ownership pertaining to digital currencies plus proposed a
possible way out by use of a Blockchain preceding to the
settlement of transactions [8].
All the nodes within thecryptocurrencynetworkmaintain
the blockchain public ledger. The blockchain warehouses all
the transactions that have ever happened in the
cryptocurrency system. [7] Unlike conventional centralized
banking and payment systems, which have the capacity to
control the system, the blockchain network does not rely on
any central trusted authority or agency. Instead, trust is
achieved as an emergent property from the exchanges
between nodes contained by the network. Digital signatures
and Cryptography mathematical models are used for
generating Hash codes for every single transaction. It is
believed that breaking or hacking into any transaction is
highly impossible due to monitoring by all peers in the
system. Transaction won’t be committed unless and until by
peers authorized it [3]. Any peer in the cluster can read or
write transactions.
Special nodes in the [3] Cluster acts as Miners, whose
securely authenticate, validate the transaction with
mathematical models. The first Miner which does the
mathematical cryptographic calculations and authorize the
transaction will get remuneratedwithBitCoincalled“Proofof
Work” illustrated in Figure 2.
Miners: Miners are special nodes competing the
transactions for validation, authentication and providing
instantaneous reply to all other peers to substantiate on
authentication of transaction. It uses Cryptographic
mathematical models to create the necessary Hash Codes
intended forthetransactionsbasedonprecedingtransactions
plus connect the historical linked transactions altogether.[3]
By confirming this “proof of work” it is remunerated.
Figure 2: Blockchain Network Revolution.
B. BlockChain Hashing
Bitcoin uses hash code proof of work algorithm shown in
Figure 3. The transactions are gathered as blocks. All blocks
encompass latest balances of user’s in the network. A
blockchain is established with all the transactions (blocks)
from genesis block. A block header contains fields: Version,
hash of the previous block (hashPrevBlock), root hash of the
merkletreeofalltransactionsintheblock(hashMerkleBlock),
Timestamp, Nonce. The hash code algorithm repeatedly
hashes the block header and raises the counter with nonce
[3].
Figure 3: BlockChain Hashing.
ď‚· Version: The block version number.
ď‚· HashPrevBlock: This field obtain all the
information in all the previous blocks that led to
this one.
ď‚· HashMerkleRoot:Thisfieldmerklehashbasedon
all of the transactions in the block.
ď‚· Timestamps: It is time field when Miners founds
the block.
ď‚· Nonce: This field value initially set to 0 and is
incremented for each hash.
ď‚· Difficulty target: This refers to difficulty of
locating a new block with the help of floating
point value.
With hashcodealgorithm,allthe block headers excluding
the transaction data is hashed to form the block hash. This
generated block hash is used as proof that the other parts of
the header have not been modified, and later this block hash
is used as a reference by the succeeding block in blockchain.
IV. METHODOLOGY
In this HDFS architecture, there is a collection of Data
Nodes also known as HDFS_DN data nodes are made from
cluster of commodity hardware.
By eliminating the Name Node in this architecture, the
design [4] becomes simpler and cheaper. However with this
the metadata storage and management becomes a critical
concern in HDFS design due to lack of the central authority-
the Name Node. HDFS should have the capability to store,
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2057
manage, and access file metadata in reliable manner, and
provide faster access to the metadata storage.
The storage of metadata is accomplished by means of
spreading it throughout the Data Nodes by a master node
called HDFS_MINER (one among the Data Nodes) and is
stored in an encrypted form using the HDFS_Clients’ public
key. HDFS_MINER can be allocated to any Data Nodes
depending on any arbitration mechanism in a master/slave
configuration.
Blockchain technology links the data blocks in different
Data Nodes in a serial manner for easy access. The block
diagram of blockchain is shown in figure 4. Timestamping of
attached blocks ascertains the ordering of data blocks Right
back to what’s called the genesis block, the very first one in
the chain. The data blocks are attached with two hash values.
The previous data block hashis calledas H1. Thecurrentdata
block hash is called as H2. [4]For the first data block, H1 is
from metadata and H2 computed from thecurrentdatablock.
Second block onwards,H1istheprevioushashvalueobtained
from the relevant Data Nodes; H2 is calculated by the local
Data Node itself. The different data blocks of the same file
which are spread over various Data Nodes are connected
together by means of the hash (H1) of the precedingblock.[4]
The current hash value (H2) stored in each block which
preserves data integrity. The distributed consensus is
achieved in blockchain by identifying the proper nonce value
as a proof-of-work.
Figure 4: Blocks in the blockchain.
The following are components of HDFS
1) HDFS_Client: It requests for the storage and executes
integrity check subsequent to retrieval.
2) HDFS_DN: Specialized Data Nodes whose primary goal
is storage of the data blocks and a memory area earmarked
for storing the metadata blocks.
3) HDFS_MINER: One or more specially delegated nodes
from the Data Nodes called as MINER nodes. They help in
metadata creation along with data block storage.
A key feature of this architecture is to store metadata
securely throughout theclusterofDataNodesinanencrypted
form. In the HDFS architecture, the concept of Name Node is
not employed. The simple block diagram of new HDFS
architecture with using blockchain technology is shown in
figure 5.
Figure 5: HDFS architecture with using blockchain
Technology.
Working procedure HDFS architecture with blockchain
technology
1. HDFS_Client requests the Admin to acquire the key pair
for encrypting the corresponding metadata as well as data
blocks, as per requirement.
2. By means of Token passing method or any alternate
Arbitration mechanism, selects one SHDFS_MINER from the
data nodes.
3. On receipt the key pair, HDFS_Client requests the
HDFS_MINER for data storage.
4. Data is first written to a HDFS_MINER when a client is
writing data to an HDFS file. Then HDFS_MINER creates the
metadata .This contains the detailsofDataNodesinwhichthe
data blocks and its replica blocks to be stored .The metadata
will be stored in a temporary storage. Hadoop configuration
specifies the default block size and replication factor. Block
size, the number of replicas, and the replication factor for a
specific file at its creation time are specifiedbyanapplication.
5. Heartbeats, failure report and Block report
• All peer Data Nodes sends a heartbeat signal
mutually to recognize the failurenodes within the cluster.Ifa
particular node doesn’t receive the heart beat within the
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2058
specified time period, peer node will sent a failure report to
the MINER and the MINER will construct the another
metadata.
• The Data Nodes send Block report following
successful storage of the data blocks to the MINER.
6. Once all the block reports are received in MINER, it will
send the replica of metadata encrypted by means of the
HDFS_Client’s public key to all the related Data Nodes.
• The replica management is done by Rack Awareness
standard rules. The replica copies are limited to 3; two
replicas are positioned on one rack and the third one is in a
different rack.
7. Blockchain creation: Each block of the blockchain is
composed of two hash values (H1, H2) and the data fragment
(64MB/128MB). The hash of the previous block stores the
first hash value (H1); second hash (H2) stores the hash
computed from the present data fragment. The first hash
value (H1) is used to attach different data fragments in
different HDFS_DNs.
8. HDFS_Client can read the metadata from HDFS_MINER
stored in encrypted form. This can be decrypted using its
private key provided by admin and reading data from the
corresponding DataNode. After getting the first blockofdata,
it is simple to access the successive blocks within the
blockchain.
9. Data Nodes, HDFS_MINER failure:
• If the Data Node fails, it is detected from the failure
report to the HDFS_MINER and it modifies the metadata.
• If the MINER node fails, another node will act as
HDFS_MINER through arbitration mechanism.
V. CONCLUSION
The current HDFS architecture has limitation from single
point of failure and a contemporary architecture was
suggested for small sized cluster of nodes to ease the trouble
of single point of failure. Also the secondary name node is
utilized to create checkpoints to the remote NFS mounted
device. Both are maintained employing expensive reliable
hardware. In the new architecture both expensive hardware
is being removed and consequently making it a cost effective
strategy. The blockchain stores replicas ofmetadataonarack
by rack basis with one Data Node / rack and cost of
maintaining blockchain. The application of blockchain
technology, there is no point of recovery due to the presence
of several replicas of the metadata being spread throughout
one DataNodeper rack. Hence the Name nodeandsecondary
name nodeis not essentialintheproposedHDFSarchitecture.
REFERENCES
[1] Sheng Kang1, Wei-Ting Kary Chien1, Jun Gang Yang2”
A Study for Big-Data (Hadoop) Application in Semiconductor
Manufacturing”, Proceedings of the 2016 IEEE IEEM, p 1893-
1897. [2] Roscoe Lin, Veda Shankar “Performance Study of
Ceph Storage with Intel Cache Acceleration Software”,2017
IEEE 4th International Conference on Cyber Security and
Cloud Computing, p 10-12.
[3] Lakshmana Kumar Yeddu, Shireesha Yeddu” Hadoop:
Bitcoin-BlockChain - A New Era Needed In Distributed
Computing”, International Journal of Computer Applications
(0975 – 8887) Volume 154 – No.6, November 2016.
[4] Deepa S. Kumar, M. Abdul Rahman” Simplified HDFS
Architecture with Blockchain Distribution of Metadata”,
International Journal of Applied Engineering Research ISSN
0973-4562 Volume 12, Number 21 (2017) pp. 11374-11382.
[5] Zhanye Wang, Dongsheng Wang” NCluster: Using
Multiple Active Namenodes to Achieve High Availability for
HDFS”, 2013 IEEE International Conference on High
Performance Computing and Communications & 2013 IEEE
International Conference on Embedded and Ubiquitous
Computing, p2291-2297. [6] Shiori KURAZUMI , Tomoaki
TSUMURA, Shoichi SAITO, and Hiroshi MATSUO” Dynamic
processing slots scheduling for I/O intensive jobs of Hadoop
MapReduce” IEEE 2012 Third International Conference on
Networking and Computing.
[7] Xiwei Xu, Cesare Pautasso, Liming Zhu, Vincent
Gramoli,AlexanderPonomarev,ShipingChen”TheBlockchain
as a Software Connector”, 2016 13th Working IEEE/IFIP
Conference on Software Architecture.
[8] Matthias Mettler” Blockchain Technology in
Healthcare”, 2016 IEEE 18th International Conference on e-
Health Networking, Applications and Services (Healthcom)
[9]SaharMirzayi,MohammadMehrzad”Bitcoin,AnSWOT
Analysis”, 7th International Conference on Computer and
Knowledge Engineering (ICCKE 2017), October 26-27 2017,
Ferdowsi University of Mashhad,pp 205-510.
[10] Nazri Abdullah, Anne HĂĄkansson, Esmiralda
Moradian” Blockchain based Approach to Enhance Big Data
Authentication in Distributed Environment”, 2017 IEEE,pp
888-892.

More Related Content

What's hot

Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyShital Kat
 
An experimental evaluation of performance
An experimental evaluation of performanceAn experimental evaluation of performance
An experimental evaluation of performanceijcsa
 
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONMAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONijdms
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodIRJET Journal
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khanKamranKhan587
 
Implementation of Multi-node Clusters in Column Oriented Database using HDFS
Implementation of Multi-node Clusters in Column Oriented Database using HDFSImplementation of Multi-node Clusters in Column Oriented Database using HDFS
Implementation of Multi-node Clusters in Column Oriented Database using HDFSIJEACS
 
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking AlgorithmPerformance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking AlgorithmIRJET Journal
 
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Cognizant
 
Big Data: hype or necessity?
Big Data: hype or necessity?Big Data: hype or necessity?
Big Data: hype or necessity?Bart Vandewoestyne
 
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame Work
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame WorkA Big-Data Process Consigned Geographically by Employing Mapreduce Frame Work
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame WorkIRJET Journal
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringIRJET Journal
 

What's hot (20)

Big data processing using - Hadoop Technology
Big data processing using - Hadoop TechnologyBig data processing using - Hadoop Technology
Big data processing using - Hadoop Technology
 
Hadoop
HadoopHadoop
Hadoop
 
Srinivasan2-10-12
Srinivasan2-10-12Srinivasan2-10-12
Srinivasan2-10-12
 
An experimental evaluation of performance
An experimental evaluation of performanceAn experimental evaluation of performance
An experimental evaluation of performance
 
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATIONMAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
MAP REDUCE BASED ON CLOAK DHT DATA REPLICATION EVALUATION
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control Method
 
Seminar Report Vaibhav
Seminar Report VaibhavSeminar Report Vaibhav
Seminar Report Vaibhav
 
A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khan
 
Implementation of Multi-node Clusters in Column Oriented Database using HDFS
Implementation of Multi-node Clusters in Column Oriented Database using HDFSImplementation of Multi-node Clusters in Column Oriented Database using HDFS
Implementation of Multi-node Clusters in Column Oriented Database using HDFS
 
IJARCCE_49
IJARCCE_49IJARCCE_49
IJARCCE_49
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Hadoop paper
Hadoop paperHadoop paper
Hadoop paper
 
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking AlgorithmPerformance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
Performance Improvement of Heterogeneous Hadoop Cluster using Ranking Algorithm
 
paper
paperpaper
paper
 
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
 
Big Data: hype or necessity?
Big Data: hype or necessity?Big Data: hype or necessity?
Big Data: hype or necessity?
 
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame Work
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame WorkA Big-Data Process Consigned Geographically by Employing Mapreduce Frame Work
A Big-Data Process Consigned Geographically by Employing Mapreduce Frame Work
 
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and StoringBig Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
 
Hdf5
Hdf5Hdf5
Hdf5
 

Similar to IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS Environment

Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceeSAT Publishing House
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...IRJET Journal
 
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET Journal
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopIRJET Journal
 
assignment3
assignment3assignment3
assignment3Kirti J
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)Hadoop File system (HDFS)
Hadoop File system (HDFS)Prashant Gupta
 
Design and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryDesign and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryIJRESJOURNAL
 
IRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication SystemIRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication SystemIRJET Journal
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET Journal
 
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...IRJET Journal
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & HadoopAnkan Banerjee
 
Introduction to hadoop ecosystem
Introduction to hadoop ecosystem Introduction to hadoop ecosystem
Introduction to hadoop ecosystem Rupak Roy
 
Hadoop file system
Hadoop file systemHadoop file system
Hadoop file systemJohn Veigas
 
IRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET Journal
 
Introduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone ModeIntroduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone Modeinventionjournals
 
BDA Mod2@AzDOCUMENTS.in.pdf
BDA Mod2@AzDOCUMENTS.in.pdfBDA Mod2@AzDOCUMENTS.in.pdf
BDA Mod2@AzDOCUMENTS.in.pdfKUMARRISHAV37
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group
 

Similar to IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS Environment (20)

Geo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduceGeo distributed parallelization pacts in map reduce
Geo distributed parallelization pacts in map reduce
 
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...IRJET-  	  A Study of Comparatively Analysis for HDFS and Google File System ...
IRJET- A Study of Comparatively Analysis for HDFS and Google File System ...
 
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of HadoopIRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
IRJET- Big Data-A Review Study with Comparitive Analysis of Hadoop
 
Hadoop Cluster Analysis and Assessment
Hadoop Cluster Analysis and AssessmentHadoop Cluster Analysis and Assessment
Hadoop Cluster Analysis and Assessment
 
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and HadoopSurvey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
 
assignment3
assignment3assignment3
assignment3
 
Lecture 2 part 1
Lecture 2 part 1Lecture 2 part 1
Lecture 2 part 1
 
G017143640
G017143640G017143640
G017143640
 
Hadoop File system (HDFS)
Hadoop File system (HDFS)Hadoop File system (HDFS)
Hadoop File system (HDFS)
 
Design and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on RaspberryDesign and Research of Hadoop Distributed Cluster Based on Raspberry
Design and Research of Hadoop Distributed Cluster Based on Raspberry
 
IRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication SystemIRJET - A Secure Access Policies based on Data Deduplication System
IRJET - A Secure Access Policies based on Data Deduplication System
 
IRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOPIRJET - Survey Paper on Map Reduce Processing using HADOOP
IRJET - Survey Paper on Map Reduce Processing using HADOOP
 
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
Performance Enhancement using Appropriate File Formats in Big Data Hadoop Eco...
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
Introduction to hadoop ecosystem
Introduction to hadoop ecosystem Introduction to hadoop ecosystem
Introduction to hadoop ecosystem
 
Hadoop file system
Hadoop file systemHadoop file system
Hadoop file system
 
IRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFSIRJET- Performing Load Balancing between Namenodes in HDFS
IRJET- Performing Load Balancing between Namenodes in HDFS
 
Introduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone ModeIntroduction to Big Data and Hadoop using Local Standalone Mode
Introduction to Big Data and Hadoop using Local Standalone Mode
 
BDA Mod2@AzDOCUMENTS.in.pdf
BDA Mod2@AzDOCUMENTS.in.pdfBDA Mod2@AzDOCUMENTS.in.pdf
BDA Mod2@AzDOCUMENTS.in.pdf
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 

Recently uploaded (20)

HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 

IRJET- Generate Distributed Metadata using Blockchain Technology within HDFS Environment

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2054 Generate Distributed Metadata using Blockchain Technology within HDFS Environment Shilpa K. Sangode1, Harish K. Barapatre2 1,2 Department of Computer Engineering, Yadavrao Tasgaonkar Institute of Engineering and Technology Bhivpuri Road, Karjat ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - Big Data processing and analyzing is very crucial across various industries. Rapidly increasing data volume and multiple sources of data generation it has created a challenge to analyze the huge amount of data. Traditional relational database management system RDBMS cannotprocesstheload of data generated. Hadoop Distributed File System (HDFS) is a distributed and enormously parallel processing system suited for big data whereby the storage is created on the distributed file system. Single point of failure is one of the key challenges due to the existence only one single Name Node in the existing HDFS architecture. Metadata summarizes basic information about the distribution of data blocks, block size information and replica management. Newer HDFS architecture for big data storage eliminates the concept of Name Node with the functionalities of the Name Node being stores the metadata using blockchain technology. mathematical cryptographic techniques like hashcode with the help of BlockChain Miners. Keywords—Big data; Hadoop Distributed File System; Blockchain; Hashcode; Metadeta; MINER, HDFC architecture, Single point of failure. I. INTRODUCTION With the fast expansion of semiconductor industry, the manufacturing process became more multifaceted and meticulous. With the advanced nature of technologies and machines data processing became more precise and huge scale. The five facets of the big data are Volume (size ofdata), Velocity (swiftness of data), Variety (analysis of streaming data), Veracity (data uncertainty) and Value (data quantification) [1]. Day by day the data set is growing. Conventional data processing application is not enough to dispense desired analysis and usability. Nature of data may be unstructured, semi-structured or structured. Majority of the relational database management systems (RDBMS) are process only structured data. Data are being generated and collected from variety of standalone sources in different formats. Post generation of the data it is collated together to analyze and to create insights for givenproblems.Businesses and institutes use intelligence to forecast future trends, for improving decision making. These insights are a valuable asset in current market. Hadoop was created to lessen the time and efforts required to query the data of numerous data formats. Apache Hadoop is an open-source software framework. It’s one of the main application is in distributed storage in Hadoop Distributed File System (HDFS); and big data set processing by means of MapReduce programming model. It contains computer clusters madeupofcommodityhardware [2]. HDFS is formed of a Name Node and a cluster of Data Nodes. HDFS is must for storage and processing of terabytes or petabytes of data; especially when hard disk capacity is limited on local machines. The client data is split in multiple data nodes in a cluster which uses physical hard disk (commodity hardware’s) in the nodes.Metadata containsthe information about the block size, distribution of data,replica maintenance. In latest versions of Map Reduce framework it is a layer configured with block size either 64MB (default) or 128MB placed on top of prevailing operating system [3] [4]. With possible hypothesis of a HDFS architecture for big data storage which will eliminate the model of Name Node with the functionalities of the Name Node storing the metadata by means of blockchain technology. The vital role of Name Node is generating and recording metadata including the information about the position ofblocksstored the size of the files, hierarchy and permissions. Additional tasks include block report from Data Nodes, maintaining and managing Data Nodes, reception of heartbeat using the blockchain technology. Blockchain stores the metadata in encrypted format. Bitcoin, the digital crypto currency uses Blockchain technology. It keeps the metadata information in a distributive fashion. Particularly, business transactions authorizations with particular mathematical cryptographic methods such as hashcode with theaidofBlockChainMiners. II. ELEMENTS OF HDFS DESIGN The Hadoop Distributed File System is created to gather a mammoth volume of data in severalservers/clusters. Datain a Hadoop cluster is fragmented in small blocks (default is 64MB) and then dispensed throughout the clusters. Rack aware,a block placement algorithm, defines the placementof the blocks based on the replication factor, which is mostly 3 by default. The master slave configuration decides the working principle of HDFS. A master service comprises of NameNode, secondary Name Node and job tracker besides the slave services involving Data Node and task tracker. Master node and slave nodes are capable of communicating with each other. The slave nodes talk with each other. The job tracker can converse with the Task Tracker as illustrated in figure 1.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2055 Figure 1: Elements of HDFS architecture. NameNode: An HDFS cluster consists of an only one NameNode and it is master node. It is responsible for storage of the Meta data of files and directories. It preserves the file system tree and the metadata for all the files and directories in the tree. [3] The NameNode contains information about all the blocks of fragmented files in a cluster. HDFS clients constantly sends the updates of metadata to namenode; when they release the write operations such as create, rename, delete, mkdir, etc. Datanodes send block report to namenode cyclically to notify it whether there are corrupted data block or not. All these modifications in metadata are stored in files, known as FsImage and EditLog resident on namenode’s local disk. The FsImage comprisesof a snapshot of the HDFS metadata. [5] If there is a change to metadata, it will be appended to the EditLog file. It’s like a database transactionlog,wherealterationswillnotbeapplied to the data file immediately, however appended into a log. When namenode fails for a definite amount of time, changes are noted in EditLog would be combined with FsImage, so that namenode could have latest report HDFS metadata. Next to combining the changes from EditLog to FsImage, namenode eliminates the old FsImage copy and exchanges it with fresher one as it haslatestupdatedFsImage which denotes present state of HDFS and then it starts up latest EditLog. In HDFS cluster, namenode is the single point of failure for the reason that if namenode upholds the namespace and Editlog, if these records are distorted or lost, entire cluster willgodown.[5]Namenodealsohasbecomethe performance block of the HDFS cluster because there is only one active namenode exists in cluster. SecondaryNameNode: In HDFS the Name Node experiences single point of failure, to recover the data some tough additions are to be performed. For the same, it is recommended to convert the Name Space Image to some remote NFS mount device, from time to time. The secondary Name Node is combines the Name Space Image and the Edit log called checkpoints and then writes it to a file. This supports the Name Node to free the main memory occupied by the edit log till the point of last checkpoint. [4] Just in case of failure, the Hadoop administrator boots up the new Name Node and the previous data from the remote NFS mount should be recovered by hand and configure the new node. JobTracker: JobTracker going on master node [6]. [4]JobTracker proceeding a separate node. Job tracker not usually on a Data Node and is a part of MapReduce execution in MRv1.In MRv2 job tracker got replaced by Resource Manager/Application Master. JobTracker receives the requests from the client. It then ask the Name Node to determine the location of the data. JobTracker identifies the best TaskTracker nodes to execute tasks depending on the data locality to execute a task on a given node. JobTracker observers the individual TaskTrackers and presents the complete status of the job back to the client. JobTracker process is significant totheHadoopclusterinordertoexecute MapReduce. When the JobTracker is not working, HDFS will still be operational howevertheMapReduceexecutioncannot be started; the existing MapReduce jobs will be stopped. DataNode: TaskTrackers on slave nodes [6]. Data Nodes are inexpensive commodity hardware with storage and processing capacity which are assembled within different clusters. HDFSshowsafilesystemnamespaceandallowsuser information to be kept in files. The input file is divided into a number of files blocks and the blocks are kept in a set of Data Nodesas per the metadata report from given the NameNode. The Name Node performs operations like opening, closing, and renaming files and directories. It also governs the mapping of blocks to Data Nodes. The Data Nodes helps to read and write requests from the file system’s clients. The Data Nodes also execute block creation, deletion, and replication upon command from the Name Node. [4]When a Data Node starts up, it scans across its local file system, creates a listing of all HDFS datablockscorrespondingtoallof these local files besides sending the report to the NameNode, which is the Block report. Heartbeat is the signal sent by all the data nodes regularly to the Name Node for retaining the relationship. Task Tracker: TaskTracker runs upon all Data Nodes. Node Manager replaces it in MapReduce NextGen aka YARN aka MRv2. The Mapper and Reducer tasks performed, as part of Map reduce programming,onDataNodesareadministered by TaskTrackers. TaskTrackers will be allocated Mapper and Reducer tasks to be accomplished by JobTracker. [4] TaskTracker will be in continuous interaction with the JobTracker indicating the advancement of the task in execution. III. BLOCKCHAIN A. Background Blockchain was initially developed forthecryptocurrency Bitcoin. It was firstly explained by Satoshi Nakamoto’s in a 2008 whitepaper. Nakamoto addressed the challenge of
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2056 ownership pertaining to digital currencies plus proposed a possible way out by use of a Blockchain preceding to the settlement of transactions [8]. All the nodes within thecryptocurrencynetworkmaintain the blockchain public ledger. The blockchain warehouses all the transactions that have ever happened in the cryptocurrency system. [7] Unlike conventional centralized banking and payment systems, which have the capacity to control the system, the blockchain network does not rely on any central trusted authority or agency. Instead, trust is achieved as an emergent property from the exchanges between nodes contained by the network. Digital signatures and Cryptography mathematical models are used for generating Hash codes for every single transaction. It is believed that breaking or hacking into any transaction is highly impossible due to monitoring by all peers in the system. Transaction won’t be committed unless and until by peers authorized it [3]. Any peer in the cluster can read or write transactions. Special nodes in the [3] Cluster acts as Miners, whose securely authenticate, validate the transaction with mathematical models. The first Miner which does the mathematical cryptographic calculations and authorize the transaction will get remuneratedwithBitCoincalled“Proofof Work” illustrated in Figure 2. Miners: Miners are special nodes competing the transactions for validation, authentication and providing instantaneous reply to all other peers to substantiate on authentication of transaction. It uses Cryptographic mathematical models to create the necessary Hash Codes intended forthetransactionsbasedonprecedingtransactions plus connect the historical linked transactions altogether.[3] By confirming this “proof of work” it is remunerated. Figure 2: Blockchain Network Revolution. B. BlockChain Hashing Bitcoin uses hash code proof of work algorithm shown in Figure 3. The transactions are gathered as blocks. All blocks encompass latest balances of user’s in the network. A blockchain is established with all the transactions (blocks) from genesis block. A block header contains fields: Version, hash of the previous block (hashPrevBlock), root hash of the merkletreeofalltransactionsintheblock(hashMerkleBlock), Timestamp, Nonce. The hash code algorithm repeatedly hashes the block header and raises the counter with nonce [3]. Figure 3: BlockChain Hashing. ď‚· Version: The block version number. ď‚· HashPrevBlock: This field obtain all the information in all the previous blocks that led to this one. ď‚· HashMerkleRoot:Thisfieldmerklehashbasedon all of the transactions in the block. ď‚· Timestamps: It is time field when Miners founds the block. ď‚· Nonce: This field value initially set to 0 and is incremented for each hash. ď‚· Difficulty target: This refers to difficulty of locating a new block with the help of floating point value. With hashcodealgorithm,allthe block headers excluding the transaction data is hashed to form the block hash. This generated block hash is used as proof that the other parts of the header have not been modified, and later this block hash is used as a reference by the succeeding block in blockchain. IV. METHODOLOGY In this HDFS architecture, there is a collection of Data Nodes also known as HDFS_DN data nodes are made from cluster of commodity hardware. By eliminating the Name Node in this architecture, the design [4] becomes simpler and cheaper. However with this the metadata storage and management becomes a critical concern in HDFS design due to lack of the central authority- the Name Node. HDFS should have the capability to store,
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2057 manage, and access file metadata in reliable manner, and provide faster access to the metadata storage. The storage of metadata is accomplished by means of spreading it throughout the Data Nodes by a master node called HDFS_MINER (one among the Data Nodes) and is stored in an encrypted form using the HDFS_Clients’ public key. HDFS_MINER can be allocated to any Data Nodes depending on any arbitration mechanism in a master/slave configuration. Blockchain technology links the data blocks in different Data Nodes in a serial manner for easy access. The block diagram of blockchain is shown in figure 4. Timestamping of attached blocks ascertains the ordering of data blocks Right back to what’s called the genesis block, the very first one in the chain. The data blocks are attached with two hash values. The previous data block hashis calledas H1. Thecurrentdata block hash is called as H2. [4]For the first data block, H1 is from metadata and H2 computed from thecurrentdatablock. Second block onwards,H1istheprevioushashvalueobtained from the relevant Data Nodes; H2 is calculated by the local Data Node itself. The different data blocks of the same file which are spread over various Data Nodes are connected together by means of the hash (H1) of the precedingblock.[4] The current hash value (H2) stored in each block which preserves data integrity. The distributed consensus is achieved in blockchain by identifying the proper nonce value as a proof-of-work. Figure 4: Blocks in the blockchain. The following are components of HDFS 1) HDFS_Client: It requests for the storage and executes integrity check subsequent to retrieval. 2) HDFS_DN: Specialized Data Nodes whose primary goal is storage of the data blocks and a memory area earmarked for storing the metadata blocks. 3) HDFS_MINER: One or more specially delegated nodes from the Data Nodes called as MINER nodes. They help in metadata creation along with data block storage. A key feature of this architecture is to store metadata securely throughout theclusterofDataNodesinanencrypted form. In the HDFS architecture, the concept of Name Node is not employed. The simple block diagram of new HDFS architecture with using blockchain technology is shown in figure 5. Figure 5: HDFS architecture with using blockchain Technology. Working procedure HDFS architecture with blockchain technology 1. HDFS_Client requests the Admin to acquire the key pair for encrypting the corresponding metadata as well as data blocks, as per requirement. 2. By means of Token passing method or any alternate Arbitration mechanism, selects one SHDFS_MINER from the data nodes. 3. On receipt the key pair, HDFS_Client requests the HDFS_MINER for data storage. 4. Data is first written to a HDFS_MINER when a client is writing data to an HDFS file. Then HDFS_MINER creates the metadata .This contains the detailsofDataNodesinwhichthe data blocks and its replica blocks to be stored .The metadata will be stored in a temporary storage. Hadoop configuration specifies the default block size and replication factor. Block size, the number of replicas, and the replication factor for a specific file at its creation time are specifiedbyanapplication. 5. Heartbeats, failure report and Block report • All peer Data Nodes sends a heartbeat signal mutually to recognize the failurenodes within the cluster.Ifa particular node doesn’t receive the heart beat within the
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 2058 specified time period, peer node will sent a failure report to the MINER and the MINER will construct the another metadata. • The Data Nodes send Block report following successful storage of the data blocks to the MINER. 6. Once all the block reports are received in MINER, it will send the replica of metadata encrypted by means of the HDFS_Client’s public key to all the related Data Nodes. • The replica management is done by Rack Awareness standard rules. The replica copies are limited to 3; two replicas are positioned on one rack and the third one is in a different rack. 7. Blockchain creation: Each block of the blockchain is composed of two hash values (H1, H2) and the data fragment (64MB/128MB). The hash of the previous block stores the first hash value (H1); second hash (H2) stores the hash computed from the present data fragment. The first hash value (H1) is used to attach different data fragments in different HDFS_DNs. 8. HDFS_Client can read the metadata from HDFS_MINER stored in encrypted form. This can be decrypted using its private key provided by admin and reading data from the corresponding DataNode. After getting the first blockofdata, it is simple to access the successive blocks within the blockchain. 9. Data Nodes, HDFS_MINER failure: • If the Data Node fails, it is detected from the failure report to the HDFS_MINER and it modifies the metadata. • If the MINER node fails, another node will act as HDFS_MINER through arbitration mechanism. V. CONCLUSION The current HDFS architecture has limitation from single point of failure and a contemporary architecture was suggested for small sized cluster of nodes to ease the trouble of single point of failure. Also the secondary name node is utilized to create checkpoints to the remote NFS mounted device. Both are maintained employing expensive reliable hardware. In the new architecture both expensive hardware is being removed and consequently making it a cost effective strategy. The blockchain stores replicas ofmetadataonarack by rack basis with one Data Node / rack and cost of maintaining blockchain. The application of blockchain technology, there is no point of recovery due to the presence of several replicas of the metadata being spread throughout one DataNodeper rack. Hence the Name nodeandsecondary name nodeis not essentialintheproposedHDFSarchitecture. REFERENCES [1] Sheng Kang1, Wei-Ting Kary Chien1, Jun Gang Yang2” A Study for Big-Data (Hadoop) Application in Semiconductor Manufacturing”, Proceedings of the 2016 IEEE IEEM, p 1893- 1897. [2] Roscoe Lin, Veda Shankar “Performance Study of Ceph Storage with Intel Cache Acceleration Software”,2017 IEEE 4th International Conference on Cyber Security and Cloud Computing, p 10-12. [3] Lakshmana Kumar Yeddu, Shireesha Yeddu” Hadoop: Bitcoin-BlockChain - A New Era Needed In Distributed Computing”, International Journal of Computer Applications (0975 – 8887) Volume 154 – No.6, November 2016. [4] Deepa S. Kumar, M. Abdul Rahman” Simplified HDFS Architecture with Blockchain Distribution of Metadata”, International Journal of Applied Engineering Research ISSN 0973-4562 Volume 12, Number 21 (2017) pp. 11374-11382. [5] Zhanye Wang, Dongsheng Wang” NCluster: Using Multiple Active Namenodes to Achieve High Availability for HDFS”, 2013 IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, p2291-2297. [6] Shiori KURAZUMI , Tomoaki TSUMURA, Shoichi SAITO, and Hiroshi MATSUO” Dynamic processing slots scheduling for I/O intensive jobs of Hadoop MapReduce” IEEE 2012 Third International Conference on Networking and Computing. [7] Xiwei Xu, Cesare Pautasso, Liming Zhu, Vincent Gramoli,AlexanderPonomarev,ShipingChen”TheBlockchain as a Software Connector”, 2016 13th Working IEEE/IFIP Conference on Software Architecture. [8] Matthias Mettler” Blockchain Technology in Healthcare”, 2016 IEEE 18th International Conference on e- Health Networking, Applications and Services (Healthcom) [9]SaharMirzayi,MohammadMehrzad”Bitcoin,AnSWOT Analysis”, 7th International Conference on Computer and Knowledge Engineering (ICCKE 2017), October 26-27 2017, Ferdowsi University of Mashhad,pp 205-510. [10] Nazri Abdullah, Anne HĂĄkansson, Esmiralda Moradian” Blockchain based Approach to Enhance Big Data Authentication in Distributed Environment”, 2017 IEEE,pp 888-892.