Submit Search
Upload
IRJET - Survey Paper on Map Reduce Processing using HADOOP
•
0 likes
•
21 views
IRJET Journal
Follow
https://www.irjet.net/archives/V7/i3/IRJET-V7I3982.pdf
Read less
Read more
Engineering
Report
Share
Report
Share
1 of 4
Download now
Download to read offline
Recommended
Seminar Report Vaibhav
Seminar Report Vaibhav
Vaibhav Dhattarwal
Big data Presentation
Big data Presentation
himanshu arora
Big data
Big data
Abilash Mavila
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
IRJET Journal
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
IRJET Journal
Hadoop Seminar Report
Hadoop Seminar Report
Bhushan Kulkarni
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challenges
ijcisjournal
A data analyst view of Bigdata
A data analyst view of Bigdata
Venkata Reddy Konasani
Recommended
Seminar Report Vaibhav
Seminar Report Vaibhav
Vaibhav Dhattarwal
Big data Presentation
Big data Presentation
himanshu arora
Big data
Big data
Abilash Mavila
Big Data with Hadoop – For Data Management, Processing and Storing
Big Data with Hadoop – For Data Management, Processing and Storing
IRJET Journal
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
IRJET Journal
Hadoop Seminar Report
Hadoop Seminar Report
Bhushan Kulkarni
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challenges
ijcisjournal
A data analyst view of Bigdata
A data analyst view of Bigdata
Venkata Reddy Konasani
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
Editor IJCATR
10 Popular Hadoop Technical Interview Questions
10 Popular Hadoop Technical Interview Questions
ZaranTech LLC
Hadoop
Hadoop
RittikaBaksi
Hadoop info
Hadoop info
Nikita Sure
Big data and hadoop
Big data and hadoop
Chanchal Tripathi
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
Mahantesh Angadi
Hadoop
Hadoop
Anil Reddy
Hadoop MapReduce Framework
Hadoop MapReduce Framework
Edureka!
Big Data Concepts
Big Data Concepts
Ahmed Salman
Hadoop(Term Paper)
Hadoop(Term Paper)
Dux Chandegra
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET Journal
Big data
Big data
revathireddyb
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
IOSR Journals
Introduction to Big data & Hadoop -I
Introduction to Big data & Hadoop -I
Edureka!
Seminar_Report_hadoop
Seminar_Report_hadoop
Varun Narang
Twitter word frequency count using hadoop components 150331221753
Twitter word frequency count using hadoop components 150331221753
pradip patel
hive lab
hive lab
marwa baich
Big data and hadoop
Big data and hadoop
AshishRathore72
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
IRJET Journal
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
IRJET Journal
Big Data
Big Data
Kirubaburi R
Big data
Big data
revathireddyb
More Related Content
What's hot
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
Editor IJCATR
10 Popular Hadoop Technical Interview Questions
10 Popular Hadoop Technical Interview Questions
ZaranTech LLC
Hadoop
Hadoop
RittikaBaksi
Hadoop info
Hadoop info
Nikita Sure
Big data and hadoop
Big data and hadoop
Chanchal Tripathi
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
Mahantesh Angadi
Hadoop
Hadoop
Anil Reddy
Hadoop MapReduce Framework
Hadoop MapReduce Framework
Edureka!
Big Data Concepts
Big Data Concepts
Ahmed Salman
Hadoop(Term Paper)
Hadoop(Term Paper)
Dux Chandegra
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET Journal
Big data
Big data
revathireddyb
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
IOSR Journals
Introduction to Big data & Hadoop -I
Introduction to Big data & Hadoop -I
Edureka!
Seminar_Report_hadoop
Seminar_Report_hadoop
Varun Narang
Twitter word frequency count using hadoop components 150331221753
Twitter word frequency count using hadoop components 150331221753
pradip patel
hive lab
hive lab
marwa baich
Big data and hadoop
Big data and hadoop
AshishRathore72
What's hot
(18)
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
10 Popular Hadoop Technical Interview Questions
10 Popular Hadoop Technical Interview Questions
Hadoop
Hadoop
Hadoop info
Hadoop info
Big data and hadoop
Big data and hadoop
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
BIGDATA- Survey on Scheduling Methods in Hadoop MapReduce
Hadoop
Hadoop
Hadoop MapReduce Framework
Hadoop MapReduce Framework
Big Data Concepts
Big Data Concepts
Hadoop(Term Paper)
Hadoop(Term Paper)
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articles
Big data
Big data
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
Introduction to Big data & Hadoop -I
Introduction to Big data & Hadoop -I
Seminar_Report_hadoop
Seminar_Report_hadoop
Twitter word frequency count using hadoop components 150331221753
Twitter word frequency count using hadoop components 150331221753
hive lab
hive lab
Big data and hadoop
Big data and hadoop
Similar to IRJET - Survey Paper on Map Reduce Processing using HADOOP
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
IRJET Journal
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
IRJET Journal
Big Data
Big Data
Kirubaburi R
Big data
Big data
revathireddyb
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET Journal
SURVEY ON BIG DATA PROCESSING USING HADOOP, MAP REDUCE
SURVEY ON BIG DATA PROCESSING USING HADOOP, MAP REDUCE
AM Publications,India
IJET-V2I6P25
IJET-V2I6P25
IJET - International Journal of Engineering and Techniques
Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases report
Ahmad El Tawil
hadoop seminar training report
hadoop seminar training report
Sarvesh Meena
A Survey on Big Data Analysis Techniques
A Survey on Big Data Analysis Techniques
ijsrd.com
Big data and Hadoop overview
Big data and Hadoop overview
Nitesh Ghosh
Twitter word frequency count using hadoop components 150331221753
Twitter word frequency count using hadoop components 150331221753
pradip patel
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
Sysfore Technologies
G017143640
G017143640
IOSR Journals
TSE_Pres12.pptx
TSE_Pres12.pptx
ssuseracaaae2
Big Data & Hadoop
Big Data & Hadoop
Krishna Sujeer
Hadoop
Hadoop
Veera Sundari
Big data with hadoop
Big data with hadoop
Anusha sweety
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Rio Info
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
IRJET Journal
Similar to IRJET - Survey Paper on Map Reduce Processing using HADOOP
(20)
Survey Paper on Big Data and Hadoop
Survey Paper on Big Data and Hadoop
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Big Data
Big Data
Big data
Big data
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
SURVEY ON BIG DATA PROCESSING USING HADOOP, MAP REDUCE
SURVEY ON BIG DATA PROCESSING USING HADOOP, MAP REDUCE
IJET-V2I6P25
IJET-V2I6P25
Map reduce advantages over parallel databases report
Map reduce advantages over parallel databases report
hadoop seminar training report
hadoop seminar training report
A Survey on Big Data Analysis Techniques
A Survey on Big Data Analysis Techniques
Big data and Hadoop overview
Big data and Hadoop overview
Twitter word frequency count using hadoop components 150331221753
Twitter word frequency count using hadoop components 150331221753
Hadoop and Big Data Analytics | Sysfore
Hadoop and Big Data Analytics | Sysfore
G017143640
G017143640
TSE_Pres12.pptx
TSE_Pres12.pptx
Big Data & Hadoop
Big Data & Hadoop
Hadoop
Hadoop
Big data with hadoop
Big data with hadoop
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big Data Testing Using Hadoop Platform
Big Data Testing Using Hadoop Platform
More from IRJET Journal
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
React based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
More from IRJET Journal
(20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
React based fullstack edtech web application
React based fullstack edtech web application
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Recently uploaded
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
Suhani Kapoor
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ranjana rawat
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Low Rate Call Girls In Saket, Delhi NCR
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
ranjana rawat
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
ranjana rawat
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
Asutosh Ranjan
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur High Profile
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur High Profile
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
rknatarajan
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
ssuser5c9d4b1
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
Tsuyoshi Horigome
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ranjana rawat
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
SIVASHANKAR N
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
purnimasatapathy1234
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
KurinjimalarL3
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
M Maged Hegazy, LLM, MBA, CCP, P3O
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
Suhani Kapoor
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
SIVASHANKAR N
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
soniya singh
Recently uploaded
(20)
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
IRJET - Survey Paper on Map Reduce Processing using HADOOP
1.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 4905 Survey Paper on “MAP REDUCE PROCESSING USING HADOOP”. Manisha R Kaila1, Harshita N Shetty2, Meet S Bhanushali3 1Student, Computer Engineering, SIGCE, Navi Mumbai, Maharashtra, India 2Student, Computer Engineering, SIGCE, Navi Mumbai, Maharashtra, India 3Student, Computer Engineering, SIGCE, Navi Mumbai, Maharashtra, India ----------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - With the development of web based applications and mobile computer technology there is a rapid growth of data, their computations and analysis continuously in the recent years. Various fields around the globe are facing a big problem with this large scale data which highly supports in decision making. Even the traditional relational DBMS’s and classical data mining are not suitable for handling this big data. Map Reduce is used my many popular andwell-knownIT companies such as Google, Yahoo, Facebook etc. In Big data world Map Reduce helps in achieving the increasing demands on computing resources affected by voluminous data sets. Hadoop is an approved open-source map-reduce implementation which is being used to store and process extremely large data sets on commodity hardware. Key Words: Big Data, Hadoop, MapReduce, Bigdata Analytics 1. INTRODUCTION Ever since the developmentoftechnology,data hasalsobeen growing every single day. If you go back in time in70sor80s not many people were using computers there were only a fraction of people who were dealing with computers and that’s why the data fed into the computer system was also quite less but today everyone owns a gadget like laptops, mobile phones and they are generating data from there every single day. Another factor behind the rise of bigdata is social media. Billions of people today use social media for posting photos, videos and to interact with others. It has been seen that in Facebook the user generates almost 4 million likes in every 60 seconds and similarly on Twitter there is almost 300 thousand tweets every minute. Now that’s a lot of data and it has been rising exponentially over years. Big Data is the term for collection of data sets so large and complex that is becomes difficult to process using on hand database management tools or traditional data processing applications.It becomes verydifficulttodeal with this kind of data that is being generated with such a high speed and that’s why big data becomes an issue because traditional system fails to store and process such big data. The big data problems are identified by 5 V’s – Volume, Variety, Velocity, Value, and Veracity. Volume- It implies that the amount of data the client is using is so huge that it becomes increasingly difficult for the client to store the data into the traditional system. Variety- As there are many sources which lead to Big Data, the type of data they are generating is different. The data generated can be structured, semi-structured or unstructured. Almost 90% of the data are unstructured which is a problem because traditional systems are incapable of processing this unstructured data. Velocity- Velocity refers to the speed at which data is being generated and processed to meet the demands. 72 hours of YouTube video is uploaded every minute, this is velocity Value – It is very important to have useful data and you should be able to extract the right information from it because there are many datasets lying around which are unnecessary. Veracity- Veracity is sparseness of data. Veracity says that you cannot expect data to be always correct or reliable in today’s world. There might be data whichhasmissingvalues and we may have to work with various types of data that is incorrect. 2. APACHE HADOOP Apache Hadoop is an open-source framework that allow us to store and process large data sets in parallel and distributed fashion. It can handle hardware failure automatically. Several applications like Google, Facebook, Yahoo, YouTube, Amazon etc. uses Hadoop. Hadoop Framework can be divided into two parts- Hadoop Distributed File System (HDFS) which helps to store data across clusters and Map Reduce whichdealswithprocessing of data stored in HDFS.HDFS is a distributed file system that
2.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 4906 allows you to store large data across the clusters. In Apache Hadoop splits the file into set of blocks andeachblock length is 64 megabytes or 128 megabytes, these blocks are circulated across multiple nodes of cluster. HDFS follows Master/Slave architecture. It means that you have one master node and remaining all other nodes are slave nodes. In HDFS, Master Node are called Name nodes whereas Slave Nodes are called Data Nodes. Now Data Nodes are actually responsible for storing the actual data andNameNodebeing the master node manages all the slave nodes or data nodes. Other responsibility of Name Nodeincludesmaintainingand managing metadata. Metadata is the information about data that is present inside the data nodes. Along with that Data Node are supposed to send some signal so as to ensure that all the Data Nodes are working properly. If anyone Data Node stops sending signal, then name node will assume that the particular Data Node has been failed and same will be notified to admin. It distributes the file among thenodesand allows the system to continue work in case of any node failure. This approach reduces the risk of catastrophic system failure. Apache Hadoop consists of the Hadoop kernel, Hadoop distributed file system (HDFS), map reduce, and related projects are zookeeper, HBase, Apache Hive. Hadoop is scalable, fault tolerant and has high availability. Storage and analysis forlargescaleprocessingisprovidedby Hadoop components. Now a day’s Hadoop is being used by hundreds of companies. Fig (a) HDFS 3. MAPREDUCE Map Reduce is a processing unit of Hadoop using which we can process the Big Data that is storedinHadoopDistributed File System (HDFS). The Big Data that is stored on HDFS is not stored in a traditional manner. Thedata getsdividedinto blocks of data which is stored in respective Data Nodes. There is no complete data that is present in one single location. Hence native client application cannot processthat data so we need a special framework that has the capability of processing such data and so Hadoop Map Reduce is used. Map Reduce is used for Indexing and Searching, Classification and it is used as recommendation engine by Amazon, Flipkart and many more.ItisalsousedforAnalytics by several companies.MapReducehasbeenimplemented by Google and it has been adopted by Apache Hadoop for processing data using Pig, Hive and HBase. The Map reduce programming model is based on two functions which are map function and reduce function. It is also called as master and slave architecture. Job tracker is the master process in MapReduce whereas task tracker is the slaveinMapReduce. These are the processes which are used to process the data present in HDFS. Job tracker is running under name node machine whereas task trackerrunondata nodes.sub-project of the Apache Hadoop project. The reduce operation combines those data tuples basedonthekeyandaccordingly modifies the value of the key. A Map Reduce framework can be classified into two steps such as: Map Task and Reduce Task. The Map Task gets an input and the output yield from map task after processing is given as inputtothereducetask and then the final output is given to the client. A Map Reduce works on key value pair. A Map takes key value pair as input and gives an output as list of key value. Now this list of key value goes through a shuffle phase and an input of key and list of values is given to the reducer. Finally, the reducers give a list of key value pair as output. Fig (b) Map Reduce 3.1 Input reader The input reader divides the inputintoapplicablesize'splits' (in follow, typically, 64 MB to 128 MB) also the framework assigns one split to every Map perform. The input reader reads information from stable storage (typically, a distributed file system) and generates key/value pairs. A common example can scan a directory filled with text files and return each line as a record. 3.2. Map function The Map function takes a series ofkey/valuepairs,processes each, and generates zero or more output key/value pairs. The input and output forms of the map usually can be (and often are) completely different from one other. If the appliance is doing a word count, the map function would break the line into words and output a key/value combine for every word. Each output combine would contain the word as the key and also the variety of instances of that word within the line as the value. 3.3. Partition function Each Map function output is allocated toa particularreducer the application partition for sharing purpose. The partition perform is given the key, and also thevarietyofreducers and returns the index of the required reducer. A typical defaultis to hash the key and use the hash price modulothenumberof reducers. It is necessary to choose a partition perform that
3.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 4907 provides AN around uniform distribution of knowledge per fragment for load-balancing functions, otherwise the Map Reduce operation are often delayed waiting for slow reducers to finish (i.e. The reducers assigned the larger shares of the non-uniformly divided data). Between the maps and cut back stages, the data are shuffled (parallel- sorted / exchanged between nodes) to move the data from the map node that produced them to the shard in whichthey will be reduced. The shuffle will generally take longer than the computation time counting on network information measure, CPU speeds, data produced, andtimetakenby map and reduce computations. 3.4. Comparison function The input for every cut back is forced from the machine where the Map ran and sorted using the application's comparison perform. 3.5. Reduce function The framework calls the application’s Reduce function once for each unique key in the sorted order. The cut back will retell through the values that are related to that key and manufacture zero, or additional outputs. In the word count example, the reduce perform takes the input values, adds them and generates one output of the wordandalsothefinal sum. 3.6. Output writer The output writer writes the output of the Reduce to the stable storage. IV. ADVANTAGES 4.1 Scalability: Hadoop is highly scalable. This is because of its ability to store as well as distribute large data sets across plenty of servers. It provides a model in which data can be processed by spreading the data over multiple inexpensive machines which can be increased or decreased as per the enterprise requirement 4.2 Cost-effective solution: Hadoop, on the other hand, is designed as a scale-out architecture that can affordably store all ofa company'sdata for later use. The cost savings are staggering: rather than cost accounting thousands to tens of thousands of pounds per computer memory unit, Hadoop offers computing and storage capabilities for many pounds per computermemory unit. 4.3 Flexibility: Hadoop allows businesses to simply access new knowledge sources and faucet into differing of information (both structured and unstructured) to come up with worth from that data. This means businesses will use Hadoop to derive valuable business insight fromknowledgesourceslikesocial media, email conversations or click stream knowledge .In addition, Hadoop may be used for a large type of functions, like log process, recommendation systems, knowledge deposit, market campaign analysis and fraud detection. 4.4 Parallel processing: One of the first aspects of the operating of Map Reduce programming is that it divides tasks in a very manner that enables their execution in parallel. Parallel processing permits multiple processors to take on these divided tasks, such that they run entire programs in less time. V. CONCLUSION This paper analyses the concept of big data analysisandhow it can be simplified as from existing traditional relational database technologies. This paper clearly specifies the Hadoop environment, its architecture and how it can be implemented using Map Reduce along with various functions. Hadoop map reduce programming overview and HDFS are progressively being utilized for processing vast and unstructured data sets. VI. REFERENCES [1] http://hadoop.apache.org.2010 [2] https://en.wikipedia.org/wiki/MapReduce [3] T. White, Pro Hadoop: The Definitive Guide. O’Reilly Media, Yahoo!Press, June 5,2009. [4]http://www.tutorialspoint.com/hadoop/hadoop_hdfs_ov erview.htm [5]V. Patil, V.B.Nikam, “study of mining algorithm in cloud computing using MapReduce Framework”, Journal of engineering, computers & applied sciences (JEC&AS) vol.2,No.7,July 2013 BIOGRAPHIES Manisha R Kaila is pursuing Bachelordegree(B.E.)inComputer Engineering from Smt. Indira Gandhi College of Engineering, Navi Mumbai. Harshita N Shetty is pursuing Bachelor’s degree (B.E.) in Computer Engineering from Smt. Indira Gandhi College of Engineering, Navi Mumbai.
4.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 4908 Meet S Bhanushali is pursuing Bachelordegree(B.E.)inComputer Engineering from Smt. Indira Gandhi College of Engineering, Navi Mumbai.
Download now