SlideShare a Scribd company logo
Hadoop Installation
Xuhong Zhang, Jiangling Yin
Advisor: Dr. Jun Wang
Hadoop Installation
• Documentation
Goes to http://hadoop.apache.org/
Documentation
Install Hadoop in a Cluster
UCF CASS:
http://cass.eecs.ucf.edu/ganglia/?p=2&c=CASS
Prerequisites
• Several machines
• Linux for production platform (Linux centos in
this example)
• Java installed (Version 6 or later)
• SSH installed
Install Hadoop
• Two steps:
1. Download Hadoop from release page
http://hadoop.apache.org/releases.html#Download
In this example, we use Hadoop 2.2.0
1. Config configuration files
Download and unpack Hadoop
• Unpackage
$ tar hadoop-2.2.0.tar.gz
• ‘cd’ to directory hadoop
$ cd hadoop-2.2.0/
• Inside hadoop-2.2.0 directory
Configuration (1)
• All configuration files are under hadoop-
2.2.0/etc/hadoop directory:
Configuration (2) –
Environment variables
• Java
Set JAVA_HOME to the location of your jdk
for example:
$export JAVA_HOME=/home/ji453898/jan/jdk1.7.0_03
• Hadoop
Set HADOOP_HOME to the location of your hadoop
folder
export HADOOP_HOME=/home/xzhang/hadoop-2.2.0
export PATH=$PATH:$HADOOP_HOME/bin
Configuration (3) –
core-site.xml
Configuration (4) –
hdfs-site.xml
Configuration (4) –
hdfs-site.xml
Configuration (5) –
mapred-site.xml
Configuration (5) –
mapred-site.xml
Configuration (6) –
yarn-site.xml
Configuration (7) –
slaves
A list of machines (one per line) that each run
a datanode and a tasktracker.
Configuration-SSH passwordless
login
SSH passwordless login from master to slaves
• Generate SSH key pairs(public and private)
Configuration-SSH passwordless
login
• Append public key into authorized_keys
$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
• Copy id_rsa.pub file into all slave’s ~/.ssh/ folder
Run Hadoop
• Format Name node
• Start dfs(HDFS)
• Start Yarn(resourcemanager,nodemanager)
• Check if success
Format Namenode
• Inside bin directory
$./hadoop namenode –format
Start DFS and Yarn
• Start hadoop
Check
• On namenode
• On datanode
Run a sample MapReduce (1)
• Upload one file into dfs:
Link to hadoop file system shell documentation
http://hadoop.apache.org/docs/r2.4.0/hadoop-project-dist/hadoop-common/FileSystemShell.html
Run a sample MapReduce (2)
• Run a MapReduce job:
$ hadoop jar hadoop-mapreduce-examples-2.4.2-SNAPSHOT.jar wordcount
/wordcount/input /wordcount/output
……./hadoop/share/hadoop/mapreduce/
compiling
• compile WordCount.java
$ javac -classpath hadoop-core-0.20.203.0.jar -d
wordcount WordCount.java
• create a jar
$jar -cvf ./word.jar -C wordcount .
• Look up the clasess:
$ jar tf word.jar

More Related Content

What's hot

New microsoft power point presentation
New microsoft power point presentationNew microsoft power point presentation
New microsoft power point presentationrajsandhu1989
 
Friends of Solr - Nutch & HDFS
Friends of Solr - Nutch & HDFSFriends of Solr - Nutch & HDFS
Friends of Solr - Nutch & HDFSSaumitra Srivastav
 
Global State Management of Micro Services
Global State Management of Micro ServicesGlobal State Management of Micro Services
Global State Management of Micro ServicesSalesforce Engineering
 
Dockertaipei 20150528-dockerswarm
Dockertaipei 20150528-dockerswarmDockertaipei 20150528-dockerswarm
Dockertaipei 20150528-dockerswarmWei-Ting Kuo
 
Hadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesHadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesBigdata Meetup Kochi
 
Introduction of mesos persistent storage
Introduction of mesos persistent storageIntroduction of mesos persistent storage
Introduction of mesos persistent storageZhou Weitao
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst AgainVarun Thacker
 
Ceph-Mesos framework
Ceph-Mesos frameworkCeph-Mesos framework
Ceph-Mesos frameworkZhongyue Luo
 
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for BeginnersAn Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for BeginnersAmir Sedighi
 
SLE12 SP2 : High Availability et Geo Cluster
SLE12 SP2 : High Availability et Geo ClusterSLE12 SP2 : High Availability et Geo Cluster
SLE12 SP2 : High Availability et Geo ClusterSUSE
 
SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)Lars Marowsky-Brée
 
Containerized Data Persistence on Mesos
Containerized Data Persistence on MesosContainerized Data Persistence on Mesos
Containerized Data Persistence on MesosJoe Stein
 
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...Glenn K. Lockwood
 
Developing with-devstack
Developing with-devstackDeveloping with-devstack
Developing with-devstackDeepak Garg
 
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...C4Media
 
HBaseConEast2016: HBase on Docker with Clusterdock
HBaseConEast2016: HBase on Docker with ClusterdockHBaseConEast2016: HBase on Docker with Clusterdock
HBaseConEast2016: HBase on Docker with ClusterdockMichael Stack
 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and dockerFabio Fumarola
 

What's hot (18)

New microsoft power point presentation
New microsoft power point presentationNew microsoft power point presentation
New microsoft power point presentation
 
Friends of Solr - Nutch & HDFS
Friends of Solr - Nutch & HDFSFriends of Solr - Nutch & HDFS
Friends of Solr - Nutch & HDFS
 
Global State Management of Micro Services
Global State Management of Micro ServicesGlobal State Management of Micro Services
Global State Management of Micro Services
 
Dockertaipei 20150528-dockerswarm
Dockertaipei 20150528-dockerswarmDockertaipei 20150528-dockerswarm
Dockertaipei 20150528-dockerswarm
 
Hadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesHadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologies
 
Introduction of mesos persistent storage
Introduction of mesos persistent storageIntroduction of mesos persistent storage
Introduction of mesos persistent storage
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst Again
 
Dev ops meetup
Dev ops meetupDev ops meetup
Dev ops meetup
 
Ceph-Mesos framework
Ceph-Mesos frameworkCeph-Mesos framework
Ceph-Mesos framework
 
An Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for BeginnersAn Introduction to Elasticsearch for Beginners
An Introduction to Elasticsearch for Beginners
 
SLE12 SP2 : High Availability et Geo Cluster
SLE12 SP2 : High Availability et Geo ClusterSLE12 SP2 : High Availability et Geo Cluster
SLE12 SP2 : High Availability et Geo Cluster
 
SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)SUSE Storage: Sizing and Performance (Ceph)
SUSE Storage: Sizing and Performance (Ceph)
 
Containerized Data Persistence on Mesos
Containerized Data Persistence on MesosContainerized Data Persistence on Mesos
Containerized Data Persistence on Mesos
 
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
 
Developing with-devstack
Developing with-devstackDeveloping with-devstack
Developing with-devstack
 
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
 
HBaseConEast2016: HBase on Docker with Clusterdock
HBaseConEast2016: HBase on Docker with ClusterdockHBaseConEast2016: HBase on Docker with Clusterdock
HBaseConEast2016: HBase on Docker with Clusterdock
 
Develop with linux containers and docker
Develop with linux containers and dockerDevelop with linux containers and docker
Develop with linux containers and docker
 

Viewers also liked

An example Hadoop Install
An example Hadoop InstallAn example Hadoop Install
An example Hadoop InstallMike Frampton
 
Installing hadoop on ubuntu 16
Installing hadoop on ubuntu 16Installing hadoop on ubuntu 16
Installing hadoop on ubuntu 16Enrique Davila
 
Introducción a Big Data. HDInsight - Webcast Technet SolidQ
Introducción a Big Data. HDInsight - Webcast Technet SolidQIntroducción a Big Data. HDInsight - Webcast Technet SolidQ
Introducción a Big Data. HDInsight - Webcast Technet SolidQSolidQ
 
Big Data para Dummies
Big Data para DummiesBig Data para Dummies
Big Data para DummiesStratebi
 
Install Apache Hadoop for Development/Production
Install Apache Hadoop for  Development/ProductionInstall Apache Hadoop for  Development/Production
Install Apache Hadoop for Development/ProductionIMC Institute
 
Big data para principiantes
Big data para principiantesBig data para principiantes
Big data para principiantesCarlos Toxtli
 
Introducción al Big Data
Introducción al Big DataIntroducción al Big Data
Introducción al Big DataDavid Alayón
 
Deploy hadoop cluster
Deploy hadoop clusterDeploy hadoop cluster
Deploy hadoop clusterChirag Ahuja
 
When the faith rubber meets the road mile 3
When the faith rubber meets the road   mile 3When the faith rubber meets the road   mile 3
When the faith rubber meets the road mile 3Donald Jacobs
 
怪异的Google文化
怪异的Google文化怪异的Google文化
怪异的Google文化sogopoo
 
Prevención del sida en la escuela secundaria
Prevención del sida en la escuela secundariaPrevención del sida en la escuela secundaria
Prevención del sida en la escuela secundarialucmcgreb
 
H.Y.P.E. 2010 Newsletter
H.Y.P.E. 2010 NewsletterH.Y.P.E. 2010 Newsletter
H.Y.P.E. 2010 Newsletterhoustonsyouth
 
Rijden Op Waterstof In Overijssel (3)
Rijden Op Waterstof In Overijssel (3)Rijden Op Waterstof In Overijssel (3)
Rijden Op Waterstof In Overijssel (3)kristelijn
 
Austraalia meestele
Austraalia meesteleAustraalia meestele
Austraalia meesteleTuuli Kotov
 
2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru
2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru
2011 Subaru Tribeca For Sale Near Manchester NH | Singer SubaruSinger Subaru, LLC
 

Viewers also liked (20)

Ppt recentschoolnieuws
Ppt recentschoolnieuwsPpt recentschoolnieuws
Ppt recentschoolnieuws
 
An example Hadoop Install
An example Hadoop InstallAn example Hadoop Install
An example Hadoop Install
 
Installing hadoop on ubuntu 16
Installing hadoop on ubuntu 16Installing hadoop on ubuntu 16
Installing hadoop on ubuntu 16
 
Introducción a Big Data. HDInsight - Webcast Technet SolidQ
Introducción a Big Data. HDInsight - Webcast Technet SolidQIntroducción a Big Data. HDInsight - Webcast Technet SolidQ
Introducción a Big Data. HDInsight - Webcast Technet SolidQ
 
Big Data para Dummies
Big Data para DummiesBig Data para Dummies
Big Data para Dummies
 
Install Apache Hadoop for Development/Production
Install Apache Hadoop for  Development/ProductionInstall Apache Hadoop for  Development/Production
Install Apache Hadoop for Development/Production
 
Big data para principiantes
Big data para principiantesBig data para principiantes
Big data para principiantes
 
Introducción al Big Data
Introducción al Big DataIntroducción al Big Data
Introducción al Big Data
 
Deploy hadoop cluster
Deploy hadoop clusterDeploy hadoop cluster
Deploy hadoop cluster
 
When the faith rubber meets the road mile 3
When the faith rubber meets the road   mile 3When the faith rubber meets the road   mile 3
When the faith rubber meets the road mile 3
 
Avain Green Tourism of Finland GTF® ympäristömerkkiin - 2018
Avain Green Tourism of Finland GTF®   ympäristömerkkiin  - 2018Avain Green Tourism of Finland GTF®   ympäristömerkkiin  - 2018
Avain Green Tourism of Finland GTF® ympäristömerkkiin - 2018
 
El verbo
El verboEl verbo
El verbo
 
怪异的Google文化
怪异的Google文化怪异的Google文化
怪异的Google文化
 
Prevención del sida en la escuela secundaria
Prevención del sida en la escuela secundariaPrevención del sida en la escuela secundaria
Prevención del sida en la escuela secundaria
 
myriadeclecticviews
myriadeclecticviewsmyriadeclecticviews
myriadeclecticviews
 
H.Y.P.E. 2010 Newsletter
H.Y.P.E. 2010 NewsletterH.Y.P.E. 2010 Newsletter
H.Y.P.E. 2010 Newsletter
 
Rijden Op Waterstof In Overijssel (3)
Rijden Op Waterstof In Overijssel (3)Rijden Op Waterstof In Overijssel (3)
Rijden Op Waterstof In Overijssel (3)
 
Bab iii
Bab iiiBab iii
Bab iii
 
Austraalia meestele
Austraalia meesteleAustraalia meestele
Austraalia meestele
 
2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru
2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru
2011 Subaru Tribeca For Sale Near Manchester NH | Singer Subaru
 

Similar to Install hadoop in a cluster

Single node setup
Single node setupSingle node setup
Single node setupKBCHOW123
 
Hadoop single node setup
Hadoop single node setupHadoop single node setup
Hadoop single node setupMohammad_Tariq
 
Single node hadoop cluster installation
Single node hadoop cluster installation Single node hadoop cluster installation
Single node hadoop cluster installation Mahantesh Angadi
 
02 Hadoop deployment and configuration
02 Hadoop deployment and configuration02 Hadoop deployment and configuration
02 Hadoop deployment and configurationSubhas Kumar Ghosh
 
Apache HDFS - Lab Assignment
Apache HDFS - Lab AssignmentApache HDFS - Lab Assignment
Apache HDFS - Lab AssignmentFarzad Nozarian
 
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseApache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseShiva Rama Krishna Dasharathi
 
Hadoop single node installation on ubuntu 14
Hadoop single node installation on ubuntu 14Hadoop single node installation on ubuntu 14
Hadoop single node installation on ubuntu 14jijukjoseph
 
Hadoop cluster 安裝
Hadoop cluster 安裝Hadoop cluster 安裝
Hadoop cluster 安裝recast203
 
Big data with hadoop Setup on Ubuntu 12.04
Big data with hadoop Setup on Ubuntu 12.04Big data with hadoop Setup on Ubuntu 12.04
Big data with hadoop Setup on Ubuntu 12.04Mandakini Kumari
 
Hadoop installation on windows
Hadoop installation on windows Hadoop installation on windows
Hadoop installation on windows habeebulla g
 
Deployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersDeployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersAmal G Jose
 
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)Nag Arvind Gudiseva
 
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...Hadoop installation and Running KMeans Clustering with MapReduce Program on H...
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...Titus Damaiyanti
 
Big data using Hadoop, Hive, Sqoop with Installation
Big data using Hadoop, Hive, Sqoop with InstallationBig data using Hadoop, Hive, Sqoop with Installation
Big data using Hadoop, Hive, Sqoop with Installationmellempudilavanya999
 
Hadoop Installation
Hadoop InstallationHadoop Installation
Hadoop InstallationAhmed Salman
 

Similar to Install hadoop in a cluster (20)

Single node setup
Single node setupSingle node setup
Single node setup
 
Hadoop single node setup
Hadoop single node setupHadoop single node setup
Hadoop single node setup
 
Single node hadoop cluster installation
Single node hadoop cluster installation Single node hadoop cluster installation
Single node hadoop cluster installation
 
02 Hadoop deployment and configuration
02 Hadoop deployment and configuration02 Hadoop deployment and configuration
02 Hadoop deployment and configuration
 
Apache HDFS - Lab Assignment
Apache HDFS - Lab AssignmentApache HDFS - Lab Assignment
Apache HDFS - Lab Assignment
 
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exerciseApache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exercise
 
Hadoop 설치
Hadoop 설치Hadoop 설치
Hadoop 설치
 
Run wordcount job (hadoop)
Run wordcount job (hadoop)Run wordcount job (hadoop)
Run wordcount job (hadoop)
 
Exp-3.pptx
Exp-3.pptxExp-3.pptx
Exp-3.pptx
 
Hadoop 2.4 installing on ubuntu 14.04
Hadoop 2.4 installing on ubuntu 14.04Hadoop 2.4 installing on ubuntu 14.04
Hadoop 2.4 installing on ubuntu 14.04
 
Hadoop single node installation on ubuntu 14
Hadoop single node installation on ubuntu 14Hadoop single node installation on ubuntu 14
Hadoop single node installation on ubuntu 14
 
Hadoop cluster 安裝
Hadoop cluster 安裝Hadoop cluster 安裝
Hadoop cluster 安裝
 
Hdfs java api
Hdfs java apiHdfs java api
Hdfs java api
 
Big data with hadoop Setup on Ubuntu 12.04
Big data with hadoop Setup on Ubuntu 12.04Big data with hadoop Setup on Ubuntu 12.04
Big data with hadoop Setup on Ubuntu 12.04
 
Hadoop installation on windows
Hadoop installation on windows Hadoop installation on windows
Hadoop installation on windows
 
Deployment and Management of Hadoop Clusters
Deployment and Management of Hadoop ClustersDeployment and Management of Hadoop Clusters
Deployment and Management of Hadoop Clusters
 
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)
Hadoop 2.0 cluster setup on ubuntu 14.04 (64 bit)
 
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...Hadoop installation and Running KMeans Clustering with MapReduce Program on H...
Hadoop installation and Running KMeans Clustering with MapReduce Program on H...
 
Big data using Hadoop, Hive, Sqoop with Installation
Big data using Hadoop, Hive, Sqoop with InstallationBig data using Hadoop, Hive, Sqoop with Installation
Big data using Hadoop, Hive, Sqoop with Installation
 
Hadoop Installation
Hadoop InstallationHadoop Installation
Hadoop Installation
 

Install hadoop in a cluster