0% found this document useful (0 votes)

26 views

Hadoop Multinode Cluster Installation

The document describes the steps to install Apache Hadoop 2.7.1 in a multinode cluster configuration with 1 name node and 3 data nodes on Ubuntu 12.04. It involves setting up the environment, configuring hostnames and IPs, installing Java and Hadoop, configuring configuration files, enabling passwordless SSH login, formatting the name node, and starting and stopping the Hadoop daemons.

Uploaded by

Jay Lakshmi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Hadoop Multinode Cluster Installation

Uploaded by

Jay Lakshmi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Multinode Cluster Installation Mode

Apache Hadoop v2.7.1

Linux Operating System (Ubuntu 12.04)

Environment Setup:
No.of Nodes = 4 (1 Namenode, 3 Datanodes)
Hostnames:
Namenode – namenode
Datanodes – datanode1, datanode2, datanode3

Installation and Configuration:

In “namenode”
1. Create a new user “multinode” for this installation procedure.
~$ sudo adduser multinode

2. Edit the “/etc/hosts” file providing the IP addresses of the cluster nodes.
~$ sudo vim /etc/hosts
namenode- ip-address namenode
datanode1-ip-address datanode1
datanode2-ip-address datanode2
datanode3-ip-address datanode3

Comment the line containing “localhost”

After making the above mentioned changes, save and close the file
Note: Also make sure, all the four nodes are reachable via network

3. Switch to the newly created user account

~$ su – multinode

4. Download the latest stable version of Apache Hadoop tarball distribution.

5. Download Java1.7 JDK tarball. Consider the architecture 32 bit (i386, i586,
i686), 64bit (x86_64) before downloading.

6. Assuming that the downloaded tarballs are present under the home directory of
the user. Extract the tarballs

~$ tar -xvf hadoop-2.7.1.tar.gz

~$ tar -xvf jdk-7u79-linux-x86_64.gz

7. After extracting, set up the environment variables in ~/.bashrc file

~$ vi .bashrc
export JAVA_HOME=/home/multinode/jdk1.7.0_79
export HADOOP_PREFIX=/home/multinode/hadoop-2.7.1
export HADOOP_HOME=${HADOOP_PREFIX}
export HADOOP_CONF_DIR=${HADOOP_PREFIX}/etc/hadoop
export PATH=$JAVA_HOME/bin:$HADOOP_HOME/bin:
$HADOOP_HOME/sbin:$PATH
After appending these lines, save and close the file.

8. For these variables to be set for the current shell, source the file.
~$ source ~/.bashrc
Check whether the changes have been applied properly
~$ echo $JAVA_HOME
~$ hadoop version

9. Next, edit the hadoop configuration files

~$ cd $HADOOP_CONF_DIR

~hadoop-2.7.1/etc/hadoop$ vi hadoop-env.sh
export JAVA_HOME=/home/multinode/jdk1.7.0_79

~hadoop-2.7.1/etc/hadoop$ vi core-site.xml
<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://namenode:8020</value>
</property>
</configuration>

~hadoop-2.7.1/etc/hadoop$ vi hdfs-site.xml
<configuration>
<property>
<name>dfs.namenode.name.dir</name>
<value>/home/multinode/name</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/home/multinode/data</value>
</property>
<property>
<name>dfs.namenode.http.address</name>
<value>namenode:50070</value>
</property>
</configuration>
~hadoop-2.7.1/etc/hadoop$ vi yarn-site.xml
<configuration>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>namenode</value>
</property>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
</configuration>

~hadoop-2.7.1/etc/hadoop$ cp mapred-site.xml.template mapred-site.xml

~hadoop-2.7.1/etc/hadoop$ vi mapred-site.xml
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

Note: These configurations are designed to work for the scenario of running
Namenode, ResourceManager and JobHistoryServer daemons in the “namenode”.

~hadoop-2.7.1/etc/hadoop$ vi slaves
datanode1
datanode2
datanode3

In “datanodes”:
Repeat the steps from 1 to 9 on all datanodes.

10. To enable password less login from namenode to all datanodes through SSH
In “namenode”:
~$ ssh-keygen
~$ ssh-copy-id -i ~/.ssh/id_rsa.pub namenode
~$ ssh-copy-id -i ~/.ssh/id_rsa.pub datanode1
~$ ssh-copy-id -i ~/.ssh/id_rsa.pub datanode2
~$ ssh-copy-id -i ~/.ssh/id_rsa.pub datanode3

This procedure avoids prompting for password, when starting the daemons.

11. Format the namenode before starting the daemons.

~$ hadoop namenode -format

This formats the dfs.namenode.name.dir location and creates the necessary files
and folders required for namenode.

Note: Steps 10 and 11 are one-time procedures.

12. Start the cluster

~$ start-dfs.sh
~$ start-yarn.sh
~$ mr-jobhistory-daemon start historyserver
Alternatively, to start all the daemons
~$ start-all.sh
~$ mr-jobhistory-daemon start historyserver

13. To check for the daemons, use jps (java process status)
~$ jps

14. To Stop the cluster

~$ stop-yarn.sh
~$ stop-dfs.sh
~$ mr-jobhistory-daemon stop historyserver
To stop all the daemons in one go
~$ stop-all.sh
~$ mr-jobhistory-daemon stop historyserver

Note: To stop or start daemons individually.

~$ hadoop-demon.sh <start | stop> <namenode | datanode>
~$ yarn-daemon.sh <start | stop> <resourcemanager | nodemanager>

To stop or start all datanodes

~$ hadoop-daemons.sh <start | stop> datanode

To stop or start all nodemanagers

~$ yarn-daemon.sh <start | stop> nodemanager

Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
BDAO
No ratings yet
BDAO
23 pages
TP2 _3IM - En
No ratings yet
TP2 _3IM - En
7 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Experiment-2_BDA_Lab
No ratings yet
Experiment-2_BDA_Lab
13 pages
Lab 1
No ratings yet
Lab 1
12 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Hadoop
No ratings yet
Hadoop
4 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Create A Multi-Node Cluster For Distributed Hadoop Environment
No ratings yet
Create A Multi-Node Cluster For Distributed Hadoop Environment
5 pages
Hadoop Cluster Setup
No ratings yet
Hadoop Cluster Setup
10 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
213nt1306- Big Data Analytics Lab Manual
No ratings yet
213nt1306- Big Data Analytics Lab Manual
80 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
Lab 0-Cluster With Multiple VMs-30-01-2024
No ratings yet
Lab 0-Cluster With Multiple VMs-30-01-2024
6 pages
Ex 3
No ratings yet
Ex 3
3 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop for Ubuntu 2
No ratings yet
Hadoop for Ubuntu 2
4 pages
BigData_Lab_Manual
No ratings yet
BigData_Lab_Manual
44 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
4 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
Hadoop Installation On Linux
No ratings yet
Hadoop Installation On Linux
4 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
A Report On Distributed Computing
No ratings yet
A Report On Distributed Computing
25 pages
Hadoop Installation 2
No ratings yet
Hadoop Installation 2
5 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
bigdatamanual(2)
No ratings yet
bigdatamanual(2)
45 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Hadoop All Installations
No ratings yet
Hadoop All Installations
19 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Amrita CC 3.1
No ratings yet
Amrita CC 3.1
7 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hadoop Installation Final
No ratings yet
Hadoop Installation Final
32 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
How To Install Hadoop On Ubuntu 18
No ratings yet
How To Install Hadoop On Ubuntu 18
15 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
Bda Record
No ratings yet
Bda Record
27 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
4 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
Hadoop Installation On CentOS PDF
No ratings yet
Hadoop Installation On CentOS PDF
3 pages
Installing Multi Node Cluster - Handbook 2.0
No ratings yet
Installing Multi Node Cluster - Handbook 2.0
2 pages
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
Pakistan: International School (English Section)
No ratings yet
Pakistan: International School (English Section)
2 pages
Jku Computer Science Model Exam 2
No ratings yet
Jku Computer Science Model Exam 2
18 pages
DBA 2 study guide
No ratings yet
DBA 2 study guide
7 pages
(IBM Security) IBM Security QRadar Installation Guide
No ratings yet
(IBM Security) IBM Security QRadar Installation Guide
54 pages
Jan Corné Olivier - Linear Systems and Signals (2019)
No ratings yet
Jan Corné Olivier - Linear Systems and Signals (2019)
304 pages
CS604 MCQs Solved Part 1
50% (2)
CS604 MCQs Solved Part 1
89 pages
Maximum Availability WP 19c
No ratings yet
Maximum Availability WP 19c
37 pages
Gas Agency Management System Project Report
No ratings yet
Gas Agency Management System Project Report
96 pages
Computer Company Names
No ratings yet
Computer Company Names
9 pages
Yamaha PSR 225
No ratings yet
Yamaha PSR 225
29 pages
Bi-Radial® Studio Monitor: High-Frequency Horn and Driver
No ratings yet
Bi-Radial® Studio Monitor: High-Frequency Horn and Driver
4 pages
OOP JAVA M1 Ktunotes - in
No ratings yet
OOP JAVA M1 Ktunotes - in
170 pages
Vehicle Speed System
No ratings yet
Vehicle Speed System
3 pages
TC3064en-Ed02 Installation Procedure For OmniVista8770 R5.1.16.01
No ratings yet
TC3064en-Ed02 Installation Procedure For OmniVista8770 R5.1.16.01
70 pages
Dheeraj - Malisetty - Resume - 31-10-2022-17-06-37 - Dheeraj Malisetty
No ratings yet
Dheeraj - Malisetty - Resume - 31-10-2022-17-06-37 - Dheeraj Malisetty
2 pages
Real-Time Inventory Management With RFIDash
No ratings yet
Real-Time Inventory Management With RFIDash
10 pages
Vsphere Esxi Vcenter 802 Management Guide
No ratings yet
Vsphere Esxi Vcenter 802 Management Guide
216 pages
ATG Commerce - Intercepting Pipeline
0% (1)
ATG Commerce - Intercepting Pipeline
1 page
Decimation 1
No ratings yet
Decimation 1
9 pages
IoT Unit-I Question
No ratings yet
IoT Unit-I Question
16 pages
جميع اسئلة الرؤيا
No ratings yet
جميع اسئلة الرؤيا
13 pages
Practice Desha
No ratings yet
Practice Desha
26 pages
Ram BGM
No ratings yet
Ram BGM
1 page
Wavetek User Guides
100% (2)
Wavetek User Guides
28 pages
Lab09 Diodes
No ratings yet
Lab09 Diodes
13 pages
Amplidyne
50% (2)
Amplidyne
2 pages
Adafruit Trinket M0
No ratings yet
Adafruit Trinket M0
1 page
Experiment No:-02: Electrical Symbols
100% (4)
Experiment No:-02: Electrical Symbols
6 pages
Watchkeeper Users Guide
No ratings yet
Watchkeeper Users Guide
23 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
8 pages

Hadoop Multinode Cluster Installation

Uploaded by

Hadoop Multinode Cluster Installation

Uploaded by

Multinode Cluster Installation Mode

Apache Hadoop v2.7.1

Installation and Configuration:

Comment the line containing “localhost”

3. Switch to the newly created user account

4. Download the latest stable version of Apache Hadoop tarball distribution.

~$ tar -xvf hadoop-2.7.1.tar.gz

7. After extracting, set up the environment variables in ~/.bashrc file

9. Next, edit the hadoop configuration files

~hadoop-2.7.1/etc/hadoop$ cp mapred-site.xml.template mapred-site.xml

11. Format the namenode before starting the daemons.

Note: Steps 10 and 11 are one-time procedures.

12. Start the cluster

14. To Stop the cluster

Note: To stop or start daemons individually.

To stop or start all datanodes

To stop or start all nodemanagers

You might also like