Installing Hadoop in Ubuntu in Virtual Box Instructions

This document provides instructions for installing Hadoop 2.6 in single node pseudo-distributed mode on an Ubuntu 14.04 virtual machine. It describes steps to install Java, disable IPv6, add a hadoop user, install SSH, configure SSH keys, download and extract Hadoop, configure configuration files, format the HDFS filesystem, start all Hadoop processes, test the installation and stop Hadoop processes.

Uploaded by

vidhyadeepa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views

Installing Hadoop in Ubuntu in Virtual Box Instructions

Uploaded by

vidhyadeepa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 4

https://youtu.

Install Java
Disable IPv6
Add a dedicated Hadoop User
Install SSH
Give hduser Sudo Permission
Set up SSH Certifiactes
Install Hadoop
Configure Hadoop
a. bash.rc
b. hadoop-env.sh
c. core-site.xml
d. mapred-site.xml.template
e. hdfs-site.xml
9. Format Hadoop filesystem
10. Start Hadoop
11. Testing that is running
12. Stopping Hadoop
------------------------------------------------------------------------------------------1. Install Java
sudo apt-get update
sudo apt-get install default-jdk
java -version
2. Disable IPv6
sudo apt-get install vim
sudo vim /etc/sysctl.conf
# disable ipv6
net.ipv6.conf.all.disable_ipv6 = 1
net.ipv6.conf.default.disable_ipv6 = 1
net.ipv6.conf.lo.disable_ipv6 = 1
cat /proc/sys/net/ipv6/conf/all/disable_ipv6 ... (should return zero)
3. Adding a dedicated Hadoop User
sudo addgroup hadoop
sudo adduser --ingroup hadoop hduser
4. Install SSH
sudo apt-get install ssh
5. Give hduser Sudo Permission
sudo adduser hduser sudo

6. Setup SSH Certificates

su hduser
ssh-keygen -t rsa -P ""
cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys
ssh localhost
7. Install Hadoop
su hduser
wget http://mirrors.sonic.net/apache/hadoop/common/hadoop-2.6.0/hadoop-2
.6.0.tar.gz
tar xvzf hadoop-2.6.0.tar.gz
cd hadoop-2.6.0
sudo mkdir /usr/local/hadoop
sudo mv * /usr/local/hadoop
8. Set up the Configuration files
a. vim ~/.bashrc
#HADOOP VARIABLES START
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64
export HADOOP_INSTALL=/usr/local/hadoop
export PATH=$PATH:$HADOOP_INSTALL/bin
export PATH=$PATH:$HADOOP_INSTALL/sbin
export HADOOP_MAPRED_HOME=$HADOOP_INSTALL
export HADOOP_COMMON_HOME=$HADOOP_INSTALL
export HADOOP_HDFS_HOME=$HADOOP_INSTALL
export YARN_HOME=$HADOOP_INSTALL
export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_INSTALL/lib/native
export HADOOP_OPTS="-Djava.library.path=$HADOOP_INSTALL/lib"
#HADOOP VARIABLES END
b. vim /usr/local/hadoop/etc/hadoop/hadoop-env.sh
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64
c. sudo mkdir -p /app/hadoop/tmp
sudo chown hduser:hadoop /app/hadoop/tmp
vim /usr/local/hadoop/etc/hadoop/core-site.xml
<property>
<name>hadoop.tmp.dir</name>
<value>/app/hadoop/tmp</value>
<description>A base for other temporary directories.</de
scription>
</property>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:54310</value>
<description>The name of the default file system. A URI
whose
scheme and authority determine the FileSystem implementa
tion. The
uri's scheme determines the config property (fs.SCHEME.i
mpl) naming
the FileSystem implementation class. The uri's authorit

y is used to
determine the host, port, etc. for a filesystem.</descri
ption>
</property>
d. cp /usr/local/hadoop/etc/hadoop/mapred-site.xml.template /usr/local/h
adoop/etc/hadoop/mapred-site.xml
vim /usr/local/hadoop/etc/hadoop/mapred-site.xml
<property>
<name>mapred.job.tracker</name>
<value>localhost:54311</value>
<description>The host and port that the MapReduce job tr
acker runs
at. If "local", then jobs are run in-process as a singl
e map
and reduce task.
</description>
</property>
e. sudo mkdir -p /usr/local/hadoop_store/hdfs/namenode
sudo mkdir -p /usr/local/hadoop_store/hdfs/datanode
sudo chown -R hduser:hadoop /usr/local/hadoop_store
vim /usr/local/hadoop/etc/hadoop/hdfs-site.xml
<property>
<name>dfs.replication</name>
<value>1</value>
<description>Default block replication.
The actual number of replications can be specified when
the file is created.
The default is used if replication is not specified in c
reate time.
</description>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:/usr/local/hadoop_store/hdfs/namenode</value
>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>file:/usr/local/hadoop_store/hdfs/datanode</value
>
</property>
9. Format Hadoop filesystem
hadoop namenode -format
10. Starting Hadoop
su hduser
sudo chown -R hduser:hadoop /usr/local/hadoop/
cd /usr/local/hadoop/sbin
start-all.sh

11. Testing if it is working

jps
netstat -plten | grep java
http://localhost:50070/
12. Stopping Hadoop
stop-all.sh

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
DEV 301 - Lab Guide
100% (1)
DEV 301 - Lab Guide
46 pages
Lab 1 - Hadoop HDFS and MapReduce
No ratings yet
Lab 1 - Hadoop HDFS and MapReduce
4 pages
Cloudera Administrator Training For Apache Hadoop
No ratings yet
Cloudera Administrator Training For Apache Hadoop
5 pages
How To Set Up A Hadoop Cluster in Docker
No ratings yet
How To Set Up A Hadoop Cluster in Docker
13 pages
Home - Latest Documentation - MapR (PDFDrive)
No ratings yet
Home - Latest Documentation - MapR (PDFDrive)
345 pages
MANUAL Mobitec ICU 302 V1 R1 - English
No ratings yet
MANUAL Mobitec ICU 302 V1 R1 - English
6 pages
7 P's of Insurance Industry
100% (1)
7 P's of Insurance Industry
13 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop All Installations
No ratings yet
Hadoop All Installations
19 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Cloudera Administration Handbook
From Everand
Cloudera Administration Handbook
Rohit Menon
No ratings yet
SAS Hadoop Kerberos
No ratings yet
SAS Hadoop Kerberos
27 pages
Manual Hadoop HIve Installation
No ratings yet
Manual Hadoop HIve Installation
4 pages
Administration of Hadoop Summer 2014 Lab Guide v3.1
No ratings yet
Administration of Hadoop Summer 2014 Lab Guide v3.1
107 pages
MapR Sandbox For Hadoop DocUpdateFor3.1.1
No ratings yet
MapR Sandbox For Hadoop DocUpdateFor3.1.1
7 pages
Adm2000 Lab Guide
100% (1)
Adm2000 Lab Guide
48 pages
Databricks DBX CLI - Deploy The Spark JAR Using YAML - by Ganesh Chandrasekaran - Medium
No ratings yet
Databricks DBX CLI - Deploy The Spark JAR Using YAML - by Ganesh Chandrasekaran - Medium
7 pages
Cloudera Administration Study Guide
No ratings yet
Cloudera Administration Study Guide
3 pages
Mapr Snapshots
No ratings yet
Mapr Snapshots
31 pages
Cloudera Administration
No ratings yet
Cloudera Administration
424 pages
Cloudera Hive
No ratings yet
Cloudera Hive
107 pages
ADM203 L13 Troubleshooting
No ratings yet
ADM203 L13 Troubleshooting
19 pages
Cloudera Installation
No ratings yet
Cloudera Installation
180 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Hive Queries
No ratings yet
Hive Queries
5 pages
Twitter Sentimental Analysis
No ratings yet
Twitter Sentimental Analysis
42 pages
1433427145-Setting Up A Virtual Cluster - ADM 201
No ratings yet
1433427145-Setting Up A Virtual Cluster - ADM 201
15 pages
BIG DATA WITH HADOOP, HDFS & MAPREDUCE (Hands On Training)
No ratings yet
BIG DATA WITH HADOOP, HDFS & MAPREDUCE (Hands On Training)
35 pages
DATA ANALYTICS Lab
No ratings yet
DATA ANALYTICS Lab
3 pages
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
No ratings yet
Scaladayslambda Architecture Spark Cassandra Akka Kafka 150609194508 Lva1 App6891 PDF
100 pages
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
5 pages
Hive Main Installation
No ratings yet
Hive Main Installation
2 pages
Hadoop and Mapreduce
No ratings yet
Hadoop and Mapreduce
21 pages
Parallel Distributed Architecture For Storage and Sharing (PDash)
No ratings yet
Parallel Distributed Architecture For Storage and Sharing (PDash)
6 pages
Cloudera Administration PDF
No ratings yet
Cloudera Administration PDF
478 pages
Tutorial-HDP-Administration V III
100% (1)
Tutorial-HDP-Administration V III
274 pages
Data Science in Spark With Sparklyr::: Cheat Sheet
No ratings yet
Data Science in Spark With Sparklyr::: Cheat Sheet
2 pages
Cloudera Administration
No ratings yet
Cloudera Administration
694 pages
Cloud Computing Lab Setup Using Hadoop & Open Nebula
100% (4)
Cloud Computing Lab Setup Using Hadoop & Open Nebula
46 pages
Edureka VM Split Files - 4.0
No ratings yet
Edureka VM Split Files - 4.0
2 pages
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Apache Spark
No ratings yet
Apache Spark
6 pages
MapR Certified Hadoop Developer Study Guide (MCHD)
No ratings yet
MapR Certified Hadoop Developer Study Guide (MCHD)
26 pages
Big Data and Hadoop: by - Ujjwal Kumar Gupta
No ratings yet
Big Data and Hadoop: by - Ujjwal Kumar Gupta
57 pages
Integrating Snort With OSSIM PDF
No ratings yet
Integrating Snort With OSSIM PDF
6 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
Spark Scala Protected
No ratings yet
Spark Scala Protected
211 pages
Unit-3 (HDFS)
No ratings yet
Unit-3 (HDFS)
59 pages
Pig
No ratings yet
Pig
16 pages
Spark Training in Bangalore
No ratings yet
Spark Training in Bangalore
36 pages
Introduction To Hadoop & Spark
No ratings yet
Introduction To Hadoop & Spark
28 pages
Hadoop Mapr Configuring Topologies
No ratings yet
Hadoop Mapr Configuring Topologies
34 pages
01-Docker - 02 - Install Docker Desktop on Windows (1)
No ratings yet
01-Docker - 02 - Install Docker Desktop on Windows (1)
6 pages
05 Monitoring The Cluster
No ratings yet
05 Monitoring The Cluster
12 pages
Apache Hue-Cloudera
No ratings yet
Apache Hue-Cloudera
63 pages
Sqoop User Guide
No ratings yet
Sqoop User Guide
58 pages
Cloud Development and Deployment with CloudBees
From Everand
Cloud Development and Deployment with CloudBees
Nicolas De loof
No ratings yet
WebSphere Application Server 7.0 Administration Guide
From Everand
WebSphere Application Server 7.0 Administration Guide
Steve Robinson
No ratings yet
Nazym Tazairt CV
No ratings yet
Nazym Tazairt CV
2 pages
BIMESTRAL 11° 1ER CORTE 2023
No ratings yet
BIMESTRAL 11° 1ER CORTE 2023
2 pages
English Listening Practice Level 4 - Learn English by Listening Engilsh With Subtitle
No ratings yet
English Listening Practice Level 4 - Learn English by Listening Engilsh With Subtitle
69 pages
Module Manufacturing Technology
No ratings yet
Module Manufacturing Technology
5 pages
Reliance Industries Limited by Chirag
100% (1)
Reliance Industries Limited by Chirag
31 pages
CALM Buoy Brochure English
No ratings yet
CALM Buoy Brochure English
4 pages
Kyle Capodice Ecet Candidate Resume
No ratings yet
Kyle Capodice Ecet Candidate Resume
1 page
Tyler Haley Wolfhaley Formspring PDF 2
No ratings yet
Tyler Haley Wolfhaley Formspring PDF 2
1 page
Governor Sindh IT Course 1st Entrance Test Conducted On 16 July 2023
100% (1)
Governor Sindh IT Course 1st Entrance Test Conducted On 16 July 2023
10 pages
Serbian Prepositions
No ratings yet
Serbian Prepositions
3 pages
Resources, Solar Resources
No ratings yet
Resources, Solar Resources
9 pages
PLC Chicken House Model
No ratings yet
PLC Chicken House Model
4 pages
Generating Business Ideas Through Trends in Business Environment
100% (1)
Generating Business Ideas Through Trends in Business Environment
5 pages
Introduction and History of Pharmacovigilance
33% (3)
Introduction and History of Pharmacovigilance
37 pages
Prajapati Cv Now
No ratings yet
Prajapati Cv Now
2 pages
Catalog Generation
No ratings yet
Catalog Generation
42 pages
Billal: Subject: Offer For Supply and Installation of 1000 Kva 11/0.415 KV Sub-Station
100% (1)
Billal: Subject: Offer For Supply and Installation of 1000 Kva 11/0.415 KV Sub-Station
8 pages
(Process Safety Progress 2009-Sep Vol. 28 Iss. 3) Angela E. Summers - Safety Management Is A Virtue (2009) (10.1002 - prs.10337) - Libgen - Li
No ratings yet
(Process Safety Progress 2009-Sep Vol. 28 Iss. 3) Angela E. Summers - Safety Management Is A Virtue (2009) (10.1002 - prs.10337) - Libgen - Li
4 pages
List of Government Officials 2017
No ratings yet
List of Government Officials 2017
42 pages
Merkel Sidebyside PDF
No ratings yet
Merkel Sidebyside PDF
36 pages
Solution PKP Tutorial
No ratings yet
Solution PKP Tutorial
6 pages
Digital Transforamtion Report on Hulu
No ratings yet
Digital Transforamtion Report on Hulu
14 pages
Download (Ebook) Cambridge International AS and A Level Further Mathematics Further Pure Mathematics 2 Student Book (Cambridge International Examinations) by Ball, Helen ISBN 9780008257781, 0008257787 ebook All Chapters PDF
100% (2)
Download (Ebook) Cambridge International AS and A Level Further Mathematics Further Pure Mathematics 2 Student Book (Cambridge International Examinations) by Ball, Helen ISBN 9780008257781, 0008257787 ebook All Chapters PDF
65 pages
Excavator Mounted Vibrator
No ratings yet
Excavator Mounted Vibrator
4 pages
The Impacts of Plastic Pollution On Sea Turtles
No ratings yet
The Impacts of Plastic Pollution On Sea Turtles
17 pages
Karcher b102 Plus Operation User S Manual 20
No ratings yet
Karcher b102 Plus Operation User S Manual 20
20 pages
Microsegmentation Guide and Slides
No ratings yet
Microsegmentation Guide and Slides
2 pages
Seminar Report Blackberry Phones : Submitted To: Submitted by
No ratings yet
Seminar Report Blackberry Phones : Submitted To: Submitted by
16 pages

Installing Hadoop in Ubuntu in Virtual Box Instructions

Uploaded by

Installing Hadoop in Ubuntu in Virtual Box Instructions

Uploaded by

https://youtu.

be/SaVFs_iDMPo - video link

6. Setup SSH Certificates

11. Testing if it is working

You might also like