0% found this document useful (0 votes)

3 views

Exp_1

The document outlines the installation process for a Hadoop Single Node Cluster, detailing the necessary software, system requirements, and configuration steps. It includes instructions for downloading, configuring environment variables, setting up Hadoop configuration files, and starting Hadoop daemons. Additionally, it covers SSH configuration and formatting the Name Node to create a new distributed file system.

Uploaded by

Shaan samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Exp_1

Uploaded by

Shaan samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 24

Experiment -1

Installation of Hadoop Single Node Cluster

1
2
3
4
Infrastructure-as-a-service (IaaS)
•With IaaS, you rent IT infrastructure—servers and virtual machines (VMs), storage,
networks, operating systems—from a cloud provider on a pay-as-you-go basis.

Platform as a service (PaaS)

•Platform-as-a-service (PaaS) that supply an on-demand environment for developing,
testing, delivering and managing software applications.

• PaaS is designed to make it easier for developers to quickly create web or mobile apps,
without worrying about setting up or managing the underlying infrastructure of servers,
storage, network and databases needed for development.

5
Software as a service (SaaS)
•Software-as-a-service (SaaS) is a services for delivering software applications
over the Internet, on demand and typically on a subscription basis.

•With SaaS, cloud providers host and manage the software application and
underlying infrastructure and handle any maintenance, like software upgrades.

6
VMware Workstation
• VMware Workstation is a hosted hypervisor that runs on x64
versions of Windows and Linux operating systems.
• It enables users to set up virtual machines on a single physical
machine, and use them simultaneously along with the actual
machine.

7
REQUIREMENTS

• A Laptop or Desktop with 32 or 64 bit with window (Linux

operating system when using VM )

• Minimum of 2 Gb ram

• Minimum of 100 gb hard disk

• Minimum VGA is required

8
Installing of Hadoop Single Node Cluster Configuration .

1. Downloading the software Required

2. Untar the software
3. Bashrc configurations
4. Hadoop Configuration File
5. Share public key to localhost
6. Formatting the name node
7. Starting Hadoop Daemons
8. Checking the working hadoop Daemons

9
STEP-1
SOFTWARES REQUIRED

• Hadoop 1.2.0-bin.tar.gz
• Jdk 7u67-linux-i586.tar.gz
Link for JDK
http://www.oracle.com/technetwork/java/javase/downloads/in
dex.html
Link of Hadoop :
http://mirror.fibergrid.in/apache/hadoop/common/hadoop-1.2.
0/

10
STEP-2
Untar the software
• Open the terminal window

• Type the command : cd Desktop

• Type ls command

• tar –zxvf jdk-7u67-linux-i586.tar.gz

• tar –zxvf hadoop-1.2.0-bin.tar.gz

11
Step 3

Bashrc configurations
Open the terminal type the command
sudo gedit ~/.bashrc

export JAVA_HOME=/home/user/Desktop/jdk1.7.0_67
export PATH=$PATH:$JAVA_HOME/bin
export HADOOP_HOME=/home/user/Desktop/hadoop-1.2.0
export PATH=$PATH:$HADOOP_HOME/bin

Open new terminal and check for java version and Hadoop version.

12
Step-4
Hadoop Configuration Files

1. Core-site.xml

(Configuration setting for hadoop core, such as I/O setting that are
common to HDFS and Mapreduce)

2.Hdfs-Site.xml

(Configuration setting for HDFS daemons the name node, Secondary

name node and data node and Replication factor as well)

13
3. Mapred-site.xml

(configuration setting for MapReduce daemons: the job tracker and the

task tracker)

4. Hadoop-Env_sh

(Environment variables that are used in the scripts to run Hadoop)

14
• Open the new terminal window
• Go to the hadoop folder
• cd Desktop
• cd Hadoop (Press tab) for path
• cd conf
• Type ls
• All the above mentioned files are listed

15
• Go to terminal and type

16
• Go to terminal and type
sudo gedit hdfs-site.xml
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
<name>dfs.name.dir</name>
<value>/home/user/Desktop/hadoop-1.2.0/name/data</value>
</property>
</configuration>

17
• Go to terminal and type

sudo gedit mapred-site.xml

<configuration>
<property>
<name>mapred.job.tracker</name>
<value>hdfs://localhost:9001</value>
</property>
</configuration>

18
• Go to terminal and type

sudo gedit hadoop-env.sh

export JAVA_HOME=/home/user/Desktop/jdk1.7.0_67

19
STEP-5
SSh Configuration
• sudo apt-get install ssh
• ssh-keygen -t rsa -P ""
Sharing the public key with host

• ssh-copy-id –i ~/.ssh/id_rsa.pub user@ubuntuvm and check

• ssh localhost (It should not ask password)

20
STEP 6
Formatting the Name Node
• $ hadoop namenode –format

• format a new distributed file system

• Process creates an empty file system for creating the storage

directories

21
STEP-7&8
Starting Hadoop Daemons
Open the terminal window and type

$ start-all.sh

and type

$ jps

22
• user@ubuntuvm:~$ start-all.sh
• Warning: $HADOOP_HOME is deprecated.
• starting namenode, logging to
/home/user/Downloads/hadoop-1.2.0/libexec/../logs/hadoop-user-
namenode-ubuntuvm.out
• localhost: datanode running as process 2978.
• localhost: secondarynamenode running as process 3123.
• jobtracker running as process 3204.
• localhost: tasktracker running as process 3342.
• user@ubuntuvm:~$ jps
• 4020 Jps
• 3342 TaskTracker
• 3204 JobTracker
• 3123 SecondaryNameNode
• 3606 NameNode
• 2978 DataNode

23
24

Morse 4400 Manual
100% (1)
Morse 4400 Manual
18 pages
FPD-7024 Fire Alarm Control Panel: Smoke Detector Compatibility List
No ratings yet
FPD-7024 Fire Alarm Control Panel: Smoke Detector Compatibility List
2 pages
AGIL Paradigm
75% (8)
AGIL Paradigm
3 pages
Exp-1-1
No ratings yet
Exp-1-1
24 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
TP2 _3IM - En
No ratings yet
TP2 _3IM - En
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
BDAO
No ratings yet
BDAO
23 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
hadoop6
No ratings yet
hadoop6
5 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
assignmentII(singleNode)
No ratings yet
assignmentII(singleNode)
7 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
4 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
CLD_7
No ratings yet
CLD_7
3 pages
Unix Commands Part 2
No ratings yet
Unix Commands Part 2
37 pages
Unit IV
No ratings yet
Unit IV
10 pages
Experiment 1
No ratings yet
Experiment 1
17 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Experiment-2_BDA_Lab
No ratings yet
Experiment-2_BDA_Lab
13 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
No ratings yet
AICTE SPONSORED Faculty Development Programme (FDP) On "DATA SCIENCE RESEARCH AND BIG DATA ANALYTICS"
28 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
How To Install Hadoop On Ubuntu 18.04 or 20.04
No ratings yet
How To Install Hadoop On Ubuntu 18.04 or 20.04
15 pages
Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
BDA Lab File
No ratings yet
BDA Lab File
4 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Hadoop
No ratings yet
Hadoop
27 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
213nt1306- Big Data Analytics Lab Manual
No ratings yet
213nt1306- Big Data Analytics Lab Manual
80 pages
BigData_Lab_Manual
No ratings yet
BigData_Lab_Manual
44 pages
L Hadoop 1 PDF
No ratings yet
L Hadoop 1 PDF
12 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
Cloud Computing Lab Setup Using Hadoop & Open Nebula
100% (4)
Cloud Computing Lab Setup Using Hadoop & Open Nebula
46 pages
1 Big Data Lab - 230823 - 103054
No ratings yet
1 Big Data Lab - 230823 - 103054
34 pages
Experiment 1 Hadoop Installation
No ratings yet
Experiment 1 Hadoop Installation
6 pages
BDA unit-4
No ratings yet
BDA unit-4
38 pages
EX. NO Date Program NO Sign
No ratings yet
EX. NO Date Program NO Sign
80 pages
Bda Record
No ratings yet
Bda Record
27 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
BDA Lab Assignment 1 PDF
No ratings yet
BDA Lab Assignment 1 PDF
20 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
HadoopfilePP
No ratings yet
HadoopfilePP
83 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
Backend Handbook: for Ruby on Rails Apps
From Everand
Backend Handbook: for Ruby on Rails Apps
Francisco Quintero
1/5 (1)
Linux Services Deployment
From Everand
Linux Services Deployment
Fabian Mestre
No ratings yet
Kyle Capodice Ecet Candidate Resume
No ratings yet
Kyle Capodice Ecet Candidate Resume
1 page
Class X Geography Question Bank
83% (12)
Class X Geography Question Bank
66 pages
Module 2
No ratings yet
Module 2
8 pages
Test Bank For Psychology: From Inquiry To Understanding, 3/E 3rd Edition Scott O. Lilienfeld, Steven J Lynn, Laura L. Namy Download PDF
100% (3)
Test Bank For Psychology: From Inquiry To Understanding, 3/E 3rd Edition Scott O. Lilienfeld, Steven J Lynn, Laura L. Namy Download PDF
38 pages
Resources, Solar Resources
No ratings yet
Resources, Solar Resources
9 pages
Gorakhpur Chapter 13052018
No ratings yet
Gorakhpur Chapter 13052018
37 pages
Unit-IV - Money Laundering
No ratings yet
Unit-IV - Money Laundering
8 pages
The Bingo Paradox MH Sept17
No ratings yet
The Bingo Paradox MH Sept17
4 pages
Yamaha RD 350
100% (1)
Yamaha RD 350
6 pages
01 80 13 Project Site Design Criteria PDF
No ratings yet
01 80 13 Project Site Design Criteria PDF
2 pages
PAL RTC-RSI Brochure Compressed
No ratings yet
PAL RTC-RSI Brochure Compressed
44 pages
Water Technology
100% (1)
Water Technology
3 pages
Neutral Earthing
No ratings yet
Neutral Earthing
2 pages
Civilsyll
No ratings yet
Civilsyll
32 pages
Vitamins Css Forum
No ratings yet
Vitamins Css Forum
6 pages
CF 5
No ratings yet
CF 5
42 pages
New Microsoft Word Document
100% (1)
New Microsoft Word Document
7 pages
Liebherr LTM 1100/2: Dimensions
100% (1)
Liebherr LTM 1100/2: Dimensions
5 pages
Training Delivery and Evaluation
No ratings yet
Training Delivery and Evaluation
7 pages
تترا باك_٠٧٢٦١٧كتاب TETRA PAK BOOK
No ratings yet
تترا باك_٠٧٢٦١٧كتاب TETRA PAK BOOK
232 pages
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
No ratings yet
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
2 pages
Niday, Mers Opening Brief Oregon Supreme Court
No ratings yet
Niday, Mers Opening Brief Oregon Supreme Court
67 pages
Introducing advanced macroeconomics growth and business cycles 1st Edition Peter Birch Sørensen download
100% (1)
Introducing advanced macroeconomics growth and business cycles 1st Edition Peter Birch Sørensen download
60 pages
The Ultimate ChatGPT Prompts Book
100% (1)
The Ultimate ChatGPT Prompts Book
104 pages
Willa B. Brown
No ratings yet
Willa B. Brown
1 page
HVAC Systems in Buildings: The Components
No ratings yet
HVAC Systems in Buildings: The Components
27 pages
Sumanto - [email protected]: Not Including Amine Drain Pump (Self Priming Type) Not Including Antifoam Injection Pump
No ratings yet
Sumanto - [email protected]: Not Including Amine Drain Pump (Self Priming Type) Not Including Antifoam Injection Pump
3 pages

Exp_1

Uploaded by

Exp_1

Uploaded by

Experiment -1

Installation of Hadoop Single Node Cluster

Platform as a service (PaaS)

• A Laptop or Desktop with 32 or 64 bit with window (Linux

• Minimum of 100 gb hard disk

• Minimum VGA is required

1. Downloading the software Required

• Type the command : cd Desktop

• tar –zxvf jdk-7u67-linux-i586.tar.gz

• tar –zxvf hadoop-1.2.0-bin.tar.gz

(Configuration setting for HDFS daemons the name node, Secondary

(Environment variables that are used in the scripts to run Hadoop)

sudo gedit core-site.xml

sudo gedit mapred-site.xml

sudo gedit hadoop-env.sh

• ssh-copy-id –i ~/.ssh/id_rsa.pub user@ubuntuvm and check

• format a new distributed file system

• Process creates an empty file system for creating the storage

You might also like