0% found this document useful (0 votes)

64 views10 pages

5-Practicas+BigData Trabajar Hdfs

This document provides an overview of HDFS commands and conducting basic operations in HDFS such as: 1. Using the hdfs dfs command to view, create, copy, move, and delete files and directories in HDFS. 2. Exploring how HDFS stores file data across multiple blocks and nodes for replication and reliability. 3. Generating sample text and large files in HDFS to demonstrate how HDFS partitions files across blocks and nodes. 4. Performing basic file operations like copying, moving and deleting files within HDFS similar to Linux commands.

Uploaded by

Christiam Niño

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views10 pages

5-Practicas+BigData Trabajar Hdfs

Uploaded by

Christiam Niño

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Apasoft Training

Prácticas BigData
1. Prácticas con HDFS
1.1. Comando hdfs dfs
• Ejecutar el comando “hdfs dfs”. Este comando permite trabajar con los ficheros
de HDFS.
• Casi todas las opciones son similares a los comandos “Linux”
hdfs dfs
Usage: hadoop fs [generic options]
[-appendToFile <localsrc> ... <dst>]
[-cat [-ignoreCrc] <src> ...]
[-checksum <src> ...]
[-chgrp [-R] GROUP PATH...]
[-chmod [-R] <MODE[,MODE]... | OCTALMODE> PATH...]
[-chown [-R] [OWNER][:[GROUP]] PATH...]
[-copyFromLocal [-f] [-p] [-l] [-d] <localsrc> ... <dst>]
[-copyToLocal [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-count [-q] [-h] [-v] [-t [<storage type>]] [-u] [-x] <path> ...]
[-cp [-f] [-p | -p[topax]] [-d] <src> ... <dst>]
[-createSnapshot <snapshotDir> [<snapshotName>]]
[-deleteSnapshot <snapshotDir> <snapshotName>]
[-df [-h] [<path> ...]]
[-du [-s] [-h] [-x] <path> ...]
[-expunge]
[-find <path> ... <expression> ...]
[-get [-f] [-p] [-ignoreCrc] [-crc] <src> ... <localdst>]
[-getfacl [-R] <path>]
[-getfattr [-R] {-n name | -d} [-e en] <path>]
[-getmerge [-nl] [-skip-empty-file] <src> <localdst>]
[-help [cmd ...]]
[-ls [-C] [-d] [-h] [-q] [-R] [-t] [-S] [-r] [-u] [<path> ...]]
[-mkdir [-p] <path> ...]
[-moveFromLocal <localsrc> ... <dst>]
[-moveToLocal <src> <localdst>]
[-mv <src> ... <dst>]

www.apasoft-training.com 1
Apasoft Training

[-put [-f] [-p] [-l] [-d] <localsrc> ... <dst>]

[-renameSnapshot <snapshotDir> <oldName> <newName>]
[-rm [-f] [-r|-R] [-skipTrash] [-safely] <src> ...]
[-rmdir [--ignore-fail-on-non-empty] <dir> ...]
[-setfacl [-R] [{-b|-k} {-m|-x <acl_spec>} <path>]|[--set <acl_spec> <path>]]
[-setfattr {-n name [-v value] | -x name} <path>]
[-setrep [-R] [-w] <rep> <path> ...]
[-stat [format] <path> ...]
[-tail [-f] <file>]
[-test -[defsz] <path>]
[-text [-ignoreCrc] <src> ...]
[-touchz <path> ...]
[-truncate [-w] <length> <path> ...]
[-usage [cmd ...]]

Generic options supported are:

-conf <configuration file> specify an application configuration file
-D <property=value> define a value for a given property
-fs <file:///|hdfs://namenode:port> specify default filesystem URL to use, overrides
'fs.defaultFS' property from configurations.
-jt <local|resourcemanager:port> specify a ResourceManager
-files <file1,...> specify a comma-separated list of files to be copied to the
map reduce cluster
-libjars <jar1,...> specify a comma-separated list of jar files to be included
in the classpath
-archives <archive1,...> specify a comma-separated list of archives to be
unarchived on the compute machines

The general command line syntax is:

command [genericOptions] [commandOptions]
• Vamos a ver el contenido de nuestr HDFS. En principio debe estar vacío
hdfs dfs -ls /
• También podemos ver que está vacío desde la web de administración en el menú
Utilities  Browse the File System

www.apasoft-training.com 2
Apasoft Training

• Vamos a crear un nuevo directorio

hdfs dfs -mkdir /datos
• Comprobar que existe
hdfs dfs -ls /
Found 1 items
drwxr-xr-x - hadoop supergroup 0 2018-01-06 18:31 /datos
• Podemos verlo en la página WEB

• Creamos un fichero en el directorio /tmp con alguna frase

echo "Esto es una prueba" >/tmp/prueba.txt
• Copiarlo al HDFS, en concreto al directorio /datos. Usamos el comando “put”
hdfs dfs -put /tmp/prueba.txt /datos

www.apasoft-training.com 3
Apasoft Training

• Comprobar su existencia
hdfs dfs -ls /datos
Found 1 items
-rw-r--r-- 1 hadoop supergroup 19 2018-01-06 18:34 /datos/prueba.txt
• También podemos verlo en la página web. Podemos comprobar el tipo de
replicación que tiene y el tamaño correspondiente.

• Visualizar su contenido
hdfs dfs -cat /datos/prueba.txt
Esto es una prueba
• Vamos a comprobar lo que ha creado a nivel de HDFS
• Vamos a la página WEB y pulsamos en el nombre del fichero.
• Debe aparecer algo parecido a lo siguiente

www.apasoft-training.com 4
Apasoft Training

• Vemos que solo ha creado un bloque, ya que el BLOCK SIZE por defecto de
HDFS es 128M y por lo tanto nuestro pequeño fichero solo genera uno.
• Además, nos dice el BLOCK_ID y también los nodos donde ha creado las
réplicas. Como tenemos un replication de 1, solo aparece el nodo1. Cuando
veamos la parte del cluster completo veremos más nodos
• Volvemos al sistema operativo y nos vamos al directorio siguiente.
Evidentemente el subdirectorio BP-XXXX será distinto en tu caso. Se
corresponde con el Block Pool ID que genera de forma automática Hadoop.
/datos/datanode/current/BP-344905797-192.168.56.101-
1515254230192/current/finalized
• Dentro de este subdirectorio, Hadoop irá creando una estructura de
subdirectorios donde albergará los bloques de datos, don el formato
subdirN/subdirN, en este caso subdir0/subdir0.
• Entramos en él.
cd subdir0/
cd subdir0/
ls -l
total 8
-rw-rw-r--. 1 hadoop hadoop 19 ene 6 18:34 blk_1073741825
-rw-rw-r--. 1 hadoop hadoop 11 ene 6 18:34 blk_1073741825_1001.meta
• Podemos comprobar que hay dos ficheros con el mismo BLOCK_ID que
aparece en la página WEB.
o Uno contiene los datos

www.apasoft-training.com 5
Apasoft Training

o El otro contiene metadatos

• Podemos comprobarlo si visualizamos el contenido
cat blk_1073741825
Esto es una prueba
• Evidentemente, cuando tengamos ficheros muy grandes o que no sean texto, esto
no es de ninguna utilidad. Solo lo hacemos para entender bien HDFS.
• Vamos a crear otro ejemplo con un fichero grande
• Lanzamos este comando para generar un fichero de 1G en /tmp, llamado
fic_grande.dat, lleno de ceros (el comando dd de Linux permite hacer esto entre
otras muchas cosasI)
dd if=/dev/zero of=/tmp/fic_grande.dat bs=1024 count=1000000
1000000+0 registros leídos
1000000+0 registros escritos
1024000000 bytes (1,0 GB) copiados, 5,1067 s, 201 MB/s
• Lo subimos al directorio /datos de nuestro HDFS
hdfs dfs -put /tmp/fic_grande.dat /datos
• Podemos comprobar en la página web que ha creado múltiples bloques de
128MB

www.apasoft-training.com 6
Apasoft Training

• Si comprobamos de nuevo el directorio subdir0 podemos ver los bloques

correspondientes
ls -l
total 1007852
-rw-rw-r--. 1 hadoop hadoop 19 ene 6 18:34 blk_1073741825
-rw-rw-r--. 1 hadoop hadoop 11 ene 6 18:34 blk_1073741825_1001.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741826
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741826_1002.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741827
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741827_1003.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741828
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741828_1004.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741829
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741829_1005.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741830
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741830_1006.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741831
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741831_1007.meta
-rw-rw-r--. 1 hadoop hadoop 134217728 ene 6 18:59 blk_1073741832
-rw-rw-r--. 1 hadoop hadoop 1048583 ene 6 18:59 blk_1073741832_1008.meta
-rw-rw-r--. 1 hadoop hadoop 84475904 ene 6 19:00 blk_1073741833
-rw-rw-r--. 1 hadoop hadoop 659975 ene 6 19:00 blk_1073741833_1009.meta

• Vamos a crear otro directorio llamado “practicas”

hdfs dfs -mkdir /practicas
• Copiamos prueba.txt desde datos a prácticas
hdfs dfs -cp /datos/prueba.txt /practicas/prueba.txt
• Comprobamos el contenido
hdfs dfs -ls /practicas
Found 1 items
-rw-r--r-- 1 hadoop supergroup 19 2018-01-06 19:08 /practicas/prueba.txt
• Borramos el fichero
hdfs dfs -rm /practicas/prueba.txt
Deleted /practicas/prueba.txt
• Vemos que los comandos son muy parecidos a Linux

www.apasoft-training.com 7
Apasoft Training

1.2. Nuestro primer proceso Hadoop

• Vamos a ejecutar nuestro primer trabajo hadoop. Luego veremos con más detalle
esto.
• Hadoop tiene una serie de ejemplos que se encuentran en el fichero siguiente
(recordad el número de versión)
/opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.9.0.jar

• Para lanzar un proceso hadoop Map Reduce usamos el comando

hadoop jar librería.jar proceso
• En este caso, si queremos ver los programas que hay en ese “jar” ponemos lo
siguiente, sin poner el comando final
hadoop jar /opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-
examples-2.9.0.jar
An example program must be given as the first argument.
Valid program names are:
aggregatewordcount: An Aggregate based map/reduce program that counts the
words in the input files.
aggregatewordhist: An Aggregate based map/reduce program that computes the
histogram of the words in the input files.
bbp: A map/reduce program that uses Bailey-Borwein-Plouffe to compute exact
digits of Pi.
dbcount: An example job that count the pageview counts from a database.
distbbp: A map/reduce program that uses a BBP-type formula to compute exact bits
of Pi.
grep: A map/reduce program that counts the matches of a regex in the input.
join: A job that effects a join over sorted, equally partitioned datasets
multifilewc: A job that counts words from several files.
pentomino: A map/reduce tile laying program to find solutions to pentomino
problems.
pi: A map/reduce program that estimates Pi using a quasi-Monte Carlo method.
randomtextwriter: A map/reduce program that writes 10GB of random textual data
per node.
randomwriter: A map/reduce program that writes 10GB of random data per node.
secondarysort: An example defining a secondary sort to the reduce.
sort: A map/reduce program that sorts the data written by the random writer.
sudoku: A sudoku solver.
teragen: Generate data for the terasort
terasort: Run the terasort
teravalidate: Checking results of terasort
wordcount: A map/reduce program that counts the words in the input files.

www.apasoft-training.com 8
Apasoft Training

wordmean: A map/reduce program that counts the average length of the words in
the input files.
wordmedian: A map/reduce program that counts the median length of the words in
the input files.
wordstandarddeviation: A map/reduce program that counts the standard deviation
of the length of the words in the input files.
• Vemos que hay un comando llamado “wordcount”.
• Permite contar las palabras que hay en uno o varios ficheros.
• Creamos un par de ficheros con palabras (algunas repetidas) y lo guardamos en
ese directorio
hdfs dfs -put /tmp/palabras.txt /practicas
hdfs dfs -put /tmp/palabras1.txt /practicas
• Lanzamos el comando
hadoop jar /opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-
examples-2.9.0.jar wordcount /practicas /salida1
INFO mapreduce.Job: Counters: 38
File System Counters
FILE: Number of bytes read=812740
FILE: Number of bytes written=1578775
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=211
HDFS: Number of bytes written=74
HDFS: Number of read operations=25
HDFS: Number of large read operations=0
HDFS: Number of write operations=5
Map-Reduce Framework
Map input records=2
Map output records=16
Map output bytes=147
Map output materialized bytes=191
Input split bytes=219
Combine input records=16
Combine output records=16
Reduce input groups=10
Reduce shuffle bytes=191
Reduce input records=16
Reduce output records=10
Spilled Records=32
Shuffled Maps =2

www.apasoft-training.com 9
Apasoft Training

Failed Shuffles=0
Merged Map outputs=2
GC time elapsed (ms)=131
CPU time spent (ms)=0
Physical memory (bytes) snapshot=0
Virtual memory (bytes) snapshot=0
Total committed heap usage (bytes)=549138432
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=84
File Output Format Counters
Bytes Written=74

• Podemos ver el contenido del directorio

hdfs dfs -ls /salida
Found 2 items
-rw-r--r-- 1 hadoop supergroup 0 2015-04-20 07:52 /salida/_SUCCESS
-rw-r--r-- 1 hadoop supergroup 74 2015-04-20 07:52 /salida/part-r-00000
[hadoop@localhost ~]$ hadoop fs -cat /salida/part-r-00000
Esto 1
con 2
el 2
es 2
esto 1
fichero 2
primer 1
prueba 2
segundo 1
una 2

www.apasoft-training.com 10

Unit II Hadoop and Map Reduce Overview
No ratings yet
Unit II Hadoop and Map Reduce Overview
136 pages
Linux Commands By Example
From Everand
Linux Commands By Example
Khaled Jamal
4.5/5 (3)
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
Ai&Ml(Bdamanual)
No ratings yet
Ai&Ml(Bdamanual)
24 pages
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
No ratings yet
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
210 pages
BDA UNIT -3 Updated (1).docx
No ratings yet
BDA UNIT -3 Updated (1).docx
25 pages
BDA Record
No ratings yet
BDA Record
36 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
BDA Record (1)
No ratings yet
BDA Record (1)
34 pages
Hafs Commands
No ratings yet
Hafs Commands
17 pages
HDFS
No ratings yet
HDFS
18 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
lab2_BD
No ratings yet
lab2_BD
20 pages
HDFS Commands Updated
No ratings yet
HDFS Commands Updated
87 pages
COMMAND Line Interface
No ratings yet
COMMAND Line Interface
26 pages
3 Hadoop
No ratings yet
3 Hadoop
40 pages
BDM Hdfs
No ratings yet
BDM Hdfs
37 pages
BDH Record - Merged
No ratings yet
BDH Record - Merged
47 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
Hadoop Commands Only
No ratings yet
Hadoop Commands Only
19 pages
Hadoop1
No ratings yet
Hadoop1
15 pages
Hadoop commands
No ratings yet
Hadoop commands
5 pages
EXP 1-2
No ratings yet
EXP 1-2
9 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
Big Data Ia Answers
No ratings yet
Big Data Ia Answers
14 pages
Hadoop Assignement Sumit 241111 133837
No ratings yet
Hadoop Assignement Sumit 241111 133837
13 pages
hadoop
No ratings yet
hadoop
4 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
Big Data Cheat Sheet
No ratings yet
Big Data Cheat Sheet
12 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
1 Hdfs Notes
No ratings yet
1 Hdfs Notes
38 pages
HDFS Command
No ratings yet
HDFS Command
15 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
Apache Hadoop
No ratings yet
Apache Hadoop
3 pages
2335_m4_demo1_v1_b54_kwf9d75
No ratings yet
2335_m4_demo1_v1_b54_kwf9d75
8 pages
basic HDFS commands
No ratings yet
basic HDFS commands
7 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet
hadoop
No ratings yet
hadoop
6 pages
HDFS Commands
No ratings yet
HDFS Commands
15 pages
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
Hadoop File Complte
No ratings yet
Hadoop File Complte
18 pages
How To Set Up A Hadoop Cluster in Docker
No ratings yet
How To Set Up A Hadoop Cluster in Docker
13 pages
Lab2_BigData-HDFSp
No ratings yet
Lab2_BigData-HDFSp
4 pages
Lista de Comandos HDFS
No ratings yet
Lista de Comandos HDFS
8 pages
HDFS Commands - Revised
No ratings yet
HDFS Commands - Revised
6 pages
HDFS
No ratings yet
HDFS
6 pages
Practical 1 - 1 - Hadoop Commands
No ratings yet
Practical 1 - 1 - Hadoop Commands
3 pages
C:/Users/HP Hdfs Namenode - Format
No ratings yet
C:/Users/HP Hdfs Namenode - Format
7 pages
9 Practicas+BigData MapReduce
No ratings yet
9 Practicas+BigData MapReduce
6 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
Hadoop Linux Commands
No ratings yet
Hadoop Linux Commands
8 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
Hadoop Linux Hdfs Commands
No ratings yet
Hadoop Linux Hdfs Commands
2 pages
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
No ratings yet
Hadoop File System: CSC 369 Distributed Computing Alexander Dekhtyar
5 pages
Hadoop Tutorial
No ratings yet
Hadoop Tutorial
13 pages
Hadoop-HDFS-commands
No ratings yet
Hadoop-HDFS-commands
1 page
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Command
No ratings yet
Command
1 page
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
2 pages
Examples Problems
No ratings yet
Examples Problems
16 pages
Django
No ratings yet
Django
139 pages
Hierarchichal and Network PPT 1
100% (1)
Hierarchichal and Network PPT 1
10 pages
Advanced Standard SQL Dynamic Structured Data Modeling and Hierarchical
No ratings yet
Advanced Standard SQL Dynamic Structured Data Modeling and Hierarchical
407 pages
Coronel Morris_DatabaseSystems_14e_PPT_Mod02
No ratings yet
Coronel Morris_DatabaseSystems_14e_PPT_Mod02
45 pages
Building Web Applications With: Course 977
No ratings yet
Building Web Applications With: Course 977
463 pages
Borrar CACHE
No ratings yet
Borrar CACHE
1 page
dbms unit5
No ratings yet
dbms unit5
74 pages
DBDM Unit-3
No ratings yet
DBDM Unit-3
45 pages
Relational Algebra and Caluculus
No ratings yet
Relational Algebra and Caluculus
16 pages
Hospital Management System Database Design
84% (62)
Hospital Management System Database Design
8 pages
Experiment No. 4 - DBMS Lab Final
No ratings yet
Experiment No. 4 - DBMS Lab Final
17 pages
Section 9
No ratings yet
Section 9
6 pages
IT 405 (DBMS) Unit I Notes - 1615635798
No ratings yet
IT 405 (DBMS) Unit I Notes - 1615635798
15 pages
Software Architectures - Lecture Notes, Study Material and Important Questions, Answers
No ratings yet
Software Architectures - Lecture Notes, Study Material and Important Questions, Answers
5 pages
Chapter 2: Intro To Relational Model
No ratings yet
Chapter 2: Intro To Relational Model
39 pages
BDMS Assignment 1 SHS
No ratings yet
BDMS Assignment 1 SHS
10 pages
256 Hibernate Interview Questions Answers Guide
No ratings yet
256 Hibernate Interview Questions Answers Guide
12 pages
Document 376700.1 PDF
No ratings yet
Document 376700.1 PDF
26 pages
STUDENT (Entity Type)
No ratings yet
STUDENT (Entity Type)
10 pages
Distributed Systems Lab 1
No ratings yet
Distributed Systems Lab 1
24 pages
Voucher Cidalung - Net VOCER 5000 Up 815 12.06.22
No ratings yet
Voucher Cidalung - Net VOCER 5000 Up 815 12.06.22
8 pages
Data Management (Assignment 1)
No ratings yet
Data Management (Assignment 1)
5 pages
Comandos Hive SQL
100% (1)
Comandos Hive SQL
5 pages
Structural and Behavioral Design Patterns
No ratings yet
Structural and Behavioral Design Patterns
8 pages
Worksheet - 1.3 Dbms Sarthak Verma Lab (20bcs3950)
No ratings yet
Worksheet - 1.3 Dbms Sarthak Verma Lab (20bcs3950)
8 pages
DBMS Questions
No ratings yet
DBMS Questions
9 pages
Oracle Linux 7 - Update 0 or Higher (64-Bit)
No ratings yet
Oracle Linux 7 - Update 0 or Higher (64-Bit)
11 pages
2.1 DBMS Assignment1
No ratings yet
2.1 DBMS Assignment1
1 page
Generate - Marksheet Range Range Sum Min: Def in in
No ratings yet
Generate - Marksheet Range Range Sum Min: Def in in
2 pages
Ruby On Rails
No ratings yet
Ruby On Rails
6 pages
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
No ratings yet
Prácticas Bigdata: 1. Lanzar Un Proceso Mapreduce Contra El Cluster
3 pages
Ihor K.
No ratings yet
Ihor K.
1 page
My CV English Version
No ratings yet
My CV English Version
1 page
Tut11 Arch
No ratings yet
Tut11 Arch
2 pages

5-Practicas+BigData Trabajar Hdfs

Uploaded by

5-Practicas+BigData Trabajar Hdfs

Uploaded by

Apasoft Training

[-put [-f] [-p] [-l] [-d] <localsrc> ... <dst>]

Generic options supported are:

The general command line syntax is:

• Vamos a crear un nuevo directorio

• Creamos un fichero en el directorio /tmp con alguna frase

o El otro contiene metadatos

• Si comprobamos de nuevo el directorio subdir0 podemos ver los bloques

• Vamos a crear otro directorio llamado “practicas”

1.2. Nuestro primer proceso Hadoop

• Para lanzar un proceso hadoop Map Reduce usamos el comando

• Podemos ver el contenido del directorio

You might also like