0% found this document useful (0 votes)

28 views

Practical 2 Hadoop Distributed File System (HDFS)

Uploaded by

black hello

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Practical 2 Hadoop Distributed File System (HDFS)

Uploaded by

black hello

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

1 of 4

BMCS2013 DATA ENGINEERING

PRACTICAL 2 Hadoop Distributed File System (HDFS)

1. Launch the Ubuntu-22.04-de distro:

In PowerShell (run as administrator), launch the distro for this course:
PS C:\Users\TARUMT> wsl ~
hduser@PC25:~$

~ As the user hduser ~

2. Start HDFS and YARN
2.1. Start the HDFS service
hduser@PC25:~$ start-dfs.sh
Starting namenodes on [localhost]
Starting datanodes
Starting secondary namenodes [PC25]

To check the services currently running, use the jps command:

hduser@PC25:~$ jps
1392 Jps
1114 SecondaryNameNode
876 DataNode
685 NameNode

2.2. Start the YARN service

hduser@PC25:~$ start-yarn.sh
Starting resourcemanager
Starting nodemanagers

hduser@PC25:~$ jps
2112 Jps
1696 NodeManager
1542 ResourceManager
1114 SecondaryNameNode
876 DataNode
685 NameNode

FYI only, the following actions have already been completed in the distro:
# Create the directories named user and tmp in the distributed file system:
# The /user directory is where all Hadoop users’ home directories will be created later on.
hduser@PC25:~$ hdfs dfs -mkdir /user
hduser@PC25:~$ hdfs dfs -mkdir /tmp

# Give full permissions for all users to the tmp directory:

hduser@PC25:~$ hdfs dfs -chmod -R 777 /tmp
2 of 4

3. Create User Directories in HDFS

3.1. Create a HDFSuser directory for student:
hduser@PC25:~$ hdfs dfs -mkdir /user/student

3.2. Change ownership for the newly created directory:

hduser@PC25:~$ hdfs dfs -chown student:hduser /user/student

Note (FYI only):

HDFS file permissions are similar to Linux file permissions.

E.g., to change the permission of the file shakespeare.txt to 664:
$ hdfs dfs -chmod 664 shakespeare.txt
where 664 is an octal representation of the flags to set for the permission triple.
The above statement changes the permissions to -rw-rw-r--:
● 6 is 110, which means read and write, but not execute.
● 7 is 111, which means complete permissions.
● 4 is 100, which means read-only.

~ As the user student ~

4. Switch user to student

hduser@PC25:~$ su - student
student@PC25:~$

5. HDFS Basic File System Operations

5.1. See the available commands in the dfs shell
student@PC25:~$ hdfs dfs -help

5.2. Download the shakespeare.txt file from Google Drive into your local file
system (Ubuntu 22.04)
student@PC25:~$ wget --no-check-certificate
'https://docs.google.com/uc?
export=download&id=122PnuKaSaA_OyYOKnxQOdlMc5awdyf5v' -O
shakespeare.txt

💡 Remember to confirm that the above action is successful.

5.3. Copy the downloaded file shakespeare.txt from the local file system to HDFS
student@PC25:~$ hdfs dfs -put shakespeare.txt shakespeare.txt

💡 Remember to confirm that the above action is successful.

3 of 4
4 of 4

5.4. Read the contents of the file in HDFS using the cat command, and then pipe the
output to less in order to view the contents of the remote file.
student@PC25:~$ hdfs dfs -cat shakespeare.txt | less

Note: use the arrow keys to navigate the file. Type q to quit.

5.5. Copy the file from HDFS to the local file system and rename it as shakespeare-
dfs.txt.
student@PC25:~$ hdfs dfs -get shakespeare.txt ./shakespeare-
dfs.txt
💡 Remember to confirm that the above action is successful.

6. To end your practical sessions

6.1. Logout from the student account
student@PC25:~$ exit
hduser@PC25:~$ su - student

~ As the user hduser ~

6.2. Terminate the YARN service
hduser@PC25:~$ stop-yarn.sh

6.3. Terminate the HDFS service

hduser@PC25:~$ stop-dfs.sh

6.4. Logout from the hduser account

hduser@PC25:~$ exit
PS C:\Users\TARUMT>

6.5. Terminate the WSL instance

PS C:\Users\TARUMT> exit

Other HDFS Commands

Recall that the HDFS shell commands are similar to POSIX-like commands and invoked using:
$ hdfs dfs <args> <command>

Other HDFS commands include:

cat chown ls rm
chgrp cp mkdir stat
chmod du mv tail

Black Belt - Intersight - Presales - Stage 2 - Quiz 1
No ratings yet
Black Belt - Intersight - Presales - Stage 2 - Quiz 1
4 pages
Statement of Work: (Insert Project Name)
No ratings yet
Statement of Work: (Insert Project Name)
9 pages
Pspice Mosfets
No ratings yet
Pspice Mosfets
6 pages
CS50 Quiz 1 Cheat Sheet
No ratings yet
CS50 Quiz 1 Cheat Sheet
2 pages
Unit 2-HDFS SGS
No ratings yet
Unit 2-HDFS SGS
29 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
lab2_BD
No ratings yet
lab2_BD
20 pages
Amrita CC 3.1
No ratings yet
Amrita CC 3.1
7 pages
Hadoop Distributed File System HDFS 1688981751
No ratings yet
Hadoop Distributed File System HDFS 1688981751
49 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
7 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Hadoop Tutorial
No ratings yet
Hadoop Tutorial
13 pages
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
No ratings yet
HDFS (Hadoop Distributed File System) : HDFS Architecture Components of The Architecture
10 pages
Hands-On Hadoop Tutorial
100% (1)
Hands-On Hadoop Tutorial
13 pages
Lab2_BigData-HDFSp
No ratings yet
Lab2_BigData-HDFSp
4 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
Hadoop1
No ratings yet
Hadoop1
15 pages
Big Data AnalyticUnit2
No ratings yet
Big Data AnalyticUnit2
19 pages
Unit 3.1
No ratings yet
Unit 3.1
88 pages
l2 Hdfs and Mapreduce Model 2022s2
No ratings yet
l2 Hdfs and Mapreduce Model 2022s2
52 pages
05 - Introduction To HDFS
No ratings yet
05 - Introduction To HDFS
27 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
Lista de Comandos HDFS
No ratings yet
Lista de Comandos HDFS
8 pages
Running Hadoop On Ubuntu Linux
No ratings yet
Running Hadoop On Ubuntu Linux
15 pages
Hadoop-HDFS-commands
No ratings yet
Hadoop-HDFS-commands
1 page
Exp3 BDI 60004200124
No ratings yet
Exp3 BDI 60004200124
5 pages
Hadoop Linux Commands
No ratings yet
Hadoop Linux Commands
8 pages
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
No ratings yet
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
43 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet
10 Dfs
No ratings yet
10 Dfs
5 pages
HadoopfilePP
No ratings yet
HadoopfilePP
83 pages
CC Hadoop Lab
No ratings yet
CC Hadoop Lab
6 pages
BDA UNIT -3 Updated (1).docx
No ratings yet
BDA UNIT -3 Updated (1).docx
25 pages
Linux Commands By Example
From Everand
Linux Commands By Example
Khaled Jamal
4.5/5 (3)
5-Practicas+BigData Trabajar Hdfs
No ratings yet
5-Practicas+BigData Trabajar Hdfs
10 pages
Chapter 4 - Hadoop Ecosystem
No ratings yet
Chapter 4 - Hadoop Ecosystem
24 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
BDA Lab Assignment 1 PDF
No ratings yet
BDA Lab Assignment 1 PDF
20 pages
BDT - Unit - II - Hdfs and Hadoop Io
No ratings yet
BDT - Unit - II - Hdfs and Hadoop Io
42 pages
HDFS File System Shell Guide
No ratings yet
HDFS File System Shell Guide
10 pages
2 HDFS Commands
No ratings yet
2 HDFS Commands
7 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
3_HDFS-Hive-HBase-Pig
No ratings yet
3_HDFS-Hive-HBase-Pig
8 pages
Exp1 Bda
No ratings yet
Exp1 Bda
11 pages
3 Hadoop
No ratings yet
3 Hadoop
40 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation Cluster
No ratings yet
Hadoop Installation Cluster
9 pages
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
No ratings yet
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
11 pages
Hadoop Hdfs Commands
No ratings yet
Hadoop Hdfs Commands
2 pages
3rd Units Big Data
No ratings yet
3rd Units Big Data
6 pages
IntroductionHPC_Session03_Mar23
No ratings yet
IntroductionHPC_Session03_Mar23
52 pages
bigdatamanual(2)
No ratings yet
bigdatamanual(2)
45 pages
Steps To Install Hadoop 2.x Release (Yarn or Next-Gen) On Single Node Cluster Setup
No ratings yet
Steps To Install Hadoop 2.x Release (Yarn or Next-Gen) On Single Node Cluster Setup
7 pages
Hadoop Lab
100% (2)
Hadoop Lab
6 pages
bos.adt.lib bos.adt.libm bos.adt.syscalls bos.rte.SRC bos.rte.libc bos.rte.libcfg bos.rte.libcur bos.rte.libpthreads bos.rte.odm 如果您要安装并行的资源组，还要安装下面的包： bos.rte.lvm.rte5.1.0.25 or higher bos.clvm.enh
No ratings yet
bos.adt.lib bos.adt.libm bos.adt.syscalls bos.rte.SRC bos.rte.libc bos.rte.libcfg bos.rte.libcur bos.rte.libpthreads bos.rte.odm 如果您要安装并行的资源组，还要安装下面的包： bos.rte.lvm.rte5.1.0.25 or higher bos.clvm.enh
16 pages
Wa Introhdfs PDF
No ratings yet
Wa Introhdfs PDF
11 pages
Lecture 2
No ratings yet
Lecture 2
70 pages
Hadoop
No ratings yet
Hadoop
51 pages
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
No ratings yet
Big-Data Computing: Hadoop Distributed File System: B. Ramamurthy
45 pages
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
Chap01 - Intro to Programming
No ratings yet
Chap01 - Intro to Programming
37 pages
Guide to Install Visual Studio 2019
No ratings yet
Guide to Install Visual Studio 2019
3 pages
Setup - Firebase
No ratings yet
Setup - Firebase
9 pages
Chapter 6 Network Layer_July 2023
No ratings yet
Chapter 6 Network Layer_July 2023
58 pages
Chapter 6 - Multimedia Element Video
No ratings yet
Chapter 6 - Multimedia Element Video
44 pages
Chapter 2 Network Protocols _ Communication_July 2023
No ratings yet
Chapter 2 Network Protocols _ Communication_July 2023
56 pages
Chapter 4 Data Link Layer (OSI Model)_July 2023
No ratings yet
Chapter 4 Data Link Layer (OSI Model)_July 2023
39 pages
Chapter 10 Application Layer_July 2023
No ratings yet
Chapter 10 Application Layer_July 2023
36 pages
Practical 1 Slide
No ratings yet
Practical 1 Slide
20 pages
L08 Hierachical agglomerative clustering
No ratings yet
L08 Hierachical agglomerative clustering
41 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
L03 Generalization, Train Test Splits and Validation
No ratings yet
L03 Generalization, Train Test Splits and Validation
49 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
L02 Classification and Regression
No ratings yet
L02 Classification and Regression
26 pages
L05 Unsupervised learning - Overview
No ratings yet
L05 Unsupervised learning - Overview
16 pages
L01 Introduction to ML
No ratings yet
L01 Introduction to ML
16 pages
Catalogo de ThermoScientific
No ratings yet
Catalogo de ThermoScientific
8 pages
Doors Geze Slimdrive SL Usi PLIANT en
100% (1)
Doors Geze Slimdrive SL Usi PLIANT en
48 pages
Brkaci 2102
No ratings yet
Brkaci 2102
121 pages
Entry Level Data Analyst Resume Example
No ratings yet
Entry Level Data Analyst Resume Example
1 page
201 To 300 Multiple Choice Questions For MS Word - MCQ Sets
No ratings yet
201 To 300 Multiple Choice Questions For MS Word - MCQ Sets
12 pages
Vagas 2 Anos #Tammyindica
No ratings yet
Vagas 2 Anos #Tammyindica
3 pages
Blocking and Confounding in Factorial Dessigns
No ratings yet
Blocking and Confounding in Factorial Dessigns
13 pages
Iot Device For Sewage Gas Monitoring and Alert System
No ratings yet
Iot Device For Sewage Gas Monitoring and Alert System
7 pages
Neuro Glow[1]
No ratings yet
Neuro Glow[1]
11 pages
ARC SoundWave UserManual NT8
No ratings yet
ARC SoundWave UserManual NT8
21 pages
ARC Project Presentation - Indo European Skilling Centers For Mehcatronics and Industrial Robotics PDF
No ratings yet
ARC Project Presentation - Indo European Skilling Centers For Mehcatronics and Industrial Robotics PDF
26 pages
Infrastructure Penetration Testing Course (Online)
No ratings yet
Infrastructure Penetration Testing Course (Online)
5 pages
Fddwin32 Manual en
No ratings yet
Fddwin32 Manual en
56 pages
Advantages and Disadvantages of Multimed
No ratings yet
Advantages and Disadvantages of Multimed
4 pages
Screen Resolution en
No ratings yet
Screen Resolution en
3 pages
Javascript Excercises v2 v.1
No ratings yet
Javascript Excercises v2 v.1
4 pages
Form 1
No ratings yet
Form 1
3 pages
Image Manipulation Activity Answered
No ratings yet
Image Manipulation Activity Answered
7 pages
NewTek Ebook - Stream Like A Pro A - Comprehensive Guide To Live Streaming Success
No ratings yet
NewTek Ebook - Stream Like A Pro A - Comprehensive Guide To Live Streaming Success
11 pages
The Importance of Educational Technology in Teaching
No ratings yet
The Importance of Educational Technology in Teaching
5 pages
Mobile App Development - Course Outline
No ratings yet
Mobile App Development - Course Outline
3 pages
05 Slide
No ratings yet
05 Slide
39 pages
Diagnostico de Trascabo Caterpillar 925k
No ratings yet
Diagnostico de Trascabo Caterpillar 925k
2 pages
Linux Server Troubleshooting
No ratings yet
Linux Server Troubleshooting
8 pages
CNB HDxE Manual ENG20111209TW - 2 PDF
No ratings yet
CNB HDxE Manual ENG20111209TW - 2 PDF
60 pages
Chainlink: A Decentralized Oracle Network Steve Ellis, Ari Juels, and Sergey Nazarov
No ratings yet
Chainlink: A Decentralized Oracle Network Steve Ellis, Ari Juels, and Sergey Nazarov
38 pages