Unix Commands
Unix Commands
The core of the UNIX operating system is the kernel. It can be thought of as a
piece of software that handles all the communications between user software
and computer hardware. It is the kernel that decides how to communicate with
peripheral devices, how to share time among users of the system, how to
allocate memory for different programs running, and how to utilize the processor
and other resources. The kernel keeps records of all programs (commonly called
processes) running on the system and shares time among these processes
according to a well-defined policy.
Users of the UNIX system don't have a direct interaction with the kernel. Instead,
the user always works with a program called the UNIX shell. A shell can be
considered a command interpreter. The shell takes user commands, interprets
them, and takes the necessary action to execute them. It also provides the output
of these commands to the user.
The Superuser
There is a special user in the UNIX systems called root. This user is created
during the installation process of HP-UX. The root user has privileges to do all
system administration tasks, such as adding and deleting users, administration of
printers, and other routine system maintenance tasks. Usually, the systems
administrator uses the root user name. It is a common practice to call the root
user the superuser to show the power associated with the name.
Each user in the UNIX system has a unique number associated with the user
name, called a User ID. The root user has ID 0; therefore, any user with that ID
has superuser privileges.
Logging In
When logging in, you'll see a login prompt on your terminal screen similar to the
one shown in Figure 1-1.
This prompt shows that the system is waiting for a user to enter a login name and
password. At the login prompt, the user should enter the login name. The system
then prompts for the password assigned to the login name. The user then enters
the password. If the login name and the password match one of the users on the
system, the user is allowed to log into the system. If the login name and
password do not match an existing user, then access to the system is denied and
the error message, Login incorrect, is displayed. Figure 1-2 shows an example
of a complete login session where a user named boota logs into HP-UX.
Figure 1-2. Login session for user boota.
Once the user sees the $ symbol, the login process is complete.
Shell Prompts
As soon as a user logs into HP-UX, the shell assigned to the user starts and
displays the shell prompt (also called the command prompt). Superuser and
common user command prompts differ: $ represents a common user, while #
represents a superuser. When you see either of these command prompts, you
can issue any command for which you are authorized.
Note
When you log in as the root user, be careful, as by your actions you
can accidentally damage the system integrity.
Logging Out
Once you have finished working on HP-UX, you need to log out of the system.
The logout process finishes your session. To log out, use the exit command at
the shell prompt. The system will terminate your session and display the login
prompt. A typical logout process is as follows.
$ exit
logout
Console Login :
Study Break
terminal.
Some of the commands that a user issues are intrinsic commands, which are
built into the UNIX shell being used. Other commands, stored as separate files,
are called extrinsic commands. When a user issues a command, the shell first
checks if it is an intrinsic command. If it is not, then extrinsic commands are
checked. The command for changing a directory (cd) is a typical intrinsic
command, and the command for listing files (ls) is a typical extrinsic command.
The extrinsic and intrinsic commands are also called external and internal
commands, respectively.
Changing Your Password
Perhaps the first command every user should know is how to change the
password. When the superuser or the system administrator creates a new
account for you, your initial password is typically (but not always) set. When you
log into HP-UX for the first time, you should change your password to secure
your account. The password is changed with the passwd command. When you
issue this command, you will be asked to enter the current password. For
security reasons, this password will not be displayed on the screen. If you enter
this old password correctly, you are prompted to enter the new password. Once
you enter the new password and press the Enter key, the system will ask you to
reenter the new password. Now you enter the new password again. A typical
password changing session would be like the following.
$ passwd
Old password :
New password :
Re-enter new password :
Password changed
$
Please note that you must have a password between six and eight characters
long, and it must be a combination of characters and numbers. At least two
characters of the password must be letters and one must be a number or a
special character, such as a dash (-), underscore (_), or asterisk (*). If you use a
password of length greater than eight, any characters after the eighth character
are ignored. For security reasons, it is better not to use any dictionary word as a
password.
The HP-UX shell keeps the last executed commands in a shell history. The shell
history is used for future reference to the same commands or reexecuting the
commands without retyping them. The command history is saved in a file called
.sh_history (in which an underscore separates the characters .sh and the word
history) for each user. This file is kept in the user's home directory. Users can
set the number of entries in the history through a control variable, which will be
discussed later in Chapter 3.
To repeat a previous command, a user presses the - key combination.
The previous command appears on the command prompt. If the user wants to go
another step backward, simply pressing the key again moves one step back.
Similarly, you can go back as far as you want until the history is finished. When
the correct command is displayed, just press the key to execute it.
For example, consider you have used the following three commands in
sequence.
$ date
Thu Aug 29 15:36:23 EDT 1999
$ whoami
boota
$ pwd
/home/boota
$
Now you want to execute the date command again. Just press - once
and you shall see that pwd command appears on the command prompt. Now
press the key again and whoami will appear. Pressing the key for the
third time, the date command will appear on the command prompt. Now you can
execute the command by pressing the key.
The history command is designed to recall commands which have been used
previously. For example, the following command shows the last three
commands.
$ history -3
date
whoami
pwd
$
Command Aliases
An alias is a different name for a UNIX command. Aliases can also be used to
remember commands by assigning them more meaningful names. They can be
set using the alias command at the command prompt. For example, the ls
command is used to list the file and directory names in HP-UX. If you want to use
dir instead of ls, you can use an alias for that purpose as follows.
$ alias dir=ls
$
If after that you use dir command, it will function the same as the ls command
does. Aliases are very useful for situations in which a user has to repeatedly
issue a complex or long command.
The system startup file is used for tasks that are common to all system users.
Examples of these tasks are setting your time zone (TZ), and the global search
path for extrinsic commands. The system startup file is called profile and is
present in the /etc directory of HP-UX. It is the responsibility of the system
administrator to manage this file and make it more useful, keeping in mind a
particular UNIX environment. The system startup file can also be used by a
system administrator to send messages that are displayed to users as soon as
someone logs into the system.
refer to it often.
Where Am I?
In the HP-UX environment, it is always important to know what directory you are
in before you move to another directory else. You can use the pwd (print working
directory) command at any time to find out the current directory name.
$ pwd
/home/boota
$
The date command tells you the current date and time set on the HP-UX system
clock.
$ date
Thu Aug 29 15:36:23 EDT 1999
$
The date command is also used for changing the date and time. This feature of
the command is accessible by the superuser only. So if you are logged in as user
root, you can set a new date and time, in case the displayed date or time is
incorrect. Please note that there is another HP-UX command, time, which is
used for another purpose (not for displaying or setting system time).
What if you see an unattended terminal to which a user is logged in? You can
use the whoami command to identify who is logged in. When you use this
command, you are actually asking the system to tell "you" about "yourself."
$ whoami
boota
$
There are some other interesting uses of this command. For example, if you write
a shell program that will do something different depending on which user
executes the program, this little command does a great job of detecting who
initiated the program.
There is another useful command that is even shorter than this one. This is the
who command, which tells you the login names of all users logged into the
system, along with their login time and the terminal line they are using. The
output of the who command may vary depending on how many users are logged
in.
$ who
operator pts/ta Aug 30 16:05
boota pts/tb Aug 30 15:59
$
It is interesting that an even shorter command exists in HP-UX, which is w, and it
does a similar job. Additionally, the w command tells how long the system has
been up, what the current time is, and what the logged-in users are doing.
$ w
4:27pm up 1 day, 12:10, 2 users, load average: 0.07,
0.08, 0.09
User tty login@ idle JCPU PCPU what
operator pts/ta 4:05pm 12 -sh
boota pts/tb 3:59pm w
$
In HP-UX, some commands are linked to other commands such that the result of
these commands is the same. For example, the w command is linked to the
uptime -w command and both have the same output. Command linking is done
Every HP-UX system has a system name. If you are curious to know what the
system name is or which version of HP-UX is running on the system, use the
uname command.
$ uname -a
HP-UX myhp B.11.00 E 9000/800 71678 8-user license
$
Here the command output shows that it is an HP-UX system. The system name
is myhp and it is running HP-UX version 11.00.
Printing Banners
Banners are very useful for printing welcome messages at login time and for
separating printed pages of different users when many users are using a
common printer. You can print a fancy design of your name using the banner
command.
$ banner BOOTA
###### ####### ####### ####### #
# # # # # # # # #
# # # # # # # # #
###### # # # # # # #
# # # # # # # #######
# # # # # # # # #
###### ####### ####### # # #
$
The Calendar
A calendar is displayed by cal, which is a short and handy command for printing
the calendar of a particular month. In the simplest case, the cal command prints
the calendar of the current month, but you can use the command in a number of
ways to print the desired calendar.
$ cal
August 1999
S M Tu W Th F S
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31
$
UNIX systems provide a very powerful and useful feature that provides detailed
help on all of the commands. This tool is called manual pages or man pages.
You can use the man command to get help for any UNIX command as follows:
$ man cal
cal(1) cal(1)
NAME
cal - print calendar
SYNOPSIS
cal [[month] year]
DESCRIPTION
Cal prints a calendar for the specified year. If a
month is also specified, a calendar for just that
month is printed. If neither is specified, a calen-
dar for the present month is printed. Year can be
between 1 and 9999. Month is a decimal number be-
tween 1 and 12. The calendar produced is a Gregorian
calendar.
EXTERNAL INFLUENCES
Environment Variables
LANG determines the locale to use for the locale
categories when both LC_ALL and the corresponding en-
vironment variable (beginning with LC_)do not specify
a locale. If LANG is not set or is set to the empty
string, a default of "C" (see lang(5)) is used.
. . .
. . .
$
How would a user get help on the man command itself? Just as man cal provides
help on the cal command, the man man command helps with the man command.
We shall describe the manual pages in more detail in Chapter 8.
The shell allows users to enter multiple commands on a single line. In this case,
each command is separated by a semicolon. See the result of the command
execution as given here.
$ cal;date
August 1999
S M Tu W Th F S
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31
Mon Aug 30 17:57:51 EDT 1999
$
Options and arguments are used to enhance a command feature. Anything typed
after the command name is either an option or an argument. A minus (-) symbol
precedes any option. Options change command behavior as shown with the
history command where you used -3 as an option. Arguments provide
additional information to a command. You used the man command with argument
cal, which provided information to the man command to show manual pages for
2.5 Wildcards
Creating a File
A file is a named area on the disk(s) where you can store information. The cat
command is a basic command used for creating new files containing text. For
example, if you want to create a file with the name newfile, containing three
lines, the process is as follows:
$ cat > newfile
This is first line. <ENTER>
This is the second line. <ENTER>
This is third and last line. <ENTER>
<CTRL-d>
$
Note that you press the key at the end of each line. When you have
finished entering the text, you press (pressing the Control and d
keys simultaneously) to end the text entry process and save the file.
Please note that use of the cat command for creating a new file is not very
common but it is the simplest way to do so. Most of the time you will be using the
vi editor to create or modify files. The vi editor is discussed in more detail in
Chapter 5.
Listing Files
Now that you have created a file, you can verify it by listing it using the ls
command.
$ ls newfile
newfile
$
The ls command shows that our newly created file, newfile, does exist and that
the file creation process was successful. What if you want to see a list of all other
files? This is very simple; you use the ls command without any argument.
$ ls
FORMAT FORMAT.ZIP myf newfile rafeeq.zip
$
Now the ls command shows that there are five files with the names shown
above. Please note that UNIX is case sensitive, meaning it differentiates between
lowercase and uppercase letters. So the file name myfile is different from
MyFile.
HP-UX has another popular command to list files. This is the ll (long listing)
command.
$ ll
total 350
-rw-r----- 1 boota users 104230 Aug 27 19:04 FORMAT
-rw-rw-rw- 1 boota users 0 Aug 30 20:47 myf
-rw-rw-rw- 1 boota users 72 Aug 30 20:47 newfile
$
This command shows that there are three files, with the names displayed in the
last column of the output. If you are wondering what the -rw-rw-rw- characters
displayed in the first column are, just leave these for the time being. These are
the file permissions showing who is allowed to read, write, and execute a
particular file. We will be discussing file permissions in more detail in Chapter 7.
If you remember from the first chapter that some commands are linked to other
commands, ll is another example. This command is linked to the ls -l
command. The ls command has many options, and you can have a look at these
using its manual pages.
Now we try to figure out the other columns in the file listing. The second column
shows how many links are associated with this file. A 1 (numeric one) means
there is no other link to this file. The next column shows the owner of the file. The
users is the group name of the user boota who owns this file. The next column
shows the file size in number of bytes. Then we have the date and time of last
modification to the file, and in the last column the file name is displayed.
Deleting Files
To keep the system clean, you need to delete unwanted files from time to time.
The files are deleted with the rm command.
$ rm newfile
$
Warning
We already have used the cat command for creating new files. The same
command is used to display contents of text files.
$ cat newfile
This is first line.
This is the second line.
This is third and last line.
$
We have just omitted the ">" symbol from the command line. The cat command
displays the entire contents of a file in one step, no matter how long the file is. As
a result, the user is able to see only the last page of text displayed. There is
another useful command with the name more that displays one page of text at a
time. After displaying the first page, it stops until the user hits the spacebar. The
more command then displays the next page of text and so on. Figure 2-1 shows a
screen shot of the more command while displaying the .profile file.
Generally a file name in UNIX can be as long as 256 characters. The rules that
apply to the file names are as follows.
1. A file name can be a combination of letters, numbers, and special characters.
2. All letters, both upper- (A–Z) and lowercase (a–z) can be used.
3. Numbers from 0 to 9 can be used.
4. Special characters like plus (+), minus (-), underscore (_), or dot (.) can be
used.
5. As mentioned earlier, UNIX is case sensitive, and uppercase and lowercase
letters are treated separately. So file names myfile, Myfile, MyFile, and
myfilE are different names.
6. There are no special names for executable files in UNIX; the file permissions
show which file is executable and which is not.
Hidden Files
Any file that starts with a dot (.) is not displayed when using the ll or ls
command. These are hidden or invisible files. Usually these files are used to
store configuration information. If you remember the user startup file with the
name .profile, it is a hidden file. To display the hidden files, use the ls -a
command.
$ ls -a
.profile newfile testfile.zip
$
Hidden files are more protected against the rm command when used to delete all
files in a directory. This command does not delete hidden files.
Creating Directories
Deleting Directories
Directories are deleted with rmdir command. This command deletes only empty
directories. If the directory contains another file or directory, first that file or
directory must be deleted. In case a user needs to delete a directory that is not
empty, it is possible to use rm -rf command, which can delete a nonempty
directory.
Warning
Parent and child directories are just relative to each other. For example, home
directory is a child directory of the root directory but it is a parent directory for the
boota directory.
The directory names are referenced relative to the root directory. A complete
reference name to a directory is called a path name. For example, the path name
of the home directory is /home. Similarly, the path name of directory boota is
/home/boota. It is easy to judge from the path name that boota is a child directory
of home, which in turn is a child directory of the root directory. Files also have
path names similar to directories. For example, a complete path name for a file
created in directory /home/boota with name myfile is /home/boota/myfile. A
path name that starts with the "/" symbol is called the absolute path name. We
can also use relative path names, which start from the current directory. For
example, to refer to a file with the name alpha in the parent directory of the
current directory, we may use a path name ../alpha.
Whenever a new directory is created, two entries are created in the new directory
automatically. These are "." and ".." where "." is a reference to the current
directory and ".." is a reference to the parent directory of the current directory.
You used the pwd command in Chapter 1. This command was used to check the
current directory. The cd (change directory) command is used to move to some
other directory in the directory tree. This command, like other UNIX commands,
can be used both with absolute and relative path names. You already know that
a user automatically goes to the home directory just after the login process. We
again consider the example of user boota who has just logged in and is in home
directory, /home/boota. To confirm that she is indeed in her home directory and
then move to the /etc directory, the user issues the following commands.
$ pwd
/home/boota
$ cd /etc
$
$ pwd
/etc
$
The last pwd command showed that the user has moved to the destination
directory /etc. In this example, we used an absolute path. In an example of using
a relative path, consider the user boota is in her home directory /home/boota and
wants to move to the /home (the parent) directory. She can use the cd .. or cd
/home command, and either will have the same effect. In cd .., she asked the
shell to move to the parent directory of the current directory. What if you use cd
../..?
Study Break
Once again create the same directory and the same file
inside it. Now use the rm -rf command to delete the
nonempty directory.
Copying Files
The files are copied with the cp command. The source and destination file names
are supplied to the cp command as arguments. The first argument is the source
file name, and second argument is the destination file name.
$ cp myfile anotherfile
$
This command copies myfile from the current directory to anotherfile in the
current directory. It is possible to copy files from any directory to any other
directory using the path names of the files. For example, if you want to copy
profile from the /etc directory to the current directory with the name myprofile,
The mv command is used for renaming files and moving files from one place to
another in the directory structure. Like the cp command, it takes source and
destination file names as arguments. If both source and destination names are
specified without any path (absolute or relative), the file is renamed. On the other
hand, if any or both of the file names contain a path name, the file is moved from
the source location to the destination location.
RENAME A FILE
$ mv myfile newfile
$
Make sure that the operation was successful by using the ll command.
MOVE A FILE
$ mv myfile /tmp/myfile
$
Two or more files can be moved simultaneously using the mv command. The
destination must be a directory name. The following command moves two files,
file1 and file2, to directory /tmp.
$ mv file1 file2 /tmp
$
Note
2.5 Wildcards
When you want to use many file names in one command, such as the one where
grep is used to search a pattern in many files, it is very inconvenient to type all
these names at the command line. Wildcard characters are used as a shortcut to
refer to many files. Two wildcards are used in UNIX, the asterisk character (*)
and the question mark (?). The * matches zero or more characters, whereas ?
matches only one character. There is a third type of character matching
mechanism that checks a range of characters. This is the [] pattern, and a range
is specified inside the square brackets. Sometimes this is called the third
wildcard.
Use of *
Suppose you use the ls command to list files and the following list appears.
$ ls
myfile myfile00 myfile01 myfile010 myf xyz
$
Now we can use the * character to list files we want to be displayed. If we want
to list all files that start with myfile, the command is:
$ ls myfile*
myfile myfile00 myfile01 myfile010
$
To list all files that start with my, we use:
$ ls my*
myfile myfile00 myfile01 myfile010 myf
$
Use of ?
The ? matches only a single character. For example, if you want to list all files
that start with myfile0 and the last character may be anything, the result is:
$ ls myfile0?
myfile00 myfile01
$
Now try to figure out why myfile010 did not appear in the list.
The wildcard characters can be used wherever you need to specify more than
one file. For example, if you want to copy all files from the current directory to the
/tmp directory, the command will be:
$ cp * /tmp
$
Similarly, if you want to search for the word root in all files of the /etc directory,
you can use this command.
$ grep root /etc/*
The wildcard characters are very useful, and if you master these, you can save a
lot of time in your daily computer use.
Use of [ ] Wildcard
This wildcard matches a range of characters given inside the square brackets.
Only one character from the range is taken. For example [a-m] means any one
character between "a" and "m". Similarly [a,c,x] means character "a", "c," or "x".
$ ls /etc/[w,x]*
/etc/wall /etc/whodo /etc/wtmp /etc/xtab
$
The above command lists all files in the /etc directory that start with a "w" or "x"
character.
A Text File
$ file /etc/profile
/etc/profile: ascii text
$
A Directory
$ file /etc
/etc: directory
$
An Executable File
$ file /bin/ls
/bin/ls: PA-RISC1.1 shared executable
$
A Shared Library
$ file /lib/libc.1
/lib/libc.1: PA-RISC1.1 shared library -not stripped
$
A Shell Script
$ file abc
abc: commands text
$
Similarly, the file command is able to detect a number of other file types. The
file command uses the /etc/magic file to determine different file types by
finding a magic string inside the file. A detailed discussion on magic numbers is
out of the scope of this book, but you can see man pages for /etc/magic for
further information on magic numbers. The file command is very useful in
situations where you want to determine the type of file before performing an
operation on it. It is quite possible that your display would be garbled if you were
to use the cat command on a binary file.
Study Break
Copying and Moving Files Using Wildcards and Finding the Type of a
File
You can find out the type of a file by using the file
command. Try to find a shared executable file on the
system by applying this command to different files.
Here we show how you can find whether a particular user exists by applying the
grep command on the /etc/passwd file.
$ grep Mark /etc/passwd
mstyle:elBY:2216:125:Mark Style,,,:/home/mstyle:/usr/bin/sh
mgany:iF5UeWQ:2259:125:Mark Gany,,,:/home/mgany:/usr/bin/sh
mbuna:tQfwUNo:2318:125:Mark Buna,,,:/home/mbuna:/usr/bin/sh
mblack:ipCg:2388:125:Mark Black,,,:/home/mblack:/usr/bin/sh
$
This command shows that there are four users on the system with the name
Mark. If you want to make a search case insensitive, you may use grep -i
instead of grep. If you are interested to know how many times the string occurs in
the file, without displaying the lines containing the string, use grep -c. You can
even reverse the selection of lines by grep -v. In this case, all lines that don't
match the string pattern are displayed.
If you want to search using a string of multiple words, enclose the words with
double quotes. For example, if you want to search for "Mark Black" in
/etc/passwd, you will use the grep command.
$ grep "Mark Black" /etc/passwd
mblack:ipCg:2388:125:Mark Black,,,:/home/mblack:/usr/bin/sh
$
For a case-insensitive search of "Mark Black," use the following command.
$ grep -i "mark black" /etc/passwd
mblack:ipCg:2388:125:Mark Black,,,:/home/mblack:/usr/bin/sh
$
As I mentioned earlier, the grep command can be used to search multiple files for
a matching string. You need to specify all file names in which you want to search
for the text string. For example, if you search for the word root in the
/etc/passwd and /etc/group files, the following result is displayed.
$ grep root /etc/passwd /etc/group
/etc/passwd:root:8JgNSmFv806dA:0:3:,,,:/home/root:/sbin/sh
/etc/group:root::0:root
/etc/group:other::1:root,hpdb
/etc/group:bin::2:root,bin
$
The command shows that the word root occurs once in the /etc/passwd file and
three times in the /etc/group file.
Sometimes you need to view only the first or last few lines of a text file. By
default, the head command lists the first ten lines of a text file, and the tail
command lists the last ten lines of a file. For example, if you want to see the first
ten lines of the /etc/passwd file (used to store user names and passwords), the
command and its output will be:
$ head /etc/passwd
root:8JgNSmFv806dA:0:3:,,,:/home/root:/sbin/sh
mmsecad:ETxUQ5wSQZCAk:0:3::/:/sbin/sh
daemon:*:1:5::/:/sbin/sh
bin:*:2:2::/usr/bin:/sbin/sh
sys:*:3:3::/:
adm:*:4:4::/var/adm:/sbin/sh
uucp:*:5:3::/var/spool/uucppublic:/usr/lbin/uucp/uucico
lp:*:9:7::/var/spool/lp:/sbin/sh
nuucp:*:11:11::/var/spool/uucppublic:/usr/lbin/uucp/uucico
hpdb:*:27:1:ALLBASE:/:/sbin/sh
$
Additional parameters can be used with both the head and tail commands to
view any number of lines of text. A tail -n 3 /etc/passwd will show the last
three lines of the file. If you want to see what is being added to a text file by a
process in real time, you can use the tail -f command. This is a very useful
tool to see text being added to a log file.
Many times, you want to know how many characters, words, or lines there are in
a file. In the /etc/passwd file, for example, there is one line for every user. You
can count the number of users on the HP-UX system if you count the number of
lines in the file. We use the wc (word count) command for this purpose. It displays
the number of lines, words, and characters, respectively.
$ wc /etc/profile
171 470 3280 /etc/profile
$
It shows that there are 171 lines, 470 words, and 3280 characters in the
/etc/profile file. If you want to count only the number of lines in a file, you can
use wc -l. Similarly, for counting words, wc -w, and for counting characters, wc -
c, can be used.
$ wc -l /etc/passwd
2414 /etc/passwd
$
It shows that there are 2414 lines in /etc/passwd, which is an indirect way to find
out the number of users on this system.
Link Files
Many times you need to refer to the same file that has different names. You can
create a link file that is not the actual file but points to some other file to which it
is linked. There are two types of links, hard and soft. Soft links may be
established across file systems. The soft link is a special type of file; the first
character of the ll command list is "l" for link files. To create a link, the ln
command is used. For example, to create a hard link, abc, to a file, myfile, we
use:
$ ln myfile abc
$
To create a soft link, we use the -s option.
$ ln -s myfile abc
$
Chapter 3. Environment Variables
Chapter Syllabus
As soon as a user logs into HP-UX, the shell is invoked and waits for commands
from the user. To execute these commands, the shell needs to know some
information about the environment being used. For example, to correctly display
a file, the shell needs to know which type of terminal is attached to the system.
Similarly, when a user issues an extrinsic command, the shell needs to know in
which directories it should look for the command. In UNIX terminology, we call
this type of information the shell environment.
The shell stores environment information in environment variables. Usually,
many types of variables are set in the system startup file (/etc/profile)
managed by the system administrator. The users can also set environment
variables through the use of a user startup file kept in the home directory
(.profile).
Any program that needs environment variables checks the existence and value
of these variables at the startup time. For example, the editor program vi needs
your terminal information to correctly display and scroll text. When you start the vi
editor, it will check the TERM variable. If it understands the terminal type set by
the TERM variable, it will start in full screen mode; otherwise, it will start in line
editing mode, where you can edit or display only one line of text at a time.
Similarly, the more command needs the TERM variable to display a particular
number of text lines, depending on the type of terminal being used.
You can modify and change environment variables set by the system
administrator. The system administrator usually sets the PATH variable that
shows the search path for the executable commands. But as you start using the
UNIX system, you also create your own programs and scripts, and you want the
shell to look into the directories containing your own programs as well. For this
purpose, you can add your own directory names in the PATH variable.
In this chapter, you will see the difference between environment and shell
variables and how to set and display variables. There are many predefined
environment variables, and the most important of these will be discussed. Then
you will learn how to increase the visibility of a shell variable by exporting it. The
default HP-UX command prompt shows little information, and you will see how to
add some useful information to it using variables. Since PATH is an important
variable, you will learn more about it at the end of the chapter.
If you want to list all variables known to your current shell, use the set command.
$ set
EDITOR=vi
EPC_DISABLED=TRUE
ERASE=^H
FCEDIT=/usr/bin/ed
HISTFILE=/home/root/.sh_history
HISTSIZE=400
HOME=/home/boota
INTR=^C
LINENO=1
LOGNAME=boota
MAIL=/var/mail/boota
MAILCHECK=600
MANPATH=/usr/share/man/%L:/usr/share/man:/usr/contrib/man:/
u
sr/local/man/%L:/usr/local/man
NAME=12
OPTIND=1
PATH=/usr/sbin:/baan/bse/bin:/usr/bin:/usr/ccs/bin:/usr/
contrib/bin:/usr/bin/X11:/usr/contrib/bin/X11:/opt/perf/
bin:/u
sr/sbin:/sbin
PPID=26709
PS1='boota on myhp $PWD => '
PS2='> '
PS3='#? '
PS4='+ '
PPID=26709
SHELL=/sbin/sh
TERM=vt100
TMOUT=0
TZ=EST5EDT
VAR3=TestVar
_=set
$
This list will change from system to system. It also depends on what applications
are running on the system, as applications also set their environment variables.
We will discuss some of the common variables in the next pages.
Often a variable has a value that contains space characters in it. If you try to set
a variable containing spaces in the normal way, you will get an error message as
follows.
$ NAME=Mike Ron
sh: Ron: not found.
$
The shell thought that you were setting a variable NAME with value Mike while Ron
is a UNIX command. The shell then tried to execute this command and failed. To
set variables containing multiple words, we use single or double quotes.
$ NAME="Mike Ron"
$
$ echo $NAME
Mike Ron
$
Single quotes may also be used.
$ NAME='Mike Ron'
$
There is a slight difference between single- and double-quote characters that I
will soon elaborate on.
The echo command can be used to display a variable and additional text at the
same time. For example, just after displaying the NAME variable, we want to
display the number 7. What if we use command echo $NAME7?
$ echo $NAME7
sh: NAME7: Parameter not set.
$
The shell actually started looking for a variable name NAME7 instead of NAME but
could not find it. To avoid this ambiguity, we use {} to separate a variable from
the rest of the text as follows.
$ echo ${NAME}7
Mike Ron7
$
Many UNIX users put {} around variable names to avoid any ambiguity. The
curly brackets must be used any place a shell variable is used with some other
text.
Modifying a Variable
Assigning a new value to the same variable name modifies the previous value of
the variable. It can be done in two ways. If we just assign a new value, the old
value of the variable is destroyed. We can also append to the old value by putting
the variable name on the right-hand side of the = symbol at the time of
assignment. For example, if we want to add a third part to the NAME variable, it
can be done as follows.
$ NAME="$NAME Junior"
$
$ echo $NAME
Mike Ron Junior
$
Note
This is a very useful way to add your own directories to the PATH
variable. The PATH variable set by the system administrator contains a
list of directories where command files are located. When finding a
command, if you want the shell to also search in your own directories,
you can use the above method to append your own directory names to
the PATH variable.
Now we come to the difference between single and double quotes. Consider the
above command example by replacing the double quotes with single quotes and
watch the result carefully.
$ NAME='$NAME Junior'
$
$ echo $NAME
$NAME Junior
$
This is not what we wanted! What happens is that single-quote characters do not
expand any variable name inside them to its value. Instead, anything inside the
single quotes is taken as is and assigned to the variable. One must be careful
when using single quotes! The same rule applies when you use single and
double quotes with other commands. See the results of two echo commands.
$ NAME= "Mike Ron"
$
$ echo "$NAME Junior"
Mike Ron Junior
$ echo '$NAME Junior'
$NAME Junior
$
Removing a Variable
A shell variable can be removed by the unset command on HP-UX. Please note
that this command is not available in all UNIX shells.
$ NAME="Mike Ron"
$ echo $NAME
Mike Ron
$ unset NAME
$ echo $NAME
sh: NAME: Parameter not set.
$
On most keyboards, the back quote character is displayed when you press the
"~" key without the SHIFT key. It is used to assign the result of a command to a
variable. If you want to assign your login name to a variable NAME, you can use
the following command.
$ NAME=`whoami`
$
$ echo $NAME
boota
$
You can also use the back quote character anywhere that you want to substitute
the result of a command. In the following example, it is used with echo command.
$ echo "My login name is `whoami`"
My login name is boota
$
Study Break
You will see a ">" symbol in the next line. You get back
neither the command prompt nor any other display. You
may be wondering what happened to the command.
Actually, you issued an incomplete command, missing the
closing double quotes. The symbol ">" is the value of the
PS2 environment variable that is displayed whenever you
You can play some tricks with the command prompt to display useful information.
For example, it is very handy if the prompt displays the current directory, as it can
be hard to remember which directory you are in. We can make a slight change to
PS1 to display the current directory path.
$ PS1="My Computer \$PWD =>"
My Computer /home/boota =>
As you can see, the prompt changes and the current directory is there. Now,
whenever you use the cd command to change the current directory, the PWD
environment variable changes, and your prompt also changes to show the new
current directory. Similarly, it is possible to have your login name and the
computer name in the command prompt along with the current directory, as
shown in this example.
$ PS1= "`whoami` on `uname -n` \$PWD =>"
boota on myhp /home/boota =>
Note
This is very useful if you work in a network environment and often log
into different machines with different names. It is the command prompt
that tells you which machine you are on, which login name you are
using, and what the current directory is. Does this seem interesting to
you? You can also try to change your prompt to include the date and
time, blinking characters, and so on, depending on the type of terminal
you are using.
This is because, if you accidentally use a program or script name, the same as a
standard HP-UX command, your program will be executed instead of the HP-UX
command, as it will be found first by the shell.
1:
Just after login, you issue the command echo HOME. What will be the output
of this command?
2:
What is not true about the PATH variable?
3:
You have a variable "ABC" with value "Alphabets". You use the following
command to change its value.
ABC='All $ABC'
What will be the new value of the variable?
A. All $ABC
B. All Alphabets
C. All ABC
D. The variable value can't be changed with this command. It remains
"Alphabets".
4:
To assign the output of a command to a variable, we use:
A. double-quote characters
B. comma characters
C. single-quote characters
D. back-quote characters
5:
The value of the PS2 variable on your system is ">" (greater-than symbol).
You issue an incomplete command. What will be the shell's response?
A. The shell will issue an error message showing that the command is
incomplete.
B. The shell will display "PS2".
C. The shell will try to execute that part of the command that is supplied.
D. The shell will give a ">" prompt and wait for you to complete the
command.
6:
What is wrong with the shell variable name 3Xyab2?
Most UNIX commands are designed to take simple text (alphanumeric) data and
punctuation as input. Usually, the output is also of simple text. Whenever you
start a UNIX command, it opens three standard data streams: standard input
(stdin), standard output (stdout), and standard error (stderr). Every UNIX
command takes input data from stdin and sends its normal output to stdout and
error messages to stderr. These data streams are often called standard
input/output. UNIX associates numbers known as file descriptors with all open
files. File descriptor 0 is used with standard input, 1 with standard output, and 2
with standard error.
Standard input, usually the user keyboard, is normally the place where a program
reads its input from. Standard output, usually your terminal screen, is where the
results of a command or program are displayed. In normal cases, standard error
messages are also displayed on the terminal screen, but it is always possible to
separate stdout from stderr. The UNIX shell can redirect any of these streams to
a file, a device, or some other command, as required by the user. We call this
process I/O redirection. You studied one example of output redirection in Chapter
2, when you created a new file with the cat command. In its normal use, the cat
command reads from the keyboard (stdin) and writes to the terminal screen
(stdout). We used the ">" symbol to redirect output from the stdout to a file.
Similarly, when we displayed the contents of a file with the cat command, we
redirected input to the cat command from the keyboard (stdin) to the file. Figure
4-1 shows the standard location of input, output, and error for any UNIX
command or program.
Another useful feature of UNIX is the pipe, with which we can send output of one
command to the input of another command. This is often used to process and
format data produced by a command and make it more understandable. Many
commands are used as filters in UNIX, which take input from a command, filter
the required data, and throw away the garbage. For example, the cat
/etc/passwd command displays the contents of the password file, but using a
This is another way of copying text files. As you go through the book,
you will find how versatile the UNIX commands are and how many
different ways these commands can be used. Until now, you have used
the cat command to create a new file, display contents of a file, and
copy a text file using redirection. The same command is used for other
purposes as well, and you will learn more uses of the cat command
later in this chapter.
As another example, consider the who command. We redirected its output to a file
with the name whofile. We can verify the contents of whofile with the more or
cat command.
If a file with the name file1 already exists, it will be overwritten by using
the above command without any warning.
Two or more files can be joined into a single file by the use of the cat command
and redirecting output to a file. Let us suppose there are three files in your home
directory, with the names file1, file2, and file3. If you use the cat command
with file1 and file2 as its arguments, it will show you the contents of file1 and
file2, respectively. What if we use the cat * command? It will display the
contents of all files in the directory. Now, by simply redirecting the output to
another file, the command will concatenate all of these files.
$ cat file1 file2 >file4
$
This command created file4, which contains the contents of both file1 and
file2. The following command creates file5, containing all files in the directory.
$ cat * >file5
$
Note
This is the another use of the cat command is for joining two or more
files.
Appending to a File
In the case of output redirection with the ">" symbol, the file to which we redirect
the output of a command is overwritten. It means that the previous contents of
the file are destroyed. We can use the double redirection symbol ">>" to keep the
previous contents of the file. In such a situation, the output of a command is
appended to the file. Consider the following example.
$ cat file1 >>file2
$
This command means that file2 still contains the old contents of file2. In
addition to this, the contents of file1 are added to the end of file2. If file2
does not exist, it is created. This is a very useful feature and is used in many
situations. For example, if we want to check how many users are logged in every
hour, we can ask UNIX to run date and who commands every hour and redirect
(append) the output of both of these commands to a log file. The date command
will append the current date and time and the who command will append a list of
users. Later on we can view this log file to get the desired information.
Sometimes you can use the same redirection method to print simple text files, if
the printer is directly connected to the HP-UX machine and you know the device
name for the printer.
When redirecting output, keep in mind that sterr is not redirected automatically
with the output. If the command you issue generates an error message, it will still
be displayed on your own terminal screen.
There is no special symbol for redirecting stderr. The same ">" symbol is used
but with the number 2 attached in front of it. If you remember from previous
pages, there are three file descriptors opened whenever a command is issued.
These file descriptors are shown in Table 4-1.
Table 4-1. Standard File Descriptors
File Descriptor Number Description
0 Standard input
1 Standard output
2 Standard error
We use "2>" for stderr redirection to tell the shell that we want to redirect the error
messages instead of stdout (for which the file descriptor value is 0). Consider the
following command.
$ ll xyz
xyz not found.
$
We tried to list a file with name xyz and the command result shows that this file
does not exist. This is an error message of the ll command. Now see the
following command.
$ ll xyz >abc
xyz not found.
$
We tried to redirect the output, but still the message is displayed on our screen.
The redirection had no effect because we are trying to redirect stdout while the
command is generating stderr messages. Now let us see what happens if we
change ">" to "2>".
$ ll xyz 2>abc
$
Now there is nothing displayed because the error message has been stored in a
file with name abc. You can use the cat command to verify that the error
message was indeed stored in the abc file.
the same file. Wait for five minutes and again use the date
and who commands and append their output to logfile.
Now use the cat command to display logfile. You will see
that it contains a line for time and date and then a list of
users who were logged in at that time. You can use this
technique with the UNIX scheduler (cron) to create a log for
a whole day and study it later.
Many times we don't need all of the output produced by a command. In such a
case, we can filter the desired information from the output produced by a
command. Filtering means extracting useful data and throwing away the rest. We
have already studied the who command, which is used to see the names of
logged-in users. In large systems, where hundreds of users are logged in
simultaneously, it is difficult to find out whether a particular user is currently
logged in. In this situation, we use the filter to get the desired information. We
can use the who command with the grep command, where grep acts as a filter.
Consider the next example, where we want to find if a user "mike" is logged in.
First we use only the who command and then we combine the who and grep
commands.
$ who
operator pts/ta Aug 30 16:05
boota pts/tb Aug 30 15:59
mike pts/tc Aug 30 15:44
linda pts/td Aug 30 14:34
$
Now we use a pipe to filter out our required information.
$ who | grep mike
mike pts/tc Aug 30 15:44
$
As you can see, only the line containing the word "mike" is now displayed. We
have used the grep command previously to find a string from one or multiple
files. The grep commands, at that time, used file names as input. In this example,
it did the same thing but took its input from the pipe.
How did grep know that no more data were coming from the pipe and that it
should stop processing? Well, this is quite simple. The who command sends an
end of file (EOF) character when it has completed sending output to the pipe.
The grep command checks the EOF character and stops execution when it finds
the character. In case there are no data in the pipe and the grep command has
not received the EOF character, it will just wait until it gets more data or the EOF
character.
As another example, we can get only login names from the who command by
using another filter known as cut. We will discuss the cut command in more
detail in the last chapter, but for the time being just see how we use it to extract
the first word of each line and throw away the rest.
$ who | cut -f 1 -d " "
operator
boota
mike
linda
$
The cut command takes its input as fields separated by space characters and
picks the first field from each input line. Since the first field of all output lines is
the login name, we got the login names only in the output.
You can also use multiple levels of pipes as shown below.
$ who | cut -f 1 -d " "| grep mike
mike
$
Try to explain what is happening here. We have filtered the output of one
command and then again filtered the output of the second command. You can
continue this process as far as you want.
As we have used pipes for filtering data, we can also use them for reorganizing
and manipulating data. What if you need to get output of a command in sorted
form? Yes, it is quite simple if you pass it through the sort command using a
pipe. Consider the above example of using the who command. See how the
output changes without and with a sort pipe.
$ who
operator pts/ta Aug 30 16:05
boota pts/tb Aug 30 15:59
mike pts/tc Aug 30 15:44
linda pts/td Aug 30 14:34
$
Now we use a pipe with the sort command.
$ who | sort
boota pts/tb Aug 30 15:59
linda pts/td Aug 30 14:34
mike pts/tc Aug 30 15:44
operator pts/ta Aug 30 16:05
$
The sort command has arranged the output of the who command in alphabetical
order. If there are many users logged in, the output of the who command just
scrolls up and you see only the last page. In that case, you can use the more
command as a filter to stop the scrolling at the end of each page.
$ who | more
Filters can do many things for you in a very simple way. If you were using some
other operating system, you might need to write separate programs!
The tee command is used to form a T-junction. It takes its input from stdin and
writes the same thing to stdout as well as to another file at the same time.
Consider the same example of the who command. If you want to display the
output of the who command at the terminal as well as save it in whofile for future
use, the command line and result will be as follows.
$ who | tee whofile
operator pts/ta Aug 30 16:05
boota pts/tb Aug 30 15:59
mike pts/tc Aug 30 15:44
linda pts/td Aug 30 14:34
$
Now if we see the contents of the whofile, it will contain the same data.
$ cat whofile
operator pts/ta Aug 30 16:05
boota pts/tb Aug 30 15:59
mike pts/tc Aug 30 15:44
linda pts/td Aug 30 14:34
$
Like ordinary pipes and redirection symbols, multiple levels of t-junction pipe can
be used to send data to many places. Can you use the sort or head commands
with the tee command now? How about using the spell command to check
spellings of a command output?
Table 4-2 is a summary of the redirection and pipe symbols used in HP-UX.
Table 4-2. Standard I/O Redirection
Symbol Function Syntax
> Redirect stdout and overwrite or create a prog > file
file
< Redirect stdin prog < file
>> Redirect stdout and append to, or create a prog >> file
file
2> Redirect stderr prog2> file
2>&1 Send stderr and stdout to the same file prog2>&1 file
|& Pipe stdout and stderr of prog1 to stdin of prog1 |& prog2
prog2
1:
What is the file descriptor used for stderr?
A. 1
B. 0
C. 2
D. 3
2:
The symbol used to append to a file when redirecting stdout to that file is:
A. >
B. >>
C. <
D. 2>
3:
When you redirect both stdout and stderr to the same location, you use:
A. 2&>
B. 2&>1
C. 2>&1
D. 1>&2
4:
A pipe is used to:
5:
Which is not true?
Chapter Syllabus
Until now you have been creating new files but had no way to edit a file. This
chapter builds skill in using the vi editor, which is the most widely used editor on
UNIX. The vi editor is available with every distribution of the UNIX operating
system. Initially, all users of UNIX find the vi editor difficult to use. But as they
become used to it, they start to like it due to its simplicity and power. It is not like
common editors found on desktops with menus and screen displays. It gives you
the full screen for displaying text. The bottom line is mostly used for vi
commands when needed.
This editor is capable of handling multiple files simultaneously. All open files are
called buffers in vi terminology. You can perform routine editing tasks within one
buffer as well as across buffers. You can cut, copy, and paste text, search and
replace text, export and import text to and from other files, and spell check. In
addition, it is possible to configure the vi editor according to your requirements
using a configuration file.
In the beginning, it is difficult to remember vi commands because there are no
menus. As you continue using vi, you will soon remember these commands
because they are simple. I have seen many experienced UNIX users who use vi,
although other GUI-based editors are available, just because of its simplicity and
power. In addition to the use of vi for general file editing, it is also used for typing
email and editing the command line. You have already learned the EDITOR
environment variable in Chapter 3. If you set the value of this variable to vi, all of
your command line editing will use the same commands you use in vi.
This chapter does not explain all of the powers of the vi editor but provides
sufficient explanation for the commonly used features. You will learn most of the
file editing tasks in this chapter. We will start with vi modes and cursor
movement and then move toward other text editing features.
active buffer. The insert mode is also called the text entry mode.
When you start vi, it is in command mode. This means that whatever you type is
considered a command by the editor. You can switch to text entry mode or insert
mode by pressing i at any time in the command mode. After that, whatever you
type is inserted into the file buffer and is displayed on the terminal screen. A file
buffer is an area in memory where files are loaded and edited by vi. When you
are in insert mode, you can't issue any command until you return to the
command mode. To go back to command mode insert mode, you press the
key on your keyboard. During a file editing session, you may switch
between command mode and insert mode many times.
Note
If you are not sure at any time which mode you are in, just press the
key once or twice. If you are in command mode, you will remain in
command mode and if you are in insert mode, you will go into command
mode.
The editor is started when you use a vi command and give a file name as its
argument. For example, if you want to edit a file with the name myfile, you will
use:
$ vi myfile
This command starts vi and allocates a memory buffer for file myfile. If the file
already exists, text in the file will be displayed on the screen. The bottom line,
which is used as a status line, will display a file name, line number, and number
of characters in the file. Remember that you can't add any text to the file until you
go into insert mode because vi started in the command mode. You will see
something like Figure 5-1 when you start vi.
changes to it with the :q command, it will refuse. The exclamation symbol "!"
forces vi to quit even if the file buffer is changed. You can add the exclamation
symbol to any command when you want to force vi to do something.
Note
Most vi beginners make mistakes while using vi, resulting in unexpected
changes to the file being edited. It is a good habit to use the :w command every
few minutes to save the file being edited. Then if you make a mistake and don't
understand what has happened, just quit vi with the :q! command and restart it
with the saved file. In this case, you will lose only the data typed after the last
save.
To move the cursor, you must be in command mode. On most of the modern
terminals, you can move the cursor with the arrow keys on your keyboard. If this
is not possible with your keyboard, you can move your cursor using other
commands, as mentioned in Table 5-1.
Table 5-1. Cursor Movement Commands
Command Effect
l Move one character right
h Move one character left
j Move one line down
k Move one line up
Table 5-1. Cursor Movement Commands
Command Effect
<space> Move one character right
G Go to last line of the file
nG Go to line number n in the file
$ Go to end of current line
^ Go to start of line
w Go to beginning of next word
b Go to beginning of previous word
e Move to end of word
H Go to first line of screen
M Go to middle line of screen
L Go to last line of screen
( Go to beginning of sentence
) Go to end of sentence
{ Go to beginning of paragraph
} Go to end of paragraph
Before using any of these commands, make sure that you are in the command
mode by pressing the key. The G command is used to go to a line number
in the file. If you want to go to the start of the file, use 1G. To go to end of file, use
G. To go to line number 100, use 100G. You can press at any time to
find out which line you are on. You can also instruct the vi editor to display the
line number with each line using the :set number command.
The start and end of a sentence are considered with reference to the dot "."
character. The start and end of a paragraph are indicated with reference to a
blank line. In other words, text within two blank lines is considered to be a
paragraph.
We can combine vi commands. For example, $ is used to go to the end of a line,
and G is used to go to the last line of the file. We can combine these two
commands as $G to go to end of the last line of a file. Other commands can be
combined in a similar fashion.
Text insertion takes place only when you are in insert mode. Text deletion tasks
are performed in command mode. You already know how to switch between
these two modes. Here we will introduce some more commands to switch to
insert mode from command mode.
When you use the i command to go to insert mode, new text is entered where
the cursor is at the time you press the character i. When you insert new text, the
previous text on the right-hand side of the cursor moves toward the right. If you
use the a command to go to insert mode, the new text entry takes place after the
current position of the cursor. You can use the I command to start entering new
text at the beginning of a line and the A command to start entering at the end of a
line. If you want to enter text at the end of the file, you can combine the G and A
commands for this purpose. See Table 5-2 for a list of text entry commands.
Table 5-2. Text Insertion Commands
Command Effect
i Start inserting text at the current cursor location.
I Start inserting text at the beginning of the current line.
a Start inserting text at the next character position relative to the current
cursor location.
A Start inserting text at the end of the current line.
o Append a blank line just below the current line and start inserting text from
the beginning of that line.
O Append a blank line just above the current line and start inserting text from
the beginning of that line.
To delete text, press to go to command mode and then use any of the
commands described in Table 5-3.
Table 5-3. Text Deletion Commands
Command Effect
x Delete character at current cursor location.
nx Delete n characters starting at current cursor location.
X Delete previous character from the current cursor location.
nX Delete n previous characters from the current cursor location.
dd Delete current line.
db Delete previous word.
dw Delete from current cursor location to the end of word.
dG Delete to the end of file including current line.
In addition to inserting and deleting text, you can also replace existing text with
new text. You can replace a single character or many lines of the text. The r
command is used to replace a single character. When you press r while in
command mode, nothing happens, but as soon as you press the next character,
it appears on the screen replacing the character at the current cursor location. To
replace one character and then go to insert mode, use the s command. You will
see a $ symbol and vi will put you in the insert mode just after replacing the
current character. You can also replace multiple characters with the R command.
These and other replacement commands are listed in Table 5-4.
Table 5-4. Text Replacement Commands
Command Effect
r Replace current character remaining in command mode.
s Replace current character and go to insert mode.
R
Replace multiple characters until the key is pressed.
cw Change to the beginning of next word.
cc Change entire line.
cG Change to the end of file.
5 6 Undo and Redo
If you make a change by mistake, you can always undo that change. You use the
u command to undo the last change. You can also use the U command to undo
all changes made to the current line. To redo something changed by undo, you
can use the "." (dot) command. Undo and redo are opposite to each other.
You can scroll text in both the upward and downward directions. Table 5-5 shows
the commands to scroll text. Before using any command, press to go to
command mode.
Search and replace is a necessary editing feature found in all good editors. If you
want to find a text pattern in vi, you can use the /text command, where text is
the string you want to search. This command searches the text in the forward
direction from the current cursor position. You can search in the backward
direction if you replace / with ? in the command. To repeat the search once you
find a string, just use / or ? without typing the string again.
You can also replace text after a search is made. For example, if you want to
replace the word "Atlanta" with "Chicago," you can use the command
:s/Atlanta/Chicago/ to search for the next occurrence of "Atlanta" and then
replace it with "Chicago." You can use the search-and-replace feature in as many
lines as you want or in all of the file. Table 5-6 shows a summary of search-and-
replace commands used in vi.
Table 5-6. Text Searching and Replacing Commands
Command Effect
/text Search text in forward direction starting from current cursor
location.
?text Search text in backward direction starting from current cursor
Table 5-6. Text Searching and Replacing Commands
Command Effect
location.
/ Repeat previous search in forward direction.
? Repeat previous search in backward direction.
n Repeat search in the same direction.
N Repeat search in the opposite direction.
:s/oldtext/newtext Search oldtext in the forward direction and replace it with
newtext.
:m,ns/oldtext/newtext Search oldtext in the forward direction from line m to line n
and replace it with newtext.
:s/oldtext/newtext/g Search oldtext in the entire file and replace it with newtext.
/<space>text If you put a space between the / and the text to be searched,
only whole words are searched.
/^text Search text only in the beginning of a line.
/text$ Search text only in the end of a line.
/(More than one word) Use parenthesis to search multiple words.
Escape characters have special meaning in vi. For example, the $ character is
used to show the end of a line. If you want to search the $ sign in vi, you need to
put a backslash (\) in front of it to tell vi that it should be considered an ordinary
character and no special meaning of $ should be taken. You will use "/\$" to
search for the $ character in a file. Similarly, other special characters (^, *, /, .)
must also be escaped.
You have already seen commands used to delete text, such as dd and dw. These
commands cut the text and put it on a cut buffer. Text from the cut buffer can be
pasted anyplace using the p command. The text that you want to copy from one
place and paste at another is yanked (copied) first. We use the yy command to
yank one line of text. You can also yank multiple lines of text by using the nyy
command where n is the number of lines starting from the current cursor position.
To paste the text at a new place in the file, move the cursor to that place and use
the p command to place the text after the cursor position. You can also use the P
command to paste the text before the cursor position.
The cut-paste combination is the same as moving text from one place to another.
You can move text with the m command. The m command moves one line of text
from the current cursor position to a new position given to m. For example, m 7
will move the current line to line number 7. You can also move multiple lines of
text with the m command, for example, "1,15m76" will move lines 1 to 15 and
paste them after line number 76.
The line number method can also be used to copy and paste text. As an
example, "7,23t55" will copy lines 7 to 23 and will paste these lines after line
number 55. To remind you, you can use the :set number command to see line
numbers with each line. Table 5-7 shows commands related to cut, copy, and
paste.
Table 5-7. Cut, Copy, and Paste Commands
Command Effect
yy Copy or yank current line.
nyy Copy n lines starting from current line position.
p Paste yanked text after the current cursor position.
P Paste yanked text before the current cursor position.
:m a Move current line and paste after line number a.
:a,bmc Move lines from a to b and paste after line number c.
:a,btc Copy lines from a to b and paste after line number c.
The vi editor lets you deal with files in many ways. We can open multiple files,
copy text from an opened file to save it to another file on disk (export), or insert a
disk file into the editor at a specific location (import).
Importing a Disk File into Editor
To insert a disk file into a location in the opened file, we use the :r filename
command. The filename is the name of a file, with full path. If the path is not
specified, vi searches for the file in the current directory. For example, if you
have opened a file file1 in the editor and want to insert another file file2 after
line number 8, you first move the cursor to line number 8. Then you issue the :r
file2 command. All contents of file file2 are inserted into the editor after the
You can export any number of lines so that they are saved as a new file on the
disk. We use the w command for this purpose. As an example, if you have
opened file1 in the editor and want to save lines 3 to 47 as file3, you can use
the command :3,47w file3.
If you want to save the opened file as a new file (to make a backup), the same
command can be used without line numbers. To save the current file as file4,
you use :w file4.
You have already seen how you can tell vi to display line numbers with every
line. For this you used the command :set number. A list of vi options is shown in
Table 5-8.
Table 5-8. vi Options
Option Effect
Table 5-8. vi Options
Option Effect
:set number Sets line number in front of each line in vi.
:set all Lists all available options.
:set The next line is indented the same number of character as the current
autoindent line.
:set readonly Sets the current file as read-only. No change is saved.
:set Sets the right wrap margin equal to n. If we are using 80-column
wrapmargin=n display and the wrap margin is set to 6, every line will be wrapped to
the next line after 74 characters.
:set showmode Shows the user when the user is in "insert mode," "replace one
character mode," or "replace mode."
You can also use abbreviations for these commands. For example, you can use
:se nu instead of :set number.
The vi editor has a configuration file with the name .exrc, which is stored in the
home directory of each user. You can put vi options in this file to make the
settings permanent. After creating this file, whenever you start the editor, it will
first read the $HOME/.exrc file and set the options automatically. For example, if
you are writing C programs, you may want to set autoindent and number every
time you start editing a C program. You can create an .exrc file with the following
two lines in it.
se nu
se ai
Now these two settings are permanent until you make a change in the
$HOME/.exrc file manually.
1:
You have made changes to a file and want to quit vi without saving these
changes. What command will you use?
A. :wq
B. x
C. q!
D. w!
2:
You want to replace cat with dogin your file at all locations in the vi editor.
You use:
A. /s/cat/dog
B. s/cat/dog/
C. s/cat/dog/g
D. s/dog/cat/g
3:
While in command mode you press "a"; what happens?
All human languages have idioms and phrases. These are made up of
combinations of words not used in their ordinary meanings. Regular expressions
can be considered as idioms of the UNIX shell. These are used for string pattern
matching in many UNIX commands. As idioms and phrases convey a handful of
meanings in few words, regular expressions are also very useful where you need
to match complex text patterns and ordinary methods are just not applicable.
Regular expressions consist of strings of characters, position specifiers or anchor
characters, and meta characters that have special meanings. Each regular
expression is expanded into its meaning before the UNIX shell executes a
command containing a regular expression. Before we actually use regular
expressions in this chpater, we will start with the command execution process.
We will then discuss basic meta characters used in regular expressions. You will
learn the use of regular expressions with some simple commands. At the end of
the chapter, you will be able to use regular expressions to search and replace
character strings in files and in stdin and stdout.
All HP-UX commands consist of two basic parts. The first one is the command
name and the second part consists of options and arguments. Before executing a
command, the shell looks for a valid command in the path specified by the PATH
variable. If it finds an executable command, it checks for any meta characters or
position specifiers used in the arguments. These meta characters and position
specifiers are discussed later in this chapter. If the shell finds any of these
characters in the arguments, it starts expanding the argument according to
predetermined rules. After expansion, the shell then passes the arguments to the
command and invokes it for the execution process. The shell then displays any
output or error message generated by the command on the user terminal. It also
checks to see if the command execution was successful and keeps a record until
a next command is executed.
The command execution process is completed in the following steps.
1. The shell looks for a valid command by searching all directories specified
by the PATH variable.
2. Options and arguments are parsed and arguments are expanded
depending on the special characters used.
3. The command is invoked.
4. The results of the command are displayed back to the user.
Position specifiers are characters that are used to specify the position of text
within a line. Sometimes these are also called anchor characters. The caret
character (^) is the starting position specifier. It is used to match a text string
occurring at the start of a line of text. The dollar sign ($) is the end-position
specifier and is used to refer to a line that ends with a particular string.
Table 6-1 shows the uses of position specifiers.
Table 6-1. Uses of Position Specifiers
Use of $
The dollar sign $ is used to match a string if it occurs at the end of a line.
Consider a file with the name myfile having contents as shown below after using
the cat command.
$ cat myfile
Finally I got it done. The procedure for adding a
new template is completed in three steps.
IMPORTANT
===========
The template will not work if the node name specified
in it is unknown to ITO server. In our template we
specified batch_server which was unknown to ITO server
node name in the template. Finally I got out the node
name which is more convenient as ITO automatically takes
current node name if the name is not specified in the
template.
Template Options
===============
1- It runs every minute. Scans the file only if it is
modified.
2- User initiated action is specified to run restart.
3- A short instruction is provided to run the script.
It needs to be modified to make more meaningful.
$
Let us use the grep command to find all lines in the file that contain the word
node.
$ grep node myfile
2- Assign this template to a node with this procedure.
message browser showing that update process on the node
The template will not work if the node name specified
node name in the template. Finally I got out the node
current node name if the name is not specified in the
$
You found out that there are five lines in the file containing the word node. Now
let us find only those lines that end with this word by using the $ position
specifier.
$ grep node$ myfile
message browser showing that update process on the node
node name in the template. Finally I got out the node
$
The position specifiers can be used with any command that deals with text-type
data.
Use of ^
The caret character (^) matches a string at the start of a line. Using the same
example of finding the word node, now at the start of a line, enter the following
command and watch the result.
$ grep ^node myfile
node name in the template. Finally I got out the node
$
As another example, you can list all users on your system with login names
starting with the letter "m" as follows.
$ grep ^m /etc/passwd
Sometimes the actual string contains one of the position specifiers or meta
characters. If you pass this string as-is to a command, the shell will expand the
meta character to its special meaning, and you will not get correct results. To
instruct the shell not to expand a character to its special meaning, you need to
escape that character. For this purpose, you use a backslash (\) before the
character. For example, if you want to search for the $ character in a file, you will
use the grep \$ command instead of grep $. If you don't escape the $ character,
this command will display all contents of the file.
Please note that \ is also a special character. To match a backslash, you need to
use two backslashes \\ in the string.
Meta characters are those that have special meaning when used within a regular
expression. You already have seen two meta characters used as position
specifiers. A list of other meta characters and their meanings is shown in Table
6-2.
Table 6-2. Meta Characters Used in Regular Expressions
Character Description
* Matches any number of characters, including zero.
. Matches any character, one at a time.
[] One of the enclosed characters is matched. The enclosed characters may be a
list of characters or a range.
{n1,n2\\ Matches minimum of n1 and maximum of n2 occurrences of the preceding
character or regular expression.
\< Matches at the beginning of the word.
\> Matches at the end of the word.
\ The character following acts as a regular character, not a meta character. It is
used for escaping a meta character.
Use of the Asterisk * Character
including zero. Now each word that does not start with the letter "m" is also
matched because it has zero occurrences of "m". So one must be careful when
using the asterisk (*) character in regular expressions.
The dot character matches any character excluding the new line character, one
at a time. See the example below where we used the dot to match all words
containing the letter "s" followed by any character, followed by the letter "e".
$ grep s.e myfile
new template is completed in three steps.
If step 3 is successful, a message appears on ITO
The template will not work if the node name specified
specified batch_server which was unknown to ITO server
current node name if the name is not specified in the
1- It runs every minute. Scans the file only if it is
2- User initiated action is specified to run restart
$
In every line shown above, there is a word containing an "s" followed by another
character and then "e". The second-to-last line is of special interest, where this
letter combination occurs when we combine the two words "runs every." Here
"s" is followed by a space and then an "e".
Use of Range Characters [...]
Consider that you want to list all files in a directory that start with the letters a, b,
c, d, or e. You can use a command such as:
$ ls a* b* c* d* e*
This is not convenient if this list grows. The alternate way is to use a range
pattern like the following.
$ ls [a-e]*
Square brackets are used to specify ranges of characters. For example, if you
want to match all words that contain any of the capital letters from A to D, you can
use [A-D] in the regular expression.
$ grep [A-D] myfile
1- Create a new template.
2- Assign this template to a node with this procedure.
Action -> Agents -> Assign Templates -> Add -> Enter
3- After assignment, the template is still on the ITO
Action -> Agents -> Install/Update SW & Config ->
IMPORTANT
3- A short instruction is provided to run the script.
$
Similarly, if you need to find words starting with lowercase vowels, [aeiou] will
serve the purpose. If such words are desired to be at the beginning of a line, we
can use ^[aeiou]. Multiple ranges can also be used, such as ^A[a-z0-9], which
matches words that are at the start of a line, has "A" as the first character, and
either a lowercase letter or a number as the second character.
The selection criteria can also be reversed using ^ as the first character within
the square brackets. An expression [^0-9] matches any character other than a
number.
These two sets of meta characters can be used to match complete words. The \<
character matches the start of a word and \> checks the end of a word. Without
these meta characters, all regular expressions match a string irrespective of its
presence in the start, end, or middle of a word. If we want to match all
occurrences of "this" or "This" as a whole word in a file, we can use the
following grep command.
$ grep \<[tT]his\>
If you use \< only, the pattern is matched if it occurs in the start of a word. Using
only \> matches a pattern occurring in the end of a word.
6.4 Standard and Extended Regular Expressions
start with the words "From:" and "Date:". Using a standard regular expression it
would be very difficult to extract this information. The egrep command uses an
extended regular expression as opposed to the grep command that uses
standard regular expressions. If you use parentheses and the logical OR
operator (|) in extended regular expressions with the egrep command, the
above-mentioned information can be extracted as follows.
$ egrep '^(From|Date):' $HOME/mbox
Note that we don't use \ prior to parentheses in extended regular expressions.
You may think that this task can also be accomplished using a standard regular
expression with the following command; it might seem correct at the first sight but
it is not.
$ grep '[FD][ra][ot][me]:' $HOME/mbox
This command does not work because it will also expand to "Fate," "Drom,"
"Droe," and so on.
Extended regular expressions are used with the egrep and awk commands.
Sometimes it is more convenient to use standard expressions. At other times,
extended regular expressions may be more useful. There is no hard and fast rule
as to which type of expression you should use. I use both of these and
sometimes combine commands using both types of expressions with pipes to get
a desired result. With practice you will come to know the appropriate use.
A. *
B. \
C. $
D. -
Chapter 7. File Permissions
Chapter Syllabus
In a multiuser operating system like UNIX, many people are using system
resources including disks and files. It is important to keep individual user files
secure so that other users are not able to read, modify, or delete them. At the
same time, users may be working on the same project and need to share files
among themselves. It becomes important that a user be able to grant read or
write access to some of the other users. If a group of programmers is working on
some software development project, they need to share code and information.
They also need to protect this information from people outside the group.
UNIX implements a file security policy by dividing users into three categories.
These are the owner of a file, the group of users to which the owner of a file
belongs, and all other users of the system. Files and directories can be granted
read, write, or execute permissions to one or more user groups. In addition to this
file permission scheme, HP-UX also implements another scheme, which is
known as access control lists (ACLs). Using an ACL, individual users can also be
granted specific permissions in addition to group permissions.
In this chapter, we will go through some basic concepts of UNIX file permissions.
You will learn categories of UNIX users and how file permissions are managed
among them. You will study types of file permissions and how to change them.
Here some commands to change or modify file permissions will be introduced. All
users can't use all of these commands, and you will see who is permitted to do
what. There is a set of default file permissions, and you will see how to deal with
it. Many times you need to change the owner or group of a file, and you will learn
some commands to do so. You will see what the special file permissions are and
how these affect system security. Then there will be a method to find files with a
particular set of permissions. In the end, ACLs will be introduced and a method
for listing and changing the ACL associated with a file will be presented.
After going through this chapter, you will be able to list and modify file and
directory permissions and ACLs. You will also be able to find potential security
problems related to wrong file permissions.
The users on a UNIX system are divided into groups. Every user must be a
member of one of the groups. The group information is maintained in the
/etc/group file. Although a user may be member of more than one group, he or
she has a primary group membership. All other groups are secondary groups for
a particular user. The file security and permission system is designed on the
group information. A user who creates a file is the owner of that file. The owner of
a file has the privilege to assign or revoke file permissions to other users. The
owner can assign any permissions to the members of any group of which the
owner is a member. All other users of the system who don't belong to this group
are considered "others" in UNIX terminology. Sometimes these "other" users are
also called "world," and permissions granted to them are "world" permissions.
From this discussion, we conclude that there are three types of users in UNIX.
They are:
1. the owner
2. the group
3. others
Every file and directory has a permission set that tells which user has what
permission. No one except the owner or the superuser can alter this permission
set.
The division of users in this scheme provides a lot of convenience for securing as
well as sharing files among different users. Multiple group membership for a user
is also very useful. For example, an accounts manager may be a member of the
accounts group and the managers group. The accounts manager can then share
files related to management tasks with other managers, but the files are still
protected from other members of the accounts group. Similarly, there may be an
applications group, members of which may be able to run or stop applications on
the system. Some of these members may also be assigned some of the system
administration tasks and in a special group created by the system administrator.
Every file in UNIX has one or more of three types of permissions. A file may be
readable, writable, or executable. A user who has read-only permission for a
particular file can only read it and is not able to change its contents or to delete
the file. A write permission allows a user to modify the file or delete it. As we
mentioned in Chapter 2, there are no special names for executable files in UNIX.
Any file can be executable if a user has permission to execute it and if it is a valid
executable file or shell script. If a user compiles a program but nobody has the
right to execute it, it will not be executed. On the other hand, if you assign
execute permission to a file that does not contain anything to execute, UNIX still
tries to execute and displays an error message.
File permissions can be displayed using the ll (or ls -l) command in HP-UX.
This command shows the long listing of files. The first field consists of 10
characters. The first character shows the type of the file. File types are explained
in Table 7-1. The next three characters (2nd to 4th) show permissions for the
owner of the file. These permissions are always listed in rwx format. In rwx
format, "r" represents read permission, "w" is used for write permission, and "x" is
for execute permission. If one of these characters is present, it shows that the
permission is granted. If the character is not there, a hyphen "-" is placed,
showing that the permission is withheld. After the file owner permissions, the next
combination of rwx shows permissions of the group to which the owner belongs.
The last combination of rwx is for all other users of the UNIX system.
Table 7-1. File Types as Displayed by the ll Command
The next file in the listing is a directory with the name usage. We came to know
that this is a directory by looking at the first character of the line, which is "d". You
can see from the listing that user root is the owner of this directory and that it
belongs to group sys. Everyone has read, write, and execute permissions for this
directory. Then we have a file qwe and we can see that other users have no
permission for this file.
Read permission is the basic file permission. Other permissions are not of much
use if someone doesn't have the read permission. If a regular user has no read
permission for a file but has execute permission, he or she can't execute the file,
as you need to read the file before executing it. Also, having write permission
without a read permission will allow a user to delete the file. The user will not be
able to edit the file because you need to read the file before you edit it.
For example, prog2 is a program file having the following permissions. The owner
can't execute it because the read permission is denied.
---xrw-rw- 1 boota users 14 Sep 3 18:28 prog2
If the owner of the file tries to execute the file, the following message appears.
$ prog2
sh: prog2: Cannot find or open the file.
$
As you can see from the file permissions for the owner (--x), although the owner
has execute permission, the shell is not able to read the program file. And if you
can't read a program, how can you execute it?
In UNIX, the permissions are read from left to right. It means that the first allow or
deny that comes to your path is applied. For example, if the owner of a file
doesn't have execute permission to a file and the group has the execute
permission, the owner will not be able to execute the file. This is because, first of
all, owner permissions were analyzed by the shell and it came to know that the
execute permission is revoked. The shell did not go further in checking the group
or other permissions and stopped any further action on the file with an error
message. Consider the same program we used in the previous example with
permission set as shown below.
-rw-rwxrwx 1 boota users 14 Sep 3 18:28 prog2
Here the owner has read and write permission but all others have execute
permission in addition to read and write. When the owner executes the program,
the following message appears
$ prog2
sh: prog2: Execute permission denied.
$
The UNIX way of interpreting file permissions is different from some other
operating systems that analyze all permissions applied to a user and take the
most allowed or most restricted permissions set.
Directory Permissions
• If read permission for a directory is not set, no file inside the directory can be
listed or accessed.
• If execute permission of a directory is not set, files inside the directory can be
listed with names only. This means ls will work but ll will not work. Also, no
files inside the directory can be read or executed. Because of this, the
execute permission for a directory is also called list permission.
• If only execute permission is set and read or write permissions are not set, a
user can go into the directory with the cd command and execute a program
inside the directory if the program name is known. Also, a file can be viewed
with the cat command if the file name is already known. It means you can
execute programs but can't see the files.
• If a directory contains a file that a user can't delete (no write permission), he
or she can't delete the directory even though write permission is granted for
the directory.
There are four user modes that can be used with the chmod command. We
represent different users with these modes. Also, there are file modes which are
represented with characters we have studied earlier. User and file modes are
listed in Table 7-2.
Table 7-2. User and File Modes
Mode Meaning
u User or owner of a file
g Group membership of the file
o Others (not owner or member of group)
a All users
r Read permission
w Write permission
x
Execute permission
To grant permission to a user, we use the "+" symbol between the user and file
modes. To revoke a permission, use "-" between the user and file modes. To
exactly assign a permission, regardless of the previous permission, we use the
"=" symbol.
For example, to grant execute permission to the owner of file myprog, the chmod
command will be used as follows. We have used the ll command to show old
and new file permissions.
$ ll myprog
-rw-rw-rw- 1 boota users 103 Sep 8 18:06 myprog
$ chmod u+x myprog
$ ll myprog
-rwxrw-rw- 1 boota users 103 Sep 8 18:06 myprog
$
To revoke write permission of other users, we can use the following command.
$ ll myprog
-rwxrw-rw- 1 boota users 103 Sep 8 18:06 myprog
$ chmod o-w myprog
$ ll myprog
-rwxrw-r-- 1 boota users 103 Sep 8 18:06 myprog
$
Now the owner has all read, write, and execute permissions, members of the
users group have read and write permissions, and all other users have only read
permissions. If you want to grant all users only read permissions, you can use
following command.
$ ll myprog
-rwxrw-r-- 1 boota users 103 Sep 8 18:06 myprog
$ chmod a=r myprog
$ ll myprog
-r--r--r-- 1 boota users 103 Sep 8 18:06 myprog
$
As you have seen, when we use the "=" symbol, the new file permissions are set
regardless of the previous permissions.
You can also combine more than one mode in the command line to change
multiple permissions simultaneously, as in the following example.
$ ll myprog
-r--r--r-- 1 boota users 103 Sep 8 18:06 myprog
$ chmod u+x,o-r myprog
$ ll myprog
-r-xr----- 1 boota users 103 Sep 8 18:06 myprog
$
Each group of rwx can be represented by an octal number. In the binary number
system each position has a weight associated with it. These weights double at
each step as we move from right to left, the right-most character carrying weight
1. If we map this scheme to rwx, the "x" carries weight 1, "w" carries weight 2 and
"r" carries weight 4. We can change symbolic modes with these weights. A file
that has all the permissions set can be considered as carrying weight 7 (4+2+1).
A file with only read and write permissions will carry weight 6 (4+2). As three
binary characters make one octal digit, we add weights of three modes for each
user group (owner, group, others) to make an octal number containing three
digits. As an example, if we want to grant all three (read, write, execute)
permissions to all users, we can use 777 with chmod as follows.
$ ll myprog
-r--r--r-- 1 boota users 103 Sep 8 18:06 myprog
$ chmod 777 myprog
$ ll myprog
-rwxrwxrwx 1 boota users 103 Sep 8 18:06 myprog
$
If you want to grant all permissions to the owner (4+2+1=7), read and execute
permissions to the group (4+1=5), and no permission for other users (0), the
command is:
$ ll myprog
-r--r--r-- 1 boota users 103 Sep 8 18:06 myprog
$ chmod 750 myprog
$ ll myprog
-rwxr-x--- 1 boota users 103 Sep 8 18:06 myprog
$
If you feel comfortable with octal numbers, this is a quicker way to grant or
revoke multiple file permissions.
What will be the permissions of a newly created file or directory? By default, all
new files carry rw-rw-rw- and all new directories have rwxrwxrwx permission. We
can control default file permissions with the umask command. The umask
command sets the mask for new files. A mask is three digit octal number similar
to the one we used to set file permission. It shows which permissions will be
revoked when new files or directories are created. For example, if you set umask
to value 022, the write permission for group and other users will be withheld for a
new file. New default permissions for files will be rw-r--r--, and for directories
these will be rwxr-xr-x. See the following command sequence for the role of
umask on new files.
$ touch file1
$ ll file1
-rw-rw-rw- 1 boota users 0 Sep 8 18:06 file1
$ umask 022
$ touch file2
$ ll file2
-rw-r--r-- 1 boota users 0 Sep 8 18:06 file2
$
This happened because the octal character 2 represents "w" in the symbolic
mode. When this appears in the group and other places, it revokes "w" or write
permission from both of these. Also note that we have used the touch command,
which creates a zero-size file if the file does not already exist. The touch
command changes file update time to the current time if the file already exists.
The current mask value is displayed with the umask command when used without
any argument.
$ umask
022
$
A new mask value can be set at any time. A better place for setting the mask
value is the user startup file $HOME/.profile so that the value is set as soon as a
user logs in.
When a user creates a new file, it belongs to the user who created it and to the
current group of the user. Any user other than the owner of the file can't change
ownership of a file, except the superuser. A user can change group membership
of a file only if he or she is member of both the old and new groups. In normal
circumstances, the system administrator needs to change file and group
ownership when copying files to new user directories or when making files
available to new users or groups.
In HP-UX, you use chown (CHange OWNer) for changing the owner of a file and
chgrp (CHange GRouP) for changing group membership of a file. Let us see
what happens when we change the owner of a file from user boota to jim.
$ ll file1
-rwxrwxrwx 1 boota users 0 Sep 8 18:06 file1
$ chown jim file1
$ ll file1
-rwxrwxrwx 1 jim users 0 Sep 8 18:06 file1
$
Note that the user name is changed in the third field. We don't need to mention
the old owner name when changing ownership of a file with the chown command.
A similar method is used for changing the group ownership of file1 from users
to root with the chgrp command.
$ ll file1
-rwxrwxrwx 1 jim users 0 Sep 8 18:06 file1
$ chgrp root file1
$ ll file1
-rwxrwxrwx 1 jim root 0 Sep 8 18:06 file1
$
Multiple file names and wildcard characters can also be used with chown and
chgrp. Names of a new owner and group must be present in the /etc/passwd and
/etc/group files, respectively. The chown command can also be used to change
the owner and group in one step. See the following example, where the owner
and group are used together with a colon symbol.
$ ll file1
-rwxrwxrwx 1 boota users 0 Sep 8 18:06 file1
$ chown jim:root file1
$ ll file1
-rwxrwxrwx 1 jim root 0 Sep 8 18:06 file1
$
To change the ownership or group of a directory tree, you can use the -R option
with both of these commands and with the directory name as argument.
Whenever you change ownership of a link file, ownership of the file to which the
link points is also changed. To avoid this, use the -h option with the chown or
chgrp commands.
Study Break
and create another file with the name myfile. Did you find
any difference between the file permissions of testfile
and myfile? Now try to change the contents of myfile
using the vi editor. When you try to save the file, vi tells
you that the file is read-only. This is because you had
changed the mask value such that the owner didn't have
write permission. Quit the vi editor without saving the file
and use the chmod command to grant write access to the
owner of the file.
There are three types of special file attributes: set user ID (SETUID), set group ID
(SETGID), and sticky bit. In the general case, if a user executes a file owned by
someone else, the process created in memory is owned by the user who
executes the file. In the case of SETUID, the process created is owned by the
owner of the file. A similar rule is applicable in the case of the SETGID bit. The
sticky bit is used to protect files in that directory.
We use the chmod command to set these special permissions to a file. If you are
using a symbolic method, use u+s for setting SETUID and g+s for setting SETGID. In
case you use octal numbers, add a fourth octal digit on the left-hand side of the
file permissions. Digit 4 represents SETUID and 2 represents SETGID. Examples of
symbolic and octal number use are given below.
$ ll file1
-rwxrwxrwx 1 boota users 0 Sep 8 18:06 file1
$ chmod u+s file1
$ ll file1
-rwsrwxrwx 1 boota users 0 Sep 8 18:06 file1
$ chmod 2777 file1
$ ll file1
-rwxrwsrwx 1 boota users 0 Sep 8 18:06 file1
$
As you can see, "x" is replaced by "s" in the file permission representation with
either SUID or SGID.
The SUID bit plays an important role when you want to execute a program with
higher privileges. For example, when you change your password, you modify the
/etc/passwd file. Only root has permission to modify this file, so how can every
system user modify it? This becomes possible because the command you use
for a password change (/bin/passwd) is owned by root and has the SETUID bit
set. So whenever any user executes this command, the command runs as root
and has the privilege to modify the /etc/passwd file.
This also causes a great security problem. For example, if you have a program
with the SUID bit set, anybody executing that program gets the privileges of the
owner of the program during the execution of that program. Now, if by chance
you also allow write permission to that program file, someone can change the
contents of the program and execute it with the owner privilege. Just imagine if
someone has write permission to a file owned by root and the SETUID bit is in
place, the user can change its contents with some other command to damage
the whole file system!
Sticky Bit
The sticky bit is represented by "t" and can be set using the chmod command with
the u+t symbolic method or 1 (one) in the fourth digit position of octal numbers.
One example is:
$ ll file1
-rwxrwxrwx 1 boota users 0 Sep 8 18:06 file1
$ chmod u+t file1
$ ll file1
-rwsrwxrwt 1 boota users 0 Sep 8 18:06 file1
$
The use of the sticky bit for directories has a significant advantage. If the sticky
bit for a directory is set, users can use that directory as a public area for file
sharing. Any file present in a directory with the sticky bit set can only be deleted
by the owner of the file. It may be useful to set the sticky bit for the /tmp directory
where users can safely put and delete their temporary or sharable files.
You have already used the find command to search any file in the file system
hierarchy. You can also use the find command to list files with desired file
permissions in the file system. For example, if the system administrator wants to
list all files for which the SUID bit is set in the /usr/bin directory, the following
command is useful. Here I have shown only partial output, as the actual output is
quite long.
# find /usr/bin -perm -u+s -exec ll {} \;
-r-sr-xr-x 5 root bin 49152 Apr 9 1998 /usr/bin/chfn
-r-sr-xr-x 1 root bin 49152 Nov 6 1997 /usr/bin/chkey
-r-sr-xr-x 5 root bin 49152 Apr 9 1998 /usr/bin/chsh
-r-sr-xr-x 1 root bin 53248 Apr 9 1998 /usr/bin/ct
-r-sr-xr-x 1 root bin 40960 Apr 9 1998 /usr/bin/cu
-r-sr-xr-x 1 root bin 73728 Feb 2 1998 /usr/bin/df
-r-sr-xr-x 1 root bin 49152 Nov 7 1997 /usr/bin/login
-r-sr-xr-x 1 root bin 45056 Jun 15 1998 /usr/bin/lp
-r-sr-xr-x 1 root bin 40960 Oct 27 1997 /usr/bin/lpalt
-r-sr-sr-x 2 root mail 45056 Nov 7 1997 /usr/bin/mail
#
This output shows that whenever someone uses one of the above commands, he
or she will become the superuser during the execution time of the command. You
can also use a similar command to check your files in your home directory to find
out if some security hole exists due to any mistake in granting permissions. The
following command lists all files for which SETUID is set, and anybody from group
or others also has write permission to that file.
$ find / -perm -u+s,g+w,o+w
7.8 Access Control Lists
Access control lists are used to grant or deny permissions to users or groups in
addition to those specified with traditional UNIX file access permissions. This
mechanism is also called discretionary access control (DAC) and is supported
under the older HFS file system but not under the modern JFS file system. To
grant or deny specific file accesses to a user or group, users are specified as
shown in Table 7-3.
Table 7-3. Access Control List User Specification
Pattern Description
user.group ACL for a specific user and a specific group
user.% ACL for a specific user and all groups
%.group ACL for all users and a specific group
%.% ACL for all users and all groups
Listing ACL
Changing ACL
The chacl command is used for changing ACLs. You can grant another user
mary read, write, and execute (rwx) permissions to myfile.
$ chacl "mary.%=rwx" myfile
$ lsacl myfile
(boota.%, rwx) (jim.%,rwx) (mary.%,rwx) (%.users,r-x)
(%.%,r--) myfile
$
7.9 Miscellaneous Commands
Here are few other commands that are not used directly to modify file
permissions but are related to this process.
If a user is a member of more than one group, all new files created by that user
belong to the current group. If the user wants the new files to belong to another
group, the newgrp command changes the current group membership temporarily.
In the following example, user boota belongs to group users (default group) and
class. See how the new files created by the touch command are affected by
The su Command
The switch user ID (su) command is used to change the user ID temporarily just
as you used the newgrp command to change the group ID. You need to know the
password for the user you are changing to. The following command changes the
user ID to jim.
$ su jim
Password:
$
To switch back to the actual user ID, use the exit command. After changing the
user ID, all new files created belong to the new user. The environment related to
user jim is not loaded until you use the su - jim command instead of su jim.
If you don't specify any user name with the su command, it will change to user
root. You can become the superuser temporarily by using this command if you
The id Command
The id command is used to display the current user and group IDs. A typical
result of this command is:
$ id
uid=415 (boota), gid=100 (users)
$
The command shows that the current user is boota having a user ID of 415. The
current group membership is users and the group ID is 100.
Chapter Summary
In this chapter, you learned the concept of file permissions and access control
lists. These are very important from a file- and system-security point of view. The
following items were presented in this chapter.
• There are three types of user in UNIX; the owner, members of the owner
group, and all other users.
• The three types of permissions can be set for every file and directory. These
are read, write, and execute permissions.
• A file's permissions can be changed with the chmod command.
• The owner and group of a file can be changed with the chown and chgrp
commands.
• Use of the SETUID and SETGID bits and security problems associated with
these.
• Use of the sticky bit and its effect on files and directories.
• Finding files with specific file permissions using the find command.
• Use of Access Control Lists (ACLs) with the lsacl and chacl commands,
which are used to list and change ACLs, respectively.
In addition, you have also used some other commands that don't affect file
permissions directly. The newgrp command is used to change group membership
temporarily while the su command is used to change user ID temporarily. The id
command is used to check the current user and group membership.
In the next chapter, we will see the file system structure of HP-UX and how
different directories are arranged in the file system. We will also see which
directory contains what type of files.
1:
A file has rwxr-xr-- permissions. It is owned by a user mark belonging to a
group users. You are logged in as user jim belonging to group users. What
permissions do you have for this file?
2:
You are logged in as user jim and create a file myfile and want to give it to
a user mark by changing its ownership. The command for this is:
3:
The id command without any argument lists:
A. user ID
B. group ID
C. both user and group IDs
D. system ID
4:
You want to change your group ID temporarily. Which command will you
use?
5:
The system administrator wants a command to be executed with superuser
permissions no matter which user executes it. He or she will set:
6:
A file myfile already exists. You use command touch myfile. What will be
the effect of this command?
A. It will erase the file and create a new file with the same name and zero
byte size.
B. It will rename the file as myfile.bak and create a new file with the name
myfile and zero byte size.
C. It will change the file modification date and time to the current value.
D. It will do nothing, as the file already exists.
7:
You are logged in as user boota belonging to group users. When you list
files using the ll command, you see the following list.
-rwxrw-r-- 1 jim class 0 Sep 8 18:06 myfile
What operations can you perform on this file?
8:
You use the command chmod 764 myfile. It provides:
UNIX files and directories are arranged in a systematic way, like a tree structure.
The root directory is represented by the slash symbol (/). It serves as the root of
this directory tree. By file system layout, we mean the arrangement of files and
directories in a directory tree. It does not mean the actual physical format of the
disk drive but shows only logical arrangement. Directories in UNIX are named
and arranged in a fashion that have particular meaning for users. Every directory
contains a particular set of files related to some defined purpose or task. For
example, all files in the /etc directory are related to some sort of system
configuration task. Similarly, the home directories of all users are usually kept in
the /home directory.
There are two major types of file system hierarchies used in UNIX distributions.
One of these is based on Berkley Distribution of UNIX (BSD) and the other is
AT&T System V. The file system layout of HP-UX version 10.x and above is
based on AT&T system V release 4 (SVR4). The directory structure starts with
the root directory. All other directories are named relative to the root directory.
There are two major groups of these directories. One is the group in which
system files are present and don't change frequently. This is the static part of the
directory hierarchy. The other group contains those files and directories that are
changed on a routine basis and are sometimes called dynamic files and
directories. An example of this type of file is log files that change on a day-to-day
basis. Another division of directories may also be based on shared and
nonshared files.
This chapter contains an introduction to the directory tree and the types of files in
each directory. We will describe what directories are related to system files and
what are used by HP-UX users. Through a careful study of the chapter, a user
will build understanding of how to quickly find different files and directories used
for particular purposes. As examples, you will learn where files containing
manual pages are located and what directory contains files related to network
administration.
Static files and directories contain information that usually does not change after
system installation. The files that fall into this category are executable files,
libraries, applications, and so on. This part contains /sbin, /usr, and the
application directories under /opt. As these files are also shared among many
systems in a network where diskless or dataless clients are used, they are also
called shared files. The dynamic files are those that are found in other directories
like the /var directory, which holds most of the log files. In fact, the name var
also comes from "variable." Similarly, you place temporary files in the /tmp
directory. Most of the applications also put their temporary files in the /var/tmp
directory and it continuously changes. Contents of the home directory also
change whenever a user is created or deleted or whenever a user makes
changes to any of his or her files. Figure 8-1 shows the division of a file system
into static and variable directories.
To keep the logical division of files and directories, HP-UX keeps all application
directories separate from system directories so that these two types of files never
get mixed with each other. Similarly, executable and nonexecutable files and
directories are also kept separate from each other. In the following part of this
chapter, you will get a brief introduction to the contents of each directory under
the root directory. The directory tree is specific to HP-UX version 10.x and
above.
All of the file system is viewed with reference to the root directory. The name
root comes from the logical position of this directory. This is the top-level
directory in the tree structure and hence the name root is used for it. It is
represented by a slash character (/). It must be kept in mind that the root
directory or any other directory is not related to physical disks in any respect. The
directory structure is just a logical arrangement regardless of the number of disks
in your system or how these disks are partitioned.
The device directory contains all of the device files. Device files are used to
represent devices attached to the system. Each device file has a major and a
minor number. The major number of a device file represents a device driver built
into the HP-UX kernel, while the minor number represents the device's hardware
address and (optionally) certain device-specific options. An instance means a
logical numbering of the same type of devices. For example, there may be more
than one SCSI disk in a system. All of these SCSI disks use the same device
drivers but represent different instances of disks and have different minor
numbers.
We will be discussing more about device files in Chapter 17. At the moment,
however, it is useful to mention that the device directory contains many
subdirectories depending on the types of devices. This is used to group device
files in a more logical way. Files related to one type of device are kept in one
subdirectory under /dev. Figure 8-2 shows some of these subdirectories.
have their own directories in /dev that contain device files for logical volumes.
We will see more about logical volumes in Chapter 18. Files starting with tty in
the /dev directory represent terminal types.
Almost all of the files in the /dev directory are special files, and if you take a
listing with the ll command, you will find a special character in the first column of
the output showing whether a file represents a character-type device, block
device, or something else. A listing of /dev/rdsk is presented as an example
here.
$ ll /dev/rdsk
total 0
crw-r----- 1 bin sys 188 0x003000 Nov 25 1998 c0t3d0
crw-r----- 1 bin sys 188 0x004000 Nov 25 1998 c0t4d0
crw-r----- 1 bin sys 188 0x005000 Nov 25 1998 c0t5d0
crw-r----- 1 bin sys 188 0x006000 Oct 5 16:57 c0t6d0
crw-r----- 1 bin sys 188 0x0a0000 Aug 31 13:40 c10t0d0
crw-r----- 1 bin sys 188 0x0a0100 Aug 31 13:40 c10t0d1
crw-r----- 1 bin sys 188 0x0a0200 Aug 31 13:40 c10t0d2
crw-r----- 1 bin sys 188 0x0a0300 Aug 31 13:40 c10t0d3
crw-r----- 1 bin sys 188 0x0a0400 Aug 31 13:40 c10t0d4
crw-r----- 1 bin sys 188 0x0a0500 Aug 31 13:40 c10t0d5
crw-r----- 1 bin sys 188 0x0a0600 Aug 31 13:40 c10t0d6
crw-r----- 1 bin sys 188 0x0a0700 Aug 31 13:40 c10t0d7
crw-r----- 1 bin sys 188 0x0a1000 Aug 31 13:40 c10t1d0
crw-r----- 1 bin sys 188 0x0a1100 Sep 19 04:58 c10t1d1
$
As you can see, all of these are character-type devices.
8.4 The /etc Directory
The /etc directory is where a system administrator spends much of his or her
time. All of the system configuration files are placed in this directory. The files
here control system startup and shutdown procedures as well as the startup and
shutdown of server and daemon processes. It also contains configuration files for
applications installed on the HP-UX system. The /etc directory also contains
subdirectories, as shown in Figure 8-3.
Files in the /etc/rc.config.d directory are configuration and control files for
system startup and shutdown and other server and daemon processes.
Files in the /etc/opt directory contain configuration files for applications installed
on the system.
Traditionally, this directory contains the home directories of all system users. A
home directory for user linda will be /home/linda. All users are owners of their
respective directories and have full rights of files under these directories. The
size of this directory usually depends on the number of users and varies widely
as users add and delete files in their home directories.
Under the home directory of each user is located user-related configuration files.
You have already used the $HOME/.profile and $HOME/.exrc files. There may be
many other user configuration files. An example would be the file that controls
how the X-Window system should start up for a particular user and what should
be displayed on the user's desktop.
In a network where user information is shared, the home directories are kept on a
central file server and are exported from that server to all other machines. A user
can log into any one of the available systems, and he/she will find the same
home directory everywhere. At this time, it must also be kept in mind that it is not
mandatory to keep all home directories under the /home directory. The system
administrator is free to make any choice for home directories. The name /home is
just a convention. At the same time, some other directory names are mandatory.
For example, a system administrator cannot place system configuration files in a
directory other than /etc.
This directory contains executable files needed at boot time. Under HP-UX, this
directory also contains system scripts required at startup and shutdown time.
This directory also contains subdirectories. The most important script in the /sbin
directory is the rc script that controls all of the system startup and shutdown
processes. Another important program is init, which is used to control different
system states like single-user mode, multiuser mode, etc. There is the
/sbin/init.d directory that contains all scripts executed at system startup and
shutdown or at the time when the system changes its state (e.g., from single-user
to multiuser). A partial /sbin directory subtree is shown in Figure 8-4.
This is the directory where the HP-UX kernel is kept. The kernel configuration file
is also present in this directory. The kernel file is /stand/vmunix and the
configuration file is /stand/system. In addition, you can keep multiple kernel files
in this directory so that if a new kernel fails to boot, you could use any old kernel
file to recover such a situation. It also contains the /stand/build directory used
to rebuild the new kernel.
The /opt directory is used for installing applications on an HP-UX system. Each
application has a subdirectory in this directory. Starting from that subdirectory,
each application may have its own directory tree containing binary files, manual
pages, libraries, and so forth.
Putting applications in one directory solves many problems related to system
administration. Not all vendors that supply software for HP-UX specify a
particular directory for their application. This makes it easy to keep multiple
versions of the same application on the system, as each version has its own
directory structure, which is independent from the rest of the system or any other
application. Also, it makes it easy to keep track of these applications when they
are installed under one top directory, exactly like it is easy to maintain home
directories of users when all of these exist under one main directory /home. You
should keep in mind, however, that although application files are kept under this
directory, some application files maintain configuration files in /etc/opt and log
files in /var/opt directories. This scheme provides another advantage when
application directories are shared among many systems on a network, since
each sharing system can have its own separate configuration and log files
depending on local system configuration.
A sample application directory tree is shown in Figure 8-5 showing the HP Ignite-
UX application used for system recovery.
8.10 The Temporary File Directory (/tmp)
This directory has permissions for everybody to create or delete files. Most of the
time, temporary files are placed in this directory and are deleted from time to
time. Many system administrators prefer to empty this directory at boot time to
avoid the piling up of unnecessary files. Anybody creating files in this directory
must be aware of the public nature of this directory, and no important file should
be placed here. It is a better idea to enable the sticky bit on this directory so that
a user may not delete files in this directory created by other users.
Any applications that need to create temporary files should do so in the /var/tmp
directory instead of /tmp.
/usr/bin
/usr/contrib
/usr/include
/usr/lib
Libraries for programming and machine-dependent database files. A user may
need to look into this directory to check the existence of certain libraries in case
there are compilation or run-time errors.
/usr/sbin
/usr/share/man
This directory contains basically three types of files: log files, spool files, and
temporary files created by applications. The name of the directory comes from
the fact that all of these file types are variable in nature. It means these files grow
and shrink on a continual basis. A sample directory subtree is shown in Figure 8-
7.
Log Files
Log files are arranged in many directories depending on the type of log. Some of
these are explained here.
/var/adm This directory is used for log files related to system administration programs.
The software installation or removal files are in the /var/adm/sw directory.
Log files for cron are placed in the /var/adm/cron directory. Log files
generated by syslog are in the /var/adm/syslog directory. Crash dumps are
kept in the /var/adm/crash directory.
/var/opt Application run-time files and log files are placed in this directory.
Spool files related to print services are kept in the /var/spool/lp directory. Spool
files for electronic mail go into /var/mail. Other spool files are kept in a directory
under /var/spool.
These files are usually created by the fsck command, which is used to check file
system integrity from time to time and at boot time. The files that have no valid
links are copied to this directory. Each file system contains one lost+found
directory.
Study Break
If you understand file system hierarchy, it will not be difficult for you to find a file
in UNIX. However, there are some useful commands to help you in your file
search.
Another command is whereis, which can search and locate source, binary, and
manual pages. The following command shows two files that can be used as the
cat command and the path to the manual page for the command.
$ whereis cat
cat: /sbin/cat /usr/bin/cat /usr/share/man/man1.Z/cat.1
$
You can also use the man -k command to search all man pages related to a
keyword. The command below searches man pages and shows all commands
related to mkdir.
$ man -k mkdir
dosmkdir(1) - make a DOS directory
mkdir(1) - make a directory
mkdir(2) - make a directory file
mkdirp(), rmdirp()(3G) - create, remove directories in a
path nis_server, nis_mkdir, nis_rmdir, nis_servstate,
nis_stats, nis_getservlist, nis_freeservlist,
nis_freetags(3N) - miscellaneous NIS+ functions
nismkdir(1) - create NIS+ directories
stmkdirs(1) - Utility to build Scalable Type
fonts.dir and charsets.
dir files
$
Before you can use this command, you must create the /usr/share/lib/whatis
file using the catman command. This file contains a list of all manual pages, with
a short description of each command.
In the above listing generated by the man -k mkdir command, you can see that
there is a number specified in parentheses after the name of a command. This
number shows the section of the manual pages in which that entry is found. For
example, you can see that both sections 1 and 2 contain a manual entry for
mkdir. When you use the man mkdir command, the first manual page found by
the man command is displayed. If you want to display a particular manual page,
you need to specify the section as the argument to the man command. As an
example, to display the manual page for mkdir in Section 2, you need to use the
man 2 mkdir command instead of man mkdir.
A. It is an executable program.
B. It is a configuration file.
C. It is a log file.
D. It is a spool file.
A. /mail
B. /var/spool/mail
C. /home/mail
D. /var/mail
A. whatis
B. which
C. whois
D. find
A. /sbin
B. /usr/sbin
C. /startup
D. /etc
A. /kernel
B. /var/kernel
C. /usr
D. /stand
Chapter 9. Working with the POSIX Shell and Job
Control
Chapter Syllabus
9.5 Substitution
The shell is an interactive program that gets user input, interprets it, and then
takes necessary action accordingly. It acts like a command interpreter for the
user. As soon as a command is received from a user, the shell looks for a
matching intrinsic command and executes it. If it does not find an intrinsic
command, it searches for a disk file corresponding to the command and executes
it. During this process it substitutes any variables and other commands listed in
the command line. It also handles I/O redirection and can execute batch files
containing other UNIX commands and control structures.
You are already familiar with many of the shell features, such as environment
variables, intrinsic and extrinsic commands, shell history, and shell configuration
files. In this chapter, more features of the POSIX shell are presented. These
include file name completion, setting terminal options, and job control. There is
more information in this chapter on some of the features you are already familiar
with. These include shell history, command line editing, and substitution. We will
start with a comparison of the capabilities of different UNIX shells and see what
the subshells are. Then there is a brief introduction to the ulimit command that
is used for limiting system resources. File name completion and shell history will
be discussed next. Then you will find some more information on command line
editing and substitution. In the substitution process, you will see how command
substitution, variable substitution, and tilde substitution work. While working on
different terminals, you may need to set some of the terminals' parameters, and
you will learn the stty command that is used for this purpose. In the last part of
the chapter, job control features of the POSIX shell are discussed.
9.1 POSIX Shell Capabilities
The POSIX shell is the default shell for HP-UX users. Other shells are also
available on HP-UX. Important shells and their corresponding executable files
are listed in Table 9-1.
Table 9-1. Important UNIX Shells
C shell /usr/bin/csh
There are two other shells used in HP-UX. One of these is called the restricted
shell (/usr/bin/rsh), which is used to provide restricted access to some users.
The key shell (/usr/bin/keysh) is a context-sensitive shell that may be handy to
use with HP terminals.
The capabilities of POSIX and other important shells available on HP-UX
systems are presented in Table 9-2.
Table 9-2. Features of Different UNIX Shells
A subshell can be invoked inside a shell at any time by using the shell path as
the command line. A subshell inherits all environment variables from the parent
shell but doesn't inherit any shell variables. Sometimes a user may need to
invoke a subshell to have a particular shell environment. The current shell also
invokes subshells if required by a command. Shell scripts are usually executed in
subshells. You can exit from a subshell using the exit command at any time.
The ulimit command is an intrinsic command of the POSIX shell and is used to
limit user resources. If you specify the -a switch with this command, it lists
current resource limits. A typical output of the command is as follows.
$ ulimit -a
time(seconds) unlimited
file(blocks) unlimited
data(kbytes) 135168
stack(kbytes) 8192
memory(kbytes) unlimited
coredump(blocks) 4194303
nofiles(descriptors) 256
$
This command is very useful if you want to specify a limit for resources. For
example, if you don't want the shell to create a core file larger than 1024 blocks,
you can use following command.
$ ulimit -c 1024
$ ulimit -a
time(seconds) unlimited
file(blocks) unlimited
data(kbytes) 135168
stack(kbytes) 8192
memory(kbytes) unlimited
coredump(blocks) 1024
nofiles(descriptors) 256
$
File name completion is used when you are typing commands with long file
names. Using this feature of the shell, you type in a few starting characters of the
name of a file and then press the key twice, and the shell completes the
name of the file. Consider a directory that contains three files, and you want to
see the contents of a file beta.
$ ls
alpha-334 alpha-434 beta
$ cat b Esc Esc
$ cat beta
As soon as you press the key twice, the shell completes the file name as
shown in the last line.
If there is more than one file that starts with the same characters you have typed,
the shell completes the file name to the extent that all files have the same
characters. After that you can press the and keys to list the available
choices and then complete the file name by typing additional characters. Please
see the next example where two files start with the letter a. By pressing the
key twice, the shell completes the file name up to alpha-. After that you
can press the keys to see the choices, and then complete the
command using the usual command line editing method or by typing additional
characters and again pressing the key twice.
$ ls
alpha334 alpha-434 beta
$ cat a Esc Esc
$ cat alpha- Esc =
1) alpha-334
2) alpha-434
$
After displaying the matching files, you can type a few more characters on the
command line to distinguish files and again press the key twice to complete
the file name.
The file name completion mechanism may be very useful when typing long
commands with less keystrokes and without any error.
You have already seen that the shell keeps the history of the commands you
type. The history command can be used to list previously used commands. You
can use an option to list a particular number of previous commands. To list the
three previous commands, you use:
$ history -3
457 cat /etc/profile
458 more /home/boota/.profile
459 ls -l /etc|more
$
To list a range of commands, you provide the starting and ending numbers of the
commands in the history.
$ history 451 453
451 date
452 ls
453 mkdir tmp
To reexecute a previous command, you use the r command with the command
number. The r command executes a numbered command from the history and
prints the command as well as its result on stdout. In the next example, the
command executed is the date command.
$ r 451
date
Wed Oct 13 22:33:17 EDT 1999
$
As you saw earlier, previous commands can be reused with the help of the
history feature. But the history feature executes commands without any
modification. If you want to execute a previous command with some modification,
you can use the vi editing features for command editing. If you press the
keystroke combination, the last command appears on your command
prompt. If you continue pressing , you can go back as far as the history
supports. To go to the next command, you can press the key during this
process. If you want to edit a particular command, first bring that command on
your command prompt using this procedure and then use to move right and
to move left. As you can see, all of these are vi commands. You can use
other vi commands to insert or delete any text on the command line.
Use of vi commands with the command line editing feature is controlled by the
EDITOR environment variable. The value of this variable must be set to vi to use
vi commands.
Study Break
1. landiag
2. last
3. lastb
4. lastcomm
9.5 Substitution
There are three types of substitutions used in the POSIX shell. These will be
explained next.
Variable Substitution
In a variable substitution, a variable name is used with the $ sign in the command
line. The value of the variable is then substituted in the command line before it is
executed. For example, the echo command prints whatever it receives on the
command line.
$ echo HOME
HOME
$ echo $HOME
/home/boota
$
In the first case, the echo command printed what you typed at the command line.
In the second case, echo substituted the variable $HOME and printed the value of
the HOME variable.
Command Substitution
Tilde Substitution
Where HOME points to your home directory, PWD has the value of the current
directory and OLDPWD has the value of the previous working directory. A good use
of tilde substitution is to go to your home directory from any other directory. See
the following where you move from the /etc directory to your home directory
(/home/boota) using tilde substitution.
$ pwd
/etc
$ cd ~/
$ pwd
/home/boota
$
A variety of terminals having different capabilities are used with HP-UX systems.
Many of these terminals have different keyboards and key mappings. As an
example, different keys may be used for the delete and backspace characters on
different types of terminals. HP-UX provides the facility to change and modify
control key mappings with the Set Terminal Type (stty) command. This
command can be used to map a certain control character to a key sequence. The
stty command with no arguments shows current control commands and their
respective key mappings. It also shows current baud rate, start/stop bits, parity,
flow control, and other information when invoked with the -a option.
$ stty -a
speed 9600 baud; line = 0;
rows = 24; columns = 132
min = 4; time = 0;
intr = ^C; quit = ^\; erase = ^H; kill = ^U
eof = ^D; eol = ^@; eol2 = <undef>; swtch = <undef>
stop = ^S; start = ^Q; susp = <undef>; dsusp = <undef>
werase = <undef>; lnext = <undef>
parenb -parodd cs7 -cstopb hupcl -cread -clocal -loblk -
crts
-ignbrk brkint ignpar -parmrk -inpck istrip -inlcr -igncr
icrnl -iuclc
ixon -ixany ixoff -imaxbel -rtsxoff -ctsxon ienqak
isig icanon -iexten -xcase echo echoe echok -echonl -noflsh
-echoctl -echoprt -echoke -flusho -pendin
opost -olcuc onlcr -ocrnl -onocr -onlret -ofill -ofdel -
tostop
$
If you want to set the underscore character (_) as the backspace key, you can
use the following command. After that, you can use the underscore key to work
as the backspace key.
$ stty erase _
$ stty -a
speed 9600 baud; line = 0;
rows = 24; columns = 132
min = 4; time = 0;
intr = ^C; quit = ^\; erase = _; kill = ^U
eof = ^D; eol = ^@; eol2 = <undef>; swtch = <undef>
stop = ^S; start = ^Q; susp = <undef>; dsusp = <undef>
werase = <undef>; lnext = <undef>
parenb -parodd cs7 -cstopb hupcl -cread -clocal -loblk -
crts
-ignbrk brkint ignpar -parmrk -inpck istrip -inlcr -igncr
icrnl -iuclc
ixon -ixany ixoff -imaxbel -rtsxoff -ctsxon -ienqak
isig icanon -iexten -xcase echo echoe echok -echonl -noflsh
-echoctl -echoprt -echoke -flusho -pendin
opost -olcuc onlcr -ocrnl -onocr -onlret -ofill -ofdel -
tostop
$
As you can see in the fourth line of the output, the new setting shown by the stty
command, the value of erase is changed. Usually, people establish such settings
in the startup files. If a setting is needed for all system users, the system
administrator sets it as /etc/profile.
A job is a running process in memory. Job control is a shell feature that moves
jobs from the foreground to background, suspends running jobs, and restarts
suspended jobs. You can also put a job into a wait state, where it waits for
another job to finish and then restarts. The shell provides a mechanism to carry
out these tasks with the help of some intrinsic shell commands. The process of
job control is explained next.
Many times you start a job in the foreground and then you want to do something
else without abandoning the program running in the foreground. For example,
you may have started the vi editor and you need to copy files without
abandoning the editor. The POSIX shell provides a mechanism to suspend a
current job temporarily. To suspend a running job, you can use the key sequence
represented by the susp value in the stty -a output, often defined as
. Pressing that key sequence suspends the current job and gives you
the command prompt. The job is suspended, and if you use the jobs command,
you will see the job is stopped. You will also see a plus (+) symbol showing that
this is the current job.
$ jobs
[1] + Stopped vi myfile
[2] - Stopped vi file2
$
To find the value of the susp sequence, use the stty -a command. If this value is
not set, you can use the stty command to set its value as follows.
$ stty susp ^z
$
The (^) symbol shows the control key. After this command, you can use the
key sequence to suspend a job.
All suspended jobs can be resumed with the foreground (fg) command. The
same command is used to bring background jobs to the foreground. To bring a
job to the foreground, you need to know the job id with the jobs command. If you
don't specify any job ID, the current job is brought into the foreground. For
example, when you suspend the vi editor to do some other work, the fg
command will bring the vi screen back, and you can use it again in the normal
way.
$ jobs
[1] + Stopped vi myfile
[2] - Stopped vi file2
$ fg %2
Job numbers are used with the percent (%) symbol with the fg command. You
can also use the command name with the fg command, which is sometimes
more convenient when writing shell scripts.
Study Break
Job Control
At any point, if you want to wait for background jobs to be finished, just use the
wait command. This command stops the command prompt until all background
jobs are finished. If you want to stop the wait command, just press the
key on your keyboard. We can also wait for a particular job to finish by specifying
the job number as an argument to the wait command.
$ wait %2
1:
The default HP-UX shell is:
A. Bourne Shell
B. Korn Shell
C. POSIX Shell
D. C Shell
2:
You are in a directory having three files, file1, file2, and afile. You type a
command ls f and then press the key followed by the key. What
happens?
3:
You use the date command to see the current system time. Just after that,
you press the key followed by the key. What happens?
4:
What does the command r 3 do?
A. It reads three lines of user input and then displays them on the screen.
B. It repeats the last three commands.
C. It returns three steps back in the command history.
D. It executes command number 3 in the command history.
5:
For what purpose is the stty command used?
6:
Your home directory is /home/boota. You moved from your home directory to
the /etc directory. How can you go back to your home directory?
A. using the cd HOME command
B. using the cd ~/ command
C. using the cd OLDPWD command
D. all of the above
7:
A job running in the foreground can be suspended by:
A. the bg command
B. the fg command
C. using the susp control character
D. using the suspend command
8:
Background jobs are always in:
A. a suspended state
B. a stopped state
C. a running state
D. none of the above
Chapter 10. Introduction to Shell Programming
Chapter Syllabus
10.7 Branching
Let us go directly to our first program and analyze it. I have used the file name
script-00 for this program. Contents of this file are shown below using the cat
command.
$ cat script-00
#!/usr/bin/sh
# This is to show what a script looks like.
echo "Our first script"
echo "----------------"
echo # This inserts an empty line in output.
echo "We are currently in the following directory"
pwd
echo
echo "This directory contains the following files"
ls
$
Before looking into the program and explaining what each line does, let us see
what happens if we execute it. We execute it from the current directory with the
command line:
$ ./script-00
Our first script
----------------
1. the full path of the subshell that will be used to execute the program
2. some comment lines
3. commands and control structures
The current shell executes all programs unless otherwise specified. In case you
need to execute a program in a specific shell (Bourne, C, or POSIX), you can
specify it in your program. In that case, a subshell is created as soon as a
program starts execution. The first program line shows which HP-UX shell will be
used to execute commands found in the program. This line always starts with the
"#!" character combination and shows the full path of the executable program
that will be used as shell. All HP-UX extrinsic commands have the same syntax
no matter which shell is used to execute them. The difference is in the execution
of intrinsic commands. For example, the method of setting a shell variable in the
C shell is different from the one used in the POSIX shell. So you need to execute
your program in the proper shell. Depending on the information provided, your
current shell will spawn the appropriate subshell to execute the program. As an
example, you can't use the setenv command in a program that is expected to be
run in a POSIX or Bourne shell because this command is specific to the C shell
only.
In the example used here, the subshell that will be used to execute the program
is /usr/bin/sh, which is the POSIX shell. You can use other shells, such as C,
by changing this to /usr/bin/csh. It is a good habit to provide the shell name in
the program so that if somebody else is using it in a different shell, the correct
subshell is created and the program runs without any error.
The second line in our example program contains a comment. A comment is that
part of a program that is not executed. It is used for providing reference
information about the program.
All comments start with a pound sign (#) except for the special combination used
in the first line for specifying the subshell. A comment can be placed anywhere in
the file. If a line starts with the "#" sign, all of the line is treated as a comment. If
the "#" sign is placed somewhere else in a line, anything after that sign is
considered a comment. In the example program script-00, comments are used
in the second and fifth lines. The second line starts with the "#" sign, so all of the
line is a comment, and nothing in this line is executed. The fifth line contains a
command echo and after that a comment string. The command is executed but
the comment is ignored.
This is the most important part of the program, where you put actual commands
that are executed. The commands may be simple ones that the shell executes
one-by-one, in the order they are placed in the program file. In the example
program, we have used the commands pwd, echo, and ls. All of these commands
are executed in order and their result is displayed on your terminal screen as you
have already seen. The echo command without any argument just prints a blank
line.
The control structures are used for branching and looping. The decision of
branching or looping is made depending on the result of a test performed on
some variables or constants. We will discuss branching at the end of this chapter
and looping in the next chapter.
A shell program is created in two basic steps. In the first step, a file is created
that contains commands and control structures. This file is saved on the disk.
Usually this file is not executable. In the second step, you need to modify file
permissions to make it executable. If you are not sharing the program with
others, you can use the chmod u+x command to make it executable. If you are
sharing your program with other users, you need to set the appropriate
permissions for this purpose.
You can also execute a shell program without the execute bit set if you use the
program name as an argument to sh as shown below.
$ sh script-00
After setting appropriate execute permissions, you can execute a program. Care
must be taken while naming shell programs such that the names do not match
with any existing HP-UX commands.
If the current directory is not included in the PATH variable, you will not be able
to execute the program by simply typing its name on the command line. For that
purpose you need to specify the full path of the file. You can give the full path on
the command line in either the absolute or relative form. The better way is the
relative form, where you use "./" (dot slash) to refer to the current directory.
Note
Many times new script writers wonder why the script is not being executed, even
though they have placed the correct commands and have the execution bit set.
The reason is the directory in which they are placing the program is not included
in the PATH variable, and they are not specifying the path to the file explicitly.
Sometimes it may happen that your current directory is included in the PATH
variable at the end. When you execute your program without specifying the full
path to the file, you get unexpected results. This is the case when you use a file
name for a program that already exists on your system. What happens is the
shell starts searching the file name from the first directory specified in your PATH
variable. It gets the other file before it reaches the current directory and executes
it. So it is always recommended to use "./" when you are testing your program
for the first time to make sure that the shell is indeed executing the correct file.
line on your terminal screen before executing it. The actual line present in the
program is shown with a plus (+) sign in the start of the line. After that, its output
is displayed. This method can be used to identify which program line is causing a
problem. Below is the output of our example program script00 after this
modification. Note that comments are not displayed.
$ ./script-00
+ echo Our first script
Our first script
+ echo ----------------
----------------
+ echo
Study Break
intend it to do.
Variables can be set and used in the same way you have used them on the
command line. Any variable that is set during the execution of a shell program is
not visible after the execution is finished. Shell programs can read environment
variables and can also modify their values for the duration of the execution of the
program. Variables are also a useful way of passing data to shell programs. Let
us see another program named script-01 where we have set two variables TAB
and FUR. These variables are then displayed using the echo command. The
program is shown below.
$ cat script-01
#!/usr/bin/sh
echo "Use of Variables"
echo "----------------"
echo
TAB=table
FUR=furniture
echo "The $TAB is an example of $FUR"
$
When this program is executed, the results are:
$ ./script-01
Use of Variables
----------------
the variable value, and you find out that it is the same as it was before executing
the program.
$ COLOR=red
$ export COLOR
$ echo $COLOR
red
$ ./script-04
The current COLOR variable is
red
The new COLOR variable is
green
$
$echo $COLOR
red
$
The command line arguments are stored in variables that show the position of
the argument in the command line. That is why these are also called positional
parameters. The variables that store command line arguments have names from
$0 to $9. Beginning with the tenth command line argument, the argument number
The shift command is used to move the command line arguments one position
left. The first argument is lost when you use the shift command. Shifting
command line arguments is useful when you perform a similar action to all
arguments, one-by-one, without changing the variable name. The shift
command throws away the left-most variable (argument number 1) and reassigns
values to the remaining variables. The value in $2 moves to $1, the value in $3
moves to $2, and so on. Let's modify script-05 into script-06 as shown below
using the cat command.
$ cat script-06
#!/usr/bin/sh
echo "Total number of command line arguments is: $#"
echo "These arguments are: $*"
echo "The first argument is: $1"
shift
echo "New first argument after shift: $1"
shift
echo "First argument after another shift: $1"
$
Now let's execute script-06 with the same three arguments we used with
script-05. You can see from the next result that after every shift, a new value is
assigned to $1. This value is the variable that is just on the right side of $1 (i.e.,
$2).
$ ./script-06 red green blue
Total number of command line arguments is: 3
These arguments are: red green blue
The first argument is: red
New first argument after shift: green
First argument after another shift: blue
$
During the first shift operation, $1 value is lost forever and can't be recovered by
the program. The shift command can also do multiple shift operations in one
step. For this you need to supply an argument to the shift command. For
example, shift 2 will shift two arguments in one step, such that the old values of
$1 and $2 will be lost, the value of $3 will be assigned to $1, the value of $4 will be
Study Break
message before the read command is executed. This message informs the user
of what the program is expecting. These programs are used in situations where a
program first checks some system parameter and then requires user input to
perform an operation on it. As an example, if you want to talk to another user on
the system, you may first want to see who is logged into the system. After getting
a list of users, you may initiate conversation with a particular user using the talk
command.
The read command takes one line of input from the user and assigns it to a
variable. The variable name is provided as an argument to the read command.
After entering some text, the user presses the key. Below is script-07,
which lists all users currently logged into the system and then waits for you to
enter a user name. After getting the user name, it initiates conversation with that
user using the talk command.
$ cat script-07
#!/usr/bin/sh
echo "Currently logged in users are:"
who
echo
echo
echo "Enter the name of the user to whom you want to talk"
read NAME
echo "initiating talk with $NAME"
talk $NAME
$
After you select a user, the program rings the other party and displays a
message asking the other user to respond to your talk request. If that user
accepts your request, a talk window appears. Before the talk window appears,
the program executes as shown below. Here you initiate a talk session with a
user linda.
$ ./script-07
Currently logged in users are:
boota pts/t0 Oct 18 17:53
linda pts/0 Oct 18 22:13
You have already used the echo command to display text on your screen. This
command uses escape characters that can be used to format the displayed text
to enhance its readability. The escape characters used with the echo command
are listed in Table 10-2.
Table 10-2. Escape Characters Used with the echo Command
Character Effect
\a Alert character (beep)
Table 10-2. Escape Characters Used with the echo Command
Character Effect
\b Backspace
\c Suppress new line at the end of displayed text
\f Form feed
\n Insert a new line character
\r Carriage return
\t Insert a tab character
\\ Backslash
\nnn Character having ASCII value nnn in the octal format. The
first n is 0.
In the example of script-07, the cursor goes to the next line after the message,
"Enter the name of the user to whom you want to talk". If you want the
cursor to stay in the same line until you enter the user name, you need to
suppress the new line character by using \c in the echo command as follows.
echo "Enter user name to whom you want to talk \c"
You can also use \a to add a beep to your program as soon as this command is
executed.
When a program terminates its execution, it returns a result code to the shell that
shows the termination or exit status of the program. In the case of termination
after successful execution, this code is zero. If the program terminates
abnormally, the exit code is not equal to zero. The exit code of the last executed
program can be checked by displaying the value of a special variable $?. See the
following examples to get an idea of how the exit code works.
$ ls
PHCO_18132.depot phco_18132.txt script-02
PHCO_18132.text scr script-03
phco_18131.txt script-01 script-05
$ echo $?
0
$
$ mv
Usage: mv [-f] [-i] [-e warn|force|ignore] f1 f2
mv [-f] [-i] [-e warn|force|ignore] f1 ... fn d1
mv [-f] [-i] [-e warn|force|ignore] d1 d2
$ echo $?
1
$
The first command is the ls command, which executed successfully. After that,
you used the echo $? command, and it showed you an exit code zero. The
second command was mv. You did not provide any argument to this command,
which is a must, so the command terminated abnormally and returned a value of
1.
You can use exit codes in your program to check the execution status of any
commands used within the program. If a command used in the program did not
execute successfully, you can make corrective measures.
You can also terminate your shell program at any point by using the exit
command. The exit command can be supplied with an argument that will show
the exit status of your program and can be used for debugging purposes. As an
example, if you are writing a program and it needs at least one command line
argument, you can terminate its execution if no argument is supplied.
Branching decisions are made depending on the result of a test command. The
test command can perform tests on numeric and string data as well as on files.
The test command returns a true or false value. The true value is always zero,
while false is a number other than zero. Usually this number is one. You can
check the result code of the test command to make a branching decision. The
test command can be used in explicit or implicit modes. In the explicit mode, the
Tests can be performed to compare two or more integers. The relations that can
be used with numeric data are shown in Table 10-3.
Table 10-3. Numeric Tests
Relation Description
-eq Equality check
-ne Not equal
-lt Less than
-gt Greater than
-le Less than or equal to
-ge Greater than or equal to
Numeric testing will be used in shell programs later in this chapter.
The string values can be checked for equality and nonequality. Other than that, a
single string can be tested if it has a zero length or not. The string operations are
shown in Table 10-4.
Table 10-4. String Tests
Operation Description
string1 = string2 True if string1 and string2 are equal
string1 != string2 True if string1 is not equal to string2
-z string True if string length is zero
-n string True if string length is nonzero
string True if string length is nonzero
Testing Files
Testing on files can be performed in many ways. Some of these are shown in
Table 10-5. A list of other supported file tests can be found using the man sh-
posix command.
Operation Description
-d file True if the file is a directory
-f file True if the file exists and is a normal file (not a directory)
-s file True if the file is more than zero bytes in length
-r file True if the file is readable
-w file True if the file is writable
-e file True if the file exists
-L file True if the file is a symbolic link
file1 -nt file2 True if file1 is newer than file2
file1 -ot file2 True if file1 is older than file2
-x file True if the file is executable
For example, if file file1 exists, and you use the following command to check its
existence, you will get an exit code of zero (true).
$ [ -f file1 ]
$
$ echo $?
0
$
Logical operations can be performed on two expressions with one of the logical
operators shown in Table 10-6.
Table 10-6. Logical Operators
Operation Description
expr1 -o expr2 Logical OR, true if either expr1 or expr2 is true
Table 10-6. Logical Operators
Operation Description
expr1 -a expr2 Logical AND, true if both expr1 and expr2 are true
! expr Logical NOT, true if expr is false
The following code segment tests files represented by the first two command line
arguments and prints a message if both files exist.
#!/usr/bin/sh
if [ -f $1 -a -f $2 ]
then
echo "Test successful"
fi
10.7 Branching
The if-then-fi structure is used to check a condition with the help of a test
command. If the test returns a true value, then an action is performed. If the
test returns a false value (not true), the action part of the program is not
This structure is used when you want to perform one of two actions, depending
on the result of a test. The general syntax of the structure is:
if expr
then
action1
else
action2
fi
If the result of expr is true, action1 is performed. In the other case, action2 is
performed. Each of action1 and action2 may be a set of one or more commands.
The flow chart of this structure is shown in Figure 10-2.
The case structure is used where you want to branch to multiple program
segments depending on the value of a variable. The general syntax of the case
structure is:
case var in
pattern1)
commands
;;
pattern2)
commands
;;
…
patternn)
commands
;;
*)
commands
;;
esac
The value of var is checked. If this is equal to pattern1, the commands in the first
block are executed. The first block ends at the ;; pattern. If the value of var
matches pattern2, the commands in the second block are executed. If var
matches none of the pattern values, the commands in the last block after "*)" are
executed. This last part is optional and can be omitted. The case statement is
shown in the flow diagram of Figure 10-3.
1: You create a shell program and save it into a file with name "more". Your
current directory name is included in the PATH variable at the end. When
you run this program by typing "more", nothing happens and the cursor just
stops at the next line. What may be the problem?
A. You have written some commands in the program that the shell does
not understand.
B. The shell program file is not executable.
C. You have used a program name that matches a standard HP-UX
command.
D. There is a problem with your terminal and you need to reset it.
3: You use the echo $? command. The result is 2. What do you conclude from
this?
4: You used shift 3 in your shell program. What will be its effect?
A. It will shift the first three command line arguments to the left.
B. It will shift the last three command line arguments to the left.
C. It will shift all command line arguments by three places toward the left.
D. It will shift all command line arguments by three places toward the
right.
7: A shell script with the name myscript does not have the execution bit set.
How can you execute it?
A. exec myscript
B. sh myscript
C. run myscript
D. No shell script can be executed until its execution bit is set.
A. using $*
B. using $#
C. using the shift command
D. using the list command
A. 1
B. 0
C. any positive number
D. any number not equal to zero
10: You have a shell script as shown here. What will be the result when it is
executed?
#!/usr/bin/sh
ABC=aac
case $ABC in
a) echo "First"
;;
[aa]c) echo "Second"
;;
a*) echo "Third"
;;
*) echo "Last"
;;
esac
A. First
B. Second
C. Third
D. Last
Chapter 11. Advanced Shell Programming
Chapter Syllabus
There are three basic types of loops. The first one is the for-do-done loop, which
is used to execute a block of commands for a fixed number of times. The while-
do-done loop checks for a condition and goes on executing a block of commands
until that condition becomes false. The until-do-done loop repeats the execution
of a block of commands until a condition becomes true. As soon as the condition
becomes true, the loop terminates.
All of these loops are controlled by a variable known as the control variable. This
variable gets a new value on every repetition of the loop. The let command is
also used to make arithmetic, logic, and assignment operations inside the loops
and to change the value of the control variable.
In this chapter, we will start with arithmetic and logic operations performed with
the let command. The three loops will be discussed one-by-one. You will find
the general syntax of each loop as well as a flow diagram. In the end, you will
find some text processing examples and their use in loops.
The let command performs both arithmetic and logic operations. The use of the
let command is important because all loops depend on the control variable. The
value of this control must be changed during the execution of the loop. Usually
this value is incremented or decremented with the help of the let command. The
loop structures also need logic operations, used for the testing value of the
control variable. This is the second use of the let command. Like the test
command, the let command also has explicit and implicit modes.
In the explicit mode, the word let is used in the command line. Consider the
following example of the use of the command.
$ A=5
$ B=3
$ let "C=A+B"
$ echo $C
8
$
You created two new shell variables A and B and assigned these variables
numeric values. Then you used the let command to sum these two values and
assign the result to a third variable C. To display the value of this variable, you
used the echo command. Like this arithmetic operation, you can also perform
logic operations with the let command as shown in the following example.
$ var1=5
$ var2=3
$ let "var1<var2"
$ echo $?
1
$ let "var1>var2"
$ echo $?
0
$
In this example, you compared two variables. The first comparison was not true,
so the result code returned is 1. The second comparison is true and the result
code is zero.
You can replace the word let with double parentheses on each side of the
expression. The above example, where you added two variables, can also be
accomplished as follows.
$ A=5
$ B=3
$ ((C=A+B))
$ echo $C
8
$
The let command can also perform complex operations like the one shown here.
((A=A+(3*B)/(A-1)))
While evaluating the result of an expression, the usual arithmetic rules are
applied. Parentheses can be used to alter the order of evaluation.
Table 11-1 lists the operators that can be used with the let command.
The first two operators are unary operators that need only one operand. All other
operators are binary operators and need two operands. You will find many
examples of the use of the let command in this chapter.
Table 11-1. Operators Used with the let Command
Operator Description
- Unary minus
! Unary negation (same value but with a negative sign)
= Assignment
+ Addition
- Subtraction
* Multiplication
/ Integer division
% Remainder
Table 11-1. Operators Used with the let Command
Operator Description
< Less than
> Greater than
<= Less than or equal to
>= Greater than or equal to
== Comparison for equality
!= Comparison for nonequality
a value of VAR1 equal to 1. You add 2 to the value of VAR1 at each step. You
compare the value of the variable with 10. This condition is never fulfilled
because the value of the variable never becomes 10. It goes from 9 to 11,
skipping the value to which the comparison is made. By changing "!=" to "<=",
you can solve the problem. The program script-21 is shown here.
#!/usr/bin/sh
echo "The while loop example"
echo
VAR1=1
while ((VAR1 != 10))
do
echo "Value of the variable is : $VAR1"
((VAR1 = VAR1 + 2))
done
echo
echo "The loop execution is finished"
Another example of an infinite loop is when you forget to modify the control
variable inside the loop, such as in the code segment that follows.
VAR1=1
while ((VAR1 != 10))
do
echo "Value of the variable is : $VAR1"
done
Here the value of VAR1 is always 1, and the condition remains true, resulting in an
infinite loop.
The until-do-done loop is like the while-do-done loop. The only difference is
that it tests the condition and goes on executing as long as the condition remains
false. It terminates execution as soon as the condition becomes true. The
general syntax of this loop is:
until condition
do
command block
done
The flow diagram of the until-do-done loop is shown in Figure 11-2.
an until-do-done loop.
#!/usr/bin/sh
echo "The until loop example"
echo
VAR1=1
until (( VAR1 > 100 ))
do
echo "Value of the variable is : $VAR1"
((VAR1 = VAR1 * 2))
done
echo
echo "The loop execution is finished"
Let's suppose you want to change the access time of all files in your current
directory to the current time. You can use the touch command with a small shell
script as shown here.
for FILE in *
do
touch $FILE
done
Study Break
There may be situations when you want to break or discontinue the execution of
commands inside the command block of a loop. This is done when a particular
condition is met and you don't want any further execution of commands in the
command block. You may also need to check an error condition and discontinue
execution of the program depending on that error condition.
The shell provides three mechanisms for breaking the normal execution of loops.
These are break, continue, and exit.
The continue command is slightly different from the break command. When
encountered, it skips the remaining part of the loop and transfers the control to
the start of the loop for the next iteration. The script-26 does the same job as
script-25 but with the use of the continue command. In this script, we have
changed the test condition, and the loop goes on executing until you enter a file
name that is not a regular file. At this point, the loop breaks and the command
after the loop is executed.
#!/usr/bin/sh
while true
do
echo "Enter name of file to be displayed: \c"
read FILE
if [ -f $FILE ]
then
cat $FILE
continue
fi
echo "This is not a regular file"
break
done
echo "Good bye"
The exit command completely terminates the program. It returns an exit code
that is optionally provided as its argument in the program. If the exit command
doesn't have any arguments, it returns the exit code of the command executed
just before it. This command is used when a critical error is encountered, and
further execution of the program may cause faulty results. For example, dividing
a number by zero is illegal, so you want to check this condition before a
command is executed that divides a number by zero. Program script-27 reads a
number entered by the user and then divides 100 by this number. It then displays
the quotient and the remainder. If you try to divide by zero, the program displays
an error message and terminates immediately.
#!/usr/bin/sh
NUM=100
while true
do
echo "Enter a divisor for integer 100 : \c"
read DIV
if [ $DIV -eq 0 ]
then
echo "Divide by zero is not permitted"
exit 1
fi
(( QUO = NUM / DIV ))
(( REM = NUM % DIV ))
echo "The quotient is : $QUO"
echo "The remainder is : $REM"
done
You have used the grep command as a filter to extract or delete lines containing
a particular text pattern. Here you will learn two more commands that are useful
for text processing. The sed command is a stream editor that takes text from
stdin and sends it to stdout after editing it. The cut command is used to extract a
desired part of text from a line. It also takes its input from stdin and sends its
output to stdout.
Using sed
The stream editor is a useful tool to edit large amounts of text at one time. For
example, you may need to search for a word in a large file and replace it with
another word. Let's try to replace the word "echo" with "ECHO" in script-27. The
sed command will do the job as follows.
$ sed s/echo/ECHO/g script-27
#!/usr/bin/sh
NUM=100
while true
do
ECHO "Enter a divisor for integer 100 : \c"
read DIV
if [ $DIV -eq 0 ]
then
ECHO "Divide by zero is not permitted"
exit 1
fi
(( QUO = NUM / DIV ))
(( REM = NUM % DIV ))
ECHO "The quotient is : $QUO"
ECHO "The remainder is : $REM"
done
If you want to do an operation on all files, you can write a shell program to
accomplish the job. Program script-28 shown here replaces "echo" with "ECHO"
in all files of the current directory.
#!/usr/bin/sh
for FILE in *
do
cat $FILE |sed s/echo/ECHO/g >tempfile
cp tempfile $FILE
done
rm tempfile
As you can see, this is a very useful tool to make changes to a large number of
files that could take a long time otherwise. Consider you are writing a book and
want to change "figure" to "Fig" in all chapters. If you don't know how to do it in
an efficient way, you may start editing all files manually, spending hours on a job
that need take only a few minutes.
There are many ways to use sed that make it a very useful tool. For additional
information, consult the sed manual pages.
Using cut
The cut command is used to extract a particular part of data from a line of text. If
the data are in the form of fields, you can extract particular fields. For example, if
you want to list all user names on your system, you can use the cut command on
the /etc/passwd file as follows:
cut -f 1 -d : /etc/passwd
or
cat /etc/passwd | cut -f 1 -d :
Here the -f 1 option tells the command that you want to extract field number 1.
The -d : option shows that the fields in the data are separated by a delimiter
colon ":". Since user names are in the start of each line in /etc/passwd and they
are followed by a colon, the command extracts all user names from the file.
You may also use the cut command to extract a particular number of characters
from a file. To extract the first eight characters from every line in the /etc/passwd
file, you may use the following command.
cat /etc/passwd | cut -c 1-8
Here you specified a range of characters using the -c 1-8 option. See the
manual pages for more information on the cut command.
Let us use the cut command in a shell program script-29. This script is used to
send an email message to all users on a system. The message contents are
stored in a file mailfile. You use the mailx command to send the message.
#!/usr/bin/sh
for USER in $(cut -f 1 -d : /etc/passwd)
do
mailx -s "Test mail" $USER <mailfile
done
You have used the cut command to create a list of user names and then send a
mail message to each name in the list.
The sleep command is used to suspend execution for a certain amount of time.
You provide the number of seconds as the argument to the sleep command. The
following code segment lists all files in the current directory with a pause of five
seconds between every file.
for FILE in *
do
ll $FILE
sleep 5
done
1: Which command will you use to add the values of two variables VAR1 and
VAR2, and store the result in VAR3?
2: You want to wait for 10 seconds at the end of the loop in each loop cycle.
Which command will you use?
A. sleep
B. pause
C. wait
D. Any of the three commands can be used.
3: Consider the following code segment. How many times does the loop
execute?
A=1
until [ $A < 10 ]
do
echo $A
(( $A=$A+1))
done
A. zero
B. one
C. nine
D. ten
5: While writing a program, you meet a situation where you want to break the
normal execution and shift control to the beginning of the loop, skipping the
remaining commands in the loop. Which command will you use?
A. break
B. continue
C. exit
D. shift