0% found this document useful (0 votes)
17 views40 pages

POSIX Threads Programming

Uploaded by

Sara
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
17 views40 pages

POSIX Threads Programming

Uploaded by

Sara
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 40

POSIX Threads Programming

Privacy & Legal Notice

POSIX THREADS

Table of Contents

1. Pthreads Overview
1. What is a Thread?
2. What are Pthreads?
3. Why Pthreads?
4. Designing Threaded Programs
2. The Pthreads API
3. Thread Management
1. Creating Threads
2. Terminating Thread Execution
3. Example: Pthread Creation and Termination
4. Passing Arguments to Threads
5. Thread Identifiers
6. Joining Threads
7. Detaching / Undetaching Threads
8. Example: Joining Threads
4. Mutex Variables
1. Mutex Variables Overview
2. Creating / Destroying Mutexes
3. Locking / Unlocking Mutexes
4. Example: Using Mutexes
5. Condition Variables
1. Condition Variables Overview
2. Creating/Destroying Condition Variables
3. Waiting / Signalling on Condition Variables
4. Example: Using Condition Variables
6. Pthreads, MPI, SMPs, AIX and IBM's PE
7. Pthread Library Routines Reference
8. References and More Information

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (1 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Pthreads Overview

What is a Thread?

● A thread is an independent, schedulable, stream of instructions. In the UNIX environment, a


thread exists within a process, and uses the process resources. However, a thread possesses its own
independent flow of control. There can be multiple threads within a process. The concept of a
"procedure", which can run independently within a process, may best describe a thread.

● To better understand what comprises a thread, it is helpful to understand the relationship between
a process and a thread. A process is created by the operating system. Processes contain
information about program resources and program execution state, including:
❍ Process ID, process group ID, user ID, and group ID

❍ Environment

❍ Working directory.

❍ Program instructions

❍ Registers

❍ Stack

❍ Common address space, data and memory heap

❍ File descriptors

❍ Signal actions

❍ Shared libraries

❍ Inter-process communication tools (such as message queues, pipes, semaphores, or shared

memory).

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (2 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● Threads use and exist within these process resources, yet are able to be scheduled by the operating
system and run as independent entities within a process.

● A thread can possess an independent flow of control and be schedulable because it maintains its
own:
❍ Stack

❍ Scheduling properties (such as policy or priority)

❍ Set of pending and blocked signals

❍ Thread specific data.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (3 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● A process can have multiple threads, all of which share the resources within a process and all of
which execute within the same address space. Within a multi-threaded program, there are at any
time multiple points of execution.

● Because threads within the same process share resources:


❍ Changes made by one thread to shared system resources (such as closing a file) will be

seen by all other threads.


❍ Two pointers having the same value point to the same data.

❍ Reading and writing to the same memory locations is possible, and therefore requires

explicit synchronization by the programmer.

● On a uniprocessor, multi-threaded processes provide for concurrent execution. On a


multiprocessor system, a process with multiple threads provides potential parallelism.

● Threads are peers. All threads, except the initial thread are automatically created when a process is
created, are on the same hierarchical level. A thread does not maintain a list of created threads, nor
does it know the thread that created it.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (4 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Pthreads Overview

What are Pthreads?

● Historically, hardware vendors have implemented their own proprietary versions of threads. These
implementations differed substantially from each other making it difficult for programmers to
develop portable threaded applications.

● In order to take full advantage of the capabilities provided by threads, a standardized


programming interface was required. For UNIX systems, this interface has been specified by the
IEEE POSIX 1003.1c standard (1995). Implementations which adhere to this standard are referred
to as POSIX threads, or Pthreads. Most hardware vendors now offer Pthreads in addition to their
proprietary API's.

● Pthreads are defined as a set of C language programming types and procedure calls. Vendors
usually provide a Pthreads implementation in the form of a header/include file and a library which
you link with your program.

● There are several drafts of the POSIX threads standard. It is important to be aware of the draft
number of a given implementation, because there are differences between drafts which can cause
problems.
❍ Some versions of DECthreads and IBM's AIX 4.2 threads follow draft 7 of the POSIX

standard.
❍ Windows NT does not follow the POSIX standard for threads but has its own proprietary

implementation.
❍ The Solaris 2 operating system follows the Unix International standard for its thread

implementation which is slightly different from the POSIX standard.


❍ Draft 10 of the POSIX 1003.1c eventually became the standard. This version is used by

AIX 4.3.

Pthreads Overview

Why Pthreads?

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (5 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● The primary motivation for using Pthreads is to realize potential program performance gains.

● When compared to the cost of creating and managing a process, a thread can be created with much
less operating system overhead. Managing threads requires fewer system resources than managing
processes.

For example, the following table compares timing results for the fork() subroutine and the
pthreads_create() subroutine. Timings reflect 50,000 process/thread creations, were
performed with the timex utility, and units are in seconds.

fork() pthread_create()
IBM Architecture
real user sys real user sys

332 MHz 604e 92.42 2.66 105.29 8.72 4.97 3.93


4 CPUs/node
512 MB Memory

112 MHz 604 259.21 8.84 249.26 28.79 17.82 21.03


8 CPUs/node
1 GB Memory

160 MHz P2SC 55.54 2.31 43.49 5.05 2.54 2.41


1 CPU/node
512 MB Memory

222 MHz POWER3 80.05 3.71 82.30 8.64 3.74 5.84


8 CPU/node
4 GB Memory

fork_vs_thread.txt

● All threads within a process share the same address space. Inter-thread communication is more
efficient and in many cases, easier to use than inter-process communication.

● Threaded applications offer potential performance gains and practical advantages over non-
threaded applications in several other ways:
❍ Overlapping CPU work with I/O: For example, a program may have sections where it is

performing a long I/O operation. While one thread is waiting for an I/O system call to
complete, CPU intensive work can be performed by other threads.
❍ Priority/real-time scheduling: tasks which are more important can be scheduled to

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (6 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

supersede or interrupt lower priority tasks.


❍ Asynchronous event handling: tasks which service events of indeterminate frequency and
duration can be interleaved. For example, a web server can both transfer data from previous
requests and manage the arrival of new requests.

● Multi-threaded applications will work on a uniprocessor system, yet naturally take advantage of a
multiprocessor system, without recompiling.

● In a multiprocessor environment, the most important reason for using Pthreads is to take
advantage of potential parallelism. This will be the focus of the remainder of this tutorial.

Pthreads Overview

Designing Threaded Programs

● In order for a program to take advantage of Pthreads, it must be able to be organized into discrete,
independent tasks which can execute concurrently. For example, if routine1 and routine2 can be
interchanged, interleaved and/or overlapped in real time, they are candidates for threading.

● Tasks which may be suitable for threading include tasks that:

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (7 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

❍ Block for potentially long waits


❍ Use many CPU cycles
❍ Must respond to asynchronous events
❍ Are of lesser or greater importance than other tasks
❍ Are able to be performed in parallel with other tasks

● A threaded program must possess reentrant functions/tasks. A reentrant function:


❍ Does not hold static data over successive calls, nor does it return a pointer to static data.

Reentrant functions use local variables which are dynamically allocated on the stack.
❍ Provides necessary synchronization and locking mechanisms when working with

global/shared data.
❍ Must not call non-reentrant functions, including routines from other libraries which are not

thread-safe. A workaround (which may/may not degrade performance) for thread-unsafe


routines: just put a lock around the entire routine.

● Several common models for threaded programs exist:

❍ Manager/worker: a single thread, the manager assigns work to other threads, the workers.
Typically, the manager handles all input and parcels out work to the other tasks. At least
two forms of the manager/worker model are common: static worker pool and dynamic
worker pool.

❍ Pipeline: a task is broken into a series of suboperations, each of which is handled in series,
but concurrently, by a different thread. An automobile assembly line best describes this
model.

❍ Peer: similar to the manager/worker model, but after the main thread creates other threads,
it participates in the work.

The Pthreads API

● The Pthreads API is defined in the ANSI/IEEE POSIX 1003.1 - 1995 standard. Unlike MPI, this
standard is not freely available on the Web - it must be purchased from IEEE.

● The subroutines which comprise the Pthreads API can be informally grouped into three major
classes:

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (8 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

1. Thread management: The first class of functions work directly on threads - creating,
detaching, joining, etc. They include functions to set/query thread attributes (joinable,
scheduling etc.)

2. Mutexes: The second class of functions deal with a coarse type of synchronization, called a
"mutex", which is an abbreviation for "mutual exclusion". Mutex functions provide for
creating, destroying, locking and unlocking mutexes. They are also supplemented by mutex
attribute functions that set or modify attributes associated with mutexes.

3. Condition variables:The third class of functions deal with a finer type of synchronization -
based upon programmer specified conditions. This class includes functions to create,
destroy, wait and signal based upon specified variable values. Functions to set/query
condition variable attributes are also included.

● Naming conventions: All identifiers in the threads library begin with pthread_

Routine Prefix Functional Group

pthread_ Threads themselves and miscellaneous subroutines

pthread_attr_ Thread attributes objects

pthread_mutex_ Mutexes

pthread_mutexattr_ Mutex attributes objects.

pthread_cond_ Condition variables

pthread_condattr_ Condition attributes objects

pthread_key_ Thread-specific data keys

● The concept of opaque objects pervades the design of the API. The basic calls work to create or
modify opaque objects - the opaque objects can be modified by calls to attribute functions, which
deal with opaque attributes.

● The Pthreads API contains over 60 subroutines. This tutorial will focus on a subset of these -
specifically, those which are most likely to be immediately useful to the beginning Pthreads
programmer.

● The pthreads.h header file must be included in each source file using the Pthreads library.

● The current POSIX standard is defined only for the C language. Efforts to define it for Fortran are

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%20(benco)/linux/POSIX%20Threads%20Programming.htm (9 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

not yet complete. Fortran programmers can use wrappers around C function calls.

● Implementation notes:

IBM's API includes a Fortran interface which may be used for convenience at the price of
portability. An example IBM Fortran Pthreads program is provided here.

● A number of excellent books about Pthreads are available. Several of these are listed in the
References section of this tutorial.

Thread Management

Creating Threads

● Initially, your main() program comprises a single, default thread. All other threads must be
explicitly created by the programmer.

● Routines:

pthread_create (thread,attr,start_routine,arg)

● This routine creates a new thread and makes it executable. Typically, threads are first created from
within main() inside a single process. Once created, threads are peers, and may create other
threads.

● The pthread_create subroutine returns the new thread ID via the thread argument. The caller can
use this thread ID to perform various operations on the thread. This ID should be checked to
ensure that the thread was successfully created.

● The attr parameter is used to set thread attributes. You can specify a thread attributes object, or
NULL for the default values. Thread attributes are discussed later.

● The start_routine is the C routine that the thread will execute once it is created.

● A single argument may be passed to start_routine via arg. It must be passed by reference as a

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (10 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

pointer cast of type void.

● Questions:

How do you know when a thread will be scheduled to run by the operating system after it
has been created...especially on an SMP machine?
Answer

● Implementation notes:

Check your local implementation for limits on thread creation. For example, AIX 4.2
specifies a maximum of 512 threads per process with a default thread stack size of 56K
bytes.

Thread Management

Terminating Thread Execution

● There are several ways in which a Pthread may be terminated:


❍ The thread returns from its starting routine (the main routine for the initial thread). By

default, the Pthreads library will reclaim any system resources used by the thread. This is
similar to a process terminating when it reaches the end of main.
❍ The thread makes a call to the pthread_exit subroutine (covered below).

❍ The thread is canceled by another thread via the pthread_cancel routine (not covered

here).
❍ The thread receives a signal that terminates it

❍ The entire process is terminated due to a call to either the exec or exit subroutines.

● Routines:

pthread_exit (status)

● This routine is used to explicitly exit a thread. Typically, the pthread_exit() routine is called
after a thread has completed its work and is no longer required to exists.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (11 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● If main() finishes before the threads it has created, and exits with pthread_exit(), the
other threads will continue to execute. Otherwise, they will be automatically terminated when
main() finishes.

● The programmer may optionally specify a termination status, which is stored as a void pointer for
any thread that may join the calling thread.

● Cleanup: the pthread_exit() routine does not close files; any files opened inside the thread
will remain open after the thread is terminated.

● Recommendations:

❍ Use pthread_exit() to exit from all threads...especially main().

Thread Management

Example: Pthread Creation and Termination

● This simple example code creates 5 threads with the pthread_create() routine. Each thread
prints a "Hello World!" message, and then terminates with a call to pthread_exit().

Example Code - Pthread Creation and Termination

#include <pthread.h>
#include <stdio.h>
#define NUM_THREADS 5

void *PrintHello(void *threadid)


{
printf("\n%d: Hello World!\n", threadid);
pthread_exit(NULL);
}

int main()
{
pthread_t threads[NUM_THREADS];
int rc, t;
for(t=0;t < NUM_THREADS;t++){

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (12 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

printf("Creating thread %d\n", t);


rc = pthread_create(&threads[t], NULL, PrintHello, (void *)t);
if (rc){
printf("ERROR; return code from pthread_create() is %d\n", rc);
exit(-1);
}
}
pthread_exit(NULL);
}

Thread Management

Passing Arguments to Threads

● The pthread_create() routine permits the programmer to pass one argument to the thread
start routine. For cases where multiple arguments must be passed, this limitation is easily
overcome by creating a structure which contains all of the arguments, and then passing a pointer
to that structure in the pthread_create() routine.

● All arguments must be passed by reference and cast to (void *).

● Questions:

How can you safely pass data to newly created threads, given their non-deterministic start-
up and scheduling?
Answer

● Three examples of argument passing to newly created threads follow:

Thread Argument Passing - Example 1

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (13 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

This code fragment demostrates how to pass a simple integer to each thread. The
calling thread uses a unique data structure for each thread, insuring that each thread's
argument remains intact throughout the program.
int *taskids[NUM_THREADS];

for(t=0;t < NUM_THREADS;t++)


{
taskids[t] = (int *) malloc(sizeof(int));
*taskids[t] = t;
printf("Creating thread %d\n", t);
rc = pthread_create(&threads[t], NULL, PrintHello,
(void *) taskids[t]);
...
}

Thread Argument Passing - Example 2

This example shows how to setup/pass multiple arguments via a structure.


struct thread_data{
int thread_id;
int sum;
char *message;
};

struct thread_data thread_data_array[NUM_THREADS];

void *PrintHello(void *threadarg)


{
struct thread_data *my_data;
...
my_data = (struct thread_data *) threadarg;
taskid = my_data->thread_id;
sum = my_data->sum;
hello_msg = my_data->message;
...
}

int main()
{
...
thread_data_array[t].thread_id = t;

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (14 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

thread_data_array[t].sum = sum;
thread_data_array[t].message = messages[t];
rc = pthread_create(&threads[t], NULL, PrintHello,
(void *) &thread_data_array[t]);
...
}

Thread Argument Passing - Example 3 (Incorrect)

This example performs argument passing incorrectly. The loop which creates threads
modifies the contents of the address passed as an argument, possibly before the
created threads can access it.
int rc, t;

for(t=0;t < NUM_THREADS;t++)


{
printf("Creating thread %d\n", t);
rc = pthread_create(&threads[t], NULL, PrintHello,
(void *) &t);
...
}

Important: threads initially access their data structures in the parent thread's memory space.
That data structure must not be corrupted/modified until the thread has finished accessing
it. The incorrect example above demonstrates what can happen.

Thread Management

Thread Identifiers

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (15 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● Routines:

pthread_self ()

pthread_equal (thread1,thread2)

● The pthread_self() routine returns the unique, system assigned thread ID of the calling
thread.

● The pthread_equal() routine compares two thread IDs. If the two IDs are different 0 is
returned, otherwise a non-zero value is returned.

● Note that for both of these routines, the thread identifier objects are opaque and can not be easily
inspected. Because thread IDs are opaque objects, the C language equivalence operator == should
not be used to compare two thread IDs against each other, or to compare a single thread ID against
another value.

Thread Management

Joining Threads

● "Joining" is one way to accomplish synchronization between threads. Two other ways, mutexes
and condition variables will be discussed later.

● Routines:

pthread_join (threadid,status)

● The pthread_join() subroutine blocks the calling thread until the specified threadid thread
terminates.

● The programmer is able to obtain the target thread's termination return status if specified through
pthread_exit(), in the status parameter.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (16 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● It is impossible to join a detached thread (discussed next)

Thread Management

Detaching / Undetaching Threads

● When a thread is created, one of its possible attributes defines whether or not it may be joined.
Detached means it can not be joined. Undetached means it is joinable.

● Routines:

pthread_attr_init (attr)

pthread_attr_setdetachstate (attr,detachstate)

pthread_attr_getdetachstate (attr,detachstate)

pthread_attr_destroy (attr)

pthread_detach (threadid,status)

● Pthreads may be created in either a detached or undetached state. This is done by using the attr
argument in the pthread_create() routine. The typical use of the attribute involves 4 steps:
1. Declaring a pthread attribute variable of the pthread_attr_t data type
2. Initializing the attribute variable with pthread_attr_init()
3. Setting the attribute detached status with pthread_attr_setdetachstate()
4. Free library resources used by the attribute with pthread_attr_destroy()

● The pthread_detach() routine can be used to explicitly detach a thread even though it was
created as undetached/joinable. There is no converse routine.

● Recommendations:

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (17 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

❍ Consider detaching a thread only if it will never need to synchronize with any other thread.
Detached threads can not be joined. The only benefit for using detached threads might be
some reduction in the pthreads library overhead - some system resources may be freed.

● Implementation notes:

Current implementations differ in the default detached status at thread creation time. The final
standard specifies undetached as the default.

IBM's AIX 4.2 implementation creates threads as unjoinable (detached) by default,


whereas AIX 4.3 creates threads as joinable (undetached).
SGI creates threads as joinable (undetached).

Thread Management

Example: Pthread Joining

● This example demonstrates how to "wait" for thread completions by using the Pthread join
routine. Since not all current implementations of Pthreads create threads in a joinable state, the
threads in this example are explicitly created in an undetached state so that they can be joined
later.

Example Code - Pthread Joining

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (18 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

#include <pthread.h>
#include <stdio.h>
#define NUM_THREADS 3

void *BusyWork(void *null)


{
int i;
double result=0.0;
for (i=0; i < 1000000; i++)
{
result = result + (double)random();
}
printf("result = %d\n",result);
pthread_exit((void *) 0);
}

void main()
{
pthread_t thread[NUM_THREADS];
pthread_attr_t attr;
int rc, t, status;

/* Initialize and set thread detached attribute */


pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr, PTHREAD_CREATE_UNDETACHED);

for(t=0;t < NUM_THREADS;t++)


{
printf("Creating thread %d\n", t);
rc = pthread_create(&thread[t], &attr, BusyWork, NULL);
if (rc)
{
printf("ERROR; return code from pthread_create()
is %d\n", rc);
exit(-1);
}
}

/* Free attribute and wait for the other threads */


pthread_attr_destroy(&attr);
for(t=0;t < NUM_THREADS;t++)
{
rc = pthread_join(thread[t], (void **)&status);
if (rc)
{
printf("ERROR; return code from pthread_join()
is %d\n", rc);
exit(-1);
}
printf("Completed join with thread %d status= %d\n",t, status);
}

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (19 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

pthread_exit(NULL);
}

Mutex Variables

Overview

● Mutex is an abbreviation for "mutual exclusion". Mutex variables are one of the primary means of
implementing thread synchronization.

● A mutex variable acts like a "lock" protecting access to a shared data resource. The basic concept
of a mutex as used in Pthreads is that only one thread can lock (or own) a mutex variable at any
given time. Thus, even if several threads try to lock a mutex only one thread will be successful.
No other thread can own that mutex until the owning thread unlocks that mutex. Threads must
"take turns" accessing protected data.

● Mutexes can be used to prevent "race" conditions. An example of a race condition involving a
bank transaction is shown below:

Thread 1 Thread 2 Balance

Read balance: $1000 $1000

Read balance: $1000 $1000

Deposit $200 $1000

Deposit $200 $1000

Update balance $1000+$200 $1200

Update balance $1000+$200 $1200

● In the above example, a mutex should be used to lock the "Balance" while a thread is using this

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (20 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

shared data resource.

● Very often the action performed by a thread owning a mutex is the updating of global variables.
This is a safe way to ensure that when several threads update the same variable, the final value is
the same as what it would be if only one thread performed the update. The variables being updated
belong to a "critical section".

● A typical sequence in the use of a mutex is as follows:


❍ Create and initialize a mutex variable

❍ Several threads attempt to lock the mutex

❍ Only one succeeds and that thread owns the mutex

❍ The owner thread performs some set of actions

❍ The owner unlocks the mutex

❍ Another thread acquires the mutex and repeats the process

❍ Finally the mutex is destroyed

● When several threads compete for a mutex, the losers block at that call - an unblocking call is
available with "trylock" instead of the "lock" call.

Mutex Variables

Creating / Destroying Mutexes

● Routines:

pthread_mutex_init (mutex,attr)

pthread_mutex_destroy (mutex)

pthread_mutexattr_init (attr)

pthread_mutexattr_destroy (attr)

● Mutex variables must be declared with type pthread_mutex_t, and must be initialized before

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (21 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

they can be used. There are two ways to initialize a mutex variable:

1. Statically, when it is declared. For example:


pthread_mutex_t mymutex = PTHREAD_MUTEX_INITIALIZER;

2. Dynamically, with the pthread_mutex_init() routine. This method permits setting


mutex object attributes, attr.

The mutex is initially unlocked.

● The attr object is used to establish properties for the mutex object, and must be of type
pthread_mutexattr_t if used (may be specified as NULL to accept defaults). The Pthreads
standard defines three optional mutex attributes:
❍ Protocol: Specifies the protocol used to prevent priority inversions for a mutex.

❍ Prioceiling: Specifies the priority ceiling of a mutex.

❍ Process-shared: Specifies the process sharing of a mutex.

● The pthread_mutexattr_init() and pthread_mutexattr_destroy() routines are


used to create and destroy mutex attribute objects respectively.

● pthread_mutex_destroy() should be used to free a mutex object which is no longer


needed.

● Implementation notes:

Your implementation may/may not provide the three optional mutex attributes.

In AIX 4.2 and 4.3 mutex attributes are not defined. NULL should be used instead.

Mutex Variables

Locking / Unlocking Mutexes

● Routines:

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (22 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

pthread_mutex_lock (mutex)

pthread_mutex_trylock (mutex)

pthread_mutex_unlock (mutex)

● The pthread_mutex_lock() routine is used by a thread to acquire a lock on the specified


mutex variable. If the mutex is already locked by another thread, this call will block the calling
thread until the mutex is unlocked.

● pthread_mutex_trylock() will attempt to lock a mutex. However, if the mutex is already


locked, the routine will return immediately with a "busy" error code. This routine may be useful in
preventing deadlock conditions, as in a priority-inversion situation.

● pthread_mutex_unlock() will unlock a mutex if called by the owning thread. Calling this
routine is required after a thread has completed its use of protected data if other threads are to
acquire the mutex for their work with the protected data. An error will be returned if:
❍ If the mutex was already unlocked

❍ If the mutex is owned by another thread

● Questions:

Mutex contention: when more than one thread is waiting for a locked mutex, which thread
will be granted the lock first after it is released?
Answer

Mutex Variables

Example: Using Mutexes

● This example program illustrates the use of mutex variables in a threads program that performs a
dot product. The main data is made available to all threads through a globally accessible structure.
Each thread works on a different part of the data. The main thread waits for all the threads to
complete their computations, and then it prints the resulting sum.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (23 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Example Code - Using Mutexes

#include <pthread.h>
#include <stdio.h>
#include <malloc.h>

/*
The following structure contains the necessary information
to allow the function "dotprod" to access its input data and
place its output into the structure. This structure is
unchanged from the sequential version.
*/

typedef struct
{
double *a;
double *b;
double sum;
int veclen;
} DOTDATA;

/* Define globally accessible variables and a mutex */

#define NUMTHRDS 4
#define VECLEN 100
DOTDATA dotstr;
pthread_t callThd[NUMTHRDS];
pthread_mutex_t mutexsum;

/*
The function dotprod is activated when the thread is created.
All input to this routine is obtained from a structure
of type DOTDATA and all output from this function is written into
this structure. The benefit of this approach is apparent for the
multi-threaded program: when a thread is created we pass a single
argument to the activated function - typically this argument
is a thread number. All the other information required by the
function is accessed from the globally accessible structure.
*/

void *dotprod(void *arg)


{

/* Define and use local variables for convenience */

int i, start, end, offset, len ;


double mysum, *x, *y;
offset = (int)arg;

len = dotstr.veclen;

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (24 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

start = offset*len;
end = start + len;
x = dotstr.a;
y = dotstr.b;

/*
Perform the dot product and assign result
to the appropriate variable in the structure.
*/

mysum = 0;
for (i=start; i < end ; i++)
{
mysum += (x[i] * y[i]);
}

/*
Lock a mutex prior to updating the value in the shared
structure, and unlock it upon updating.
*/
pthread_mutex_lock (&mutexsum);
dotstr.sum += mysum;
pthread_mutex_unlock (&mutexsum);

pthread_exit((void*) 0);
}

/*
The main program creates threads which do all the work and then
print out result upon completion. Before creating the threads,
the input data is created. Since all threads update a shared structure,
we need a mutex for mutual exclusion. The main thread needs to wait for
all threads to complete, it waits for each one of the threads. We specify
a thread attribute value that allow the main thread to join with the
threads it creates. Note also that we free up handles when they are
no longer needed.
*/

void main (int argc, char *argv[])


{
int i;
double *a, *b;
int status;
pthread_attr_t attr;

/* Assign storage and initialize values */


a = (double*) malloc (NUMTHRDS*VECLEN*sizeof(double));
b = (double*) malloc (NUMTHRDS*VECLEN*sizeof(double));

for (i=0; i < VECLEN*NUMTHRDS; i++)


{
a[i]=1;

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (25 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

b[i]=a[i];
}

dotstr.veclen = VECLEN;
dotstr.a = a;
dotstr.b = b;
dotstr.sum=0;

pthread_mutex_init(&mutexsum, NULL);

/* Create threads to perform the dotproduct */


pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr, PTHREAD_CREATE_UNDETACHED);

for(i=0;i < NUMTHRDS;i++)


{
/*
Each thread works on a different set of data.
The offset is specified by 'i'. The size of
the data for each thread is indicated by VECLEN.
*/
pthread_create( &callThd[i], &attr, dotprod, (void *)i);
}

pthread_attr_destroy(&attr);

/* Wait on the other threads */


for(i=0;i < NUMTHRDS;i++)
{
pthread_join( callThd[i], (void **)&status);
}

/* After joining, print out the results and cleanup */


printf ("Sum = %f \n", dotstr.sum);
free (a);
free (b);
pthread_mutex_destroy(&mutexsum);
pthread_exit(NULL);
}

Serial version

Pthreads version

Condition Variables

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (26 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Overview

● Condition variables provide yet another way for threads to synchronize. While mutexes implement
synchronization by controlling thread access to data, condition variables allow threads to
synchronize based upon the actual value of data.

● Without condition variables, the programmer would need to have threads continually polling
(possibly in a critical section), to check if the condition is met. This can be very resource
consuming since the thread would be continuously busy in this activity. A condition variable is a
way to achieve the same goal without polling.

● A condition variable is always used in conjunction with a mutex lock.

● A representative sequence for using condition variables is shown below.

Calling Thread

Declare and initialize global data/variables which require synchronization (such as


"count")

Declare and initialize a condition variable object

Declare and initialize an associated mutex

Create threads A and B to do work

Thread A Thread B

❍ Do work up to the point where a certain ❍ Do work


condition must occur (such as "count" must ❍ Lock associated
reach a specified value) mutex
❍ Lock associated mutex and check value of a ❍ Change the value of
global variable the global variable
❍ Call pthread_cond_wait() to perform a that Thread-A is
blocking wait for signal from Thread-B. Note waiting upon.
that a call to pthread_cond_wait() ❍ Check value of the
automatically and atomically unlocks the global Thread-A wait
associated mutex variable so that it can be used variable. If it fulfills
by Thread-B. the desired condition,
❍ When signalled, wake up. Mutex is signal Thread-A.
automatically and atomically locked. ❍ Unlock mutex.
❍ Explicitly unlock mutex ❍ Continue

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (27 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

❍ Continue

Join / Continue

Condition Variables

Creating / Destroying Condition Variables

● Routines:

pthread_cond_init (condition,attr)

pthread_cond_destroy (condition)

pthread_condattr_init (attr)

pthread_condattr_destroy (attr)

● Condition variables must be declared with type pthread_cond_t, and must be initialized
before they can be used. There are two ways to initialize a condition variable:

1. Statically, when it is declared. For example:


pthread_cond_t myconvar = PTHREAD_COND_INITIALIZER;

2. Dynamically, with the pthread_cond_init() routine. The ID of the created


condition variable is returned to the calling thread through the condition parameter. This
method permits setting condition variable object attributes, attr.

● The optional attr object is used to set condition variable attributes. There is only one attribute
defined for condition variables: process-shared, which allows the condition variable to be seen by
threads in other processes. The attribute object, if used, must be of type
pthread_condattr_t (may be specified as NULL to accept defaults).

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (28 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

● The pthread_condattr_init() and pthread_condattr_destroy() routines are


used to create and destroy condition variable attribute objects.

● pthread_cond_destroy() should be used to free a condition variable that is no longer


needed.

● Implementation notes:

Check your implementation for the ability to use the condition variable attributes object.

In AIX 4.2 and 4.3 this option is not implemented and the value "NULL" should be used.

Condition Variables

Waiting / Signalling on Condition Variables

● Routines:

pthread_cond_wait (condition,mutex)

pthread_cond_signal (condition)

pthread_cond_broadcast (condition)

● pthread_cond_wait() blocks the calling thread until the specified condition is signalled.
This routine should be called while mutex is locked, and it will automatically release the mutex
while it waits. Should also unlock mutex after signal has been received.

● The pthread_cond_signal() routine is used to signal (or wake up) another thread which is
waiting on the condition variable. It should be called after mutex is locked, and must unlock mutex
in order for pthread_cond_wait() routine to complete.

● The pthread_cond_broadcast() routine should be used instead of

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (29 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

pthread_cond_signal() if more than one thread is in a blocking wait state.

● It is a logical error to call pthread_cond_signal() before calling


pthread_cond_wait().

Proper locking and unlocking of the associated mutex variable is essential when using these
routines. For example:

● Failing to lock the mutex before calling pthread_cond_wait() may cause it NOT to
block.

● Failing to unlock the mutex after calling pthread_cond_signal() may not allow a
matching pthread_cond_wait() routine to complete (it will remain blocked).

Condition Variables

Example: Using Condition Variables

● This simple example code demonstrates the use of several Pthread condition variable routines. The
main routine creates three threads. Two of the threads perform work and update a "count"
variable. The third thread waits until the count variable reaches a specified value.

Example Code - Using Condition Variables

#include <pthread.h>
#include <stdio.h>

#define NUM_THREADS 3
#define TCOUNT 10
#define COUNT_LIMIT 12

int count = 0;
int thread_ids[3] = {0,1,2};
pthread_mutex_t count_mutex;
pthread_cond_t count_threshold_cv;

void *inc_count(void *idp)


{

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (30 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

int j,i;
double result=0.0;
int *my_id = idp;

for (i=0; i < TCOUNT; i++) {


pthread_mutex_lock(&count_mutex);
count++;

/*
Check the value of count and signal waiting thread when condition is
reached. Note that this occurs while mutex is locked.
*/
if (count == COUNT_LIMIT) {
pthread_cond_signal(&count_threshold_cv);
printf("inc_count(): thread %d, count = %d Threshold reached.\n",
*my_id, count);
}
printf("inc_count(): thread %d, count = %d, unlocking mutex\n",
*my_id, count);
pthread_mutex_unlock(&count_mutex);

/* Do some work so threads can alternate on mutex lock */


for (j=0; j < 1000; j++)
result = result + (double)random();
}
pthread_exit(NULL);
}

void *watch_count(void *idp)


{
int *my_id = idp;

printf("Starting watch_count(): thread %d\n", *my_id);

/*
Lock mutex and wait for signal. Note that the pthread_cond_wait
routine will automatically and atomically unlock mutex while it waits.
Also, note that if COUNT_LIMIT is reached before this routine is run by
the waiting thread, the loop will be skipped to prevent pthread_cond_wait
from never returning.
*/
pthread_mutex_lock(&count_mutex);
while (count < COUNT_LIMIT) {
pthread_cond_wait(&count_threshold_cv, &count_mutex);
printf("watch_count(): thread %d Condition signal
received.\n", *my_id);
}
pthread_mutex_unlock(&count_mutex);
pthread_exit(NULL);
}

void main()

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (31 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

{
int i, rc;
pthread_t threads[3];
pthread_attr_t attr;

/* Initialize mutex and condition variable objects */


pthread_mutex_init(&count_mutex, NULL);
pthread_cond_init (&count_threshold_cv, NULL);

/*
For portability, explicitly create threads in an undetached state
so that they can be joined later.
*/
pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr, PTHREAD_CREATE_UNDETACHED);
pthread_create(&threads[0], &attr, inc_count, (void *)&thread_ids[0]);
pthread_create(&threads[1], &attr, inc_count, (void *)&thread_ids[1]);
pthread_create(&threads[2], &attr, watch_count, (void *)&thread_ids[2]);

/* Wait for all threads to complete */


for (i = 0; i < NUM_THREADS; i++) {
pthread_join(threads[i], NULL);
}
printf ("Main(): Waited on %d threads. Done.\n", NUM_THREADS);

/* Clean up and exit */


pthread_attr_destroy(&attr);
pthread_mutex_destroy(&count_mutex);
pthread_cond_destroy(&count_threshold_cv);
pthread_exit(NULL);

Pthreads, MPI, SMPs, AIX and IBM's PE

● The primary motivation for considering the use of Pthreads within an MPI application on an IBM
SMP node is to achieve maximum communication performance and CPU utilization.

● For an MPI-only application, the most efficient use of the CPUs in an SMP node is achieved by
running at least one (and usually only one) MPI task on each CPU. For an IBM SP, this means

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (32 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

there are multiple MPI tasks on an SP node - usually one per CPU.

● However, communications between MPI-only tasks on an IBM SMP node are not always
optimal!

❍ In versions of the Parallel Environment software before 2.4, it was impossible for more
than one task at a time to use User Space (fast) communications on a node. Version 2.4
solves this problem, allowing up to 4 User Space tasks per node (one per 604e CPU). But...

❍ MPI intraprocessor communication bandwidth is less than interprocessor communication


bandwidth. That is, two tasks using User Space protocol on the same node communicate
slower than two tasks on different nodes.

Comparison of MPI Communications on Various IBM SMPs

Number of Per Task Per Task


SMP Processor MPI Bandwidth Bandwidth
Type Tasks Per US Protocol IP Protocol
Node (MB/sec) (MB/sec)
1 86 32
604e 332 MHz
2 67 30
(4 cpus) 1
4 32 26
POWER3 200 MHz 1 139 n/a
(2 cpus) 2 2 70 n/a
1 111 48
POWER3 222 MHz 2 64 55
(8 cpus) 1 4 33 51
8 n/a 46

❍ IBM's most recent versions of AIX and Parallel Environment software, running on
POWER3 SMP hardware provide the highest MPI-only communications bandwidth. When
the MP_SHARED_MEMORY (currently undocumented) environment variable is set to
"yes", intra-node communications are done using node shared memory instead of the SP
switch.

■ User Space intra-node task bandwidth = @230 MB/sec

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (33 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

■ IP intra-node task bandwidth = @223 MB/sec

● Using Pthreads to replace MPI on-node interprocessor communications may improve an


application's performance. The potential "bandwidth" of using pointer/global variable
"communications" in shared memory with pthreads, far exceeds any MPI-only communication
bandwidths (no memory copy operations), being limited only by memory/CPU bandwidth.

❍ 604e memory-cpu bus speed: 1.3 GB/sec

❍ POWER3 memory-cpu bus speed: 1.6 GB/sec

● An example of how pthreads performs better than MPI for same-node communications is shown
below. In this test, the MPI code passed one integer messages between two processes 100,000
times. The pthreads code used two threads, each of which performed 100,000 concurrent updates
to a mutex protected integer global variable. All results are in seconds using User Space
communications.

MPI MPI
IBM SMP MPI intra-node
inter- intra- pthreads
Architecture MP_SHARED_MEMORY=yes
node node

604e 5.66 9.00 n/a 0.11


332 MHz
4 CPUs

POWER3 5.60 7.97 3.98 0.08


222 MHz
8 CPUs

mpi_vs_thread.txt

● Compiling with threads: Use the following compiler commands as appropriate to insure thread-
safe/enabled executables:

Compiler
Description
Command

xlc_r
C pthreads compiler with default language level of ANSI
cc_r

xlf_r Fortran compiler with IBM Pthreads API (non-portable).

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (34 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

xlf90_r Fortran 90 pthreads compiler

xlC_r C++ pthreads compiler

mpcc_r C MPI-pthreads compiler script

mpxlf_r Fortran MPI-pthreads compiler script

mpCC_r C++ MPI-pthreads compiler script

● An example code, which uses both MPI and Pthreads on an IBM SMP system is available below.
The serial, threads-only, MPI-only and MPI-with-threads versions are all available for comparison
and review.
❍ Serial

❍ Pthreads only

❍ MPI only

❍ MPI with pthreads

❍ makefile

● Other Considerations:

❍ The AIX implementation of Pthreads does not provide all of the optional features cited in
the POSIX standard, such as mutex and condition variable attributes.

❍ Depending upon the version of AIX, threads are created in a either "detached" mode (AIX
4.2) or "undetached" mode (AIX 4.3). For older verions of AIX, threads must be explicitly
created "undetached" if they are to be joined later. See /usr/include/pthreads.h for more
AIX implementation details

Pthread Library Routines Reference

Pthread Functions

Thread pthread_create
Management
pthread_exit

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (35 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

pthread_join

pthread_once

pthread_kill

pthread_self

pthread_equal

pthread_yield

pthread_detach

Thread Specific pthread_key_create


Data
pthread_key_delete

pthread_getspecific

pthread_setspecific

Thread pthread_cancel
Cancellation
pthread_cleanup_pop

pthread_cleanup_push

pthread_setcancelstate

pthread_getcancelstate

pthread_testcancel

Thread pthread_getschedparam
Scheduling
pthread_setschedparam

Signals pthread_sigmask

Pthread Attribute Functions

Basic pthread_attr_init
Management
pthread_attr_destroy

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (36 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Detachable or pthread_attr_setdetachstate
Joinable
pthread_attr_getdetachstate

Specifying Stack pthread_attr_getstackaddr


Information
pthread_attr_getstacksize

pthread_attr_setstackaddr

pthread_attr_setstacksize

Thread pthread_attr_getschedparam
Scheduling
Attributes pthread_attr_setschedparam

pthread_attr_getschedpolicy

pthread_attr_setschedpolicy

pthread_attr_setinheritsched

pthread_attr_getinheritsched

pthread_attr_setscope

pthread_attr_getscope

Mutex Functions

Mutex pthread_mutex_init
Management
pthread_mutex_destroy

pthread_mutex_lock

pthread_mutex_unlock

pthread_mutex_trylock

Priority pthread_mutex_setprioceiling
Management
pthread_mutex_getprioceiling

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (37 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

Mutex Attribute Functions

Basic pthread_mutexattr_init
Management
pthread_mutexattr_destroy

Sharing pthread_mutexattr_getpshared

pthread_mutexattr_setpshared

Protocol pthread_mutexattr_getprotocol
Attributes
pthread_mutexattr_setprotocol

Priority pthread_mutexattr_setprioceiling
Management
pthread_mutexattr_getprioceiling

Condition Variable Functions

Basic pthread_cond_init
Management
pthread_cond_destroy

pthread_cond_signal

pthread_cond_broadcast

pthread_cond_wait

pthread_cond_timedwait

Condition Variable Attribute Functions

Basic pthread_condattr_init
Management
pthread_condattr_destroy

Sharing pthread_condattr_getpshared

pthread_condattr_setpshared

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (38 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

References and More Information

● "Pthreads Programming". B. Nichols et al. O'Reilly and Associates.

● "Threads Primer". B. Lewis and D. Berg. Prentice Hall

● "Programming With POSIX Threads". D. Butenhoff. Addison Wesley

● "Programming With Threads". S. Kleiman et al. Prentice Hall

● XL Fortran for AIX Language Reference, Version 5 Release 1. Pages 547-568 describe the IBM
Fortran API for Pthreads. Example IBM Fortran Pthread program available here.

Notes

1 Timing results were obtained on LLNL production (604e) and testing (POWER3 8-cpu) systems
using point-to-point, non-blocking send-receive pairs. These figures are approximations for what
to expect with coupled send-receives.
2 Source: "SP Switch Performance" whitepaper, IBM Corp. August 1999. No information on
which MPI routines were used or how they were used.

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (39 of 40) [23.9.2003 11:40:06]


POSIX Threads Programming

file:///D|/TEMP-KNJIGE%20OD%20NILE/E-books%2...nco)/linux/POSIX%20Threads%20Programming.htm (40 of 40) [23.9.2003 11:40:06]

You might also like