Skip to content
Snippets Groups Projects
6_unix_processes.md 14.5 KiB
Newer Older
Laurent Modolo's avatar
Laurent Modolo committed
---
title: Unix Processes


---

# Unix Processes

Laurent Modolo's avatar
Laurent Modolo committed
[![cc_by_sa](./cc_by_sa.png)](http://creativecommons.org/licenses/by-sa/4.0/)
Laurent Modolo's avatar
Laurent Modolo committed

Objective: Understand how process works in GNU/Linux

Laurent Modolo's avatar
Laurent Modolo committed
A program is a list of instructions to be executed on a computer. These instructions are written in one or many files in a format readable by the computer (binary files), or interpretable by the computer (text file). The interpretable format needs to be processed by an interpreter who is in binary format.
Laurent Modolo's avatar
Laurent Modolo committed

The execution of a program on the system is represented as one or many processes. The program is the file of instruction while the process is the instructions being read.

`mkdir` is the program, when you type `mkdir my_folder`, you launch an `mkdir` process.

Your shell is a process to manipulate other processes.

> In multitasking operating systems, processes (running programs) need a way to create new processes, e.g. to run other programs. [Fork](https://en.wikipedia.org/wiki/Fork_(system_call)) and its  variants are typically the only way of doing so in Unix-like systems.  For a process to start the execution of a different program, it first  forks to create a copy of itself. Then, the copy, called the "[child process](https://en.wikipedia.org/wiki/Child_process)", calls the [exec](https://en.wikipedia.org/wiki/Exec_(system_call)) system call to overlay itself with the other program: it ceases execution of its former program in favor of the other.

Laurent Modolo's avatar
Laurent Modolo committed
Some commands in your shell don’t have an associated process, for example there is no `cd` program, it’s a functionality of your shell. The `cd` command tell your `bash` process to do something not to fork another process.
Laurent Modolo's avatar
Laurent Modolo committed
## Process attributes

- PID : the **p**rocess **id**entifier is an integer, at a given time each PID is unique to a process
- PPID : the **p**arent **p**rocess **id**entifier is the PID of the process that has stared the current process
Laurent Modolo's avatar
Laurent Modolo committed
- UID : the **u**ser **id**entifier is the identifier of the user that has started the process, except for SE Linux, the process will have th same right as the user launching it.
- PGID : the **p**rocess **g**roup **id**entifier (like users, processes have groups)
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
You can use the command `ps` to see the processes launched by your user
Laurent Modolo's avatar
Laurent Modolo committed

```sh
ps
```

Laurent Modolo's avatar
Laurent Modolo committed
Like for the command `ls` you can use the switch `-l` to have more details.
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
Open another tab in your browser to log again on your VM. Keep this tab open, we are going to use both of them in this session.
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
In this new tab, you are going to launch a `less` process.
Laurent Modolo's avatar
Laurent Modolo committed

```sh
less .bashrc
```

Comme back, to your first tab, can you see your `less` process with the command `ps` ?

Laurent Modolo's avatar
Laurent Modolo committed
The `ps` option `-u [login]` list all the processes with `UID`the `UID`associated with `[login]`
Laurent Modolo's avatar
Laurent Modolo committed

```sh
ps -l -u etudiant
```

Is the number of `bash` processes consistent with the number of tabs you openned ?

What is the PPID of the `less` process ? Can you identify in which `bash` process `less` is running ?

Did you launch the `systemd` and `(sd-pam)` process ?

Laurent Modolo's avatar
Laurent Modolo committed
`pam` stand for [**p**luggable **a**uthentication **m**odules](https://www.linux.com/news/understanding-pam/), it’s a collection of tools that handle identification and resource access restriction. From the PPID of the `(sd-pam)` can you find which process launched `(sd-pam)` ? What is the PPID of this process ?
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
The option `-C` allows you to filter process by name
Laurent Modolo's avatar
Laurent Modolo committed

```sh
ps -l -C systemd
```

Who launched the first `systemd` process ?

Laurent Modolo's avatar
Laurent Modolo committed
## Processes tree

Laurent Modolo's avatar
Laurent Modolo committed
From PPID to PPID, you can guess that like the file system, processes are organized in a tree. The command `pstree` can give you a nice representation of this tree.

Laurent Modolo's avatar
Laurent Modolo committed
The following `ps` command shows information on the process with PID 1
Laurent Modolo's avatar
Laurent Modolo committed

```sh
ps -l -1
```

Laurent Modolo's avatar
Laurent Modolo committed
Is this output coherent with what you know on PID and the previous `ps` command ? Can you look at the corresponding program (with the command `which`) ?
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
Can you look for information on PID 0 ?
Laurent Modolo's avatar
Laurent Modolo committed

The root of the processes tree is the PID 1.

Laurent Modolo's avatar
Laurent Modolo committed
What is the UID of the `dockerd` process, can you guess why we were able to gain `sudo` access in the previous section by using a `docker` command ?

Laurent Modolo's avatar
Laurent Modolo committed
`ps` give you a static snapshot of the processes but processes are dynamic. To see them running you can use the command `top`. While `top` is functional, most system have `htop` with a more accessible interface. You can test `top` and `htop`.
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
Like `ps` you can use `-u etduiant` with htop to only display your user processes.
Laurent Modolo's avatar
Laurent Modolo committed

Laurent Modolo's avatar
Laurent Modolo committed
With the `F6` key, you can change the column on which to sort your process.
Laurent Modolo's avatar
Laurent Modolo committed

- Which process is consuming the most of CPU ?
- Which process is consuming the most of memory ?

Laurent Modolo's avatar
Laurent Modolo committed
What is the difference between `M_SIZE` (`VIRT` column), `M_RESIDENT` (`RES` column) and `M_SHARE` (`SHR` column) ? To which value, the column `MEM%` corresponds to ?
Laurent Modolo's avatar
Laurent Modolo committed

-  `M_SIZE` : The  total  amount  of  virtual  memory  used  by the task.  It  includes all code, data and shared libraries  plus  pages  that  have  been  swapped out and pages that have been mapped but not  used (If an application requests 1 GB of memory but uses only 1 MB, then `VIRT` will report 1 GB).
Laurent Modolo's avatar
Laurent Modolo committed
-  `M_RESIDENT` :  what’s currently in the physical memory. This does not include the swapped out memory and some of the memory may be shared with other processes (If a process uses 1 GB of memory and it calls `fork()`, the result of forking will be two processes whose `RES` is both 1 GB but only 1 GB will actually be used since Linux uses copy-on-write).
Laurent Modolo's avatar
Laurent Modolo committed
- `M_SHARE` : The amount of shared memory used by a task. It simply reflects memory that could be potentially shared with other processes.

Wait what is swapped out memory ?

> Linux divides its physical RAM (random access memory) into chucks of  memory called pages. Swapping is the process whereby a page of memory is copied to the preconfigured space on the hard disk, called swap space,  to free up that page of memory. The combined sizes of the physical  memory and the swap space is the amount of virtual memory available.

Laurent Modolo's avatar
Laurent Modolo committed
And as you HDD (even your fast SSD) is way slower than your RAM, when you run out of RAM and the system start to swap out memory, things will start to go really slowly on your computer. Generally, you want to avoid swapping. The swap space is often a dedicated partition in the *Linux_swap* format.
Laurent Modolo's avatar
Laurent Modolo committed

From the `htop` command, what is the size of the swap space on your VM ?

Laurent Modolo's avatar
Laurent Modolo committed
You have control over all the process launched with your UID. To test this control we are going to use the well-named command `stress`. Check the **man**ual of the `stress` command.
Laurent Modolo's avatar
Laurent Modolo committed
Launch the `stress` for 1 cpu and 3600 second.
Laurent Modolo's avatar
Laurent Modolo committed
You don’t have a prompt, it means that the last command (`stress`) is running.
Laurent Modolo's avatar
Laurent Modolo committed
## Terminate
Laurent Modolo's avatar
Laurent Modolo committed
Instead of taking a nap and come back at the end of this session, we may want to interrupt this command. The first way to do that is to ask the system to terminate the `stress` process.
Laurent Modolo's avatar
Laurent Modolo committed
From your terminal you can press `ctrl` + `c`. This short cut terminates the current process, it works everywhere except for programs like `man` or `less` which can be closed with the key `q`.
Laurent Modolo's avatar
Laurent Modolo committed
Launch another long `stress` process and switch to your other terminal tab and list your active process.
Laurent Modolo's avatar
Laurent Modolo committed

```sh
ps -l -u etudiant
```

Laurent Modolo's avatar
Laurent Modolo committed
You ask `stress` to launch a worker using 100% of one cpu (you can also see that with `htop`). You can see that the `stress` process you launched (with the PPID of your `bash`) forked another `stress` process.

Another way to terminate a process is with the command `kill`. `kill` is used to send a signal to a process with the command:
Laurent Modolo's avatar
Laurent Modolo committed

```sh
kill -15 PID
```

Laurent Modolo's avatar
Laurent Modolo committed
The `-15` option is the default option for `kill` so you can also write `kill PID`.

Laurent Modolo's avatar
Laurent Modolo committed
We can do the same thing as with the command `ctrl` + `c`: ask nicely the process to terminate itself. The `-15` signal is called the SIGTERM.

Laurent Modolo's avatar
Laurent Modolo committed
>  On rare occasions a buggy process will not be able to listen to signals anymore. The signal `-9` will kill a process (not nicely at all). The `-9` signal is called the SIGKILL signal. There are 64 different signals that you can send with the `kill` command.
Laurent Modolo's avatar
Laurent Modolo committed
Use the `kill` command to terminate the worker process of your stress command. Go to the other tab where stress was running. What is the difference with your previous `ctrl` + `c` ?

In your current terminal type the `bash` command, nothing happens. You have a shell within a shell. Launch a long `stress` command and switch to the other tab.
Laurent Modolo's avatar
Laurent Modolo committed

You can use the `ps` command to check that `sleep` is running within a `bash` within a `bash`

```sh
ps -l --forest -u etudiant
```

Nicely terminate the intermediate `bash`. What happened ?

Try not nicely. What happened ?

Laurent Modolo's avatar
Laurent Modolo committed
A process with a PPID of 1 is called a **daemon**, daemons are processes that run in the background. Congratulations you created your first daemon.

Kill the remaining `stress `processes with the command `pkill`. You can check the **man**ual on how to do that.
Laurent Modolo's avatar
Laurent Modolo committed
## Suspend
Laurent Modolo's avatar
Laurent Modolo committed

Launch `htop` then press `ctrl` + `z`. What happened ? 

```sh
ps -l -u etudiant
```

The manual of the `ps` command say the following about process state

> ```
>                D    uninterruptible sleep (usually IO)                                                                                                                                                                                                         
>                I    Idle kernel thread                                                                                                                                                                                                                         
>                R    running or runnable (on run queue)                                                                                                                                                                                                         
>                S    interruptible sleep (waiting for an event to complete)                                                                                                                                                                                     
>                T    stopped by job control signal                                                                                                                                                                                                              
>                t    stopped by debugger during the tracing                                                                                                                                                                                                     
>                W    paging (not valid since the 2.6.xx kernel)                                                                                                                                                                                                 
>                X    dead (should never be seen)                                                                                                                                                                                                                
>                Z    defunct ("zombie") process, terminated but not reaped by its parent                                                                                                                                                                        
> ```

Laurent Modolo's avatar
Laurent Modolo committed
You can use the command `fg` to put `htop` in the **f**ore**g**round.
Laurent Modolo's avatar
Laurent Modolo committed
Close `htop` and type two time de following command (`-i 1` is for simulating **io** **i**nput **o**utput operations) :
Laurent Modolo's avatar
Laurent Modolo committed

```sh
Laurent Modolo's avatar
Laurent Modolo committed
stress -i 1 -t 3600 &
Laurent Modolo's avatar
Laurent Modolo committed
Type the command `jobs`. What do you see ? You can specify which `stress` process you want to bring to the foreground with the command `fg %N` with `N` the number of the job.
Laurent Modolo's avatar
Laurent Modolo committed
Bring the 2nd `htop` to the foreground. Put it back to the background with `ctrl` + `z`. What is now the differences between your two `stress` processes ?

The command `bg` allow you to resume a job stopped in the background. You can restart your stopped `stress` process with this command. You can use the `kill %N` syntax to kill your two `stress` processes.

## Priority

We have seen that we can launch a `stress `process to use 100% of a cpu. Launch two `stress` process like that in the background.

What happened ? What can you see with the `htop` command ?

> In Linux the [Scheduler](https://en.wikipedia.org/wiki/Completely_Fair_Scheduler), is a system process that manages the order of execution of the task by the CPU(s). Linux and most Unix systems are also multiprocesses OS which means that your OS is constantly switching between process that has access to the CPU(s). From a universal Turing machine point of view, the head of the machine would be constantly switching back and forth on the tape.

You are working on a computer with a graphical interface, think about the processes drawing your windows, the processes reading and rendering your mouse, checking for your mail, loading and rendering your web pages, reading your keystrokes to send them back over the network to the NSA. The scheduler of your OS has to jungle between everything’s without losing anything (don’t be hard on windows OS).

The **nice** state of a process indicates it’s priority for the scheduler. **nice** value range from -20 (the highest priority) to 19 (the lowest priority). The default **nice** value is 0. The command `renice` allows you to change the **nice** value of a process:

```sh
renice -n N -p PID
```

 With `N` the **nice** value.

Use `renice` to set the **nice** value of the first `stress` process worker to 19. Use the command `htop` to check the result.

Can we increase the difference between the two processes ? Use re renice command to set the **nice** value of the second `stress` process worker to -20. What happened ?

Only the *root* user can lower the **nice** value of a process. You can also start start a new process with a given **nice** value with the command `nice`:

```sh
nice -n 10 stress -c 1 -t 3600 &
```

Without root access you can only set value higher than 0.

> We have seen the commands:
>
> - `ps` to display processes
> - `pstree` to display a tree of processes
> - `which` to display the PATH of a program
> - `top`/`htop` for a dynamic view of processes
> - `stress` to stress your system
> - `kill`/`pkill` to stop a process
> - `fg` to bring to the foreground a background processes
> - `jobs` to display background processes
> - `bg` to start a background process
> - `stress` to launch mock computation
> - `nice`/`renince` to change the nice value of a process

Laurent Modolo's avatar
Laurent Modolo committed
[To learn how to articulate processes you can head to the next section.](https://http://perso.ens-lyon.fr/laurent.modolo/unix/7_streams_and_pipes.html)