Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revisionBoth sides next revision
doku:monitoring [2022/06/22 16:53] – rewrite in markdown msiegeldoku:monitoring [2022/06/23 08:30] – added gpu load msiegel
Line 111: Line 111:
  
 //  compile: mpiicc -qopenmp -o ompMpiCoreIds ompMpiCoreIds.c -lhwloc //  compile: mpiicc -qopenmp -o ompMpiCoreIds ompMpiCoreIds.c -lhwloc
 +```
 +
 +## GPU Load
 +
 +We assume you program uses a GPU, and your program runs as expected,
 +so could it be faster? On the same node where your job runs (see CPU
 +load section), maybe in a new terminal, type `watch nvidia-smi`, to
 +start a simple task manager for the graphics card. `watch` just
 +repeats a command every 2 seconds, acts as a live monitor for the
 +GPU. In our example below the GPU utilisation is around 80% the most
 +time, which is very good already.
 +
 +```
 +Every 2.0s: nvidia-smi                                 Wed Jun 22 16:42:52 2022
 +Wed Jun 22 16:42:52 2022
 ++-----------------------------------------------------------------------------+
 +| NVIDIA-SMI 460.32.03    Driver Version: 460.32.03    CUDA Version: 11.2     |
 +|-------------------------------+----------------------+----------------------+
 +| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
 +| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
 +|                                                    |               MIG M. |
 +|===============================+======================+======================|
 +|    GeForce GTX 1080    Off  | 00000000:02:00.0 Off |                  N/A |
 +| 36%   59C    P2   112W / 180W |    161MiB /  8119MiB |     83%      Default |
 +|                                                    |                  N/A |
 ++-------------------------------+----------------------+----------------------+
 +
 ++-----------------------------------------------------------------------------+
 +| Processes:                                                                  |
 +|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
 +|        ID   ID                                                   Usage      |
 +|=============================================================================|
 +|    0   N/ N/A     21045      C   gmx_mpi                           159MiB |
 ++-----------------------------------------------------------------------------+
 ``` ```
  
  • doku/monitoring.txt
  • Last modified: 2023/03/14 12:56
  • by goldenberg