Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revisionBoth sides next revision | ||
doku:monitoring [2022/06/22 16:53] – rewrite in markdown msiegel | doku:monitoring [2022/06/23 08:30] – added gpu load msiegel | ||
---|---|---|---|
Line 111: | Line 111: | ||
// compile: mpiicc -qopenmp -o ompMpiCoreIds ompMpiCoreIds.c -lhwloc | // compile: mpiicc -qopenmp -o ompMpiCoreIds ompMpiCoreIds.c -lhwloc | ||
+ | ``` | ||
+ | |||
+ | ## GPU Load | ||
+ | |||
+ | We assume you program uses a GPU, and your program runs as expected, | ||
+ | so could it be faster? On the same node where your job runs (see CPU | ||
+ | load section), maybe in a new terminal, type `watch nvidia-smi`, | ||
+ | start a simple task manager for the graphics card. `watch` just | ||
+ | repeats a command every 2 seconds, acts as a live monitor for the | ||
+ | GPU. In our example below the GPU utilisation is around 80% the most | ||
+ | time, which is very good already. | ||
+ | |||
+ | ``` | ||
+ | Every 2.0s: nvidia-smi | ||
+ | Wed Jun 22 16:42:52 2022 | ||
+ | +-----------------------------------------------------------------------------+ | ||
+ | | NVIDIA-SMI 460.32.03 | ||
+ | |-------------------------------+----------------------+----------------------+ | ||
+ | | GPU Name Persistence-M| Bus-Id | ||
+ | | Fan Temp Perf Pwr: | ||
+ | | | ||
+ | |===============================+======================+======================| | ||
+ | | | ||
+ | | 36% | ||
+ | | | ||
+ | +-------------------------------+----------------------+----------------------+ | ||
+ | |||
+ | +-----------------------------------------------------------------------------+ | ||
+ | | Processes: | ||
+ | | GPU | ||
+ | | ID | ||
+ | |=============================================================================| | ||
+ | | 0 | ||
+ | +-----------------------------------------------------------------------------+ | ||
``` | ``` | ||