Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
pandoc:introduction-to-mul-cluster:01_introduction:04_slurm [2018/02/02 10:51] – Pandoc Auto-commit pandoc | pandoc:introduction-to-mul-cluster:01_introduction:04_slurm [2020/10/20 08:09] (current) – Pandoc Auto-commit pandoc | ||
---|---|---|---|
Line 289: | Line 289: | ||
===== SLURM ===== | ===== SLURM ===== | ||
- | ==== Interactive jobs (2) ==== | + | ==== Interactive jobs (2) (excercise) ==== |
Alternatively the '' | Alternatively the '' | ||
< | < | ||
- | salloc -N 1 -J test -p E5-2690v4 --qos E5-2690v4-batch | + | salloc -N 1 -J test -p E5-2690v4 --qos E5-2690v4-batch |
</ | </ | ||
+ | Then find out where your job is running: | ||
+ | |||
+ | < | ||
+ | squeue -u < | ||
+ | </ | ||
+ | or | ||
+ | |||
+ | < | ||
+ | srun hostname | ||
+ | </ | ||
+ | and connect to it: | ||
+ | |||
+ | < | ||
+ | ssh < | ||
+ | </ | ||
+ | |||
+ | ---- | ||
+ | |||
+ | ===== SLURM ===== | ||
+ | |||
+ | ==== Interactive jobs (2) (excercise) ==== | ||
+ | |||
+ | To get direct interactive access to a compute try: | ||
+ | |||
+ | < | ||
+ | salloc -N 1 -J test -p E5-2690v4 --qos E5-2690v4-batch --mem=10G | ||
+ | </ | ||
+ | |||
---- | ---- | ||
Line 309: | Line 337: | ||
#SBATCH --mem=2G | #SBATCH --mem=2G | ||
</ | </ | ||
- | The cores and the requested memory are then exclusively assigned to the processes of this job via cgroups. | + | The cores and the requested memory are then exclusively assigned to the processes of this job via cgroups. |
+ | |||
+ | |||
+ | ---- | ||
+ | |||
+ | ===== SLURM: memory ===== | ||
+ | |||
+ | * you **have to** specify memory | ||
+ | * slurm does not accept your job without a memory specification | ||
+ | * choose the right amount of memory: | ||
+ | * not too little | ||
+ | * not too much | ||
+ | * too **little** memory: | ||
+ | * could lead to very low speed because of swapping | ||
+ | * could lead to crash of job (experienced with Abaqus) | ||
+ | * too **much** memory | ||
+ | * does not hurt performance and does not kill your job | ||
+ | * but it costs you more of your fair share | ||
+ | |||
+ | |||
+ | ---- | ||
+ | |||
+ | ===== SLURM: memory ===== | ||
+ | |||
+ | ==== why have this annoying feature anyway? ==== | ||
+ | |||
+ | * because of shared usage of nodes | ||
+ | * if we would use nodes **only exclusively** then this would not be necessary | ||
Line 404: | Line 459: | ||
---- | ---- | ||
+ | |||