Differences

This shows you the differences between two versions of the page.

--- doku:vsc5quickstart [2022/06/24 09:54] – [Intel MPI] jz
+++ doku:vsc5quickstart [2023/05/17 14:54] – msiegel
@@ Line 1: / Line 1: @@
 ====== Quick start guide for VSC-5 ======
-**Status: 2022/04**
-This page is under construction.
 ===== Connecting =====
@@ Line 23: / Line 19: @@
-===== Loading Modules & Spack Environments =====
+==== New SPACK without environments ====
-Different CPUs come with different compilers, so we use
+Having worked with spack environments for some time, we have encountered several severe issues which have convinced us that we need to find a more practical way of maintaining software packages at VSC.
-the new spack feature ''environment'' to make sure to choose the right
-package.
-On login the default ''spack environment'' (zen3) is loaded
+There are now three separate spack installation trees corresponding to the CPU/GPU architectures on VSC:
-automatically, so only modules that run on AMD processors are visible
-with ''spack find''.
-On VSC5 no default modules are loaded. Please do that by yourself
+  * skylake - Intel CPUs; works on Intel Skylake and Cascadelake CPUs
-using ''spack load <module>'' or ''module load <module>''.
+  * zen - AMD CPUs; works on Zen 2 and 3 CPUs
+  * cuda-zen - AMD CPUs + NVIDIA GPUs; works on all nodes equipped with graphics cards
-Find the official SPACK documentation at https://spack.readthedocs.io/
+By default the spack installation tree suitable for the current compute/login node is activated and will be indicated by a **prefix** on the command line, e.g.:
-==== List Spack Environments ====
-Type ''spack env list'' to see which environments are available and
-which one is active.
 <code>
-$ spack env list
+zen [user@l51 ~]$
-==> 2 environments
-    cascadelake  zen3
 </code>
-The current ''spack environment'' is also shown in your prompt:
+Read more about SPACK at:
+  * [[doku:spack-transition | Transition to new SPACK without Environments]]
+  * [[doku:spack]]
+  * [[https://spack.readthedocs.io/en/latest/basic_usage.html|Official documentation of SPACK]]
-<code>
-(zen3) [myname@l55 ~]#
-</code>
-Mind that if your prompt is changed later, like when loading a ''python
+==== Load a module ====
-environment'' using ''conda'', the correct ''spack environment'' might
-not be shown correctly in your prompt.
-When a spack environment is activated, the command ''spack find -l'' lists those packages available for the active
-environment.
-The command ''module avail'' will also show only those modules that are compatible with the active
-spack environment.
-==== Change Spack Environment ====
-If you want to look for a certain package that belongs to another
-architecture, first change the spack environment:
-<code>
-$ spacktivate <myenv>
-$ spacktivate cascadelake
-</code>
-Only then ''spack find'' will show the modules for the active environment (e.g. ''cascadelake'').
-==== Save Spack Environment ====
-The following creates a load script for your current spack environment
-with all loaded modules:
-<code>
-$ spack env loads -r
-</code>
-This creates a file called ''loads'' in the environment
-directory. Sourcing that file in bash will make the environment
-available to the user. The ''source loads'' command can be included in
-''.bashrc'' files. The loads file may also be copied out of the
-environment, renamed, etc.
-==== Load a Module ====
 Please always use spack, see [[doku:spack|SPACK - a package manager for
 HPC systems]].
+Of course ''module'' commands will still work, like ''module avail xyz''.
-===== Compile Code =====
+===== Compile code =====
 A program needs to be compiled on the hardware it will later run
@@ Line 120: / Line 67: @@
 physical cores (core-id 0-127) and 256 virtual cores available.
-The A100 GPU nodes have 512GB RAM and the NVIDIA A100 cards have 40GB RAM each.
+The A100 GPU nodes have 512GB RAM and the two NVIDIA A100 cards have 40GB RAM each.
-At the moment 40 GPU nodes are installed.
+A100 nodes are installed.
+The A40 GPU nodes have 256GB RAM and the two NVIDIA A40 cards have 46GB each.
+A40 nodes are installed.
 <code>
 $ nvidia-smi
@@ Line 155: / Line 104: @@
 ===== SLURM =====
+For the partition/queue setup see [[doku:vsc5_queue|Queue | Partition setup on VSC-5]].
-The following partitions are currently available:
+type ''sinfo -o %P'' to see the available partitions.
-<code>
-$ sinfo -o %P
-PARTITION
-gpu_a100_dual* -> Currently the default partition. AMD CPU nodes with 2x AMD Epyc (Milan) and 2x NIVIDA A100 and 512GB RAM
-cascadelake_0384 -> Intel CPU nodes with 2x Intel Cascadelake and 384GB RAM
-zen3_0512 -> AMD CPU nodes with 2x AMD Epyc (Milan) and 512GB RAM
-zen3_1024 -> AMD CPU nodes with 2x AMD Epyc (Milan) and 1TB RAM
-zen3_2048 -> AMD CPU nodes with 2x AMD Epyc (Milan) and 2TB RAM
-</code>
-==== QoS ====
-During the friendly user test phase the QoS ''goodluck'' can be used for both partitions.
 ==== Submit a Job ====
@@ Line 179: / Line 115: @@
 #SBATCH -J <meaningful name for job>
 #SBATCH -N 1
-#SBATCH --gres=gpu:2
 ./my_program
 </file>
-This will submit a job in the default partition (gpu_a100_dual) using the default QoS (gpu_a100_dual).
+This will submit a job in the default partition (zen3_0512) using the default QoS (zen3_0512).
 To submit a job to the cascadelake nodes:
@@ Line 191: / Line 126: @@
 #SBATCH -N 1
 #SBATCH --partition=cascadelake_0384
-#SBATCH --qos goodluck
+#SBATCH --qos cascadelake_0384
 ./my_program
 </file>
@@ Line 202: / Line 137: @@
 #SBATCH -N 1
 #SBATCH --partition=zen3_0512
-#SBATCH --qos goodluck
+#SBATCH --qos zen3_0512
 ./my_program
 </file>
@@ Line 211: / Line 146: @@
 #SBATCH -N 1
 #SBATCH --partition=zen3_1024
-#SBATCH --qos goodluck
+#SBATCH --qos zen3_1024
 ./my_program
 </file>
@@ Line 220: / Line 155: @@
 #SBATCH -N 1
 #SBATCH --partition=zen3_2048
-#SBATCH --qos goodluck
+#SBATCH --qos zen3_2048
 ./my_program
 </file>
@@ Line 230: / Line 165: @@
 #SBATCH -J <meaningful name for job>
 #SBATCH -N 1
-#SBATCH --partition=gpu_a100_dual
+#SBATCH --partition=zen3_0512_a100x2
-#SBATCH --qos goodluck
+#SBATCH --qos zen3_0512_a100x2
 #SBATCH --gres=gpu:2
 ./my_program
@@ Line 240: / Line 175: @@
 #!/bin/sh
 #SBATCH -J <meaningful name for job>
-#SBATCH --partition=gpu_a100_dual
+#SBATCH --partition=zen3_0512_a100x2
-#SBATCH --qos goodluck
+#SBATCH --qos zen3_0512_a100x2
 #SBATCH --gres=gpu:1
 ./my_program