# VSC Wiki

## Systems

Decomissioned

Decomissioned

### Parallel computing

##### VSC-3:
doku:vsc3_storage

## VSC-3 Storage

This article is about the $GLOBAL and $HOME filesystems of VSC-3. If you are searching for info about the bioinformatics storage, the article can be found here.

VSC-3 provides three facilities for persisting data: the high-performance BeeGFS Parallel Filesystem (former Fraunhofer Parallel Filesystem, FhGFS), the Network File System (NFS) and a node-local ramdisk. They are accessible under:

• NFS: $HOME which expands to /home/lv<project>/<username> • BeeGFS (former FhGFS): • $GLOBAL expands to /fhgfs/global/lv<project>/<username>,
• $SCRATCH to /fhgfs/<node> (node local) • Scratch RAM Disk $TMPDIR </tmp>

### Usage ''$HOME'' $HOME is the location of the user UNIX home directory. It can be accessed from login and compute nodes. $HOME can be used to hold results, settings, source code etc. - data for which high concurrent job throughput and support for large file sizes is not required. Conversely, the parallel BeeGFS filesystem (see below) should utilized to persist temporary data in compute runs. Backup of $HOME is user responsibility.

$HOME is provided from file servers with disk arrays that are exported over the network file system (NFS). Even on highly scaled storage such on VSC-3, the number of concurrent file operations is bound by spinning disk physics: small file (write) operations can easily saturate capacity. Hence, please mind that $HOME is a shared resource over all projects on a given NFS server. In case your project requires persistence over a large number of small files please contact VSC administration in advance.

### Scratch Space Usage: ''\$GLOBAL'' and ''\$SCRATCH''

The BeeGFS parallel file system (former FhGFS) on VSC-3 provides a large (initially approx. 0.5 PB) scratch space. The environment variable $GLOBAL expands to: $ echo $GLOBAL /global/lv70999/username The directory is writeable as user and readable by the group members. It is advisable to make use of these directories in particular for jobs with heavy I/O operations. In addition it will reduce the load on the fileserver holding the$HOME directories.

The BeeGFS (former Fraunhofer parallel file system) is shared by all users and by all nodes. Single jobs producing heavy load (»1000 requests per second) have been observed to reduce responsiveness for all jobs and all users.

Lifetime of data is limited, see table below.

#### Per-node Scratch Directories ''$SCRATCH'' Local scratch directories on each node are provided as a link to the BeeGFS parallel file system and can thus be viewed also via the login nodes as '/fhgfs/nodelocal/rXXnXX/'. The parallel file system (and thus the performance) is identical between$SCRATCH and $GLOBAL. The variable $SCRATCH expands to:

$echo$SCRATCH
/scratch

These directories are purged after job execution.

### Fair Use

The storage resources underlying NFS and BeeGFS (former FhGFS) are shared. Please utilize BeeGFS primarily for large I/O intensive runs. The number of files per run or per project is not hard limited. Yet, it is strongly discouraged to create/operate on O(10E5) and above number of files. If millions of (small) files are required for a code, please contact system operation in advance as performance impact on other users can occur.

### Support

Parallel filesystems used in large scale computing are unlike desktop file systems. Contact VSC staff in planning for high I/O computation. Also, VSC can support architecting one-time and recurrent large ingress-egress data pipelines, recurrent large data transfer workflows, and support optimizing codes for parallel I/O.

### Backup Policy

Backup of user files independent of location is solely the responsibility of each user.

VSC-3 NFS and BeeGFS (former FhGFS) servers utilize RAID-6 that can sustain up to 2 disks failing concurrently. The data path is otherwise not redundant. Data loss may also occur due to failure modes including, but not limited to natural disaster, cooling failure, disk controller failure and filesystem software faults.

User data on VSC-2 is not backuped.