====== Documentation for VSC-1 ======
===== Troubleshooting =====
Solutions for common problems [[doku:troubleshooting|can be found under Troubleshooting]].
===== VSC Newsletter =====
*[[doku:newsletter20100610|Newsletter 10.6.2010]]
*[[doku:newsletter20101011|Newsletter 11.10.2010]]
===== User Documentation =====
*[[vsc1#Quick Start|Quick Start]]
*[[doku:advanced_sge|Advanced SGE topics]]
*[[software_vsc1|Available software packages on VSC-1]]
*[[doku:epilog|Copying back data from a temp. directory after a job is terminated: user defined epilog]]
*[[doku:mathematica|Notes on running Mathematica jobs]]
*[[doku:matlab|Example for Matlab jobs]]
*[[doku:paraview|Visualization software]]
*[[doku:gaussian09|Notes on Gaussian09]]
*[[vsc1#Cluster Monitoring: Ganglia|Cluster Monitoring: Ganglia]]
*[[vsc1#Acknowledgement in publications|Acknowledgement in publications]]
*[[doku:totalview-tutorial|Totalview: debugging software for parallel codes]]
*[[doku:gpu|GPU CUDA: documentation]]
*[[doku:mpi_tuning|MPI TUNING]]
*[[doku:xeonphi|Using Intel Xeon Phi]]
==== Quick Start ====
- Log in to your university's designated login server via SSH: \\ # Uni Wien
ssh @vsc.univie.ac.at
# TU Wien
ssh @vsc.tuwien.ac.at
# Boku Wien
ssh @vsc.boku.ac.at
- Transfer your programs and data/input files to your home directory.
- (Re-)Compile your application. Please use the latest "Intel" MPI-Environment as described in [[doku:mpi|MPI Environment]].
- Write a job script for your application: \\ #$ -N
#$ -pe mpich
#$ -V
mpirun -m $TMPDIR/machines -np $NSLOTS
where "" is a freely chosen descriptive name and "" is the number of processor cores that you want to use for the calculation. To ensure exclusive reservation of the computenodes for your job you have to give a multiple of 8 as value for "". Substitute the path to your MPI-enabled application for and you are ready to run!\\ To receive E-Mail notifications concerning job events (b .. beginning, e .. end, a .. abort or reschedule, s .. suspend), use these lines in your job script: \\ #$ -M
#$ -m beas # all job events sent via email
It is often advisable to also specify the job's runtime as #$ -l h_rt=hh:mm:ss
in particular when you know that your job will run only for several hours or even minutes. That way one can "backfill" the queue, thus avoiding very long waiting times, which may be due to a highly parallel job waiting for free resources. \\ Here is an example job-script, requesting 32 processor cores, which will run for a maximum of 3 hours and sends emails at the beginning and at the end of the job:
#$ -N hitchhiker
#$ -pe mpich 32
#$ -V
#$ -M my.name@example.at
#$ -m be
#$ -l h_rt=03:00:00
mpirun -m $TMPDIR/machines -np $NSLOTS ./myjob
- Submit your job:\\ qsub
where "" is the name of the file you just created.
- Check if and where your job has been scheduled:\\ qstat
- Inspect the job output. Assuming your job was assigned the id "42" and your job's name was "hitchhiker", you should be able to find the following files in the directory you started it from: \\ $ ls -l
hitchhiker.o42
hitchhiker.e42
hitchhiker.po42
hitchhiker.pe42
In this example hitchhiker.o42 contains the output of your job. hitchhiker.e42 contains possible error messages. In hitchhiker.po42 and hitchhiker.pe42 you might find additional information related to the parallel computing environment.
==== Cluster Monitoring: Ganglia ====
Ganglia is available via:
http://ganglia.vsc.tuwien.ac.at
==== Acknowledgement in publications =====
In publications, for which calculations have been performed on the VSC the following acknowledgment can be used:
- "Die präsentierten Rechenergebnisse wurden [zum Teil] am Vienna Scientific Cluster (VSC) erzielt."
- "The computational results presented have been achieved [in part] using the Vienna Scientific Cluster (VSC)."