Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Please visit the following website to monitor real-time system status.

Code Block
languagebash
titleHAL Real-time System Monitor
https://hal-monitor.ncsa.illinois.edu:3000/

...

Connect to HAL cluster

There are two 2 methods to log on to the HAL system. The first method is to SSH via terminal,

Code Block
languagebash
titleSSH
ssh <username>@hal.ncsa.illinois.edu

...

Submit Interactive Job to HAL

There are 2 ways to submit interactive jobs to the HAL system. The first one is to use the Slurm Wrapper Suite,

Code Block
languagebash
swrun -p gpux1

and the second method is to submit with Slurm directly.

Code Block
languagebash


srun --partition=gpux1 --pty --nodes=1 --ntasks-per-node=12 \
  --cores-per-socket=3 --threads-per-core=4 --sockets-per-node=1 \
  --gres=gpu:v100:1 --mem-per-cpu=1500 --time=2:00:00 \
  --wait=0 --export=ALL /bin/bash

Submit Batch Job to HAL

There are 2 ways to submit batch jobs to the HAL system. The first one is to use the Slurm Wrapper Suite,

Code Block
languagebash
swbatch run_script.swb

sbatch run_script.sb

...

ManufacturerSoftware PackageVersion
IBMRedHat Linux7.6
NVidiaCUDA10.1.105
NVidiaPGI Compiler19.4
IBMAdvance Toolchain12.0
IBMXLC/XLF16.1.1
IBMPowerAI1.6.1
SchedMDSlurm19.05.2
OSCOpen OnDemand1.6.20

Job Management with Slurm

...