Please fill out the following form. Make sure to follow the link on the application confirmation page to request an actual system account.
https://forms.illinois.edu/sec/6587313 |
Please submit a tech-support ticket to the admin team.
help+isl@ncsa.illinois.edu |
Please join HAL slack user group.
https://join.slack.com/t/halillinoisncsa |
Please visit the following website to monitor real-time system status.
https://hal-monitor.ncsa.illinois.edu:3000/ |
There are 2 methods to log on to the HAL system. The first method is to SSH via terminal,
ssh <username>@hal.ncsa.illinois.edu |
and the second method is to visit the HAL OnDemand webpage.
https://hal.ncsa.illinois.edu:8888 |
There are 2 ways to submit interactive jobs to the HAL system. The first one is to use the Slurm Wrapper Suite,
swrun -p gpux1 |
and the second method is to submit with Slurm directly.
srun --partition=gpux1 --pty --nodes=1 --ntasks-per-node=12 \ --cores-per-socket=3 --threads-per-core=4 --sockets-per-node=1 \ --gres=gpu:v100:1 --mem-per-cpu=1500 --time=2:00:00 \ --wait=0 --export=ALL /bin/bash |
There are 2 ways to submit batch jobs to the HAL system. The first one is to use the Slurm Wrapper Suite,
swbatch run_script.swb |
The run_script.swb example
#!/bin/bash #SBATCH --job-name="hostname" #SBATCH --output="hostname.%j.%N.out" #SBATCH --error="hostname.%j.%N.err" #SBATCH --partition=gpux1 srun /bin/hostname # this is our "application" |
and the second method is to submit with Slurm directly.
swbatch run_script.sb |
The run_script.sb example
#!/bin/bash #SBATCH --job-name="hostname" #SBATCH --output="hostname.%j.%N.out" #SBATCH --error="hostname.%j.%N.err" #SBATCH --partition=gpux1 #SBATCH --nodes=1 #SBATCH --ntasks-per-node=1 #SBATCH --export=ALL #SBATCH -t 00:10:00 srun /bin/hostname # this is our "application" |
Login node
Login Node | IBM | 9006-12P | 1x |
---|---|---|---|
CPU | IBM | POWER9 16 Cores | 2x |
Network | Mellanox | 2 Ports EDR InfiniBand | 1x |
Compute node
Compute Node | IBM | AC922 8335-GTH | 16x |
---|---|---|---|
CPU | IBM | POWER9 20 Cores | 2x |
GPU | NVidia | V100 16GB Memory | 4x |
Network | Mellanox | 2 Ports EDR InfiniBand | 1x |
Storage Node
Storage Node | IBM | 9006-22P | 1x |
---|---|---|---|
CPU | IBM | POWER9 20 Cores | 2x |
Storage | WD | NFS | 1x |
Network | Mellanox | 2 Ports EDR InfiniBand | 1x |
Manufacturer | Software Package | Version |
---|---|---|
IBM | RedHat Linux | 7.6 |
NVidia | CUDA | 10.1.105 |
NVidia | PGI Compiler | 19.4 |
IBM | Advance Toolchain | 12.0 |
IBM | XLC/XLF | 16.1.1 |
IBM | PowerAI | 1.6.1 |
SchedMD | Slurm | 19.05.2 |
OSC | Open OnDemand | 1.6.20 |