You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 70 Next »

HAL is currently operating in reduced capacity. Only 4 nodes are available.

ML/DL training sessions are on Wednesdays.  Schedule can be found here.

"My name is HAL. I became operational on March 25 2019 at the Innovative Systems Lab in Urbana, Illinois. My creators are putting me to the fullest possible use, which is all I think that any conscious entity can ever hope to do." (paraphrazed from https://en.wikipedia.org/wiki/HAL_9000)


Hardware-Accelerated Learning (HAL) cluster

Host name: hal.ncsa.illinois.edu

Hardware


Software

Documentation

To request access: fill out this form. Make sure to follow the link on the application confirmation page to request actual system account.

To report problems: email us.

User group Slack space: http://go.illinois.edu/hal

Real-time system status: https://hal-monitor.ncsa.illinois.edu:3000/

Quick start guide: (for complete details see Documentation section on the left)

To connect to the cluster:

ssh <username>@hal.ncsa.illinois.edu 

To submit interactive job:

swrun -p gpux1

or

srun --partition=gpux1 --pty --nodes=1 --ntasks-per-node=12 \
  --cores-per-socket=3 --threads-per-core=4 --sockets-per-node=1 \
  --gres=gpu:v100:1 --mem-per-cpu=1500 --time=2:00:00 --wait=0 --export=ALL /bin/bash 

To submit a batch job:

swbatch run_script.swb  

or

sbatch run_script.sb  

See run_script.swb and run_script.sb for a basic example.

Job Queue time limits:

  • "debug" queue: 4 hours
  • "gpux<n>" and "cpun<n>" queues:  72 hours

To load IBM Watson Machine Learning Community Edition (former IBM PowerAI) module:

module load wmlce

Main -> Systems -> HAL

Contact us

Request access to this system: Application

Contact ISL staff: Email Address

Visit: NCSA, room 3050E


  • No labels