This wiki site will be offline Weds, July 6th, 2022, from 5:30-8:30 PM CDT in order to upgrade Confluence
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 79 Next »

"My name is HAL. I became operational on March 25 2019 at the Innovative Systems Lab in Urbana, Illinois. My creators are putting me to the fullest possible use, which is all I think that any conscious entity can ever hope to do." (paraphrazed from https://en.wikipedia.org/wiki/HAL_9000)


Hardware-Accelerated Learning (HAL) cluster

Host name: hal.ncsa.illinois.edu

Hardware


Software

Documentation

To request access: fill out this form. Make sure to follow the link on the application confirmation page to request actual system account.

To report problems: email us.

User group Slack space: https://join.slack.com/t/halillinoisncsa

Real-time system status: https://hal-monitor.ncsa.illinois.edu:3000/

Quick start guide: (for complete details see Documentation section on the left)

To connect to the cluster:

ssh <username>@hal.ncsa.illinois.edu 

To submit interactive job:

swrun -p gpux1

or

srun --partition=gpux1 --pty --nodes=1 --ntasks-per-node=12 \
  --cores-per-socket=3 --threads-per-core=4 --sockets-per-node=1 \
  --gres=gpu:v100:1 --mem-per-cpu=1500 --time=2:00:00 --wait=0 \
  --export=ALL /bin/bash 

To submit a batch job:

swbatch run_script.swb  

or

sbatch run_script.sb  

See run_script.swb and run_script.sb for a basic example.

Job Queue time limits:

  • "debug" queue: 4 hours
  • "gpux<n>" and "cpun<n>" queues:  72 hours

To load IBM Watson Machine Learning Community Edition (former IBM PowerAI) module:

module load wmlce

Main -> Systems -> HAL

Contact us

Request access to this system: Application

Contact ISL staff: Email Address

Visit: NCSA, room 3050E


  • No labels