Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Once on the compute node, load PowerAI module using one of these:

Code Block
module load poweraiwmlce/1.6.01-py2.7 # for python2 environment
module load poweraiwmlce/1.6.01-py3.6 # for python3 environment
module load poweraiwmlce             # python3 environment by default

...

Get node for interactive use:

Code Block
srunswrun --partition=debug --pty --nodes=1 --ntasks-per-node=8 --gres=gpu:v100:1 -t 01:30:00 --wait=0 --export=ALL /bin/bashp gpux1

Once on the compute node, load PowerAI module using one of these:

Code Block
module load poweraiwmlce/1.6.01-py2.7 # for python2 environment
module load poweraiwmlce/1.6.01-py3.6 # for python3 environment
module load wmlce  powerai           # python3 environment by default

...

Get node for interactive use:

Code Block
srunswrun --partition=debug --pty --nodes=1 --ntasks-per-node=8 --gres=gpu:v100:1 -t 01:30:00 --wait=0 --export=ALL /bin/bashp gpux1

Once on the compute node, load PowerAI module using one of these:

Code Block
module load poweraiwmlce/1.6.01-py2.7 # for python2 environment
module load poweraiwmlce/1.6.01-py3.6 # for python3 environment
module load wmlce powerai            # python3 environment by default

...

Get node for interactive use:

Code Block
srunswrun --partition=debug --pty --nodes=1 --ntasks-per-node=8 --gres=gpu:v100:1 -t 01:30:00 --wait=0 --export=ALL /bin/bashp gpux1

Once on the compute node, load PowerAI module using one of these:

Code Block
module load poweraiwmlce/1.6.01-py2.7 # for python2 environment
module load poweraiwmlce/1.6.01-py3.6 # for python3 environment
module load wmlce  powerai           # python3 environment by default

...

After job completed the TensorFlow log files can be found in "~/tensorflow/mnist/logs", start the TensorBoard server on login node:

Code Block
module load poweraiwmlce
tensorboard --logdir ~/tensorflow/mnist/logs/ --port [user_pick_port] # please use random number within [6500-6999]

...

Get node for interactive use:

Code Block
srunswrun --partition=debug --pty --nodes=1 --ntasks-per-node=8 --gres=gpu:v100:1 -t 01:30:00 --wait=0 --export=ALL /bin/bashp gpux1

Once on the compute node, load PowerAI module using one of these:

Code Block
module load poweraiwmlce/1.6.01-py2.7 # for python2 environment
module load poweraiwmlce/1.6.01-py3.6 # for python3 environment
module load poweraiwmlce             # python3 environment by default

...