Containers on Polaris
Polaris, powered by NVIDIA A100 GPUs, benefits from container-based workloads for seamless compatibility across NVIDIA systems. This guide details the use of containers on Polaris, including custom container creation, large-scale execution, and common pitfalls.
Apptainer Setup
Polaris employs Apptainer (formerly known as Singularity) for container management. To set up Apptainer, run:
ml use /soft/modulefiles
ml load spack-pe-base/0.8.1
ml load apptainer
ml load e2fsprogs
apptainer version #1.2.2
The Apptainer version on Polaris is 1.2.2. Detailed user documentation is available here
Building from Docker or Argonne GitHub Container Registry
Containers on Polaris can be built by writing Dockerfiles on a local machine and then publish the container to DockerHub, or by directly building them on ALCF compute node by writing an Apptainer recipe file. If you prefer to use existing containers, you can pull them from various registries like DockerHub and run them on Polaris.
Since Docker requires root privileges, which users do not have on Polaris, existing Docker containers must be converted to Apptainer. To build a Docker-based container on Polaris, use the following as an example:
qsub -I -A <Project> -q debug -l select=1 -l walltime=01:00:00 -l filesystems=home:eagle -l singularity_fakeroot=true
export HTTP_PROXY=http://proxy.alcf.anl.gov:3128
export HTTPS_PROXY=http://proxy.alcf.anl.gov:3128
export http_proxy=http://proxy.alcf.anl.gov:3128
export https_proxy=http://proxy.alcf.anl.gov:3128
ml use /soft/modulefiles
ml load spack-pe-base/0.8.1
ml load apptainer
ml load e2fsprogs
apptainer build --fakeroot pytorch:22.06-py3.sing docker://nvcr.io/nvidia/pytorch:22.06-py3
Note: Currently container build and executions are only supported on the Polaris compute nodes
Running Containers on Polaris
To run a container on Polaris you can use the submission script described here. Below is an explanation of the job submission script.
#!/bin/sh
#PBS -l select=2:system=polaris
#PBS -q debug
#PBS -l place=scatter
#PBS -l walltime=0:30:00
#PBS -l filesystems=home:eagle
#PBS -A <project_name>
cd ${PBS_O_WORKDIR}
echo $CONTAINER
We move to current working directory and enable network access at run time by setting the proxy. We also load apptainer.
# SET proxy for internet access
ml use /soft/modulefiles
ml load spack-pe-base/0.8.1
ml load apptainer
ml load e2fsprogs
export HTTP_PROXY=http://proxy.alcf.anl.gov:3128
export HTTPS_PROXY=http://proxy.alcf.anl.gov:3128
export http_proxy=http://proxy.alcf.anl.gov:3128
export https_proxy=http://proxy.alcf.anl.gov:3128
This is important for system (Polaris - Cray) mpich to bind to containers mpich. Set the following environment variables
ADDITIONAL_PATH=/opt/cray/pe/pals/1.2.12/lib
module load cray-mpich-abi
export APPTAINERENV_LD_LIBRARY_PATH="$CRAY_LD_LIBRARY_PATH:$LD_LIBRARY_PATH:$ADDITIONAL_PATH"
Set the number of ranks per node spread as per your scaling requirements
# MPI example w/ 16 MPI ranks per node spread evenly across cores
NODES=`wc -l < $PBS_NODEFILE`
PPN=16
PROCS=$((NODES * PPN))
echo "NUM_OF_NODES= ${NODES} TOTAL_NUM_RANKS= ${PROCS} RANKS_PER_NODE= ${PPN}"
Finally launch your script
echo C++ MPI
mpiexec -hostfile $PBS_NODEFILE -n $PROCS -ppn $PPN apptainer exec -B /opt -B /var/run/palsd/ $CONTAINER /usr/source/mpi_hello_world
echo Python MPI
mpiexec -hostfile $PBS_NODEFILE -n $PROCS -ppn $PPN apptainer exec -B /opt -B /var/run/palsd/ $CONTAINER python3 /usr/source/mpi_hello_world.py
The job can be submitted using:
Recipe-Based Container Building
As mentioned earlier, you can build Apptainer containers from recipe files. Instructions are available here. See available containers for more recipes.
Note: You can also build custom recipes by bootstrapping from prebuilt images. For e.g the first two lines in a recipe to use our custom TensorFlow implementation would be
Bootstrap: oras
followed byFrom: ghcr.io/argonne-lcf/tf2-mpich-nvidia-gpu:latest
Available containers
If you just want to know what containers are available, here you go.
-
Examples for running MPICH containers can be found here
-
Examples for running databases can be found here
-
For using shpc - that allows for running containers as modules. It can be found here
The latest containers are updated periodically. If you have trouble using containers, or request a newer or a different container please contact ALCF support at support@alcf.anl.gov
.
Troubleshooting Common Issues
Permission Denied Error: If you encounter permission errors during the build
-
Check your quota and delete any unnecessary files.
-
Clean up Apptainer cache,
~/.apptainer/cache
, and set the Apptainer tmp and cache directories as below. If your home directory is full and if you are building your container on a compute node, then set the tmpdir and cachedir to local scratch -
Make sure you are not on a directory accessed with a symbolic link, i.e. check if
pwd
andpwd -P
returns the same path. -
If any of the above doesn't work, try running the build in your home directory.
Mapping to rank 0 on all nodes: Ensure that the container's MPI aligns with the system MPI. For e.g. follow the additional steps outlined in the container registry documentation for MPI on Polaris
libmpi.so.40 not found: This can happen if the container's application has an OpenMPI dependency which is not currently supported on Polaris. It can also spring up if the container's base environment is not a Debian-based architecture such as Ubuntu. Ensure the application has an MPICH implementation as well. Also try removing .conda/
, .cache/
, and .local/
folders from your home directory and rebuilding the container.
Disabled Port mapping, user namespace and [network virtualization] Network virtualization is disabled for the container due to security constraints. See issue #2533