Preparing for a Run:AI OpenShift Installation¶
The following section provides IT with the information needed to prepare for a Run:AI installation. This includes Third-party dependencies which must be met as well as access control that must be granted for Run:AI components.
Create OpenShift Projects¶
Run:AI uses two projects. One for the backend (
runai-backend) and one for the cluster itself (
runai). The project
gpu-operator-resources is used by the GPU Opeator dependency described above.
oc new-project runai oc new-project runai-backend oc new-project gpu-operator-resources
Prepare Run:AI Installation Artifacts¶
Run:AI Software Files¶
SSH into a node with
oc access (
oc is the OpenShift command-line) to the cluster and
Run the following to enable image download from the Run:AI Container Registry on Google cloud:
oc apply -f runai-gcr-secret.yaml oc apply -f runai-gcr-secret.yaml -n gpu-operator-resources
To extract Run:AI files, replace
<VERSION> in the command below and run:
tar xvf runai-<version>.tar.gz cd deploy
Upload images to Docker Registry. Set the Docker Registry address in the form of
NAME:PORT (do not add
export REGISTRY_URL=<Docker Registry address>
Run the following script (you must have at least 20GB of free disk space to run):
sudo -E ./prepare_installation.sh
(If docker is configured to run as non-root then
sudo is not required).
Run:AI Administration CLI¶
Install the Run:AI Administrator Command-line Interface by following the steps here.
Install the Run:AI Administrator Command-line Interface by following the steps here. Use the image under
If helm v3 does not yet exist on the machine, install it now:
See https://helm.sh/docs/intro/install/ on how to install Helm. Run:AI works with Helm version 3 only (not helm 2).
tar xvf helm-<version>-linux-amd64.tar.gz sudo mv linux-amd64/helm /usr/local/bin/
Mark Run:AI System Workers¶
The Run:AI Backend should be installed on a set of dedicated Run:AI system worker nodes rather than GPU worker nodes. To set system worker nodes run:
oc label node <NODE-NAME> node-role.kubernetes.io/runai-system=true
Currently, this setting cannot be changed after the backend is installed.
As part of the installation you will be required to install the Backend and Cluster Helm Charts. The Helm Charts require Kubernetes administrator permissions. You can review the exact permissions provided by using the
--dry-run on both helm charts.
Continue with installing the Run:AI third-party dependencies.