Skip to content

Whats New 2020

December 28th, 2020

It is now possible to allocate a specific amount of GPU memory rather than use the fraction syntax. Use --gpu-memory=5G.

December 15th, 2020

Project and Departments can now be set to not allocate resources beyond the assigned GPUs. This is useful for budget-conscious Projects/Departments.

December 1st, 2020

New integration documents:

November 25th, 2020

Syntax changes in CLI:

  • runai <object> list has been replaced by runai list <object>.
  • runai get has been replaced by runai describe job.
  • runai <object> set has been replaced by runai config <object>.

The older style will still work with a deprecation notice.

runai top node has been revamped.

November 12th, 2020

An Admin can now create templates for the Command-line interface. Both a default template and specific templates, that can be used with the --template flag. The new templates allow for mandatory values, defaults, and run-time environment variable resolution.

It is now also possible to pass Secrets to Job. see here

November 2nd, 2020

Several changes and additions to the Command-line interface:

  • Passing a command and arguments is now done docker-style by adding -- at the end of the command
  • You no longer need to provide a Job name. If you don't, a Job name will be generated automatically. You can also control the job-name prefix using an additional flag.
  • New --image-pull-policy flag, allowing Researcher support for updating images without tagging.

For further information see runai submit

September 6th, 2020

We released a module that helps the Researcher perform Hyperparameter optimization (HPO). HPO is about running many smaller experiments with varying parameters to help determine the optimal parameter set Hyperparameter Optimization Quickstart

September 3rd, 2020

GPU Fractions now run in training and not only interactive. GPU Fractions training Job can be preempted, bin-packed and consolidated like any integer Job. See Run:ai Scheduler Fraction for more.

August 10th, 2020

Run:ai Now supports Distributed Training and Gang Scheduling. For further information, see the Launch Distributed Training Workloads quickstart.

August 4th, 2020

There is now an optional second level of Project hierarchy called Departments. For further information on how to configure and use Departments, see Working with Departments

July 28th, 2020

You can now enforce a cluster-wise setting that mandates all containers running using the Run:ai CLI to run as non root. For further information, see Enforce non-root Containers

July 21th, 2020

It is now possible to mount a Persistent Storage Claim using the Run:ai CLI. See the --pvc flag in the runai submit CLI flag

June 13th, 2020

New Settings for the Allocation of CPU and Memory

It is now possible to set limits for CPU and memory as well as to establish defaults based on the ratio of GPU to CPU and GPU to memory.

For further information see: Allocation of CPU and Memory

June 3rd, 2020

Node Group Affinity

Projects now support Node Affinity. This feature allows the Administrator to assign specific Projects to run only on specific nodes (machines). Example use cases:

  • The Project team needs specialized hardware (e.g. with enough memory)
  • The Project team is the owner of specific hardware which was acquired with a specialized budget
  • We want to direct build/interactive workloads to work on weaker hardware and direct longer training/unattended workloads to faster nodes

For further information see: Working with Projects

Limit Duration of Interactive Jobs

Researchers frequently forget to close Interactive Job. This may lead to a waste of resources. Some organizations prefer to limit the duration of interactive Job and close them automatically.

For further information on how to set up duration limits see: Working with Projects

May 24th, 2020

Kubernetes Operators

Cluster installation now works with Kubernetes Operators. Operators make it easy to install, update, and delete a Run:ai cluster.

For further information see: Upgrading a Run:ai Cluster Installation and Deleting a a Run:ai Cluster Installation

March 3rd, 2020

Admin Overview Dashboard

A new admin overview dashboard that shows a more holistic view of multiple clusters. Applicable for customers with more than one cluster.