November 25th, 2020¶
Syntax changes in CLI:
runai <object> listhas been replaced by
runai list <object>.
runai gethas been replaced by
runai describe job.
runai <object> sethas been replaced by
runai config <object>.
The older style will still work with a deprecation notice.
runai top node has been revamped.
November 12th, 2020¶
An Admin can now create templates for the Command-line interface. Both a default template and specific templates that can be used with the --template flag. The new templates allow for mandatory values, defaults and run-time environment variable resolution. See here for more.
It is now also possible to pass Secrets to jobs. see here
November 2nd, 2020¶
Several changes and additions to the Command-line interface:
- Passing a command and arguments is now done docker-style by adding
--at the end of the command
- You no longer need to provide a job name. If you don't, a job name will be generated automatically. You can also control the job-name prefix using an additional flag.
--image-pull-policyflag, allowing Researcher support for updating images without tagging.
For further information see runai submit
September 6th, 2020¶
We released a module that helps the Researcher perform Hyperparameter optimization (HPO). HPO is about running many smaller experiments with varying parameters to help determine the optimal parameter set Hyperparameter Optimization Quickstart
August 10th, 2020¶
Run:AI Now supports Distributed Training and Gang Scheduling. For further information , see the Launch Distributed Training Workloads quickstart.
August 4th, 2020¶
There is now an optional second level of Project hierarchy called Departments. For further information on how to configure and use Departments, see Working with Departments
July 28th, 2020¶
You can now enforce a cluster-wise setting that mandates all containers running using the Run:AI CLI to run as non root. For further information, see Enforce non-root Containers
July 21th, 2020¶
It is now possible to mount a Persistent Storage Claim using the Run:AI CLI. See the
--pvc flag in the runai submit CLI flag
June 13th, 2020¶
New Settings for the Allocation of CPU and Memory¶
It is now possible to set limits for CPU and memory as well as to establish defaults based on the ratio of GPU to CPU and GPU to memory.
For further information see: Allocation of CPU and Memory
June 3rd, 2020¶
Node Group Affinity¶
Projects now support Node Affinity. This feature allows the administrator to assign specific projects to run only on specific nodes (machines). Example use cases:
- The project team needs specialized hardware (e.g. with enough memory)
- The project team is the owner of specific hardware which was acquired with a specialized budget
- We want to direct build/interactive workloads to work on weaker hardware and direct longer training/unattended workloads to faster nodes
For further information see: Working with Projects
Limit Duration of Interactive Jobs¶
Researchers frequently forget to close Interactive jobs. This may lead to a waste of resources. Some organizations prefer to limit the duration of interactive jobs and close them automatically.
For further information on how to set up duration limits see: Working with Projects
May 24th, 2020¶
Cluster installation now works with Kubernetes Operators. Operators make it easy to install, update, and delete a Run:AI cluster.
March 3rd, 2020¶
Admin Overview Dashboard¶
A new admin overview dashboard which shows a more holistic view of multiple clusters. Applicable for customers with more than one cluster.