Skip to content
Run:ai Documentation Library
Jupyter Notebook
Initializing search
GitHub
Home
Infrastructure Administrator
Platform Administrator
Researcher
Developer
Run:ai Documentation Library
GitHub
Home
Home
Overview
System Components
Whats New
Whats New
Run:ai SaaS Updates
Version 2.20
Version 2.19
Version 2.18
Version 2.17
Version 2.16
Version 2.15
Version 2.13
Hotfixes
Hotfixes
Hotfixes for 2.19
Hotfixes for 2.18
Hotfixes for 2.17
Hotfixes for 2.16
Hotfixes for 2.15
Hotfixes for 2.13
Data Privacy
Infrastructure Administrator
Infrastructure Administrator
Overview
Installation
Installation
Installation Types
Classic (SaaS)
Classic (SaaS)
Introduction
System Requirements
Network Requirements
Cluster Install
Customize Installation
Manually Create Projects
Cluster Upgrade
Cluster Uninstall
Install using Base Command Manager
Self-hosted
Self-hosted
Overview
Kubernetes-based
Kubernetes-based
Prerequisites
Preparations
Install Control Plane
Install a Cluster
Install additional Clusters
Manually Create Projects
Next Steps
Upgrade
Uninstall
OpenShift-based
OpenShift-based
Prerequisites
Preparations
Install Control Plane
Install a Cluster
Install additional Clusters
Manually Create Projects
Next Steps
Upgrade
Uninstall
Researcher Setup
Researcher Setup
Introduction
Install the V1 CLI
Install the V2 CLI
Configuration
Configuration
Overview
Clusters
Advanced Cluster Configuration
Secure your Cluster
Shared Storage
Local Certificate Authority
Install Administrator CLI
Backup & Restore
High Availability
Scaling
Email and System Notifications
Set Node Roles
Review Kubernetes Access provided to Run:ai
External access to Containers
Node Affinity with Cloud Node Pools
Setup cluster wide PVC
Group Nodes
Workload Deletion Protection
Mark Assets for Run:ai
Set Default Scheduler
Maintenance
Maintenance
Monitoring and maintenance Overview
Node Maintenance
System Monitoring
Audit Log
Authentication & Authorization
Authentication & Authorization
Overview
Single Sign-On
Single Sign-On
Setup SSO with SAML
Setup SSO with OpenID Connect
Setup SSO with OpenShift
Users
Applications
Roles
Access Rules
Researcher Authentication
User Identity in Container
Troubleshooting
Troubleshooting
Logs Collection
Troubleshooting
Diagnostics
Platform Administrator
Platform Administrator
Overview
Authentication & Authorization
Authentication & Authorization
Users
Applications
Roles
Access Rules
Managing AI Intiatives
Managing AI Intiatives
Adapting AI initiatives to your organization
Managing your Organization
Managing your Organization
Projects
Departments
Scheduling Rules
Managing your resources
Managing your resources
Nodes
Configuring NVIDIA MIG Profiles
Node Pools
Workloads
Workloads
Introduction to Workloads
Workload Types
Workloads
Workload Assets
Workload Assets
Overview
Environments
Data Sources
Data Volumes
Compute Resources
Credentials
Workload Templates
Workload Templates
Workspace Templates
Policies
Policies
Overview
Policies
Policies Examples
Policies Reference
Older Policies
Older Policies
Policies V1
Integrations
Integrations
Overview
Working with Karpenter
Review your performance
Review your performance
Dashboard Analysis
Reports
Best Practices
Best Practices
From Docker to Run:ai
System Configuration
System Configuration
Administrator Messages
Researcher
Researcher
Overview
Quickstart Guides
Quickstart Guides
Run:ai Quickstart Guides
Build
Build
Basics
Visual Studio Code Web
Build with Connected Ports
Inference
GPU Fractions
Scheduling Basics
Scheduling Basics
Over-Quota, Basic Fairness & Bin-Packing
Queue Fairness
Workloads in Run:ai
Workloads in Run:ai
Introduction to Workloads
Workload Types
Workloads
Workload Assets
Workload Assets
Overview
Environments
Data Sources
Data Volumes
Compute Resources
Credentials
Workload Templates
Workload Templates
Workspace Templates
Experiment Using Workspaces
Experiment Using Workspaces
Running Workspaces
Quick Starts
Quick Starts
Running Jupyter Notebook Using Workspaces
Train Models Using Training
Train Models Using Training
Standard Training
Standard Training
Train Models Using a Standard Training Workload
Quick Starts
Quick Starts
Run your First Standard Training
Distributed Training
Distributed Training
Train Models Using a Distributed Training Workload
Quick Starts
Quick Starts
Run your First Distributed Training
Deploy Models Using Inference
Deploy Models Using Inference
Overview
Deploy a Custom Inference Workload
Deploy Inference Workloads from Hugging Face
Deploy Inference Workloads with NVIDIA NIM
Command Line Interface
Command Line Interface
CLI V2
CLI V2
Overview
CLI Reference
CLI Examples
CLI Guides
CLI Guides
Set cluster authorization
CLI V1
CLI V1
Introduction
runai attach
runai bash
runai config
runai delete
runai describe
runai exec
runai list
runai login
runai logout
runai logs
runai port-forward
runai resume
runai submit
runai submit-dist mpi
runai submit-dist pytorch
runai submit-dist tf
runai submit-dist xgboost
runai suspend
runai top node
runai update
runai version
runai whoami
Best Practices
Best Practices
Bare-Metal to Docker Images
Convert a Workload to Run Unattended
Save Deep Learning Checkpoints
Environment Variables
Email Notifications
Secrets as Environment Variables (CLI)
Scheduling
Scheduling
The Run:ai Scheduler
Allocation of GPU Fractions
Allocation of CPU and Memory
Advanced
Advanced
Dynamic GPU Fractions
Optimize performance with the Node Level Scheduler
GPU Time Slicing
GPU Memory Swap
Researcher Tools
Researcher Tools
Visual Studio Code
PyCharm
X11 & PyCharm
Jupyter Notebook
TensorBoard
Use Cases
Developer
Developer
Overview
User Applications
API Authentication
REST API
Cluster API (Deprecated)
Cluster API (Deprecated)
Overview
Submit Workload via YAML
Submit Workload via HTTP/REST
Reference
Metrics
Metrics
Metrics via API
(Deprecated) Metrics via Prometheus
Kubernetes Workloads Integration
Use a Jupyter Notebook with a Run:ai Job
¶
See the Jupyter Notebook Quickstart
here
.
Back to top