Getting Started¶
Requirements¶
- uv >= 0.4
- Python >= 3.12
- Linux with NVIDIA GPU and
nvidia-smiin PATH
Installation¶
Make gpumod available globally (recommended)¶
Install gpumod as a uv tool so
the command is always on your PATH — no need to activate a virtualenv or
prefix with uv run:
The -e (editable) flag means changes to the source are picked up
immediately. uv places the binary in ~/.local/bin/ (or the uv tool bin
directory). If that's not on your PATH yet, run:
After this, gpumod works from any directory:
Alternative: use uv run or activate the venv
If you prefer not to install globally, you can still use:
First-time Setup¶
Initialize the database and load built-in presets:
This creates ~/.config/gpumod/gpumod.db with:
- GPU hardware profile (detected from
nvidia-smi) - Built-in service presets loaded from
presets/ - Default mode definitions
Configuration¶
gpumod is configured via environment variables with the GPUMOD_ prefix. A
.env.example file is included in the repository root — copy it to .env and
uncomment the variables you want to override.
See Configuration for all available settings including preflight thresholds, LLM backends, and database path.
Deploying a Service¶
gpumod auto-generates systemd unit files from presets — you never write them by hand.
1. Preview the generated unit file¶
2. Enable user-level systemd lingering¶
gpumod uses user-level systemd (systemctl --user) so no sudo is
needed for service management. To ensure your services start at boot and
persist after logout, enable lingering:
3. Install the unit file¶
This writes the unit to ~/.config/systemd/user/:
After installing, reload the user daemon so systemd picks up the new unit:
4. Start and manage services¶
# Start a service
gpumod service start vllm-chat
# Check service status
gpumod service status vllm-chat
# Stop a service
gpumod service stop vllm-chat
Basic Workflow¶
# Check GPU status and VRAM usage
gpumod status
# List registered services
gpumod service list
# Simulate a mode switch before committing
gpumod simulate mode coding-mode
# Switch to a mode (starts/stops services to fit VRAM budget)
gpumod mode switch coding-mode
# Launch the interactive TUI
gpumod tui
See the CLI Reference for all available commands.