Open WebUI Standalone Frontend On Proxmox

This is the split-frontend path.

The inference box already exists. Maybe it is a dedicated Ollama container. Maybe it is a llama.cpp server you want to keep under tighter control. Either way, the browser layer does not need to live in the same guest forever.

This page keeps Open WebUI in its own small LXC, points it at remote backends, and leaves the heavy GPU work somewhere else. If you still want the shortest all-in-one path, stay with Open WebUI And Ollama On Proxmox. If the lab wants an assistant instead of a browser tab, continue later to OpenClaw On Proxmox.

This is still homelab-grade, non-production plumbing. The point here is cleaner resource boundaries, not pretending the frontend suddenly became enterprise software.

Why Split Open WebUI From Inference

The combined deployment works. It is also an easy way to leave too many resources attached to the part of the stack that is only serving HTML, API glue, document indexing, and session state.

Splitting makes sense when the lab already trusts the inference node and wants the interface to become lighter, easier to rebuild, and easier to move.¹

Benefit	Detail
Resource efficiency	Open WebUI needs 2-4 cores and 4 GB RAM. The inference server gets the rest.
Independent scaling	Size the inference container for your largest model without affecting the web UI.
No GPU needed for UI	Open WebUI does not perform inference in this layout, so there is no GPU passthrough or NVIDIA driver install in the UI container.
Cleaner upgrades	Update Open WebUI without disturbing the runtime that actually serves tokens.
Multiple backends	One Open WebUI instance can connect to both Ollama and llama.cpp at the same time.
Separation of concerns	Frontend and inference have different failure modes, resource profiles, and maintenance windows.

Target Layout

The shape looks like this:

Proxmox host
  -> CT 103 Open WebUI only      (192.168.50.30:42)
  -> CT 100 Ollama only          (192.168.50.40:58008)
  -> remote llama.cpp endpoint   (OpenAI-compatible API)

If the lab already standardised on llama.cpp Inference On Proxmox or llama.cpp Router Mode On Proxmox, the same OpenAI-compatible connection flow applies here. The commands below keep the original remote-backend values from the source notes so the working examples stay intact.

What The Script Still Installs

The Proxmox helper script still installs a full Open WebUI application environment rather than a tiny static frontend. That matters because the container stays light compared with the inference guests, but it is not empty.²³

Installed

Component	Size (approx.)	Why It Is There
Python 3.12 (via `uv`)	~50 MB	Open WebUI is a Python/FastAPI application
`open-webui[all]`	~500 MB	The supported application install path
PyTorch (CPU-only)	~2 GB	Pulled in by Open WebUI's dependency tree
`sentence-transformers`	~100 MB	Used for RAG embeddings
`faster-whisper`	~50 MB	Speech-to-text support
`chromadb`	~50 MB	Default vector database for RAG
`ffmpeg`	~80 MB	Audio processing for uploads
`zstd`	~1 MB	Compression used by some Python packages

Not Installed In This Layout

Component	Why It Is Skipped
NVIDIA userspace libraries	No GPU passthrough, so `setup_hwaccel` is not triggered
NVIDIA `.run` driver in the container	No GPU work happens here
Ollama binary	Declined during the script prompt
GPU device passthrough entries	Disabled in the build inputs

What Still Works Without GPU

Feature	GPU Required?	Notes
Chat with remote models	No	All inference happens on remote Ollama or llama.cpp
RAG	No	Embeddings are generated on CPU
Speech-to-text	No	`faster-whisper` falls back to CPU
Image generation	No	Requests are forwarded to remote APIs
OCR	No	Runs on CPU
Web search	No	Pure API calls

Create The LXC Container

Container Sizing

Setting	Value	Rationale
Container ID	103	Keeps the UI separate from the earlier combined stack
Hostname	`openwebui`	Reflects the frontend-only role
Disk Size	20-30 GB	App, Python deps, SQLite data, uploaded documents
CPU Cores	2-4	FastAPI web server and indexing overhead
RAM	4096 MiB	Enough for the app, embeddings, and headroom
OS	Debian 13	Same default as the other current LXC guides
Bridge	`vmbr0`	Same network as the inference containers
IPv4	192.168.50.30/24	Keeps the browser endpoint predictable
Gateway	`192.168.50.1`	Router
GPU Passthrough	No	This is the key difference from the combined stack
Nesting	Enabled	Required for systemd in Debian 13

Pre-Configure App Defaults

The Community-Scripts framework already supports per-app defaults files. That is the cleanest way to skip the long wizard and keep the build reproducible.²³

On the Proxmox VE host shell:

# Create the directory (if it doesn't exist)
mkdir -p /usr/local/community-scripts/defaults
 
# Write the vars file
cat > /usr/local/community-scripts/defaults/openwebui.vars << 'EOF'
# OpenWebUI Standalone (Frontend Only) - App Defaults
# No GPU, no Ollama — lightweight UI container
var_cpu=4
var_ram=4096
var_disk=25
var_unprivileged=1
var_gpu=no
var_brg=vmbr0
var_net=192.168.50.30/24
var_gateway=192.168.50.1
var_hostname=openwebui
var_os=debian
var_version=13
var_ssh=yes
var_nesting=1
var_protection=yes
var_tags=ai;interface
var_timezone=Australia/Melbourne
var_container_storage=local-zfs
var_template_storage=local
EOF

Variable	Description	Example
`var_cpu`	CPU cores	`4`
`var_ram`	RAM in MiB	`4096`
`var_disk`	Disk in GB	`25`
`var_unprivileged`	`1` = unprivileged, `0` = privileged	`1`
`var_gpu`	GPU passthrough	`yes` / `no`
`var_brg`	Network bridge	`vmbr0`
`var_net`	Static IP (CIDR)	`192.168.50.30/24`
`var_gateway`	Default gateway	`192.168.50.1`
`var_hostname`	Container hostname	`openwebui`
`var_os`	OS template	`debian`
`var_version`	OS version	`13`
`var_ssh`	Enable SSH	`yes` / `no`
`var_nesting`	Enable nesting	`1`
`var_protection`	Prevent accidental deletion	`yes`
`var_tags`	Proxmox tags	`ai;interface`
`var_timezone`	Timezone	`Australia/Melbourne`
`var_container_storage`	Storage pool for container	`local-zfs`
`var_template_storage`	Storage pool for templates	`local`
`var_vlan`	VLAN tag	`50`
`var_mtu`	MTU size	`1500`

Run The Community Script

bash -c "$(curl -fsSL https://raw.githubusercontent.com/community-scripts/ProxmoxVE/main/ct/openwebui.sh)"

Select App Defaults.

When prompted "Would you like to add Ollama?" type N.

Expected output should look similar to:

  Using App Defaults for Open WebUI
  GPU Passthrough: no
  CPU Cores: 4
  RAM Size: 4096 MiB
  Disk Size: 25 GB
  IPv4: 192.168.50.30/24
  ...
  Installed Open WebUI
      Would you like to add Ollama? <y/N> N
  ...
  http://192.168.50.30:42

For a one-shot run without touching files:

var_cpu=4 var_ram=4096 var_disk=25 var_gpu=no var_hostname=openwebui \
var_net=192.168.50.30/24 var_gateway=192.168.50.1 var_nesting=1 \
bash -c "$(curl -fsSL https://raw.githubusercontent.com/community-scripts/ProxmoxVE/main/ct/openwebui.sh)"

Verify Installation

pct enter 103
 
# Check OpenWebUI service is running
systemctl status open-webui
 
# Confirm no Ollama service exists
systemctl status ollama
# Expected: "Unit ollama.service could not be found"
 
# Check no GPU devices are present
ls /dev/nvidia* 2>/dev/null
# Expected: no output (no GPU devices)

Change The Listening Port

Open WebUI defaults to port 8080. The source layout moves it to 42 to reduce scanning noise and keep the UI guest distinct from the older combined examples.

Edit the systemd service file inside the container:

pct enter 103
nano /etc/systemd/system/open-webui.service

Find:

ExecStart=/root/.local/bin/open-webui serve

Change it to:

ExecStart=/root/.local/bin/open-webui serve --port 42

Reload and restart:

systemctl daemon-reload
systemctl restart open-webui

Verify:

ss -tlnp | grep 42
# Expected: open-webui listening on 0.0.0.0:42
 
curl -s http://localhost:42
# Expected: HTML response from OpenWebUI

Configure Remote Backends

Open WebUI can talk to remote Ollama by using the Ollama connection path, and to llama.cpp by using its OpenAI-compatible API path.¹⁴

Connect To Remote Ollama

Method 1 uses the container environment file:

pct enter 103
nano /root/.env

Add or update:

OLLAMA_BASE_URL=http://192.168.50.40:58008

Restart Open WebUI:

systemctl restart open-webui

Method 2 uses the admin UI:

Open http://192.168.50.30:42 in your browser.
Go to Admin Panel -> Settings -> Connections.
Under Ollama API, set the URL to http://192.168.50.40:58008.
Save.

Once saved in the admin UI, the database-backed setting overrides the environment variable. To force environment values to win again, set ENABLE_PERSISTENT_CONFIG=false in /root/.env.⁴

Verify the connection from inside the Open WebUI container:

curl -s http://192.168.50.40:58008/api/tags | python3 -m json.tool

Connect To Remote llama.cpp

llama.cpp exposes an OpenAI-compatible API, so it goes under the OpenAI API section instead of the Ollama section.

Open http://192.168.50.30:42.
Go to Admin Panel -> Settings -> Connections.
Under OpenAI API, add a connection with these values:
- URL: http://192.168.50.45:65535/v1
- API Key: your llama-server --api-key value
Save.

Verify llama.cpp:

curl -s -H "Authorization: Bearer <your-api-key>" \
  http://192.168.50.45:65535/v1/models | python3 -m json.tool

If the lab already uses llama.cpp Router Mode On Proxmox, keep the same OpenAI connection flow and substitute that endpoint instead. If the lab already uses llama.cpp Router Mode On Proxmox, keep the same OpenAI connection flow and substitute that endpoint instead.

Useful Environment Variables

Variable	Default	Recommended	Purpose
`OLLAMA_BASE_URL`	`http://localhost:11434`	`http://192.168.50.40:58008`	Remote Ollama endpoint
`ENABLE_OLLAMA_API`	`True`	`True`	Enable Ollama API integration
`AIOHTTP_CLIENT_TIMEOUT`	`300`	`300`	Timeout in seconds for inference requests
`AIOHTTP_CLIENT_TIMEOUT_MODEL_LIST`	`10`	`3`	Timeout for model-list fetch
`ENABLE_BASE_MODELS_CACHE`	`False`	`True`	Cache model lists
`MODELS_CACHE_TTL`	`300`	`300`	Cache TTL in seconds

After editing /root/.env, restart:

systemctl restart open-webui

Migrate Data From An Existing Container

If the lab started with the combined deployment from Open WebUI And Ollama On Proxmox, you can move the interface data rather than starting over.

Path	Contains
`/root/.open-webui/`	SQLite database, users, chat history, settings, RAG data
`/root/.env`	Runtime configuration

Run from the Proxmox host:

# 1. Stop OpenWebUI on the NEW container to avoid conflicts
pct exec 103 -- systemctl stop open-webui
 
# 2. Copy the data directory from OLD container (CT 100) to host
pct pull 100 /root/.open-webui/ /tmp/open-webui-backup/ --recursive
 
# 3. Push the data directory from host to NEW container (CT 103)
pct push 103 /tmp/open-webui-backup/ /root/.open-webui/ --recursive
 
# 4. Copy the env file
pct pull 100 /root/.env /tmp/open-webui-env-backup
pct push 103 /tmp/open-webui-env-backup /root/.env
 
# 5. Update the env file on the NEW container to point to remote backends
pct exec 103 -- sed -i 's|OLLAMA_BASE_URL=.*|OLLAMA_BASE_URL=http://192.168.50.40:58008|' /root/.env
 
# 6. Start OpenWebUI on the NEW container
pct exec 103 -- systemctl start open-webui

If pct pull and pct push with --recursive are unavailable on your Proxmox version, the source notes also allow the fallback scp path:

# From inside CT 103
scp -r root@192.168.50.30:/root/.open-webui/ /root/.open-webui/
scp root@192.168.50.30:/root/.env /root/.env

Migration checklist:

Chat history is visible in the new instance.
User accounts and admin credentials still work.
Uploaded RAG documents are available.
Connection settings point to remote backends instead of localhost.
The old container no longer auto-starts Open WebUI.

Verify And Access

From inside CT 103:

# 1. OpenWebUI service is running
systemctl is-active open-webui
# Expected: active
 
# 2. No GPU devices present (expected — we don't need them)
ls /dev/nvidia* 2>/dev/null && echo "GPU found (unexpected)" || echo "No GPU (correct)"
 
# 3. Remote Ollama is reachable
curl -s http://192.168.50.40:58008/api/tags
# Expected: JSON with model list
 
# 4. Remote llama.cpp is reachable
curl -s -H "Authorization: Bearer <your-api-key>" http://192.168.50.45:65535/v1/models
# Expected: JSON with model list
 
# 5. OpenWebUI port is listening
ss -tlnp | grep 42
# Expected: open-webui listening on 0.0.0.0:42

Then open:

http://192.168.50.30:42

Verify that the login page loads, the model dropdown shows the remote backends, and chat requests return answers without local GPU errors.

Management Commands

Service Control

Inside the container:

# Status
systemctl status open-webui
 
# Restart
systemctl restart open-webui
 
# Check installed version
uv tool list | grep open-webui
 
# View logs (live)
journalctl -u open-webui -f
 
# View recent logs
journalctl -u open-webui --since "10 minutes ago"

Container Control

From the host:

# Start / Stop
pct start 103
pct stop 103
 
# Enter container shell
pct enter 103
 
# Quick command execution
pct exec 103 -- systemctl status open-webui

Update Open WebUI

Open WebUI updates are small when nothing migrates and annoying when the database changes under you. Treat every update as a data operation first and a package operation second.

Before updating, check the release notes for breaking changes or database migrations.⁵ If a release runs a one-way migration, a package downgrade alone will not put the SQLite schema back.

Check the current installed version from the host:

pct exec 103 -- /root/.local/bin/uv tool list | grep open-webui

uv lives in /root/.local/bin/, which is not always in PATH for non-login pct exec shells. Use the full path when calling it from the Proxmox host.

Back Up First

Run this from the Proxmox host before every Open WebUI update:

# Stop the service to ensure a clean SQLite snapshot
pct exec 103 -- systemctl stop open-webui
 
# Back up data directory and env file (timestamped)
pct exec 103 -- bash -c "cp -a /root/.open-webui /root/.open-webui-backup-$(date +%Y%m%d)"
pct exec 103 -- bash -c "cp /root/.env /root/.env-backup-$(date +%Y%m%d) 2>/dev/null || true"
 
# Confirm backup was created
pct exec 103 -- ls -la /root/ | grep open-webui
 
# Restart before updating (update will restart again after)
pct exec 103 -- systemctl start open-webui

For larger jumps, take a Proxmox snapshot too. That captures the Python environment as well as the app database:

# From the Proxmox host shell
pct snapshot 103 pre-update-$(date +%Y%m%d) --description "Before OpenWebUI update"

Run The Update

The Community-Scripts installer uses uv tool install --force with a numba>=0.60 constraint. Use the same shape for upgrades. Do not use uv tool upgrade; it does not apply the constraint from the install path.

pct enter 103
 
# Update to latest via uv (mirrors community script install method)
uv tool install --force --python 3.12 --constraint <(echo "numba>=0.60") open-webui[all]
 
# Restart the service
systemctl restart open-webui

You can also use the Community-Scripts update path by re-running the script from the Proxmox host. It detects the existing Open WebUI container and invokes its update function:

bash -c "$(curl -fsSL https://raw.githubusercontent.com/community-scripts/ProxmoxVE/main/ct/openwebui.sh)"
# Select CT 103 when prompted

Verify The Update

Run these from inside CT 103:

# 1. Confirm the installed package version (uv metadata — not the running version)
uv tool list | grep open-webui
 
# 2. Restart the service to load the newly installed binary
systemctl restart open-webui
 
# 3. Confirm the RUNNING version from the startup banner
journalctl -u open-webui -n 30 --no-pager | grep "v0\."
# Expected: v0.9.x - building the best AI user interface.
 
# 4. Confirm UI is responding
curl -s http://localhost:42 | head -5
# Expected: HTML response from OpenWebUI

uv tool list shows what is installed on disk. The running version is the startup banner in journalctl. If the banner still shows the old version after systemctl restart, rerun the uv tool install --force command and restart again.

After updating, hard-refresh the browser tab so stale frontend assets do not make the old UI look like it survived the upgrade.

Roll Back Without A Migration

If no database migration ran, pin the previous package version and restart:

pct enter 103
 
# Downgrade to a specific version
uv tool install --force --python 3.12 --constraint <(echo "numba>=0.60") "open-webui[all]==0.5.20"
 
systemctl restart open-webui

Replace 0.5.20 with the version you recorded before updating.

Restore After A Failed Migration

If the update migrated the database and the new release fails, restore the pre-update data backup and reinstall the previous package:

pct enter 103
 
# Stop the service
systemctl stop open-webui
 
# Remove the migrated database and restore from backup
# Replace YYYYMMDD with your backup date
rm -rf /root/.open-webui
cp -a /root/.open-webui-backup-YYYYMMDD /root/.open-webui
 
# Restore env file if needed
cp /root/.env-backup-YYYYMMDD /root/.env
 
# Downgrade the package to the pre-update version
uv tool install --force --python 3.12 --constraint <(echo "numba>=0.60") "open-webui[all]==<pre-update-version>"
 
# Restart
systemctl restart open-webui

If you took a Proxmox snapshot first, use the snapshot rollback instead:

# From the Proxmox host — stop, restore snapshot, start
pct stop 103
pct rollback 103 pre-update-YYYYMMDD
pct start 103

Choose A Secure Exposure Path

The UI is reachable on http://192.168.50.30:42, but the HTTPS decision should still live in the shared exposure guides rather than being re-explained differently on every workload page.

Secure Service Exposure On Proxmox - the decision hub for how services should leave the house.
Cloudflare Tunnel On Proxmox - the shortest secure path when you want outbound-only exposure.
Nginx Reverse Proxy LXC On Proxmox - the self-hosted reverse-proxy path when the lab wants one front door for multiple services.

If you do expose this through Cloudflare Tunnel, update the ingress rule to the new internal port:

ingress:
  - hostname: openwebui.yourdomain.com
    service: http://192.168.50.30:42

Troubleshooting

Open WebUI Cannot Reach Remote Ollama

Run this from inside the Open WebUI container:

curl -v http://192.168.50.40:58008/api/tags

Common causes:

Ollama is not listening on 0.0.0.0.
The Ollama container is stopped.
A firewall is blocking the port.
OLLAMA_BASE_URL points at the wrong IP.

Models Load Slowly Or Time Out

Increase the inference timeout in /root/.env if needed:

AIOHTTP_CLIENT_TIMEOUT=600

Use a shorter model-list timeout for faster failure when a backend is unhealthy:

AIOHTTP_CLIENT_TIMEOUT_MODEL_LIST=3

RAG Embedding Is Slow

That is expected in this shape. The default embedding model runs on CPU in the UI container. The tradeoff is slower first-time indexing in exchange for not tying Open WebUI to a GPU-enabled guest.

High Memory Usage

If 4 GB is too tight, increase it:

# From Proxmox host (container must be stopped)
pct set 103 -memory 6144
pct start 103

Open WebUI And Ollama On Proxmox - the combined path first, plus the dedicated Ollama follow-on when you still want Open WebUI living with the runtime.
llama.cpp Inference On Proxmox - the lower-level OpenAI-compatible backend when you want tighter runtime control.
llama.cpp Router Mode On Proxmox - the cleaner multi-model backend once one model no longer feels like enough.
OpenClaw On Proxmox - the assistant layer when the same local model estate should power Telegram, Discord, and scheduled workflows instead of only a browser UI.

Open WebUI documents both the bundled-Ollama path and the "Ollama on a different server" path, and notes that the first account created becomes the administrator for that instance: Open WebUI Quick Start. ↩ ↩²
The Community-Scripts build framework defines the install menu, App Defaults precedence, and the variables used by each application entrypoint: Community-Scripts build.func. ↩ ↩²
The current Open WebUI Community-Scripts entrypoint installs Open WebUI with uv, creates open-webui.service, stores data in /root/.open-webui, and exposes /root/.env for runtime overrides: Community-Scripts openwebui.sh. ↩ ↩²
Open WebUI documents the environment variables used for backend connection behaviour, including persistent config and request timeouts: Open WebUI Environment Variables. ↩ ↩²
Open WebUI publishes release notes for version changes, breaking changes, and migration warnings: Open WebUI Releases. ↩

Comments