llama.trnz22.com /v1/chat/completions

Self-hosted inference console

Qwen2.5-7B-Instruct via llama.cpp, behind Caddy + gateway