Move llama-cpp to generic fastagent slot

This commit is contained in:
2026-05-12 15:07:00 -04:00
parent 8c95173705
commit b2fc398782
2 changed files with 11 additions and 6 deletions

View File

@@ -95,7 +95,7 @@ Committed to the repo. Contains LLM provider settings and explicit model capabil
declarations.
```yaml
default_model: openai.Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf
default_model: generic.Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf
model_capabilities:
vision: false
@@ -249,6 +249,7 @@ sudo systemctl status iolaus
- **Python 3.13** required (`fast-agent-mcp` pins `>=3.13`)
- **Runtime:** [Pallas](https://git.helu.ca/r/pallas) — `pallas-mcp @ git+ssh://git@git.helu.ca:22022/r/pallas.git`
- **Transport:** StreamableHTTP (`/mcp`) throughout — not SSE
- **LLM:** OpenAI-compatible API at `http://nyx.helu.ca:22079/v1` (personal Qwen model)
- **LLM:** Local Qwen via fast-agent's Generic (OpenAI-compatible) provider at
`http://nyx.helu.ca:22079/v1`
- **Logging:** Console output — stdout → syslog → Alloy → Loki in production
- **Port scheme:** registry at 24000, personal agents 2400124049, sub-agents 2405024099