Move llama-cpp to generic fastagent slot
This commit is contained in:
@@ -95,7 +95,7 @@ Committed to the repo. Contains LLM provider settings and explicit model capabil
|
||||
declarations.
|
||||
|
||||
```yaml
|
||||
default_model: openai.Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf
|
||||
default_model: generic.Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf
|
||||
|
||||
model_capabilities:
|
||||
vision: false
|
||||
@@ -249,6 +249,7 @@ sudo systemctl status iolaus
|
||||
- **Python 3.13** required (`fast-agent-mcp` pins `>=3.13`)
|
||||
- **Runtime:** [Pallas](https://git.helu.ca/r/pallas) — `pallas-mcp @ git+ssh://git@git.helu.ca:22022/r/pallas.git`
|
||||
- **Transport:** StreamableHTTP (`/mcp`) throughout — not SSE
|
||||
- **LLM:** OpenAI-compatible API at `http://nyx.helu.ca:22079/v1` (personal Qwen model)
|
||||
- **LLM:** Local Qwen via fast-agent's Generic (OpenAI-compatible) provider at
|
||||
`http://nyx.helu.ca:22079/v1`
|
||||
- **Logging:** Console output — stdout → syslog → Alloy → Loki in production
|
||||
- **Port scheme:** registry at 24000, personal agents 24001–24049, sub-agents 24050–24099
|
||||
|
||||
Reference in New Issue
Block a user