chore(ansible): update model endpoints and enable Rommie deployment
- Bump Qwen model from 3.5 to 3.6 and update inference endpoints (nyx:22079→22072, pan:22078→22076) for caliban and puck hosts - Add Rommie MCP server deployment to site.yml - Update Rommie docs to reflect new port (20361), model versions, and health check accepting 200/406 status codes
This commit is contained in:
@@ -24,11 +24,11 @@ rommie_port: 20361
|
||||
rommie_host: "0.0.0.0"
|
||||
rommie_display: ":10"
|
||||
rommie_allowed_hosts: "caliban.incus,rommie.ouranos.helu.ca"
|
||||
rommie_model: Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf
|
||||
rommie_model_url: "http://nyx.helu.ca:22079"
|
||||
rommie_model: Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf
|
||||
rommie_model_url: "http://nyx.helu.ca:22072"
|
||||
rommie_provider: "openai"
|
||||
rommie_ground_provider: "huggingface"
|
||||
rommie_ground_url: "http://pan.helu.ca:22078"
|
||||
rommie_ground_url: "http://pan.helu.ca:22076"
|
||||
rommie_ground_model: "UI-TARS-7B-DPO-Q6_K_L.gguf"
|
||||
rommie_grounding_width: 1024
|
||||
rommie_grounding_height: 1024
|
||||
|
||||
Reference in New Issue
Block a user