pallas

r/pallas

Author	SHA1	Message	Date
Robert Helewka	ca7d714a31	docs(pallas): document sampling parameters and Prometheus metrics Add two new sections to the Pallas documentation: - Sampling parameters: explain that temperature/top_p/top_k are configured via the fast-agent decorator's `request_params`, with a provider support matrix and a note on Claude Opus 4.7 stripping these params in favor of `output_config.effort`. - Metrics: document the Prometheus `/metrics` endpoint exposed on the registry port, including scrape config, full metrics reference table, and notes on where each metric is captured.	2026-05-23 07:49:21 -04:00
Robert Helewka	6fcdb509df	Release 0.2.1 fixes LLM API Status Error	2026-05-17 19:09:34 -04:00
Robert Helewka	75d529cf16	docs: update Mantle setup to reflect automatic shim detection	2026-05-12 11:16:22 -04:00
Robert Helewka	95fa6e6fc0	feat!: stateless per-request agents; add history + conversation_id to send_message Make Pallas truly stateless per the 'Pallas is ephemeral' contract. BREAKING (behavioural, not API): * instance_scope changes from 'shared' to 'request' in pallas.server. Each MCP tools/call now acquires a freshly-created fast-agent instance via the existing create_instance / dispose_instance factories and disposes it immediately after the response. With 'shared' mode: * Every MCP caller saw the same agent.message_history, so different Daedalus conversations leaked into each other. * Mid-chat context was silently truncated once the model window filled. * Restarting the Pallas process wiped all in-flight conversation state, even though Daedalus had it persisted in Postgres. With 'request' mode the Pallas process holds no per-conversation state; the caller (Daedalus) owns history and reseeds it on every turn. send_message gains two optional arguments: * history: list[{role, content, images?}] in chronological order, converted to PromptMessageExtended and seeded onto the fresh instance's message_history before agent.send(). * conversation_id: opaque string, logged for trace correlation only — Pallas never interprets or persists it. Malformed history entries (bad role, missing image data/mime_type, etc.) are skipped with a warning rather than raising, so a single bad row cannot wipe a whole conversation. The {agent}_history MCP prompt is still registered under 'request' scope for backward compatibility but always returns []; history lives on the client. Version bumped to 0.2.0.	2026-04-27 08:16:59 -04:00
Robert Helewka	0cea5ece3a	feat: add /healthz and /metrics endpoints, replace print with logging - Add /healthz endpoint returning LLM provider validation status - Add /metrics endpoint serving Prometheus metrics via prometheus_client - Replace all print() calls in health.py with proper logging module - Remove _PREFIX variable in favor of structured logger context	2026-04-10 11:22:26 +00:00
Robert Helewka	9092afb532	Initial commit: pallas package extracted from mentor	2026-04-02 12:41:53 +00:00

6 Commits