- Add PLATFORM query type that bypasses RAG and uses a lighter model - Introduce OLLAMA_MODEL_NAME_CONVERSATIONAL env var to route CONVERSATIONAL and PLATFORM queries to a separate (smaller) Ollama model - Replace raw message history in classifier with compact intent history (classify_history) to eliminate anchoring bias in small models - Add <history_rule> and <platform_priority_rule> to classifier prompt so the model evaluates each message independently while still resolving ambiguous references from prior turns - Add fast-path detection for known platform-injected prompt prefixes - Add PLATFORM_PROMPT for account/metrics/usage responses - Persist classify_history in classify_history_store alongside session_store - Document decisions in ADR-0008 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| ADR-0001-grpc-primary-interface.md | ||
| ADR-0002-two-phase-streaming.md | ||
| ADR-0003-hybrid-retrieval-rrf.md | ||
| ADR-0004-claude-eval-judge.md | ||
| ADR-0005-embedding-model-selection.md | ||
| ADR-0006-reward-algorithm-dataset-synthesis.md | ||
| ADR-0007-mandatory-syntactic-validation-layer.md | ||
| ADR-0008-adaptive-query-routing-intent-history.md | ||