Request: Model Profile and Ollama Comparator Parity (2026-05-30)
Request Summary
Before self-review execution loops, ensure the local runtime has a reliable model-comparison baseline:
- a trained-model profile intended to remain the default in chat
- an identical stock
qwen3.5:4b24k comparator profile - Ollama tag parity so both profiles are executable
Why This Request Exists
Repeated training and A/B sessions were paying setup tax because model profile defaults and local Ollama tags drift across environments (repo run vs published artifacts run). The comparison objective needs deterministic profile wiring and deterministic local tags.
Evidence
MemorySmith.App/Services/ChatModelProfileService.cspersists model profiles and default profile id to resolved settings override.MemorySmith.App/Services/MemorySmithConfigurationPaths.csresolves override path across repo and artifacts surfaces.- Local Ollama inspection on 2026-05-30 showed stock
qwen3.5:4bonly; no localmemorysmith-athena:latesttag was present. - Created local stock comparator tag
qwen3.5:4b-24k-stockand updated override profile wiring in: MemorySmith.App/appsettings.LocalOverrides.jsonartifacts/MemorySmith.App/appsettings.LocalOverrides.json
Recommendations
1. Add automated bootstrap for model profile parity
- Classification:
Now - Impact: high
- Effort: medium
- Confidence: 92%
- Recommendation: add a script hook that resolves active settings path, verifies required Ollama tags, creates stock 24k comparator tag when missing, and writes consistent profile/default ids.
2. Gate trained-default promotion on local tag presence
- Classification:
Now - Impact: high
- Effort: low
- Confidence: 91%
- Recommendation: if trained tag is missing, warn and keep default on an executable stock profile instead of setting a broken default.
3. Add one-command verification snapshot
- Classification:
Next - Impact: medium
- Effort: low
- Confidence: 89%
- Recommendation: add a script output with
ollama listsummary, resolved settings path, profile ids, default id, and context windows for fast audit evidence.
4. Fold parity checks into periodic self-review cadence
- Classification:
Later - Impact: medium
- Effort: low
- Confidence: 84%
- Recommendation: include this verification in monthly self-review to avoid recurrence.
Status
InProgress
Tracking
- Implementation task:
TSK-0253 - Related request pages:
self-review-and-skill-governance-20260529.mdskill-small-improvements-batch-20260529.md