Vector Search Research Index
This page is the coordination hub for the current MemorySmith vector-search and semantic-search research stream. It consolidates benchmark evidence, deep audits, implemented hardening changes, and mapped task records.
Current Papers And Audits
- Codebase Vector Search Whitepaper Notes
- Code Search Benchmark Breakdown: CPU Fallback vs CUDA
- Code Search Benchmark: E5 vs Nomic Embed Text v1.5
- Audit #6 Validation Notes
- Audit #5 Validation Notes
- Deep Research Prompt: MemorySmith Codebase Vector Search And Embedding Stack
- Vector Search General Codebase Report (Research Copy, 2026-05-28)
Benchmarks
Latest Snapshot
- CPU fallback cold rebuild: about
418,824 ms(225files /2,388chunks) - CUDA cold rebuild: about
525,224 mson the same corpus (about25.4%slower than CPU fallback in that run) - CUDA batch sweep (
228files /2,417chunks): - batch
1:116,992 ms - batch
2:125,565 ms - batch
4:140,630 ms - batch
8:147,988 ms - batch
16:159,699 ms
Interpretation
- Current end-to-end rebuild performance does not support fixed-size batch defaults above
1on the measured host/corpus. - Provider-level micro-bench wins are not sufficient for default tuning decisions.
- Current optimization focus should remain evidence-first: candidate reduction and batching-shape strategy, not speculative complexity.
Implemented Hardening (2026-05-28)
- Unsupported execution providers now fail fast before ONNX native initialization, fixing CI instability for unsupported-provider diagnostics.
- Code-search document embedding failures now fail closed at file scope instead of silently persisting empty vectors.
- Code-search query timing telemetry now supports explicit disable, sampling interval throttling, and slow-query warning thresholds.
Task Mapping
Completed:
- TSK-0196 Fail fast on unsupported ONNX execution provider before native initialization.
- TSK-0197 Add throttled code-search query timing telemetry.
Backlog from deep audit:
- TSK-0198 Candidate prefilter before code-search vector scoring.
- TSK-0199 Semantic-search candidate index and ANN migration plan.
- TSK-0200 Surface semantic fallback and hybrid provider metadata in default API responses.
- TSK-0211 Harden code-search SQLite storage and query hot path.
- TSK-0212 Harden chat tool-call parsing and buffer fallbacks.
- TSK-0213 Tighten operational diagnostics redaction boundaries.
Validated cross-cutting owners from the audit family:
- TSK-0205 already closed clipboard external-fetch, Mermaid restriction, CSP, and markdown sanitization hardening that earlier audit copies still called open.
- TSK-0042 remains the broader readability/decomposition owner for
ChatServicesafter the narrower security claims were re-checked.
Next High-ROI Validation
- Run CI and confirm the unsupported-provider path remains deterministic across runner environments.
- Capture query timing samples under real
/mcpand/api/search/codeload with telemetry disabled and enabled to quantify overhead. - Prototype candidate prefiltering for code-search vector ranking and compare latency and relevance against current brute-force baseline.