zphinx/tai - tai - Gitea: Git with a cup of tea

zphinx/tai

Fork 0

Commit Graph

Author	SHA1	Message	Date
zphinx	e943e84bd2	feat(rag): harden Tier 1 retrieval observability and stability Some checks failed CI / test (push) Failing after 15s Details - Add --rag-debug flag to show retrieved chunk names and similarity scores - Add explicit fallback notices when RAG indexing/query embedding fails - Log RAG index/query metrics (duration, scores, top hit, token estimate) - Normalize and cap chunk content for more stable prompt shape on small models - Add hypothesis-continuity instruction for follow-up prompts - Add retrieval scoring API and new tests for truncation/fallback/debug paths	2026-05-04 19:13:57 +02:00
zphinx	be181c2d7f	feat(rag): implement Tier 1 in-memory RAG for interactive follow-ups Some checks failed CI / test (push) Failing after 15s Details - Add embed() to AIClient using Ollama nomic-embed-text via /v1/embeddings - Add DEFAULT_EMBED_MODEL and embed_model field to AIConfig - New rag_retriever.py: chunk_report(), EmbeddedChunk, retrieve() (pure-Python cosine) - prompt_builder: add build_message_with_chunks() for RAG-aware follow-up prompts - cli: add --no-rag flag, embed report chunks after collection, retrieve top-5 per question - Graceful fallback to full-context if embedding model unavailable - 16 new tests in test_rag_retriever.py (67 total, all passing) - Add chromadb>=0.5 as optional [rag] dep in pyproject.toml - README: add step 3 (pull nomic-embed-text), update Suggested Tooling table	2026-05-04 18:36:12 +02:00

Author

SHA1

Message

Date

zphinx

e943e84bd2

feat(rag): harden Tier 1 retrieval observability and stability

CI / test (push) Failing after 15s

Details

- Add --rag-debug flag to show retrieved chunk names and similarity scores
- Add explicit fallback notices when RAG indexing/query embedding fails
- Log RAG index/query metrics (duration, scores, top hit, token estimate)
- Normalize and cap chunk content for more stable prompt shape on small models
- Add hypothesis-continuity instruction for follow-up prompts
- Add retrieval scoring API and new tests for truncation/fallback/debug paths

2026-05-04 19:13:57 +02:00

zphinx

be181c2d7f

feat(rag): implement Tier 1 in-memory RAG for interactive follow-ups

CI / test (push) Failing after 15s

Details

- Add embed() to AIClient using Ollama nomic-embed-text via /v1/embeddings
- Add DEFAULT_EMBED_MODEL and embed_model field to AIConfig
- New rag_retriever.py: chunk_report(), EmbeddedChunk, retrieve() (pure-Python cosine)
- prompt_builder: add build_message_with_chunks() for RAG-aware follow-up prompts
- cli: add --no-rag flag, embed report chunks after collection, retrieve top-5 per question
- Graceful fallback to full-context if embedding model unavailable
- 16 new tests in test_rag_retriever.py (67 total, all passing)
- Add chromadb>=0.5 as optional [rag] dep in pyproject.toml
- README: add step 3 (pull nomic-embed-text), update Suggested Tooling table

2026-05-04 18:36:12 +02:00

2 Commits