RAG
Every post tagged "RAG" · articles, case studies, guides.
- 22 April 2026 · AI solutions · Website & online shoppgvector at 10M+ rows · index choice, query patterns, real performance numberspgvector at 10M rows is not scary · if you pick the right index. HNSW vs IVFFlat, filter patterns, real numbers.
- 22 April 2026 · AI solutionsLLM prompt caching in production · a 60-80% cost cutPrompt caching is the single biggest LLM cost lever in 2026. 4 patterns, real savings numbers, 2 gotchas worth knowing.
- 22 April 2026 · AI solutionsLLM evals-as-code · the CI gate we run on every RAG deployAn eval that's not in CI is not an eval. Here's the evals-as-code workflow we run on every RAG project.
- 20 April 2026 · AI solutionsHow to ship a production AI chatbot in 14 daysFourteen days from zero to a live AI chatbot your company can actually use. The schedule we follow on every client project, down to what happens on each day.
- 08 April 2026 · AI solutionsShipping AI agents that actually work in productionFrom demo to live system: the retrieval, eval, guardrails and cost control we run on every AI project we ship.
- 22 January 2026 · AI solutions · Website & online shopPicking a vector DB in 2026: pgvector, Pinecone, WeaviateThree serious vector DBs, three very different DNA. Here's the decision framework that held up across our 2026 projects.