LLM
Every post tagged "LLM" · articles, case studies, guides.
- 01→
AI agent pricing 2026: what an autonomous agent costs
An AI agent is not a chatbot with extra steps - it takes actions, and that changes the bill. Here are the real 2026 ranges and what drives them.
AI solutions - 02→
Self-hosted AI or the API? When to run your own LLM in 2026
Calling the OpenAI or Anthropic API is the right default for most AI features. But data sensitivity, steady high volume, or strict EU residency can flip the answer. Here's the honest decision.
AI solutions - 03→
RAG's three failure modes (and the diagnostic table)
Three failure modes, one table. 30 minutes of diagnosis, then you know what to fix. Stop guessing.
AI solutions - 04→
Build an LLM Eval Harness in 200 Lines of TS
Frameworks are great until they get in the way. Here is a 200-line TS eval harness that runs in CI, blocks regressions and prints a diff.
AI solutions - 05→
Why your AI agent leaks money: 6 prompt-cache wins
Six prompt-cache patterns. Real before/after numbers. Most agents leave 60-80% on the table. Fix it this week.
AI solutions - 06→
OWASP LLM Top 10 v2 · what changed and what to ship
v2 of the LLM Top 10 reorganised around how teams actually get hit. Here is what moved, what is new, and the default controls we ship.
Cybersecurity - 07→
On-device LLMs in 2026: Gemini Nano vs Apple Intelligence
On-device LLMs are finally usable for production features. Where Gemini Nano wins, where Apple Intelligence wins, and the Hungarian-language gap.
AI solutions · Mobile apps · iPhone & Android - 08→
LLM prompt caching in production · a 60-80% cost cut
Prompt caching is the single biggest LLM cost lever in 2026. 4 patterns, real savings numbers, 2 gotchas worth knowing.
AI solutions - 09→
Agentic AI · the safe tool-use pattern we ship by default
Agentic AI that can send email and move money is not just a chatbot. Here's the safe tool-use pattern we ship.
AI solutions · Cybersecurity - 10→
LLM evals-as-code · the CI gate we run on every RAG deploy
An eval that's not in CI is not an eval. Here's the evals-as-code workflow we run on every RAG project.
AI solutions - 11→
What an AI security audit actually checks in 2026
AI security isn't a checkbox. Here's the nine-point audit we run on every LLM system we ship, plus which bugs turn up most often on systems we didn't build.
AI solutions · Cybersecurity - 12→
LLM prompt injection playbook · the 2026 attack surface
The prompt injection surface is not a single bug · it's five categories, each with a distinct defence. Here's our playbook.
AI solutions · Cybersecurity - 13→
MCP (Model Context Protocol): what it means for LLM agents
MCP is the most important agent standard of the past year. What it means in practice, where we use it, and why to bet on it in 2026.
AI solutions - 14→
Shipping AI agents that actually work in production
From demo to live system: the retrieval, eval, guardrails and cost control we run on every AI project we ship.
AI solutions
Liked what you saw? Let's build yours.
Short email or a 30-min call · 24h reply.
Start a project