Skip to content
Blog

Notes on engineering AI

Hard-won lessons on RAG, agents, fine-tuning and getting AI to behave in production — no hype, just what works.

RAG that actually works: evaluation before vibes
RAGEvaluation

RAG that actually works: evaluation before vibes

Most retrieval-augmented systems fail not at retrieval, but at proving they're right. Here's the engineering discipline that turns a flaky RAG demo into something your customers can trust.

12 min read
Read article
Shipping AI agents to production without the chaos
AgentsMLOps

Shipping AI agents to production without the chaos

Autonomous agents are thrilling in a notebook and terrifying in production. Guardrails, observability and gradual rollout are the boring engineering that turns one into the other.

12 min read
The real economics of LLM inference: cost without compromise
InferenceInfrastructure

The real economics of LLM inference: cost without compromise

An AI feature that delights in a pilot can quietly become unaffordable at scale. The levers that cut inference cost by an order of magnitude — without cutting the quality your users notice.

12 min read

Let's build the AI that moves your business.

Tell us the problem. We'll propose the smallest first step that proves real value — usually within a week.