All Posts

700ms to 2ms: What a Cluster Fire Taught Me About Embedding

700ms. That was the number that haunted my Kubernetes cluster, slowly burning it to the ground. Every alert the cluster generated, every log line it processed for AI-driven feedback, triggered an embedding operation.

The AI That Monitored Your Cluster Just Brought It Down

“Why can’t I see the new photos?” That’s how the outage started. Not with a PagerDuty alert or a Grafana dashboard turning red, but with a casual question from my wife.

€200 Claude.ai bill in one week — so I built a cheaper alternative

April 2026 — one week of intensive AI-assisted work, one surprising bill, and one decision to do something about it The Claude.ai usage screen showed €169.

What the forum posts don't tell you: fifteen years with a self-built heating system

April 2026 — on the gap between a well-researched plan and a wet crawl space The design article made it sound rational. The build article made it sound competent.

cluster-shepherd: The AI Ops Agent That Actually Knows Your Cluster

April 2026 — what happens when you stop treating AI as a search engine and start treating it as a co-pilot with real cluster access

When Gemini Says Nothing: Two Silent Failure Modes in MCP + LibreChat

April 2026 — field notes from wiring a Kubernetes SRE agent to Gemini 2.5 Flash I spent the better part of two days debugging an AI agent that would reliably respond with… nothing.