Weekly / Every 10 Days

Posts

Notes on Kubernetes, AI systems, and the craft of reliable software. Expect concise walkthroughs, incident postmortems, and platform patterns.

Get new posts by email RSS

May 5, 2026 7 min read Kubernetes

How to run GPU-bound inference sidecars safely on multi-tenant clusters, including resource isolation, eviction controls, and rollout strategies.

Read →

Apr 25, 2026 5 min read AI Systems

A quick checklist for tracing vector search, caching layers, and LLM calls with minimal vendor lock-in.

Read →

Apr 14, 2026 6 min read SRE

Lightweight runbooks that keep deploys predictable without slowing down a team of three engineers.

Read →

Mar 31, 2026 4 min read Career

Mapping impact for platform roles—what “leveling up” looks like beyond ticket throughput.

Read →