Weekly / Every 10 Days

Posts

Notes on Kubernetes, AI systems, and the craft of reliable software. Expect concise walkthroughs, incident postmortems, and platform patterns.

Topics:
May 5, 2026 7 min read Kubernetes

Designing Sidecar Patterns for ML Inference

How to run GPU-bound inference sidecars safely on multi-tenant clusters, including resource isolation, eviction controls, and rollout strategies.

Read →
Apr 25, 2026 5 min read AI Systems

Observability for Retrieval Pipelines

A quick checklist for tracing vector search, caching layers, and LLM calls with minimal vendor lock-in.

Read →
Apr 14, 2026 6 min read SRE

Runbook-Driven Deploys on a Small Team

Lightweight runbooks that keep deploys predictable without slowing down a team of three engineers.

Read →
Mar 31, 2026 4 min read Career

Career Ladders for Platform Engineers

Mapping impact for platform roles—what “leveling up” looks like beyond ticket throughput.

Read →