Writing about SRE, infrastructure as code, and the things I learn along the way.
Story about how we built recovery tooling to patch a Celery broker cluster without downtime — and what AI-supported cross-team work looks like in practice.