Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think it's uncharitable to jump to the conclusion that just because there was a config-based outage they don't do phased config rollouts. And even more uncharitable to compare them to crowdstrike.


I have read several cloudflare postmortems and my confidence in their systems is pretty low. They used to run their entire control plane out of a single datacenter which is amateur hour for a tech company that has over $60 billion in market cap.

I also don’t understand how it is uncharitable to compare them to crowdstrike as both companies run critical systems that affect a large number of people’s lives, and both companies seem to have outages at a similar rate (if anything, cloudflare breaks more often than crowdstrike).


https://blog.cloudflare.com/18-november-2025-outage/

> The larger-than-expected feature file was then propagated to all the machines that make up our network

> As a result, every five minutes there was a chance of either a good or a bad set of configuration files being generated and rapidly propagated across the network.

I was right. Global config rollout with bad data. Basically the same failure mode of crowdstrike.


It seem fairly logical to me? If a config change causes services to crash then rollout stops … at least in every phased rollout system i’ve ever built…




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: