How an innocent OS upgrade triggered a cascade of issues and forced us into tracing Linux networking internals.
How Cloudflare was able to save hundreds of gigabits of network bandwidth and terabytes of storage from Kafka.
We use Salt to manage our ever growing global fleet of machines. Salt is great for managing configurations and being the source of truth. We use it for remote command execution and for network automation tasks. It allows us to grow our infrastructure quickly with minimal human intervention. CC-BY 2.
The following blog post describes a debugging adventure on Cloudflare's Mesos-based cluster. This internal cluster is primarily used to process log file information so that Cloudflare customers have analytics, and for our systems that detect and respond to attacks. The problem encountered didn't have any effect on our customers, but