A Byzantine failure in the real world
November 27, 2020 12:00PM
When we review design documents at Cloudflare, we are always on the lookout for Single Points of Failure (SPOFs). In this post, we present a timeline of a real-world incident, and how an interesting failure mode known as a Byzantine fault played a role in a cascading series of events....
Continue reading »
Cloudflare outage on July 17, 2020
July 18, 2020 2:22AM
Today a configuration error in our backbone network caused an outage for Internet properties and Cloudflare services that lasted 27 minutes. We saw traffic drop by about 50% across our network....
Using data science and machine learning for improved customer support
June 15, 2020 12:00PM
In this blog post we’ll explore three tricks that can be used for data science that helped us solve real problems for our customer support group and our customers. Two for natural language processing in a customer support context and one for identifying attack Internet attack traffic....
Helping sites get back online: the origin monitoring intern project
April 13, 2020 12:00PM
Life @ Cloudflare
Over the course of ten weeks, our team of three interns (two engineering, one product management) went from a problem statement to a new feature, which is still working in production for all Cloudflare customers....
Internship Experience: Cryptography Engineer
April 09, 2020 12:00PM
Back in the summer of 2017 I was an intern at Cloudflare. During the scholastic year I was a grad student working on automorphic forms and computational Langlands at Berkeley....