Outage and Security Postmortems

January 24, 2013

There are always things to learn about distributed systems, especially how they can turn against you. Companies that publish postmortems are doing us a great favor, there is a jackpot of system design and operations knowledge to be gleaned from studying as many of these as you can get your hands on.

I’ve collected [now over 250] outage and security related postmortems in this Pinboard feed.

Read More

Keep a Small Surface – Webapp Isolation

November 11, 2010

Web application developers, in particular ones on a small team, are usually focused on the next feature or getting MVPs out, not security.

When security does come up, the focus is usually mitigating direct webapp attacks. We rely on Django or RoR‘s mechanisms for XSS/CSRF protection and password hashing. We turn to App Engine, Heroku, or traditional hosts for DDoS protection. And so on.

All of that is important and worth doing your due diligence on, but what’s the plan if/when your webapp gets entirely owned?

Here’s a way to mitigate the damages, something that is doable even when you are on a small team or working alone. There are nice side effects, too.

Read More


November 10, 2010

I am no longer adding anything to gridvm.org, that was probably obvious at some point last year. A young child and increasing work responsibilities will do that to you. But a bigger issue with that site than lack of free time was that the topic did not feel right anymore.

Read More