20221123-1230 - HangOps conversation discussing some of the origins of Google SRE

pwnguin Nov 18th at 20:47 https://www.usenix.org/publications/loginonline/oncall-equal-opportunity-waste-time 204 replies nhruby  5 days ago this is a weird article? nhruby  5 days ago It sort of meanders around the fact that a lot of plac…
Read more →

SREcon21

SREcon21, a gathering of engineers who care deeply about site reliability, systems engineering, and working with complex distributed systems at scale, will be held as a virtual event for the global SREcon community on October 12–14, 2021. — https…
Read more →

Toil

In Site Reliability Engineering, we want to spend time on long-term engineering project work instead of operational work. Because the term operational work may be misinterpreted, Google came up with a specific word for it: Toil. Toil is roughly quant…
Read more →

What is Site Reliability Engineering?

The term Site Reliability Engineering (SRE) originates from Google and describes the methodology and management practices Google uses to run its infrastructure. In the years since Google first started publishing and talking about its methods, SRE has…
Read more →

SREcon19 EMEA

Notes on talks I attended while visiting SRECon in Dublin, 2019.
Read more →

Top-10 talks of SREcon18 Europe

It's been a month since I attended SREcon18 Europe and the majority of talks is now available online. In this article I look at the ten talks which stuck with me the most in the days and weeks following the conference. Summaries are provided to highl…
Read more →