20221123-1230 - HangOps conversation discussing some of the origins of Google SRE
pwnguin Nov 18th at 20:47 https://www.usenix.org/publications/loginonline/oncall-equal-opportunity-waste-time 204 replies nhruby 5 days ago this is a weird article? nhruby 5 days ago It sort of meanders around the fact that a lot of plac…
SREcon21
SREcon21, a gathering of engineers who care deeply about site reliability, systems engineering, and working with complex distributed systems at scale, will be held as a virtual event for the global SREcon community on October 12–14, 2021. — https…
Toil
In Site Reliability Engineering, we want to spend time on long-term engineering project work instead of operational work. Because the term operational work may be misinterpreted, Google came up with a specific word for it: Toil. Toil is roughly quant…
What is Site Reliability Engineering?
The term Site Reliability Engineering (SRE) originates from Google and describes the methodology and management practices Google uses to run its infrastructure. In the years since Google first started publishing and talking about its methods, SRE has…
Top-10 talks of SREcon18 Europe
It's been a month since I attended SREcon18 Europe and the majority of talks is now available online. In this article I look at the ten talks which stuck with me the most in the days and weeks following the conference. Summaries are provided to highl…