File tree Expand file tree Collapse file tree 1 file changed +6
-4
lines changed Expand file tree Collapse file tree 1 file changed +6
-4
lines changed Original file line number Diff line number Diff line change @@ -10,13 +10,15 @@ team: Core Platform
10
10
---
11
11
12
12
Nobody likes to be woken up in the middle of the night, but if you've got to do
13
- it, make sure you pick the right person. Scribd has long used
13
+ it, make sure you pick the right person to solve the problem . Scribd has long used
14
14
[ PagerDuty] ( https://pagerduty.com ) for managing on-call rotations, but only
15
15
within the "Core Infrastructure" team. All production incidents were routed to
16
- a single group of infrastructure engineers. Clearly not a good idea. To help
16
+ a single group of infrastructure engineers rather than developers who were
17
+ committing code to the service. Clearly not a good idea. To help
17
18
with our migration to AWS, we recognized the need to move to a more
18
- _ distributed_ model of incident response, and the Core Platform team ended up
19
- being a suitable test subject.
19
+ _ distributed_ model of incident response, with developers taking on more
20
+ responsibility. We needed to try something different and the Core Platform team
21
+ ended up being a suitable test subject for our experiments.
20
22
21
23
22
24
The idea of transitioning from "nobody is on-call" to "everybody is on-call"
You can’t perform that action at this time.
0 commit comments