- Administer production jobs
- Understand debugging info
- “Drain” traffic away from a cluster
- Roll back a bad software push
- Block or rate-limiting unwanted traffic
- Bring up additional serving capacity
- Use the monitoring systems (for alerting and dashboards)
Proven work experience as a Site Engineer or similar roleCollaborate and communicate asynchronouslyDocument all the things so you don’t need to learn the same thing twiceHave an enthusiastic, go-for-it attitudeRelevant training and/or certifications as a Site Reliability Engineer