Job Summary
A company is looking for a Manager of Site Reliability Engineering, Observability.
Key Responsibilities
- Create and drive strategic organization-wide observability initiatives
- Manage day-to-day operations of the team and contribute to the SRE roadmap for observability initiatives
- Guide teams to build and maintain observable systems and support end-users with training on observability tools
Required Qualifications
- Hands-on experience managing an SRE or Observability team
- Hands-on coding/scripting experience with Go, Python, etc
- Deep understanding of observability systems and tools such as APM, Splunk, and Terraform
- Background in leading complex engineering projects in a Scrum environment
- Direct exposure to cloud infrastructure and SaaS solutions