Job Summary
A company is looking for an Incident Manager responsible for coordinating real-time incident responses and maintaining production change integrity.
Key Responsibilities
- Coordinate live responses to incidents impacting data systems, applications, and infrastructure
- Lead post-incident reviews to identify root causes and track remediation progress
- Review and validate production change requests to ensure compliance with operational standards
Required Qualifications
- 2-4 years of experience in incident management or production operations
- Strong background in data, application, infrastructure, or automation platforms
- Familiarity with observability tools and incident management platforms
- Ability to analyze logs and identify issues using monitoring data
- Experience applying ITIL, SRE, or DevOps principles in real-time operations