Posted in

Manager – Incident Management

Manager – Incident Management

CompanyCaseware
LocationToronto, ON, Canada
Salary$Not Provided – $Not Provided
TypeFull-Time
Degrees
Experience LevelSenior

Requirements

  • Prior in a similar role, preferably within a software or technology company.
  • Strong technical background with experience in incident management and response.
  • Ability to drive teams to resolve incidents quickly.
  • Understanding of software landscape and system integration.
  • Excellent written and verbal communication skills.
  • Ability to work effectively under pressure and manage multiple priorities.

Responsibilities

  • Build and lead an incident management team to respond to software incidents.
  • Manage a 24/7 on-call rotation to ensure timely incident response.
  • Leverage systems such as PagerDuty, NewRelic, AWS, Microsoft Teams, and Slack to monitor and manage incidents.
  • Drive teams to resolve incidents quickly and efficiently.
  • Understand our software landscape and how systems and teams integrate.
  • Ensure the right people are involved in an active incident to facilitate rapid recovery.
  • Communicate effectively with cross-functional teams and end-user stakeholders to provide updates and resolutions.
  • Develop and implement incident management processes and best practices.
  • Track and provide uptime metrics to internal and external stakeholders, ensuring transparency in system reliability and incident recovery performance.
  • Organize and run post-mortem meetings following major incidents. Document root causes, lessons learned, and actionable steps to improve processes. Follow up on action items to ensure their completion and track progress to prevent recurrence of similar incidents.
  • Implement proactive strategies and tools to mitigate risks and strengthen system resilience.

Preferred Qualifications

    No preferred qualifications provided.