Cloud Reliability SRE: Incident Management & Observability
I
IBM
📍 markham, wales, United-Kingdom
Job Description
IBM is seeking an expert-level Reliability Engineer to enhance system reliability within a global multi-cloud environment. This position involves analyzing failure patterns, improving tooling, and coordinating incident management practices across engineering teams. Candidates must have over 10 years of experience in SRE or incident management, strong cloud skills with AWS, GCP, or Azure, and proficiency with tools like Rootly and PagerDuty. Join IBM to shape the reliability practices that power digital transformation.
#J-18808-Ljbffr
#J-18808-Ljbffr