SRE

Insight Global

📍 Toronto, ON, Canada

Full-time other-general Posted June 06, 2026

Job Description

Job Description
We are looking for a Site Reliability Engineer (SRE) to support and scale cutting-edge real-time transcription and summarization platforms powered by Large Language Models (LLMs). These tools operate in branch environments and must reliably handle a variety of real-time communication formats (e.g., phone, Webex), creating complex operational and reliability challenges.
This role will focus on ensuring high availability, performance, and resilience of distributed systems that process and summarize live communication data. Day-to-day responsibilities include monitoring system health, improving observability, managing incident response, optimizing infrastructure scalability, and implementing automation to maintain reliable, low-latency data pipelines across multiple platforms.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative...