Job Description
We are looking for a Site Reliability Engineer to devise and develop automations and provision scripts and environments, assess and maintain systems, as well as brainstorm possible improvements that can be made to our systems in the future
Responsibilities:
Maximize system uptime and infrastructure availability, ensuring functional and performance SLAs.
Establish end-to-end monitoring and alerting on all critical aspects.
Solve complex problems for critical services and build automation to prevent problem recurrence.
Influence and create new designs, architectures, standards, and methods for supporting the platform.
Initiate and lead scripting and automation to streamline system updates and upgrades.
Set up critical infrastructure, tools, and framework to streamline the deployment cycle.
Work cross-functionally with Services and Engineering teams.
Qualifications:
Bachelor’s degree in a Technol...