Job Description
Job Title: Reliability EngineerJob Description
This role focuses on ensuring fleet-scale reliability, availability, and performance of large-scale robotic systems. You will diagnose and resolve complex system-level issues across software, hardware, controls, and infrastructure, while driving continuous improvements in robustness, fault tolerance, and scalability. The position combines hands-on debugging, data-driven performance optimization, and close collaboration with cross-functional and field teams to keep thousands of deployed robots operating reliably in production environments.
Responsibilities
+ Identify, triage, and determine root causes of system-level issues impacting large-scale robotic fleets.
+ Drive improvements in system reliability, availability, and performance across thousands of deployed robots.
+ Define, implement, and monitor system performance guardrails tied to customer KPIs such as throughput, error rates, recovery time, and upt...
This role focuses on ensuring fleet-scale reliability, availability, and performance of large-scale robotic systems. You will diagnose and resolve complex system-level issues across software, hardware, controls, and infrastructure, while driving continuous improvements in robustness, fault tolerance, and scalability. The position combines hands-on debugging, data-driven performance optimization, and close collaboration with cross-functional and field teams to keep thousands of deployed robots operating reliably in production environments.
Responsibilities
+ Identify, triage, and determine root causes of system-level issues impacting large-scale robotic fleets.
+ Drive improvements in system reliability, availability, and performance across thousands of deployed robots.
+ Define, implement, and monitor system performance guardrails tied to customer KPIs such as throughput, error rates, recovery time, and upt...