HPC Infrastructure Reliability Engineer (Cequeliños)
T
Tata Consultancy Services
📍 arbo, galicia, Spain
Job Description
Are you a HPC Infrastructure Reliability Engineer seeking a new interesting challenge? If your answer is yes, it’s your lucky day so keep reading, it can be just what you’re looking for.
WHAT WILL YOU DO?
- Manage and optimize high-performance physical infrastructure (servers, GPUs, and advanced networking)
- Ensure availability, performance, and reliability of HPC and AI environments
- Drive infrastructure automation (IaC) and enable zero-touch provisioning
- Oversee the full hardware lifecycle (capacity planning, deployment, and decommissioning)
- Work with tools such as HPE OneView, Lenovo XClarity, and ServiceNow CMDB
- Collaborate with R&D, science, and engineering teams to design optimal infrastructure solutions
- Optimize resource utilization (CPU/GPU) and improve overall infrastructure efficiency
WHAT ARE WE LOOKING FOR?
- 5–7+ years of experience in Data Center Engineering, Bare Metal,...