Principal Machine Learning Infrastructure Researcher

Lightmatter

📍 toronto, on, Canada

Full-time Other-General Posted June 04, 2026

Job Description

Requirements

  • PhD in Computer Science, Electrical Engineering, or a related field and at least 10 years of industry or research experience in the field
  • Solid background in Machine Learning systems research and experience with performance evaluation and performance modeling
  • Hands‑on experience with machine learning / deep learning methods and extensive knowledge of Large Language Model training and inference frameworks
  • Academic publications in relevant journals and industry conferences
  • Strong knowledge of AI systems, including compute (GPUs, TPUs, etc.) and networking (Ethernet, Infiniband, NVLink, etc.) technologies, and their various topologies, paradigms, and trade‑offs
  • Proven track record leading a team to deliver published results
  • (Desirable) Demonstrated ability to creatively problem‑solve and apply first‑principles thinking to generate novel ideas, particularly in ambiguous, evolving environments