Pre-Training Data ML Engineer — Build Scalable Pipelines (Remote)

Cohere

📍 Toronto, ON, Canada

Full-time Other-General Posted February 28, 2026

Job Description

A leading AI research organization is seeking a Machine Learning Engineer specializing in pretraining data to design and manage robust data pipelines. The role focuses on ingesting, cleaning, and optimizing diverse datasets to enhance AI model performance. Ideal candidates will have strong software engineering skills, be proficient in Python, and possess experience with large-scale datasets. This position offers a collaborative work environment and is part of a mission-driven team aimed at advancing AI capabilities.
#J-18808-Ljbffr