RL Engineer: LLMs & Code Gen - Hybrid

Code Metal

📍 boston, davao oriental, Philippines

Full-time Engineering Posted June 06, 2026

Job Description

Code Metal in Boston, Davao Oriental, Philippines is seeking a skilled professional to bridge production and research roles in AI. You will be responsible for building distributed training systems using PyTorch and developing scalable data curation pipelines.

The ideal candidate has strong expertise in reinforcement learning and will engage with frontier research to apply RLHF to Large Language Models, particularly in code generation tasks. Benefits include comprehensive health care, a 401k with matching, and a flexible hybrid work arrangement.

#J-18808-Ljbffr