Reinforcement Learning Engineer
A
Appit LLC
📍 montreal (administrative region), qc, Canada
Job Description
APPIT Software Solutions is hiring a Reinforcement Learning Engineer in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models.
Responsibilities
- Design and implement reinforcement learning algorithms for enterprise optimization problems
- Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning
- Develop simulation environments for training and evaluating RL agents
- Implement multi-agent reinforcement learning systems for complex coordination tasks
- Optimize RL training stability and sample efficiency using state-of-the-art techniques
- Collaborate with research teams to translate RL advances into production applications
Requirements
- 5+ years of ML experience with 2+ years focused on reinforceme...