Remote RLHF Specialist: AI Alignment & Safety
LA RÉUNION, FRANCE
il y a 26 jours
A leading AI consulting firm is seeking an RLHF Specialist to improve AI models using Reinforcement Learning from Human Feedback methodologies. The role involves generating preference data, designing prompts to test models, and collaborating with engineers to enhance model performance. Required qualifications include 2+ years in data annotation or model evaluation, strong skills in Python and deep learning frameworks, and experience fine-tuning open-source models. This position offers remote work opportunities.
#J-18808-Ljbffr
Entreprise
Odixcity Consulting
Plateforme de publication
WHATJOBS
Offres pouvant vous intéresser
LA RÉUNION, FRANCE
il y a 26 jours
PARIS, 75
il y a 26 jours
PARIS, 75
il y a 22 jours
FRANCE
il y a 26 jours