Signaler

Remote RLHF Specialist: AI Alignment & Safety

LA RÉUNION, FRANCE

il y a 26 jours

A leading AI consulting firm is seeking an RLHF Specialist to improve AI models using Reinforcement Learning from Human Feedback methodologies. The role involves generating preference data, designing prompts to test models, and collaborating with engineers to enhance model performance. Required qualifications include 2+ years in data annotation or model evaluation, strong skills in Python and deep learning frameworks, and experience fine-tuning open-source models. This position offers remote work opportunities.

#J-18808-Ljbffr

Entreprise

Odixcity Consulting

Plateforme de publication

WHATJOBS

Offres pouvant vous intéresser

RLHF Specialist

LA RÉUNION, FRANCE

il y a 26 jours

AI Engineer / GenAI Specialist

PARIS, 75

il y a 26 jours

Generative AI Engineer

PARIS, 75

il y a 22 jours

AI/ML Software Engineer

FRANCE

il y a 26 jours