Signaler

Research Engineer, Model Inference & Serving - London

PARIS, 75

il y a 1 jour

Research Engineer, Model Inference & Serving

About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute.

About the Team: The Inference team builds and operates the systems that serve H’s foundational models in production. We focus on multimodal inference and serving for Computer Use Agents, optimizing across both the inference engine layer (e.g., vLLM, SGLang) and the model serving layer (e.g., disaggregated inference, intelligent routing). Agentic inference brings constraints around context length, multimodality, and tool calls, which we address by co-designing with the Models team on training-time choices and with the agent teams on how models are deployed. We operate at the intersection of research and production, translating cutting-edge inference techniques into the systems that power H’s next generation of agents. We are looking for strong engineers excited about inference to join the team and help shape the systems behind superintelligent AI.

Key Responsibilities:

Build and operate the inference stack that serves H’s multimodal agentic models
Improve latency, throughput, and cost of model serving across the stack
Research and implement inference techniques tailored to agent workloads
Co-design with the Models team on training-time decisions that affect inference
Collaborate with cross-functional teams to integrate inference into agentic AI products
Evaluate inference, serving, and hardware platforms, and communicate findings to stakeholders
Stay current with advancements in inference, model serving, and accelerator technology

Requirements:

Technical skills:
Research skills:
Soft skills:
Preferred qualifications:

Location:

Paris or London.
This role is hybrid, and you are expected to be in the office 3 days a week on average.
Please expect some travel between offices on a reasonable cadence (e.g., every 4-6 weeks).

What We Offer:

Join the exciting journey of shaping the future of AI
Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
Enjoy a competitive salary
Unlock opportunities for professional growth, continuous learning, and career development

#J-18808-Ljbffr

Entreprise

Dormont Manufacturing Co

Plateforme de publication

WHATJOBS

Offres pouvant vous intéresser

AI Engineer (Audio)

PARIS, 75

il y a 8 jours

AI Engineer (Audio)

PARIS, 75

il y a 8 jours

Research Engineer, Model Inference & Serving - Paris

PARIS, 75

il y a 8 jours

Research Engineer / Scientist, Post-training - London

PARIS, 75

il y a 6 jours