Hugging Face is looking for a Research Engineer focused on efficient ML to work on model compression, quantization, and optimization techniques. You'll contribute to our open-source libraries (PEFT, bitsandbytes, optimum) and help make state-of-the-art models accessible to everyone — including those without access to expensive hardware.
Your work will directly impact millions of developers who rely on Hugging Face to run models efficiently.
Anthropic
OpenAI
Hugging Face
Scale AI