featherlessai5mo ago

AI Researcher — Distillation

(world)Remotefull-timemid

OtherAi Researcher

3 views0 saves0 applied

Apply Now

Quick Summary

Overview

About the Role We’re looking for an AI Researcher focused on model distillation to help us push the frontier of efficient, high-performance models. You’ll work on turning large, expensive models into smaller, faster, and more deployable systems—while maintaining or improving quality.

Requirements Summary

Experience distilling large language models Work on efficiency-focused research (latency, memory, throughput) Experience with long-context models or non-Transformer architectures Open-source contributions in ML or research tooling Prior startup or…

Technical Tools

pytorchab-testingdeep-learningmachine-learning

About the Role

~1 min read

We’re looking for an AI Researcher focused on model distillation to help us push the frontier of efficient, high-performance models. You’ll work on turning large, expensive models into smaller, faster, and more deployable systems—while maintaining or improving quality.

This role is ideal for someone who enjoys publishing research, working close to real systems, and seeing their ideas move from papers → code → production.

Design and evaluate model distillation techniques (teacher–student training, self-distillation, layer-wise distillation, representation matching, etc.)
Research tradeoffs between model size, latency, memory, and accuracy
Develop novel distillation approaches for:
- Large language models
- Long-context or specialized architectures
- Inference-constrained environments
Run large-scale experiments and ablations; analyze results rigorously
Collaborate with engineers to productionize research outcomes
Write and submit research papers to top-tier venues (NeurIPS, ICML, ICLR, COLM, etc.)
Contribute to internal research notes, technical blogs, and open-source projects when appropriate

Strong background in machine learning research
Hands-on experience with model distillation or closely related topics (compression, pruning, quantization, representation learning)