inference
inference4mo ago
New
$250K – $350K • Offers Equity/yr

Machine Learning Researcher

San Franciscofull-timemid
OtherMachine Learning Researcher
0 views0 saves0 applied

Quick Summary

Overview

Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that ship, we'd love to meet you.

Key Responsibilities

Research and experiment with new model architectures to improve quality, efficiency, or capability Explore methods to decrease inference latency and improve serving efficiency Run experiments with new learning methods, including novel approaches to…

Requirements Summary

3+ years of experience training AI models using PyTorch Deep understanding of transformer architectures, attention mechanisms, and model internals Hands-on experience with post-training LLMs using SFT, RLHF, DPO, or other alignment techniques…

Technical Tools
huggingfacepytorchmachine-learning

Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that ship, we'd love to meet you.

About the Role

~1 min read

You will be responsible for conducting research into experimental models, training systems, and modalities to create novel products for our customers. Your work will span from exploring new architectures and learning methods to optimizing latency and efficiency, with the goal of delivering better models to customers.

Your north star is pushing the frontier of what's possible in LLM post-training. You'll explore new techniques, run rigorous experiments, and when something works, help bring it into production with the help of your teammates. This includes training models for customers and running evaluations as part of validating your research. This role reports directly to the founding team. You'll have the autonomy, a large compute budget / GPU reservation, and technical support to explore ambitious ideas and ship the ones that work.

Responsibilities

~1 min read
  • Research and experiment with new model architectures to improve quality, efficiency, or capability

  • Explore methods to decrease inference latency and improve serving efficiency

  • Run experiments with new learning methods, including novel approaches to SFT, RLHF, DPO, and other post-training techniques

  • Perform reinforcement learning research to improve model alignment and capability

  • Develop and improve our distillation pipeline for training high-quality models from frontier teachers

  • Train models for clients and run evaluations to validate research findings in production settings

  • Create robust benchmarks and evaluation frameworks that ensure custom models match or exceed frontier performance

  • Stay current with ML research and identify techniques that can improve our platform

  • Collaborate with applied engineers to bring successful research into production systems

  • Document findings and share knowledge with the team

Requirements

~1 min read
  • 3+ years of experience training AI models using PyTorch

  • Deep understanding of transformer architectures, attention mechanisms, and model internals

  • Hands-on experience with post-training LLMs using SFT, RLHF, DPO, or other alignment techniques

  • Experience with LLM-specific training frameworks (e.g., Hugging Face Transformers, DeepSpeed, Megatron, TRL, or similar)

  • Strong experimental methodology, including ability to design, run, and analyze rigorous experiments

  • Track record of implementing ideas from recent ML papers

  • Experience training on NVIDIA GPUs at scale

  • Strong foundation in ML fundamentals: optimization, loss functions, regularization, generalization

  • Publications in ML venues

  • Experience with model distillation or knowledge transfer

  • Experience with LLM speed optimization techniques

  • Familiarity with vision encoders, multimodal models, or other modalities

  • Experience with distributed training and infrastructure at scale

  • Contributions to open-source ML projects

You don't need to tick every box. Curiosity and the ability to learn quickly matter more.

What We Offer

~1 min read

We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $250,000 - $350,000, plus equity and benefits, depending on experience.

Inference.net is an equal opportunity employer. We welcome applicants from all backgrounds and don't discriminate based on race, color, religion, gender, sexual orientation, national origin, genetics, disability, age, or veteran status.

If you're excited about pushing the boundaries of custom AI research, we'd love to hear from you. Please send your resume and GitHub to amar@inference.net and/or here on Ashby.

Location & Eligibility

Where is the job
San Francisco
On-site at the office
Who can apply
Same as job location

Listing Details

Posted
January 5, 2026
First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
25%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

inferenceMachine Learning Researcher$250K – $350K • Offers Equity