Profluent3mo ago

USD 200000-330000/yr

Machine Learning Research Engineer

United States·EmeryvilleHybridmid

OtherMachine Learning EngineerDataMachine Learning Research Engineer

5 views0 saves0 applied

Apply Now

Quick Summary

Overview

Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine.

Key Responsibilities

Build robust, reproducible and user-friendly pipelines for automated model fine-tuning, alignment and evaluation Design and implement modular, easy-to-maintain, multi-model pipelines for protein design.

Requirements Summary

You're comfortable taking ownership and working independently in a fast-moving environment You're an execution-oriented engineer who maintains high standards, and focuses on the highest-impact work You're comfortable owning the full stack of your…

Technical Tools

awsazuredockergcpkubernetespythonpytorchetlmachine-learning

Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to revolutionize biomedicine. Based in Emeryville, CA, we are backed by leading investors including Altimeter Capital, Bezos Expeditions, Spark Capital, Insight Partners, Air Street Capital, AIX Ventures, and Convergent Ventures, and have raised over $150M to date.

We're looking for an experienced Machine Learning Engineer to build and improve the models and ML systems that drive our protein design efforts. In this role, you'll deploy and optimize large-scale generative models for protein design, and develop the surrounding infrastructure and tooling that enable our ML and protein design scientists to work faster and more confidently. As an early member of a small, fast-moving engineering team, you'll have significant ownership over our ML stack and the opportunity to shape how our platform evolves.

Responsibilities

~1 min read

→Build robust, reproducible and user-friendly pipelines for automated model fine-tuning, alignment and evaluation
→Design and implement modular, easy-to-maintain, multi-model pipelines for protein design.
→Develop highly scalable ETL pipelines to process petabyte-scale protein data for model pretraining
→Optimize model training and inference code to maximize throughput and resource utilization when deployed at scale
→Develop software and infrastructure that enable the ML team to work quickly and frictionlessly in distributed and multi-cloud environments
→Partner with ML and protein design scientists to prototype research ideas and bring them into production

You're comfortable taking ownership and working independently in a fast-moving environment
You're an execution-oriented engineer who maintains high standards, and focuses on the highest-impact work
You're comfortable owning the full stack of your work, from training code to the infrastructure it runs on
You care deeply about model quality, efficiency, and reliability
You're willing to step beyond your core responsibilities when the team needs it

Building hyperparameter search frameworks for SFT and Alignment workflows
Increasing protein language model throughput during long context generation
Updating existing model architectures to work and run efficiently on new GPU hardware
Implementing a protein design pipeline that integrates prompt retrieval, sequence generation, attribute prediction, and structure prediction
Establishing an ETL pipeline for sampling and tokenizing training datasets from an internal database of billions of sequences
Developing a benchmarking and evaluation system for newly trained sequence generation models
Contributing to the development of an internal service that provides transparent multi-node job submission for ML scientists

Requirements

~1 min read

BS or MS in Computer Science, Machine Learning, or a related field
3+ years of hands-on experience building and training ML models in PyTorch
Strong Python and software engineering fundamentals, including testing, code quality, and version control
Experience profiling, benchmarking, and optimizing ML model training and inference
Experience implementing or optimizing transformer-based architectures
Familiarity with cloud infrastructure and containerization (GCP, AWS, Azure, Kubernetes, Docker)
Strong fundamentals in ML, statistics, and/or linear algebra

Familiarity with protein language models or computational biology
Experience with GPU-level optimization (CUDA, Triton)
Experience with distributed training (DDP, FSDP, multi-node GPU clusters)
Experience with databases and data processing pipelines
Experience orchestrating multi-step ML workflows
Experience building backend systems that serve ML models in production
Contributions to open source ML projects or published research

What We Offer

~1 min read

✓High-growth opportunity with meaningful impact on the future of protein design

✓Competitive compensation package with equity participation

✓401(k) with a strong employer match

✓Comprehensive benefits including health/dental/vision insurance

✓Generous PTO policy and commitment to work-life balance

✓Professional development opportunities in a cutting-edge field at the intersection of AI and biology

Legal authorization to work in the United States is required. In compliance with federal law, all persons hired must verify their identity and work eligibility and complete the required employment verification form upon hire.

Hiring Salary Range

$200,000—$330,000 USD

Location & Eligibility

Where is the job

Emeryville, United States

Hybrid — some on-site time required

Who can apply

Listing Details

Posted: April 27, 2026
First seen: April 27, 2026
Last seen: July 27, 2026

Posting Health

Days active: 90
Repost count: 0
Trust Level: 44%
Scored at: July 26, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust

Apply for this position

Profluent

greenhouse

Profluent is an AI-first protein design company focused on developing generative models to create novel proteins for transformative applications in biomedicine.

Employees

125

Founded

2022

Domain

profluent.bio

Jobs

View 20 jobs

View company profile

Salary

USD 200000-330000

per year