applied
applied14mo ago
New

ML Runtime Optimization Engineer

Sunnyvalefull-timemid
Optimization EngineerData & AI
0 views0 saves0 applied

Quick Summary

Overview

About the role We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton).

Requirements Summary

M.Sc or PhD in a ML related area Built an ML optimization framework from scratch before Deployed ML solutions to embedded chips for real time robotics applications Compensation at Applied Intuition for eligible roles includes base salary, equity,…

Technical Tools
pytorchdeep-learningperformance-optimization

About the Role

~1 min read

We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton).

  • Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms 

  • Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers

  • Work on model pruning and quantization, and support deployment on memory constrained platforms

  • Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions

  • Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration 

  • Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field

  • 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture

  • Strong software development skills with the focus on embedded programming

  • Experience profiling and optimizing model performance on embedded compute platforms

  • Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)

Nice to Have

~1 min read
  • M.Sc or PhD in a ML related area

  • Built an ML optimization framework from scratch before

  • Deployed ML solutions to embedded chips for real time robotics applications

Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment.

Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position.

Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $159,053 - $199,295 USD annually. 

Location & Eligibility

Where is the job
Sunnyvale
On-site at the office
Who can apply
Same as job location

Listing Details

Posted
March 5, 2025
First seen
May 5, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
14%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

appliedML Runtime Optimization Engineer