Software Engineer, AI Compute Infrastructure
Quick Summary
Design and implement mechanisms to aggressively optimize GPU and cluster utilization across thousands of devices for inference, training,
At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences.
Learn more at www.heygen.com. Visit our Mission and Culture doc here.
We are seeking a seasoned Software Engineer to build and scale the foundational compute infrastructure that powers our state-of-the-art AI models—from multimodal training data pipelines to high-throughput, low-latency video generation.
Responsibilities
~1 min readYou will be the core engineer responsible for building the robust, efficient, and scalable platform that enables our research and production teams to rapidly iterate on HeyGen's generative video models. Your contributions will directly impact model performance, developer productivity, and the final quality of every AI-generated video.
- →
Requirements
~1 min readStrong proficiency in Python and a high-performance language such as C++ for developing core infrastructure components.
Deep understanding and hands-on experience with modern orchestration and distributed computing frameworks such as Kubernetes and Ray.
Experience with core ML frameworks such as PyTorch, TensorFlow, or JAX.
Requirements
~1 min read-
Master's or PhD in Computer Science or a related technical field.
-
Demonstrated Tech Lead experience, driving projects from conceptual design through to production deployment across cross-functional teams.
-
Prior experience building infrastructure specifically for Generative AI models (e.g., diffusion models, GANs, or large language models) where cost and latency are critical.
-
Proven background in building and operating large-scale data infrastructure (e.g., Ray, Apache Spark) to manage petabytes of multi-modal data (video, audio, text).
- Expertise in GPU acceleration and deep familiarity with low-level compute programming, including CUDA, NCCL, or similar technologies for efficient inter-GPU communication.
- Competitive salary and benefits package.
- Dynamic and inclusive work environment.
- Opportunities for professional growth and advancement.
- Collaborative culture that values innovation and creativity.
- Access to the latest technologies and tools.
HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Listing Details
- Posted
- February 11, 2026
- First seen
- March 26, 2026
- Last seen
- April 17, 2026
Posting Health
- Days active
- 21
- Repost count
- 0
- Trust Level
- 45%
- Scored at
- April 17, 2026
Signal breakdown

HeyGen is an AI-powered video generation platform that enables businesses and individuals to create professional-quality videos with AI avatars and voices, supporting localization in numerous languages.
View company profilePlease let Heygen know you found this job on Jobera.
4 other jobs at Heygen
View all →Explore open roles at Heygen.
Similar Software Engineer, AI Compute Infrastructure jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.