Coupang
Coupang6h ago
New

Staff Software Engineer - AI Traffic & Inference Infrastructure

Bengalurulead
OtherSoftware Engineer Ai
0 views0 saves0 applied

Quick Summary

Key Responsibilities

Design and implement sophisticated load-balancing algorithms tailored for AI workloads (training, inference), optimizing request distribution based on model availability, and accelerator health.

Requirements Summary

Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field. Experience: 8–12 years of progressive software engineering experience,

Technical Tools
OtherSoftware Engineer Ai

As a Staff Engineer on our Coupang intelligent Cloud Infrastructure team, you will design and scale the intelligent nervous system of our CIC Cloud AI platform. You won't just be moving packets; you’ll be building the orchestration and routing layers that ensure our LLMs and foundation models are highly available, low-latency, and cost-efficient. You will own the end-to-end lifecycle of traffic management from global load balancing to hardware-aware request routing across thousands of accelerators.

Responsibilities

~1 min read
  • Intelligent Routing: Design and implement sophisticated load-balancing algorithms tailored for AI workloads (training, inference), optimizing request distribution based on model availability, and accelerator health. 

  • Inference Orchestration: Architect and evolve our inference infrastructure to support seamless model deployment, auto-scaling, and multi-AZ failover. 

  • Performance Engineering: Drive initiatives to minimize tail latency (P95 /P99) and maximize throughput using advanced batching, caching, and streaming token delivery techniques. 

  • Fleet Automation: Build robust infrastructure-as-code and CI/CD pipelines to manage dynamic compute fleets, ensuring they automatically scale to meet production and research demands. 

  • Observability & Optimization: Leverage deep telemetry data to tune system performance and hardware-agnostic scheduling across diverse GPU/TPU environments. 

  • Technical Leadership: Lead cross-functional initiatives across infrastructure and SW team, ML teams, providing mentorship and setting up the long-term technical roadmap for traffic management. 

 

Requirements

~1 min read
  • Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field. 

Requirements

~1 min read
  • AI/ML Domain Knowledge: Prior experience building infrastructure specifically for LLM inference or large-scale training clusters. 

  • Low-Level Optimization: Familiarity with inference, including mixed precision, kernel tuning, or custom hardware accelerators. 

  • Public/Private Cloud: Experience managing hybrid-cloud or multi-AZ deployments across AWS, Azure, or GCP. 

  • Compliance: Experience operating in regulated environments with strict security and compliance requirements. 

 

Type of work model 

  • Hybrid

Details to consider 

  • Those eligible for employment protection (recipients of veteran’s benefits, the disabled, etc.) may receive preferential treatment for employment in accordance with applicable laws. 
     

Privacy Notice  

Location & Eligibility

Where is the job
Bengaluru
On-site at the office
Who can apply
Same as job location

Listing Details

Posted
May 8, 2026
First seen
May 8, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
67%
Scored at
May 8, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Coupang
Coupang
greenhouse

Coupang is a U.S. retail company known for its fast delivery services and commitment to customer satisfaction.

Employees
5k+
Founded
2010
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

CoupangStaff Software Engineer - AI Traffic & Inference Infrastructure