dolby
dolby22h ago
New

Staff Operational Support Engineer

lead
Customer SupportSupport Engineer
0 views0 saves0 applied

Quick Summary

Key Responsibilities

* Terraform * Helm * Kubernetes manifests * GitOps workflows * CI/CD and deployment pipelines • Use IaC as the primary mechanism for safe, auditable,

Requirements Summary

Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race,

Technical Tools
Customer SupportSupport Engineer
Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you’ll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent. We’re big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work. The Dolby Cloud Solutions organization builds technologies and innovations that easily integrate into service providers’ infrastructure to make content experiences more effective, meaningful, and engaging for consumers. Dolby OptiView is building a dedicated Operational Support (L2) team responsible for the stability, availability, and operational excellence of our 24/7 live video streaming, ads, player, and real-time delivery platforms. As an Operational Support Engineer (L2), you take end-to-end ownership of customer-impacting production incidents once they are triaged by Level 1 support. You operate directly on production systems, lead live incident resolution, and act as the operational bridge between Support, Engineering, DevOps, and customers, particularly during high-impact live events. This is a hands-on, customer-facing role focused on incident ownership, production operations, automation, and operational scalability, not just reactive troubleshooting. Key Responsibilities Incident & Operational Support • Take ownership of escalated customer issues from Level 1 Support and drive them to resolution. • Troubleshoot and resolve complex, high-impact production incidents affecting live streams, VOD playback, ad insertion, DRM, and real-time WebRTC services. • Operate directly on production environments, including configuration changes, CDN adjustments, and corrective actions, following established operational procedures, including executing mitigations and emergency changes during live incidents when customer impact requires immediate action. • Lead or actively contribute to live incident bridges involving customers, internal teams, and partners. • Provide clear, timely communication during incidents, including status updates and customer- facing explanations. Infrastructure as Code & Production Operations • Work fluently with Infrastructure as Code (IaC) to understand, troubleshoot, and safely modify production environments • Leverage tools and frameworks such as: * Terraform * Helm * Kubernetes manifests * GitOps workflows * CI/CD and deployment pipelines • Use IaC as the primary mechanism for safe, auditable, and repeatable operational changes • Collaborate with Engineering and DevOps to improve deployment reliability and operational safety • Validate and execute infrastructure or configuration changes through codified workflows AI-Driven Operations & Automation. • Leverage AI tools and automation to enhance operational efficiency and incident response. Contribute to and use: * AI-assisted incident triage and classification * Automated runbook execution * AI-based pattern detection across incidents * Intelligent alert correlation and noise reduction Use AI to: * Generate or improve incident communications * Accelerate troubleshooting workflows * Identify recurring patterns and systemic issues • Drive adoption of automation-first and AI-augmented operational practices Pre-Event Planning & Operational Readiness • Participate in pre-event readiness planning for critical customer events Validate system readiness through: * Runbook checks * Monitoring coverage validation * Risk identification and mitigation planning • Define and rehearse incident response strategies for high-risk scenarios • Collaborate with customers and internal teams to ensure smooth event execution On-Call & 24/7 Operations • Participate in a 24/7 on-call rotation, including nights, weekends, and holidays, as part of a global support model • Ensure smooth handovers between shifts and regions • Respond to critical alerts within defined SLAs for stream health, player errors, and delivery infrastructure. Root Cause & Continuous Improvement • Perform or contribute to root cause analysis (RCA) for production incidents • Document findings, corrective actions, and preventive measures • Identify recurring issues and work with Engineering and Product teams to eliminate them permanently • Contribute to and improve runbooks, operational playbooks, and knowledge bases for all OptiView products (Player, ads, live and real time streaming) Collaboration & Engineering Feedback Loop • Work closely with Engineering teams to escalate defects, validate fixes, and support production deployments • Provide feedback on system observability, tooling gaps, and operational risks • Act as the operational voice during post-incident reviews Required Skills & Experience Technical Skills • 5+ years of relevant experience in operational, support, or similar customer‑facing roles • Proven ability to own complex problems end‑to‑end and operate with a high degree of autonomy • Strong experience supporting production video streaming platforms, OTT services, live systems • Solid troubleshooting skills across distributed systems (APIs, microservices, cloud infrastructure) • Familiarity with HLS, DASH, CMAF, WebRTC, DRM and CDN architectures • Experience working with monitoring, alerting, and logs to diagnose live incidents (Grafana, Kibana/ELK, Prometheus, Loki) • Correlate backend streaming metrics, player telemetry, and CDN signals to diagnose live customer issues end-to-end. • Comfort performing controlled changes in production environments • Working knowledge of incident management and on-call operations Operational Mindset • Proven ability to remain calm, structured, and decisive during high-pressure incidents • Strong sense of ownership and accountability for customer outcomes • Excellent written and verbal communication skills, including customer-facing communication during incidents #LI-GW! The Atlanta Area base salary range for this full-time position is $136,500-$187,400, which can vary if outside this location, plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process. Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12 Equal Employment Opportunity: Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.

Location & Eligibility

Where is the job
Location terms not specified
Who can apply
Open to applicants worldwide

Listing Details

Posted
May 28, 2026
First seen
May 29, 2026
Last seen
May 29, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
49%
Scored at
May 29, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust

3 other jobs at dolby

View all →

Explore open roles at dolby.

Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

dolbyStaff Operational Support Engineer