C
USD 194000-257300/yr

Manager, Engineering (Production Orchestration)

United StatesUnited States·New Yorkmid
OtherManager
0 views0 saves0 applied

Quick Summary

Overview

Category-defining tech. Career-defining work. Lots of tech companies disrupt. But, many fail when they try to scale. We're different. CockroachDB makes it easier for companies to build and scale apps.

Technical Tools
OtherManager
 

Lots of tech companies disrupt. But, many fail when they try to scale. We're different. CockroachDB makes it easier for companies to build and scale apps. This is how and why we're helping some of the most innovative companies on the planet. We tackle problems head-on and focus on solutions that create lasting impact. 

 


At the heart of CockroachDB is our Production Orchestration team- the stewards of availability, reliability, and scalability across our cloud offerings and beyond. Built on a foundation of SRE principles and carrying forward years of operational practice, our core commitment is clear: ensuring our customers have a secure, reliable, and performant production service at scale.

We're looking for an Engineering Manager to lead our Production Orchestration team as part of a global Production Engineering organization. You'll drive foundational architectural changes to how we operate our fleet, champion AI-driven approaches to both development and operations, and foster a culture of operational excellence, ensuring CockroachDB meets and exceeds our SLAs while keeping pace with rapid growth.

You'll report to Tom Schmidt, Director of Production Engineering, who has led this team for 4+ years and will continue to be deeply involved in its technical direction. You'll be responsible for the growth and development of the team's engineers, day-to-day execution, and operational health, while bringing your own leadership and ideas to the table.

  • Lead the Production Orchestration team, focused on the reliability, availability, and scalability of CockroachDB in production. 
  • Own operational excellence. Ensure the team is meeting or exceeding our SLAs, running effective incident response, and continuously improving our operational posture. Every incident is treated as a learning opportunity.
  • Partner across the global Production Engineering organization to align on shared goals, ensure smooth coordination across time zones, and drive cohesive execution.
  • Drive automation and tooling. Relentlessly reduce operational toil by building systems that improve observability and scale our fleet without scaling headcount linearly.
  • Leverage AI to improve how the team builds and operates. Help the team adopt AI-assisted development practices and identify applied AI opportunities to improve operational workflows, from alert triage to capacity planning to incident response.
  • Contribute to foundational architecture. The team is building a new architectural initiative that will reshape how we operate our fleet. You'll help lead execution on this work and ensure the team has the space and support to deliver.
  • Coach and develop your engineers. Provide direct, constructive feedback. Guide personal development and career growth beyond just technical skills. Managing performance and ensuring engineers are achieving their goals is essential to retaining a high-performing team.
  • Partner with engineering and product leadership to shape the roadmap for CockroachDB's operational capabilities and future products.
  • Collaborate across teams to build and establish the tools and processes that empower everyone to make our customers successful.

Responsibilities

~2 min read

Tom leads Cockroach Labs' Production Engineering org, responsible for the operational reliability and scalability of CockroachDB. He joined Cockroach Labs in August 2022 as manager of Site Reliability Engineering and has since taken responsibility for the broader production engineering organization. Before CRL, Tom spent 15 years at IBM, initially in technical leadership roles spanning compiler development, test frameworks, and CI/CD, before dedicating the latter half of his career to championing SRE across the organization. An enthusiastic advocate of the discipline, Tom has presented at conferences, developed certification curriculum, secured multiple patents, and was recognized as one of IBM's first three SRE Thought Leaders. Outside of work, Tom is a proud father of a 5-year-old boy and enjoys hiking, camping, and gaming.


Cockroach Labs is proud to be an Equal Opportunity Employer building a diverse and inclusive workforce. If you need additional accommodations to feel comfortable during your interview process, please email us at accessibility@cockroachlabs.com.

Cockroach Labs has a hybrid work model, with Roachers that are local to one of our offices coming in on Mondays, Tuesdays, and Thursdays and working flexibly the rest of the week. While we’ve learned valuable lessons working remotely, nothing can replace the connection, creativity, and fun that occurs when Roachers get together and we are committed to fostering a workplace that encourages collaboration and allows us all to do our best work.


What We Offer

~2 min read
Stock Options
Medical Insurance
Vision Insurance
Dental Insurance
Life and Disability Insurance
Professional Development Funds
Flexible Time Off
Paid Holidays
Paid Sick Days
Paid Parental Leave
Retirement Benefits
Mental Wellbeing Benefits
And more!

Location & Eligibility

Where is the job
New York, United States
On-site at the office
Who can apply
US

Listing Details

Posted
May 28, 2026
First seen
May 28, 2026
Last seen
May 28, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
71%
Scored at
May 28, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

C
Manager, Engineering (Production Orchestration)USD 194000-257300