Firstup
Firstup19d ago

Director of Cloud Operations

United StatesUnited StatesRemoteFull-timeexecutive
OtherDirectorCloud Operations EngineerInfrastructure & Cloud
0 views0 saves0 applied

Quick Summary

Overview

Who We Are At Firstup, our mission is to improve the employee experience at every moment that matters, large and small. As the communication pipeline for the world's workforce,

Technical Tools
awscirclecidatadogkubernetesterraformci-cddistributed-systemsmicroservicesperformance-optimizationsaassystem-design
Who We Are

At Firstup, our mission is to improve the employee experience at every moment that matters, large and small. As the communication pipeline for the world's workforce, we now serve 40 of the Fortune 100 companies, reaching and connecting more than 17 million employees daily.

Our employees are experts in the employee experience, workforce communications and technology. 
Joining Firstup means joining a movement to make work better for every worker. As the world’s first intelligent communication platform, Firstup meaningfully engages employees at every moment from hire to retire, and delivers engagement insights to help companies support, promote and retain their talent. Our movement has taken root and is evident in our world-class customer base. Now we need your help. Ready to make a difference in the world?


We are seeking a Director of Cloud Operations (CloudOps) to lead and evolve our cloud infrastructure and operational practices across a globally distributed SaaS platform. This is a hands-on leadership role responsible for ensuring the reliability, scalability, and efficiency of our systems running across multiple AWS regions in the United States and Europe.

As part of the senior leadership team, you will partner closely with Engineering, Security, and Product to strengthen operational excellence, enhance system observability, and drive continuous improvement in how we build and run services. You will lead a distributed team of engineers across the US and UK, fostering a high-performing, collaborative, and growth-oriented environment.

This role is ideal for a leader who combines deep technical expertise with a pragmatic approach to improving systems, processes, and team capabilities.

  • Own the availability, performance, and resilience of our multi-region AWS platform.

  • Drive improvements in system reliability through well-defined SLIs/SLOs, error budgets, and proactive engineering practices.

  • Lead efforts to reduce MTTR and improve incident response effectiveness across the organization.

  • Guide architecture decisions for microservices, Kubernetes (EKS), and serverless workloads to ensure scalability and fault tolerance.

  • Advance our observability strategy using Datadog, ensuring actionable insights across infrastructure and applications.

  • Establish and refine incident management practices, including on-call processes, escalation paths, and post-incident reviews.

  • Act as an incident commander for critical events and contribute to the on-call rotation.

  • Elevate operational standards through automation, standardization, and adoption of modern best practices.

  • Drive cost optimization initiatives across AWS environments without compromising performance or reliability.

  • Leverage AI and automation to improve operational efficiency, accelerate root cause analysis, and enhance system insights.

  • Continuously improve CI/CD pipelines (CircleCI) and infrastructure-as-code practices (Terraform).

  • Lead, mentor, and support a distributed team of CloudOps engineers across the US and UK.

  • Foster a culture of accountability, learning, and continuous improvement.

  • Provide technical guidance while enabling the team to grow in ownership and capability.

  • Ensure stability and support for existing customers while maintaining clear operational boundaries with the cloud platform.

  • Experience
  • 10+ years in cloud infrastructure, SRE, or DevOps roles, with 3+ years experience leading CloudOps/SRE teams.

  • Proven track record of leading operational or platform transformations in a SaaS environment.

  • Experience operating multi-region, customer-facing systems at scale.

  • Strong hands-on experience with:

  • AWS (multi-region architectures)

  • Kubernetes (EKS) and containerized environments

  • Infrastructure as Code (Terraform preferred)

  • CI/CD pipelines (CircleCI or similar)

  • Observability platforms (Datadog or equivalent)

  • Solid understanding of microservices and distributed systems design.

  • Familiarity with serverless architectures and modern cloud-native patterns.

  • Deep experience with incident management, on-call operations, and reliability engineering practices.

  • Strong understanding of SLO/SLI frameworks, monitoring strategies, and performance optimization.

  • Demonstrated ability to balance hands-on technical work with team leadership.

  • Collaborative, pragmatic leader who can influence across teams and functions.

  • Passion for building and supporting high-performing teams.

  • Focus on continuous improvement, with a bias toward measurable outcomes.

  • Location & Eligibility

    Where is the job
    United States
    Remote within one country
    Who can apply
    Open to applicants worldwide
    Listed under
    United States

    Listing Details

    Posted
    April 17, 2026
    First seen
    April 17, 2026
    Last seen
    May 6, 2026

    Posting Health

    Days active
    18
    Repost count
    0
    Trust Level
    37%
    Scored at
    May 6, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Firstup
    Firstup
    lever

    Firstup is fundamentally changing how organizations communicate and is the backbone of the entire Digital Employee Experience.

    Employees
    125
    Founded
    2008
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    FirstupDirector of Cloud Operations