Staff Site Reliability Engineer
Quick Summary
7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with AWS and Azure.
Staff Site Reliability Engineer / Cloud SME
As the Staff SRE/Cloud SME, you will be a critical technical leader driving the rearchitecting of our existing monolithic system into a resilient, cloud-native architecture. This role requires deep expertise across multiple cloud platforms (Azure and AWS) and container orchestration (Kubernetes) to ensure the next-generation platform meets the highest standards of scalability, reliability, and security.
Responsibilities
~1 min readArchitecture & Transformation Leadership
- →Lead the technical rearchitecting efforts, transforming a large-scale monolithic system into a modern microservices-based, cloud-native application.
- →Collaborate with cross-functional teams (Engineering, Architecture, Product) to define and implement the new system architecture using domain-driven design (DDD) principles.
- →Conduct technology evaluations and provide recommendations for new tools, frameworks, and cloud services to enhance our infrastructure.
Reliability Engineering & Cloud Operations
- →Utilize Kubernetes (K8S) for container orchestration and management, ensuring extreme scalability, reliability, and high availability of the system.
- →Implement robust, highly resilient, and highly available components for the system.
- →Develop and implement comprehensive monitoring, logging, and alerting mechanisms to ensure optimal system performance and availability.
- →Drive the adoption of DevOps principles and practices throughout the software development lifecycle, ensuring seamless integration and continuous deployment processes.
Technical Expertise & Mentorship
- →Stay up-to-date with emerging technologies, frameworks, and industry trends related to systems and cloud computing.
- →Mentor and provide technical guidance to junior team members, fostering a culture of continuous learning and professional growth.
Requirements
~1 min read- Cloud Platforms: 7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with AWS and Azure.
- Cloud-Native Transformation: 7+ years of experience in rearchitecting large-scale monolithic applications to cloud-native architectures.
- Container Orchestration: Strong expertise in Kubernetes (K8S) is required, including hands-on experience with both AKS (Azure Kubernetes Service) and EKS (Elastic Kubernetes Service).
- Networking: Strong experience with Cloud Networking, with the ability to design and resolve complex cloud networking architecture problems.
- IaC: Expert knowledge of Terraform for infrastructure-as-code deployment and management.
- Security: Must possess strong knowledge of security best practices for containers and Kubernetes clusters.
- Education: Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- Bonus Knowledge: Knowledge of load balancing algorithms.
Thanks for applying!
Location & Eligibility
Listing Details
- Posted
- March 9, 2026
- First seen
- May 21, 2026
- Last seen
- May 21, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 13%
- Scored at
- May 21, 2026
Signal breakdown
Please let ascendingdc know you found this job on Jobera.
4 other jobs at ascendingdc
View all →Explore open roles at ascendingdc.
Similar Staff Site Reliability Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.