Qgenda
Qgenda3h ago
New

Senior Site Reliability Engineer

United StatesUnited States·Atlantasenior
EngineeringDevops Engineer
0 views0 saves0 applied

Quick Summary

Key Responsibilities

System Reliability & Performance: Design, implement, and manage scalable systems that ensure high availability, fault tolerance, and optimal performance.

Technical Tools
EngineeringDevops Engineer

QGenda is redefining healthcare workforce management everywhere care is delivered. We're on a mission to empower the healthcare industry to better onboarding, deploy, and manage their workforce. Over 4,500 healthcare organizations have trusted us to help them make strategic workforce decisions through our unified software platform. With more than 800 employees across the US, we are united in our vision and culture to make a difference for our customers, while enjoying the day-to-day. 

At QGenda, we value our employees and their contributions toward the success of the business. We strive to create a dynamic work environment that fosters growth, innovation, and collaboration, where employees can be proud of the work they do and the impact it has on the healthcare industry. 

QGenda is headquartered in Atlanta. 

To learn more about QGenda, visit us at qgenda.com or follow us on Instagram or LinkedIn

As a Senior Site Reliability Engineer, you will work with our Infrastructure and Product Development Teams to increase the scalability, reliability, and performance of our systems and services.  You will build and extend existing automation for configuration and monitoring of our AWS hosted applications.  You will have the opportunity to evaluate new AWS services and tools to determine if they could be utilized in our environments. You’ll bring a focus to platform health and monitoring to allow us to deliver the best possible experience for our customers. This is an excellent opportunity to have a significant impact on the stability of our systems and contribute to the evolution of our technology stack.

As a Senior Site Reliability Engineer, you will work with our Infrastructure and Product Development Teams to increase the scalability, reliability, and performance of our systems and services.  You will build and extend existing automation for configuration and monitoring of our AWS hosted applications.  You will have the opportunity to evaluate new AWS services and tools to determine if they could be utilized in our environments. You’ll bring a focus to platform health and monitoring to allow us to deliver the best possible experience for our customers. This is an excellent opportunity to have a significant impact on the stability of our systems and contribute to the evolution of our technology stack.

Responsibilities

~1 min read
    • Design, implement, and manage scalable systems that ensure high availability, fault tolerance, and optimal performance.
    • Continuously monitor and enhance system health and performance through data analysis and metrics.
    • Develop and advocate for automation tools to eliminate repetitive manual processes and improve efficiency.
    • Build and enhance CI/CD pipelines to streamline software delivery and deployments.
    • Participate in on-call rotation to respond to incidents, troubleshoot problems, and minimize downtime.
    • Conduct root cause analyses and implement permanent solutions to recurring issues.
    • Manage our cloud-based infrastructure environment in AWS.
    • Optimize costs and resources while maintaining robust and scalable systems.
    • Serve as a technical advisor to engineering teams on infrastructure and operations best practices.
    • Actively contribute to fostering an SRE culture within the organization by promoting observability, retrospectives, and continuous improvement.
  • Curiosity-driven mindset with a desire to continuously learn and improve systems
  • Strong sense of ownership — you see problems through to resolution, not just escalation
  • Comfortable navigating ambiguity and making pragmatic tradeoffs under pressure
  • Availability for off-hours deployment and upgrades of production systems during release and maintenance windows
  • Strong problem-solving skills and ability to work effectively under pressure.
  • Excellent communication skills for cross-functional collaboration as well as documentation creation.
  • B.S. in Computer Science, Computer Information Systems, or Computer Engineering from a major U.S. university or equivalent industry experience
  • 7+ years of experience as a DevOps, SRE or Systems Engineer 
  • Advanced proficiency with at least one scripting or programming language
  • Experience with Docker and container orchestration tools such as AWS ECS and EKS/Kubernetes
  • Hands-on experience building infrastructure and supporting applications in AWS using services such as Lambda, EC2, ECS, S3, SNS, SQS, RDS, Redshift, and Elasticache
  • Strong understanding of networking and DNS
  • Strong experience with Terraform for infrastructure provisioning and module development, along with configuration management and infrastructure as code (IaC) practices
  • Firm understanding and experience with Agile and Scrum SDLC processes 
  • Using distributed version control system experience (Git preferred) to check-in code, branching, merging, pull request, code review, etc
  • Knowledge of CI/CD best practices and tools such as AWS CodeBuild, Jenkins and/or TeamCity
  • Experience using AI-assisted coding tools (e.g., Claude, GitHub Copilot) to accelerate IaC development, scripting, and operational workflows
  • Familiarity with AI/ML-driven approaches to observability, anomaly detection, log analysis, or incident triage
  • Experience designing and delivering secure, high performance and highly available cloud services
  • Experience with observability platforms (e.g., Datadog, CloudWatch, PagerDuty) for monitoring, alerting, and incident response
  • Awareness of cloud security best practices including IAM policies, network segmentation, and secrets management

 

Applicants for this position must be authorized to work for any employer in the United States (U.S.), including being located in the US. We are unable to sponsor, take over sponsorship of, or hire candidates with an employment visa at this time. 

We offer a comprehensive total rewards package to support our full-time employees and their family’s day-to-day needs, well-being and major life events, which includes: 

  • Fully company-paid options for medical (both in-person and virtual), dental and vision insurance
  • Generous paid time off (PTO) policy to enjoy periods of uninterrupted rest and relaxation for a healthy work/life balance
  • Paid parental leave for birth, adoption or permanent placement
  • 401(k) with company match 
  • Options to work in a hybrid-working model or remotely from home, depending on the position
  • Annual Costco membership, cell phone stipend, commuter benefits, in-office perks and more 

QGenda delivers technology solutions to improve how healthcare is delivered and increase access — for everyone. We can only succeed by bringing together diverse minds, thoughts, ideas and team members to create better solutions for our customers and make us a better company as a whole. We are committed to creating a culture of embracing diversity, inclusion and equity for all. 

QGenda is an Equal Employment Opportunity employer and makes all employment decisions without regard to race, color, religion, creed, gender, sex (including pregnancy), sexual orientation, gender identity or expression, natural origin, ancestry, age, marital status, disability or genetic information, military status, status as a disabled or protected veteran or any other protected status under applicable law. 

If you require accommodations or assistance to complete the online application process, please contact recruiting@qgenda.com and identify the type of accommodation or assistance you are requesting. Do not include any medical or health information in this email. We will respond to your email promptly. 

Location & Eligibility

Where is the job
Atlanta, United States
On-site at the office
Who can apply
US

Listing Details

Posted
June 9, 2026
First seen
June 9, 2026
Last seen
June 9, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
67%
Scored at
June 9, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Qgenda
Qgenda
greenhouse
Employees
750
Founded
2006
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

QgendaSenior Site Reliability Engineer