Deuna
Deuna9mo ago

Site Reliability Engineer

MexicoMexico·Ciudad De MéxicoRemoteJornada Completamid
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
2 views0 saves0 applied

Quick Summary

Overview

About DEUNA 🧡 DEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates,

Technical Tools
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
About DEUNA 🧡
DEUNA is a rapidly growing startup revolutionizing global commerce with ATHIA, our AI-powered orchestration and payments platform that helps large enterprises boost approval rates, reduce costs, and unlock new revenue. Built by the team behind DEUNA—the fastest-growing Commerce OS in Latin America—ATHIA combines payment intelligence, checkout optimization, and data orchestration in one powerful solution.

With deep integrations across 300+ PSPs and alternative payment methods, and over 20% of Mexico’s digital economy running through our platform, we simplify global payments through a single integration and centralized reconciliation.
We are a rapidly growing startup expanding into the U.S. to meet the urgent needs of large retailers, marketplaces, airlines, and QSRs. Join us to shape the future of payments! 🚀

Visit https://www.deuna.com/ to learn more about us!

Role Overview
As a Mid SRE at Deuna, you’ll ensure the reliability, scalability, and performance of our AWS-based platform by integrating observability, automation, and SRE best practices across the software lifecycle. You will work closely with development teams to improve uptime, provide observability tooling, and ensure we scale efficiently and securely.
 
Key Responsibilities
- Design, define, and maintain observability and monitoring for our AWS infrastructure.
- Define and track SLIs, SLOs, and SLAs for critical systems.
- Improve system uptime, latency, and fault tolerance across the platform.
- Provide internal libraries and toolsets to developers for diagnostics and debugging.
- Manage scaling, performance, and resilience efforts related to system reliability.
- Collaborate with technical teams on capacity planning, load testing, and scaling policies.
- Improve production operations by defining and evolving deployment strategies and conducting disaster recovery (DR) testing.
 
Technical Skills:
- Expertise with Prometheus, Grafana, OpenTelemetry, AWS CloudWatch, or other observability tools.
- Experience designing dashboards, alerts, and log aggregation pipelines.
- Deep understanding of AWS services: ECS, Lambda, RDS, CodePipeline.
- Strong proficiency in Go programming language.
- Skilled at defining SLIs, SLOs, error budgets, and improving Mean Time to Recovery (MTTR).
- Experience conducting failure drills (e.g., Chaos Monkey, Gremlin) to ensure system resilience.
 
Soft Skills:
- Excellent communication and collaboration skills.
- Adaptability to thrive in dynamic, fast-paced environments.
- Strong time management and task prioritization.
- Proficiency in English.

Location & Eligibility

Where is the job
Ciudad De México, Mexico
Remote within one country
Who can apply
MX
Listed under
Mexico

Listing Details

Posted
August 5, 2025
First seen
March 30, 2026
Last seen
May 2, 2026

Posting Health

Days active
33
Repost count
0
Trust Level
32%
Scored at
May 2, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Deuna
Deuna
lever

DEUNA is a payment orchestration platform that provides a one-click checkout solution for e-commerce businesses in Latin America, aiming to increase conversion rates and reduce fraud.

Employees
125
Founded
2020
Domain
deuna.com
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

DeunaSite Reliability Engineer