Senior Site Reliability Engineer
Quick Summary
WHO WE ARE 🌍 We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement,
We help creators get more out of every conversation with Instagram-focused automations and support for other channels like Messenger, WhatsApp, and TikTok. The result? Better engagement, more sales, and real, sustainable growth.
With a diverse team of 350+ people spread across three continents, we’re building the leading Chat Marketing platform that is used — and loved — by more than 1.5 million customers worldwide.
Responsibilities
~1 min read- →Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
- →Operate and evolve our EKS clusters powering Python-based AI services
- →Migrate existing services to Kubernetes using Terraform and Helm
- →Codify infrastructure with Terraform and manage host-level automation via Ansible
- →Build and improve CI/CD pipelines with GitHub Actions
- →Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
- →Support OS-level patching, certs, WAF rules, and general infra hygiene
- →Partner with engineers to guide best practices and drive platform reliability
- →Create clean, maintainable infrastructure documentation and playbooks
- →Occasionally support rare off-hours incidents (don’t worry, really rare)
- 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
- Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
- Comfort with running and debugging Python workloads in containers
- Solid understanding of networking, IAM, and cloud security best practices
- Hands-on Nginx experience (Ingress and reverse proxy setups)
- Excellent communication skills; you can explain complex infra to devs clearly
Nice to Have
~1 min read- Strong Ansible skills beyond the basics
- PostgreSQL or Amazon RDS tuning and operations experience
- Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
- Familiarity with PHP production environments
- Experience with TDD, CI/CD best practices, and agile development
- Any previous SRE-like exposure such as building resilience, automation, or incident tooling
What We Offer
~2 min readListing Details
- Posted
- March 25, 2026
- First seen
- April 3, 2026
- Last seen
- April 26, 2026
Posting Health
- Days active
- 23
- Repost count
- 0
- Trust Level
- 31%
- Scored at
- April 26, 2026
Signal breakdown
ManyChat is a global Chat Marketing platform that enables businesses to automate conversations and drive sales on messaging apps like Instagram, WhatsApp, and Facebook Messenger. Founded in 2015, it serves over a million businesses worldwide with its user-friendly chatbot builder and automation tools.
View company profilePlease let Manychat know you found this job on Jobera.
4 other jobs at Manychat
View all →Explore open roles at Manychat.
Similar Devops Engineer jobs
View all →Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.