Optimove
Optimove1mo ago

Site Reliability Engineer

Tel Aviv · Tel Avivmid
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
0 views0 saves0 applied

Quick Summary

Key Responsibilities

System Reliability- Ensure high availability and performance of services through effective monitoring, incident management, and root cause analysis.

Requirements Summary

4+ years in Site Reliability Engineering, DevOps, or related roles. Proven experience managing large-scale, cloud-based infrastructure in GCP, AWS, or Azure.

Technical Tools
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud

At Optimove, we believe people are capable of more than a single job description. You’re not hired just to fill a position- you’re empowered to shape it, grow it, and make it your own.
We call this being Positionless.
And Positionless isn’t just our culture. It’s our product.
Optimove is the creator of Positionless Marketing, an AI-powered platform that gives every marketer the power to analyze, create, launch, and optimize independently. The result is faster execution, deeper personalization, and 88% greater campaign efficiency.
Recognized as a Visionary in Gartner’s Magic Quadrant, we partner with leading brands like Sephora, Staples, and Entain. Today, more than 550 Optimovers across NYC, London, Tel Aviv, Scotland, Brazil, Estonia, and beyond are building the future of marketing together, in an environment that actively encourages ownership and growth, with two out of every three managers promoted from within.
If you’re looking for a place where you can do more, be more, come grow with us.


Are you passionate about ensuring system reliability, scalability, and performance? Do you thrive in a dynamic environment where automation and operational excellence are key?
Optimove is looking for a Site Reliability Engineer (SRE) to join our team and play a crucial role in designing, implementing, and maintaining our cloud-based infrastructure. In this role, you will collaborate across teams to drive automation, improve system resilience, and optimize performance while fostering a culture of reliability.

Responsibilities:
  • System Reliability- Ensure high availability and performance of services through effective monitoring, incident management, and root cause analysis.
  • Automation & Tooling- Develop and maintain automation for infrastructure provisioning, configuration management, and application deployment.
  • Performance Optimization- Analyze and enhance system performance, including load balancing, caching, and database tuning. Conduct regular capacity planning.
  • Incident Response & Troubleshooting- Lead incident response efforts, participate in on-call rotations, and troubleshoot complex infrastructure issues.
  • Security & Compliance- Collaborate with security teams to implement best practices and ensure compliance with relevant standards (ISO 27001, SOC 2, etc.).
  • Collaboration & Mentorship- Work closely with developers, DevOps, Support, and product teams to enhance application reliability and implement SRE best practices.

Requirements:
  • 4+ years in Site Reliability Engineering, DevOps, or related roles.
  • Proven experience managing large-scale, cloud-based infrastructure in GCP, AWS, or Azure.
  • Expertise in container orchestration (Kubernetes, Docker) and microservices architecture.
  • Strong proficiency in scripting and programming languages (Python, Go, Bash, etc.).
  • Experience with CI/CD pipelines, infrastructure as code (Terraform, CloudFormation), and configuration management (Ansible, Puppet, Chef).
  • Hands-on experience with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK Stack).
  • Experience using AI tools to enhance SRE processes, such as intelligent monitoring, incident prediction, and automation of incident response.
  • Deep understanding of networking concepts, DNS, load balancing, and distributed systems.
  • Strong problem-solving skills, excellent communication, and a proactive mindset.

Advantages:
  • Certifications- AWS Certified Solutions Architect, GCP Professional Cloud Architect, or Kubernetes certifications (CKA, CKAD).

Why Join Us?
In this role, you will have the opportunity to work on cutting-edge technology, solve challenging problems, and make a tangible impact on the reliability and scalability of our systems. Join a team that values collaboration, innovation, and continuous learning, and be part of an exciting journey as we scale our platform to new heights!

 

Location & Eligibility

Where is the job
Tel Aviv
On-site at the office
Who can apply
Same as job location
Listed under
Worldwide

Listing Details

Posted
March 23, 2026
First seen
April 3, 2026
Last seen
April 27, 2026

Posting Health

Days active
23
Repost count
0
Trust Level
31%
Scored at
April 27, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Optimove
Optimove
greenhouse

Optimove is a customer-led marketing platform that helps brands scale their CRM marketing by using AI to foster customer loyalty and increase lifetime value.

Employees
750
Founded
2012
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

OptimoveSite Reliability Engineer