Site Reliability Engineer

PortugalPortugalmid
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud
0 views0 saves0 applied

Quick Summary

Requirements Summary

Experience with monitoring & Observability stacks such as Grafana and Prometheus; Kubernetes, Cloud and Hashicorp experience is valued; Knowledge or experience with AWS or GCP.

Technical Tools
EngineeringDevOps & InfrastructureSite Reliability EngineerDevops EngineerInfrastructure & Cloud

Feedzai is the world’s first RiskOps platform for financial risk management, and the market leader in safeguarding global commerce with today’s most advanced cloud-based risk management platform, powered by machine learning and artificial intelligence. Feedzai is securing the transition to a cashless world while enabling digital trust in every transaction and payment type. The world’s largest banks, processors, and retailers trust Feedzai to protect trillions of dollars and manage risk while improving the customer experience for everyday users, without compromising privacy. Feedzai is a Series D company and has raised $282M to date. With a valuation of $2 billion, our technology protects 1 billion consumers and 90 billion transactions each year.

With Cloud at its core, the Platform Engineering area supports our product development life cycle, from development through testing and deployment to operations and maintenance, enabling a DevOps way of working. Formed by engineers and managed by engineers, at Feedzai, you will find one of the most talented teams out there, from junior to senior engineers.

While building the best value for our customers, you will work with a wide range of technical challenges. Such as building distributed systems that need to operate 24/7 with ultra-low latencies, plus cooperating with other teams towards high performance and reliability. 

We are fast-paced and provide a safe, open, and collaborative environment that encourages us to lean in, try new things and discover our potential with continuous learning for everyone.  

 

If you are passionate about distributed systems, performance, reliability on cloud environments and like challenges of low latencies and high throughput systems, this may be the job for you.

You’ll be part of Feedzai Platform Engineering Performance & Reliability team. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. As part of this team you’ll have the opportunity to manage the complex challenges of scale which are unique to Feedzai Fraud detection mission, while working with talented platform engineers in complexity analysis and large-scale system design, developing the automation, tooling and platforms that support Feedzai top notch cloud service.

  • Provide recommendations about capacity allocation considering cost, resilience and performance.
  • Work together with product teams to support best practices and drive improvements on systems performance and reliability before and after they go live;
  • Development with Go, Python or similar languages;
  • Automate all aspects of cloud infrastructure and incident response;
  • Develop playbooks related to actionable alerts;
  • Participate in incident response, root cause investigation and resolution;
  • Maintain and develop our infrastructure as code (IaC) to manage and operate end-to-end lifecycle operations (monitoring, alerting, security, cost optimization, configuration, backup, etc.) in production environments;
  • Utilize your experience and problem solving skills to help prevent and investigate production issues.
  • A bachelor's degree in Computer Science, Information Systems, or the equivalent combination of education, experience, and training;
  • Programming skills (Go, Python or similar languages);
  • 3+ years of experience in data structures, algorithms, programming, asynchronous & multithreaded designs
  • 3+ years of experience with building scalable and distributed cloud services
  • 3+ years operating production environments
  • 2+ years of experience in cross team collaboration within a supportive role
  • Self-driven & motivated, with a strong work ethic and a passion for problem solving;
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills.
  • Experience being oncall.

Requirements

~1 min read
  • Experience with monitoring & Observability stacks such as Grafana and Prometheus;
  • Kubernetes, Cloud and Hashicorp experience is valued;
  • Knowledge or experience with AWS or GCP.

#LI-Remote #LI-LS1


You will be immersed in our brand with training, connections, and one-on-one time with your manager. You may shadow your colleagues virtually or onsite at an office depending on where you work as you are supported through your Feedzai journey. In addition, you will have access to a ton of information to give you history, context, and all the knowledge you can handle about Feedzai and the team. Finally, you will start working on projects and collaborating on work currently being done. We can't wait to have you join the team!

Life at Feedzai Instagram

Feedzai Culture


 

Listing Details

First seen
April 3, 2026
Last seen
April 26, 2026

Posting Health

Days active
23
Repost count
0
Trust Level
31%
Scored at
April 26, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Feedzai
Feedzai
greenhouse

Feedzai is a global leader in AI-driven fraud prevention, dedicated to protecting financial institutions and their customers from fraud and financial crime.

Employees
350
Founded
2013
View company profile
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

FeedzaiSite Reliability Engineer