Quick Summary
Key Responsibilities
• Design, implement and enhance system observability and monitoring tools• Monitor system performance, create incident response plans,
Technical Tools
EngineeringDevops Engineer
About Us:
Versana is an industry-backed data and technology company on a mission to make the syndicated
loan market better. By digitally capturing agent banks’ data on a real-time basis and centralizing it
onto a single platform, Versana provides unprecedented transparency into loan level details and
portfolio positions, bringing efficiency and velocity to the entire market. Through our platform,
participants can rest assured they are accessing the loan market’s most credible source of deal
information.
About You:
Versana is seeking a motivated SRE/DevOps Engineer with strong observability experience to join
our growing Platform Engineering squad. The squad’s goal is to manage public cloud, improve
DevOps practices, and monitor Versana’s real-time syndicated loan data platform. The ideal
candidate will have a deep understanding of cloud-native applications, distributed computing,
CI/CD implementation, observability tools and practices.
Key Responsibilities:
• Design, implement and enhance system observability and monitoring tools
• Monitor system performance, create incident response plans, and implement observability
practices to gain insights into system behavior.
• Implement and monitor service-level objectives (SLOs) and indicators.
• Improve system reliability and resiliency.
• Conduct post-incident reviews and implement necessary changes to prevent system
failures.
• Assist teams in implementing observability tools and leveraging available telemetry data to
troubleshoot and resolve incidents and problems.
• Leverage observability and event management to improve key incident management
metrics, such as mean time to detect and mean time to restore services.
• Continually optimize systems and workflows by improving architecture, infrastructure,
automation, CI/CD, and observability.
• Collaborate with developers to ensure applications are designed with DevOps best
practices in mind.
• Participate in a rotating on-call schedule for weekend releases and being available to
respond to production issues outside of regular working hours, including weekends and
holidays.
Must Have:
• 5+ years of experience as a Site Reliability Engineer or similar role.
• 3+ years of work experience with public cloud (Azure, AWS or GCP).
• 3+ years of direct experience with observability tools like Datadog, Elasticsearch, and
Grafana Labs, etc.
• 3+ years of experience with containerization and orchestration technologies like Docker
and Kubernetes.
• 2+ years of experience in development and management of CI/CD pipelines (e.g., Azure
DevOps, Gitlab CI/CD, Github Actions, Jenkins, etc).
• 2+ years of experience with Infrastructure-as-code tools like Terraform, Azure Bicep, Cloud
Formation, etc.
• 1+ years of experience with site reliability tools like Gremlin, Chaos Mesh, or similar.
• Proven track record leveraging core observability concepts, end-user monitoring, and
infrastructure monitoring with SaaS solutions.
• Experience with messaging services like Kafka or Azure Event Hubs.
• Good understanding of the Linux operating system.
Nice to Have:
• Experience in at least one coding language such as Java, JavaScript, Python, GoLang, or .NET.
• Certifications in cloud technologies.
• Experience with Azure cloud or Azure DevOps.
• Experience with Datadog or similar modern observability tools.
Equal Opportunity Employer:
We are committed to providing equal employment opportunities to all employees and applicants
for employment and prohibit discrimination and harassment of any type without regard to race,
color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual
orientation, gender identity or expression, or any other characteristic protected by federal, state or
local laws.
This policy applies to all terms and conditions of employment, including recruiting, hiring,
placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and
training
Versana is an industry-backed data and technology company on a mission to make the syndicated
loan market better. By digitally capturing agent banks’ data on a real-time basis and centralizing it
onto a single platform, Versana provides unprecedented transparency into loan level details and
portfolio positions, bringing efficiency and velocity to the entire market. Through our platform,
participants can rest assured they are accessing the loan market’s most credible source of deal
information.
About You:
Versana is seeking a motivated SRE/DevOps Engineer with strong observability experience to join
our growing Platform Engineering squad. The squad’s goal is to manage public cloud, improve
DevOps practices, and monitor Versana’s real-time syndicated loan data platform. The ideal
candidate will have a deep understanding of cloud-native applications, distributed computing,
CI/CD implementation, observability tools and practices.
Key Responsibilities:
• Design, implement and enhance system observability and monitoring tools
• Monitor system performance, create incident response plans, and implement observability
practices to gain insights into system behavior.
• Implement and monitor service-level objectives (SLOs) and indicators.
• Improve system reliability and resiliency.
• Conduct post-incident reviews and implement necessary changes to prevent system
failures.
• Assist teams in implementing observability tools and leveraging available telemetry data to
troubleshoot and resolve incidents and problems.
• Leverage observability and event management to improve key incident management
metrics, such as mean time to detect and mean time to restore services.
• Continually optimize systems and workflows by improving architecture, infrastructure,
automation, CI/CD, and observability.
• Collaborate with developers to ensure applications are designed with DevOps best
practices in mind.
• Participate in a rotating on-call schedule for weekend releases and being available to
respond to production issues outside of regular working hours, including weekends and
holidays.
Must Have:
• 5+ years of experience as a Site Reliability Engineer or similar role.
• 3+ years of work experience with public cloud (Azure, AWS or GCP).
• 3+ years of direct experience with observability tools like Datadog, Elasticsearch, and
Grafana Labs, etc.
• 3+ years of experience with containerization and orchestration technologies like Docker
and Kubernetes.
• 2+ years of experience in development and management of CI/CD pipelines (e.g., Azure
DevOps, Gitlab CI/CD, Github Actions, Jenkins, etc).
• 2+ years of experience with Infrastructure-as-code tools like Terraform, Azure Bicep, Cloud
Formation, etc.
• 1+ years of experience with site reliability tools like Gremlin, Chaos Mesh, or similar.
• Proven track record leveraging core observability concepts, end-user monitoring, and
infrastructure monitoring with SaaS solutions.
• Experience with messaging services like Kafka or Azure Event Hubs.
• Good understanding of the Linux operating system.
Nice to Have:
• Experience in at least one coding language such as Java, JavaScript, Python, GoLang, or .NET.
• Certifications in cloud technologies.
• Experience with Azure cloud or Azure DevOps.
• Experience with Datadog or similar modern observability tools.
Equal Opportunity Employer:
We are committed to providing equal employment opportunities to all employees and applicants
for employment and prohibit discrimination and harassment of any type without regard to race,
color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual
orientation, gender identity or expression, or any other characteristic protected by federal, state or
local laws.
This policy applies to all terms and conditions of employment, including recruiting, hiring,
placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and
training
Location & Eligibility
Where is the job
New York, United States
Hybrid — some on-site time required
Who can apply
US
Listing Details
- Posted
- May 22, 2026
- First seen
- May 22, 2026
- Last seen
- May 23, 2026
Posting Health
- Days active
- 0
- Repost count
- 0
- Trust Level
- 62%
- Scored at
- May 22, 2026
Signal breakdown
freshnesssource trustcontent trustemployer trust
External application · ~5 min on Versana's site
Please let Versana know you found this job on Jobera.
3 other jobs at Versana
View all →Explore open roles at Versana.
Browse Similar Jobs
DevOps & Infrastructure3.2kSecurity2.4kEngineering Manager1.6kData Engineering1.3kFullstack Developer1.3kBackend Engineering1.3kBackend Developer1.2kSoftware Architect1kQa Engineer1kFrontend Developer977Mechanical Engineer947Frontend Engineering898Mobile Developer880Security Engineer833Electrical Engineer722Project Engineer569IT & Administration544Design Engineer529Automation Engineer350Mobile Development345
Newsletter
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
A
B
C
D
No spam. Unsubscribe at any time.
