Sonatype is the software supply chain security company. We provide the world’s best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale.
As founders of Nexus Repository and stewards of Maven Central, the world’s largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.
More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.
The Opportunity
● We’re looking for a Senior Data Scientist to join our growing AI & Data Science team.
● You’ll operate as an internal AI consultant and technical lead, helping multiple teams across Sonatype apply machine learning and generative AI to real-world problems.
● You’ll explore complex datasets, design experiments, build models, and collaborate closely with product engineering, and security experts to turn research ideas into practical, scalable solutions.
● This role is ideal for someone who thrives on autonomy, loves translating ambiguous ideas into working systems, and enjoys working across boundaries rather than in a single product lane.
Lead applied AI projects from concept to impact — prototype, validate, and help teams deploy practical ML and GenAI solutions.
Collaborate cross-functionally: Partner with product, engineering, and research teams to scope problems, identify opportunities, and co-develop solutions.
Act as an internal consultant: Advise teams on ML/AI best practices, model evaluation, and productive use of generative technologies.
Design robust experiments and establish evaluation pipelines for model reliability, accuracy, and business impact.
Bridge research and production: Package research insights into usable APIs, tools, or workflows for other teams.
Explore new techniques (e.g., LLMs, embeddings models, retrieval-augmented generation, agentic workflows) to enhance developer and security experiences.
Share knowledge and mentor peers, helping elevate the organization’s AI literacy and capabilities.
6+ years of experience in applied data science, machine learning, or AI research
Strong Python skills and hands-on experience with ML/AI libraries and platforms such as Databricks, OpenAI API, and Scikit-learn
Comfortable working with large, messy, or unstructured datasets — you know how to turn chaos into features, insights, and beautiful visualizations
Deep familiarity with LLMs and GenAI ecosystems (e.g. OpenAI, Claude, Hugging Face): skilled in prompt engineering, parameter tuning, and evaluating model behavior against ground truth
Experience taking ML or GenAI systems from prototype to production, even if small-scale or incremental
Strong analytical thinking, experimentation skills, and appreciation for trustworthy, data-driven evaluation
Proficiency with Git and collaborative code workflows (GitHub or similar)
A balanced mindset — equally comfortable exploring research ideas and implementing production-ready systems
Proactive and self-directed: you don’t wait for perfect specs; you find meaningful problems and drive them to completion
Experience with AI-assisted coding tools (Copilot, Claude Code, Codex, etc.)
Familiarity with agentic workflows, Model Context Protocol (MCP), and tool-use integrations
Exposure to cybersecurity, anomaly detection, or code analysis
Understanding of MLOps practices (MLflow, AWS SageMaker, model serving, or monitoring)
-
2025 AI Compliance Solution of the Year - AI Breakthrough Awards
-
2025 DEVIES Award to our SBOM Manager new product for its innovation and impact in developer technology
-
2024 Industry Leader in Forrester-Wave for Software Composition Analysis (2024 Q4 report)
-
2023 Fast Company Best Places for Innovators
-
2023 Gartner's Magic Quadrant
-
2023 Software Report's Top 100 Software Companies
-
2023 BuiltIn Best Places to Work
-
2022 Frost & Sullivan Technology Innovation Leader Award
-
2022 PeerSpot Silver Peer Award in Software Composition Analysis
-
2022 Tech Ascension Best DevOps Security Solution Award
-
2022 NVCT Cyber Company of the Year
-
Company Wellness Week - We shut down company operations for a week to enable all employees to pursue personal growth and enjoy a much-needed and deserved rest.
-
Paid Volunteer Time Off (VTO)
-
Expansion of Sonatype’s India Innovation Hub in Hyderabad, reflecting our continued growth, commitment to innovation, and investment in talent to advance AI-driven software security globally