ironcladhq
ironcladhq11d ago
New
$245K – $295K • Offers Equity/yr

Senior Staff Data Scientist - AI

San Franciscofull-timesenior
OtherStaff Data Scientist
0 views0 saves0 applied

Quick Summary

Overview

Ironclad is the leading AI contracting platform that transforms agreements into assets. Contracts move faster, insights surface instantly, and agents push work forward, all with you in control.

Requirements Summary

Bachelor's or Master's degree in a quantitative field (e.g., Statistics, Computer Science, Data Science, Applied Math). 8+ years of experience in applied ML or data science, preferably in NLP or LLM-based applications.

Technical Tools
pandaspythonsqlab-testingproject-managementsystem-design

Ironclad is the leading AI contracting platform that transforms agreements into assets. Contracts move faster, insights surface instantly, and agents push work forward, all with you in control. Whether you’re buying or selling, Ironclad unifies the entire process on one intelligent platform, providing leaders with the visibility they need to stay one step ahead. That’s why the world’s most transformative organizations, from Rivian to the World Health Organization and the Associated Press, trust Ironclad to accelerate their business.

About the Role

~1 min read

Ironclad is accelerating its investment in AI to redefine how legal teams manage and understand contracts. As part of this effort, we are hiring an AI Evaluation Engineer to work within our AI Pillar. This role is focused on unlocking insights from our training data, designing feedback loops, and ensuring the continuous improvement of our agentic and ML or LLM-based systems through data-driven evaluation and iteration.

You’ll partner closely with AI Engineers and Product Managers to drive better model quality through systematic analysis, experimentation, and the curation of high-leverage datasets. Your work will directly impact the effectiveness of features like Smart Import, contract understanding, and agentic workflows.

Responsibilities

~1 min read
  • Analyze training and evaluation datasets to identify distributional gaps, labeling inconsistencies, and long-tail opportunities.

  • Design and execute labeling campaigns, including development of golden datasets and annotation guidelines.

  • Build and maintain dashboards that track model accuracy, regression trends, and product-specific KPIs like success rate or answer helpfulness.

  • Investigate failure modes via prompt clustering, error taxonomy development, and user intent classification.

  • Operationalize feedback loops: mine product telemetry and human-in-the-loop reviews for signal, then translate into data-driven model improvement strategies.

  • Partner with engineers and PMs to run structured A/B tests and human evaluations for new models or features.

  • Support the development of scalable data and evaluation infrastructure for LLMs and agents.

  • Work with product, engineering and legal to create clear & transparent processes for the handling of customer data in AI training, fine-tuning and evaluation

  • Bachelor's or Master's degree in a quantitative field (e.g., Statistics, Computer Science, Data Science, Applied Math).

  • 8+ years of experience in applied ML or data science, preferably in NLP or LLM-based applications.

  • Strong SQL and Python skills; experience with Jupyter, Pandas, and experiment tracking tools.

  • Comfortable navigating ambiguity, slicing large datasets, and communicating insights clearly to cross-functional stakeholders.

  • Experience with prompt analysis, clustering, or user behavior modeling is a plus.

  • Bonus: familiarity with LLM eval techniques, Reinforcement Learning from Human Feedback (RLHF), or agentic system design.Experience with program management.

AI is critical to the value Ironclad customers get from their contracts, allowing their business to manage risk, close revenue faster and operate more effectively. None of this is possible without reliable and accurate data. This role will lead these efforts, becoming a key contributor to the development of AI solutions in an industry that is likely to be transformed by the new generation of models.

  • Bias for action and data curiosity

  • Ownership mindset and team-first attitude

  • Comfort in fast-paced, iterative environments

  • Passion for building AI products that solve real-world customer problems

What We Offer

~1 min read
100% health coverage for employees (medical, dental, and vision), and 75% coverage for dependents with buy-up plan options available
Market-leading leave policies, including gender-neutral parental leave and compassionate leave
Family forming support through Maven for you and your partner
Paid time off - take the time you need, when you need it
Monthly stipends for wellbeing, hybrid work, and (if applicable) cell phone use
Mental health support through Modern Health, including therapy, coaching, and digital tools
Pre-tax commuter benefits (US Employees)
401(k) plan with Fidelity with employer match (US Employees)
Regular team events to connect, recharge, and have fun
And most importantly: the opportunity to help build the company you want to work at

Location & Eligibility

Where is the job
San Francisco
Hybrid — some on-site time required
Who can apply
Same as job location

Listing Details

Posted
April 28, 2026
First seen
May 6, 2026
Last seen
May 8, 2026

Posting Health

Days active
0
Repost count
0
Trust Level
43%
Scored at
May 6, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trust
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

ironcladhqSenior Staff Data Scientist - AI$245K – $295K • Offers Equity