S
Saucelabs10d ago

Senior Data Scientist

IndiaGurugramsenior
Data Science
0 views0 saves0 applied

Quick Summary

Key Responsibilities

Collaborate with the engineering team to execute experiments and provide insights Prompt engineering and optimization for accuracy, relevance,

Technical Tools
Data Science

Sauce Labs is the world’s largest full-lifecycle, test automation platform, and the company behind Selenium. Trusted by 80% of the world’s top ten largest financial institutions and over 300,000 enterprise users, Sauce Labs provides the only AI platform capable of turning business intent into autonomous testing and quality assurance. With a proprietary dataset of 8.7 billion test runs, Sauce Labs empowers the Fortune 2000 to bridge the gap between AI-driven code generation and enterprise-grade software quality. Learn more at saucelabs.com.

At Sauce Labs, we’re looking for a Data Scientist / GenAI Engineer to join our team and work directly with our engineering crew on the next generation of AI-powered products. You’ll be right in the mix of building, evaluating, and refining our new AI Assistant, helping our customers unlock deeper, smarter insights from their testing data. If you love collaborating across teams to turn complex data into helpful AI features, we’d love to meet you!

Responsibilities

~1 min read
  • Collaborate with the engineering team to execute experiments and provide insights
    • Prompt engineering and optimization for accuracy, relevance, and hallucination reduction
    • Research new use cases for AI-powered features
    • Monitor the accuracy of AI solutions over time
  • Collect and analyze data across Sauce Labs
    • Manage the data directory across Sauce Labs - work with the data engineering team
    • Analyze time-series testing datasets to identify patterns and insights
    • Analyze telemetry data for performance and usage patterns
    • Analyze logs and traces for root cause analysis
    • Discover actionable insights from the data
  • Evaluate model performance using GenAI evaluation frameworks
    • Design and maintain golden datasets for GenAI evaluation
    • Build evaluation pipelines using MLflow and LLM-as-judge frameworks
    • Develop deterministic and LLM-based scoring rubrics for answer validation

Required Skills:

  • 5+ years of experience
  • Strong Python skills (Pandas, data manipulation, LLM frameworks)
  • Experience with GenAI evaluation metrics (recall@k, MRR, faithfulness, F1)
  • Proficiency in prompt engineering (few-shot, grounding, structured outputs)
  • Familiarity with RAG techniques (hybrid retrieval, re-ranking, chunking strategies)
  • SQL proficiency (Snowflake or PostgreSQL)
  • Understanding of LLM-as-judge evaluation and scoring rubrics
  • Knowledge of data governance (bronze/silver/gold data tiers)
  • Experience with experiment tracking tools (MLflow, Weights & Biases, LangSmith)
  • Experience with agentic frameworks (MCP, tool calling, ReAct patterns)

Nice to Have

~1 min read
  • Knowledge of fine-tuning techniques (SFT, LoRA, DPO)
  • Familiarity with vector databases (Pinecone, Weaviate, Chroma)
  • Understanding of LLM security (prompt injection defense, tool safety)
  • Experience with advanced RAG (Graph-RAG, Self-RAG, Corrective RAG)
  • Knowledge of Snowflake Cortex AI features

Please note our privacy terms when applying for a job at Sauce Labs.

Sauce Labs is proud to be an Equal Opportunity employee and values diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender identity/expression/status, sexual orientation, age, marital status, veteran status or disability status.

Responsibilities

~1 min read

At Sauce, we will commit to supporting the health and safety of employees and properties, partnering with internal stakeholders to learn and act on ever-evolving security protocols and procedures. You’ll be expected to fully comply with all policies and procedures related to security at the department and org wide level and exercise a ‘security first’ approach to how we design, build & run our products and services.

Listing Details

Posted
April 8, 2026
First seen
March 26, 2026
Last seen
April 18, 2026

Posting Health

Days active
22
Repost count
0
Trust Level
50%
Scored at
April 18, 2026

Signal breakdown

freshnesssource trustcontent trustemployer trustcandidate experience
Newsletter

Stay ahead of the market

Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

A
B
C
D
Join 12,000+ marketers

No spam. Unsubscribe at any time.

S
Senior Data Scientist