See all roles

Data Scientist (AI Quality & Evaluation)

Work from home Full-time role Hiring

About the Role

We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll build the systems that ensure our AI "knows what it doesn't know" — developing evaluation frameworks, calibrated confidence scoring, and automated quality assurance that physicians can actually trust.

What You'll Do

  • Design and implement automated evaluation pipelines that assess AI output quality, accuracy, and safety at scale
  • Develop uncertainty quantification systems where confidence scores meaningfully correlate with accuracy
  • Build comprehensive evaluation frameworks combining automated assessment with clinician-validated test cases
  • Implement feedback loops that continuously improve model outputs based on validation signals
  • Establish scalable quality gates that catch errors before they reach end users
  • Contribute to model alignment and fine-tuning efforts

Qualifications

Required

  • Strong foundation in deep learning frameworks (PyTorch) and LLM architectures
  • Experience with model evaluation, benchmarking, and quality metrics
  • Proficiency in Python and modern ML development tools
  • Strong statistical foundations
  • Ability to read, implement, and extend research papers
  • Excellent communication skills

Preferred

  • Master's degree in Computer Science, Machine Learning, Statistics, or related quantitative field (PhD preferred)
  • Publications in top ML/AI venues (NeurIPS, ICML, ICLR, ACL)
  • Experience with RLHF, DPO, or preference optimization techniques
  • Background in healthcare AI or regulated industries
  • Experience building evaluation systems for production LLM applications

Apply tot his job Apply To this Job

You might like

Staff Product Data Scientist - Slack

Work from home Full-time role

Data Scientist 1

Work from home Full-time role

Data Scientist (AI) Time-series forecast

Work from home Full-time role

4055-Senior Healthcare Data Scientist

Work from home Full-time role

Digital Customer Engagement AI Data Scientist

Work from home Full-time role

Lead Data Scientist, Property Catastrophe Modeling & Geo-Analytics

Work from home Full-time role

Member of Technical Staff (Data Scientist, Evals)

Work from home Full-time role

Product Data Scientist

Work from home Full-time role

Senior Data Scientist - Data Analyst job at Jahnel Group in Schenectady, NY

Work from home Full-time role

Data Scientist/Engineer - Junior (Remote)/Junior Sofware Engineer (Remote)

Work from home Full-time role

Manager - Remote Patient Services

Work from home Full-time role

Experienced Part-Time Remote Data Entry Clerk – Flexible Work Schedule and Career Growth Opportunities at arenaflex

Work from home Full-time role

Experienced Full Stack Content & Customer Experience Specialist – Web & Cloud Application Development

Work from home Full-time role

Experienced Customer Experience Concierge – Remote Chat Professional at arenaflex

Work from home Full-time role

Experienced Data Analyst and Online Chat Assistant – Remote Opportunity at arenaflex

Work from home Full-time role

Consultant Sr Lead, Professional Services

Work from home Full-time role

Senior Director - Strategic Partnerships Americas/EMEA

Work from home Full-time role

Digital Solutions - Center of Excellence - Manager (Project Manager/Reporting Analyst) (Location: India)

Work from home Full-time role

Experienced Data Entry Specialist (Remote) - Part-Time at arenaflex - Join Our Dynamic Team

Work from home Full-time role

VIP Host, Alberta

Work from home Full-time role