See all roles

Manufacturing Expert - Quality Evaluator

Work from home Full-time role Hiring

• *About The Job

  • *Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

  • *Benchmark**

,

  • *General Catalyst**

,

  • *Peter Thiel**

,

  • *Adam D'Angelo**

,

  • *Larry Summers**

, and

  • *Jack Dorsey**

.

  • *Position:**

AI Model Evaluation Specialist

  • *Type:
  • *Contract
  • Compensation:
  • $25–$35/hour
  • *Commitment:
  • *20 hours/week
  • *Role Responsibilities
  • Write realistic prompts reflecting professional and consumer domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy and practical usefulness.
  • Identify fabricated claims and misleading reasoning in model outputs.
  • Score and rank model responses using structured rubrics.
  • Provide written justifications with specific evidence for evaluations.
  • *Qualifications
  • *Must-Have
  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills.
  • *Application Process (Takes 20–30 mins to complete)
  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment.
  • *Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

  • For any help or support, reach out to: [email protected]
  • PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

You might like

Senior Product Owner, IaaS (Remote)

Work from home Full-time role

Staff Product Owner (Oracle Retail)

Work from home Full-time role

Educational Technology AI Rater & Evaluator

Work from home Full-time role

Vocational Evaluator

Work from home Full-time role

AI Decision & Response Analyst

Work from home Full-time role

NURSE EVALUATOR III, HEALTH SERVICES

Work from home Full-time role

Finance Model Prompt Evaluator

Work from home Full-time role

AI Quality Evaluator (Polish)

Work from home Full-time role

Healthcare Research Evaluator (STEM) | $30/hr Remote

Work from home Full-time role

Generative AI Evaluator (Russian) | $15/hr Remote

Work from home Full-time role

Experienced Bilingual Licensed Customer Service Representative – Insurance Policy Support and Customer Experience

Work from home Full-time role

Senior Software Engineer - Timeseries - Elasticsearch

Work from home Full-time role

Experienced Entry-Level Data Entry Specialist (Remote) – Flexible Work Arrangements at arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Night Shift Work From Home Opportunity

Work from home Full-time role

AI Expert - Turkmen - Remote

Work from home Full-time role

Part-Time Data Entry Specialist (Night Shifts) – arenaflex

Work from home Full-time role

Experienced Amazon Virtual Assistant/Data Entry Specialist – Remote Part-Time Opportunity

Work from home Full-time role

Accounts Receivable Manager - Insurance Collections

Work from home Full-time role

Experienced Remote Customer Service Representative – Deliver Exceptional Service from the Comfort of Your Own Home

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Web & Cloud Application Development at arenaflex

Work from home Full-time role