See all roles

Senior/Staff Research Scientist, Frontier Benchmarks

Work from home Full-time role Hiring

ABOUT THE ROLE We're looking for a Staff or Senior Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets. This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market. You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production. You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community. This role reports directly to the Head of Research and is ideal for someone who is energized by cross-functional work and wants to understand how startups operate across research, data operations, and commercial teams. MAIN RESPONSIBILITIES

  • Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships
  • Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert-curated data for customer-facing presentations, technical reports, and go-to-market materials.
  • Work cross-functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap.
  • Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows
  • Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data-centric AI

PREFERRED QUALIFICATIONS

  • Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design - especially around measuring the impact of training and evaluation data on model behavior.
  • Exceptional communication skills - able to present complex technical findings clearly to both technical and non-technical audiences
  • Comfort operating in a fast-moving, cross-functional environment with ambiguous problem spaces
  • Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services.
  • Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered.

Salary Range $220,000-$320,000 USD Apply tot his job Apply To this Job

You might like

Lead AI Research Scientist - NLP

Work from home Full-time role

Staff ML Research Scientist, Co-Folding and Affinity

Work from home Full-time role

AI Research Scientist, Biological Foundation Models

Work from home Full-time role

LLM - Applied AI Research Scientist (USA & LATAM Remote)

Work from home Full-time role

Research Scientist in Radiopharmaceutical Imaging and Dosimetry

Work from home Full-time role

Assistant Research Scientist/Research Scientist 1 32423

Work from home Full-time role

Computational Research Scientist/Sr. Scientist

Work from home Full-time role

Senior Research Scientist, Model Evaluation

Work from home Full-time role

Lead Bioinformatics Scientist, NGS

Work from home Full-time role

AI Training - Research Scientist (PST)

Work from home Full-time role

Adjunct Earth Science Professor (Online – Summer Quarter)

Work from home Full-time role

Experienced Full Stack Customer Service Representative – Amazon Online Chat Support (Part-Time)

Work from home Full-time role

Experienced Remote Data Entry Specialist – Support arenaflex Operations with Precision and Efficiency

Work from home Full-time role

Experienced Technical Program Manager – Cloud Infrastructure and Data Analytics

Work from home Full-time role

HCC Certified Coder

Work from home Full-time role

Non-Tenure Track Faculty, Social Work

Work from home Full-time role

Roadway Engineer II

Work from home Full-time role

Experienced Distribution Customer Service Representative – Delivering Exceptional Customer Experiences at arenaflex

Work from home Full-time role

Bilingual Portuguese‑English Customer Support Consultant – Remote SaaS & Blockchain Solutions

Work from home Full-time role

Functional Consultant/Senior Consultant (HCM Implementation) - US Markets

Work from home Full-time role