See all roles

Principal AI Governance Scientist - Artificial Intelligence

Work from home Full-time role Hiring

Position Purpose: Leads the technical evaluation and assurance efforts within our AI Governance team. Establishes enterprise-grade, decision-relevant methodologies for red teaming, testing, and evaluating AI systems across traditional ML, Generative AI, and Agentic AI applications, ensuring evaluations directly inform AI governance decisions, deployment readiness, and ongoing oversight.

  • Develops reproducible frameworks to measure AI value, user impact, and broader outcomes to support responsible scaling, risk acceptance, and investment decisions
  • Designs rigorous evaluation methodologies for assessing AI system performance, safety, reliability, and alignment with intended use across the AI lifecycle, from development through deployment and monitoring
  • Develops criteria and benchmarks to determine whether existing evaluations are adequate and sufficient for different AI applications and risk profiles
  • Designs and executes comprehensive red team exercises to identify vulnerabilities, failure modes, and unintended behaviors across diverse AI systems and devise solutions to address them
  • Develops rigorous evaluation methodologies and criteria to assess whether existing evaluations are adequate and sufficient for different AI applications and risk profiles
  • Establishes standards for evaluation coverage, rigor, and documentation across the AI lifecycle
  • Establishes reproducible methodologies for measuring business value, user impact, and societal outcomes of AI systems using causal inference and experimental design
  • Advances the scientific understanding of AI evaluation and safety through white papers and trainings
  • Provides technical leadership and mentorship to scientists, engineers, and compliance professionals while building organizational evaluation capabilities
  • Stays at the forefront of AI safety research and identify novel risks emerging from advanced AI capabilities, particularly in frontier models
  • Translates complex technical findings into actionable recommendations for leadership, governance boards, and cross-functional teams
  • Collaborates with external researchers, institutions, and industry partners to advance evaluation methodology and contribute to the broader AI safety community
  • Performs other duties as assigned
  • Complies with all policies and standards

Education/Experience: Bachelor's Degree Computer Science, Machine Learning, Statistics, or related quantitative field; or equivalent experience required. Master's Degree preferred Technical Skills:

  • 6+ years AI/ML research, including 3+ years focused on model evaluation, safety, or robustness required. 8+ years preferred
  • Deep technical expertise in modern AI systems with hands-on experience evaluating large language models, generative AI, and/or agentic systems required
  • Proven track record designing rigorous evaluation methodologies and publication record required
  • Strong foundation in statistical methods, experimental design, causal inference, and excellent Python programming skills with ML frameworks required
  • Familiarity with Python-based AI/ML stack using PyTorch and Databricks, with agentic AI frameworks (LangChain, LlamaIndex, LangGraph, AutoGen, CrewAI) for single- and multi-agent systems. Strong focus on LLM observability, MLOps, and evaluation using LangSmith, MLflow, Weights & Biases, Datadog, OpenTelemetry, and testing frameworks like DeepEval and LangTest

Pay Range: $134,600.00 - $249,000.00 per year Centene offers a comprehensive benefits package including: competitive pay, health insurance, 401K and stock purchase plans, tuition reimbursement, paid time off plus holidays, and a flexible approach to work with remote, hybrid, field or office work schedules. Actual pay will be adjusted based on an individual's skills, experience, education, and other job-related factors permitted by law, including full-time or part-time status. Total compensation may also include additional forms of incentives. Benefits may be subject to program eligibility. Apply tot his job Apply To this Job

You might like

AI Solutions Architect 2026 - US at Aimpoint Digital

Work from home Full-time role

Applied AI - Senior Staff Software Engineer, Lead job at Sprinter Health in San Francisco, CA, Menlo Park, CA

Work from home Full-time role

Field Application Engineer, AI Systems & Solutions

Work from home Full-time role

Corporate Legal AI Client Development & Transformation Consultant (JD Required)

Work from home Full-time role

Algo Trading Quant

Work from home Full-time role

Virtual Assistant for Airbnb Host with 5 listings

Work from home Full-time role

Engineering Manager, Guest Displays and Platforms

Work from home Full-time role

Healthcare Advocate job at Alorica in HI

Work from home Full-time role

Algorithm Developer

Work from home Full-time role

Remote Amazon Data Entry Jobs - No Experience - Part-Time

Work from home Full-time role

Telemedicine Physician

Work from home Full-time role

Experienced Customer Support Specialist – Back Office Email & Chat Process for arenaflex

Work from home Full-time role

American Express - Work From Home Analyst - Marketing; Paid

Work from home Full-time role

Experienced Business Analysis Manager – Global Reporting and Insights

Work from home Full-time role

Sr. Process Engineer - Upstream

Work from home Full-time role

[Remote] Sales Specialist - NetSuite

Work from home Full-time role

Experienced Part-Time Remote Data Entry Clerk – Entry-Level Opportunity with arenaflex

Work from home Full-time role

Senior Network Engineer (REMOTE)

Work from home Full-time role

[FULL TIME Remote] Remote Client Service Specialist

Work from home Full-time role

Delta Airlines Remote Jobs Customer Service Agent

Work from home Full-time role