See all roles

[Remote] Software Engineer – AI Coding Evaluation

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. MillionLogics is a global leader in IT solutions specializing in Data & AI, Cloud Solutions, and IT Consulting. They are seeking experienced Software Engineers to evaluate and improve the coding capabilities of frontier AI models by assessing AI-generated code and developing high-quality evaluation datasets and benchmarks.

Responsibilities

  • Review and evaluate AI-generated code for correctness, efficiency, maintainability, and adherence to requirements
  • Analyze software engineering tasks and validate whether proposed solutions meet expected outcomes
  • Debug code, reproduce issues, and verify fixes across different programming environments
  • Assess model-generated explanations, reasoning, and implementation approaches for technical accuracy
  • Create, refine, and maintain evaluation datasets, benchmarks, and grading rubrics for coding tasks
  • Identify edge cases, failure modes, and areas where AI systems struggle with software engineering problems
  • Document findings clearly and provide structured feedback to improve evaluation quality and consistency
  • Collaborate with project teams to establish quality standards and evaluation methodologies

Skills

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field
  • 3+ years of professional software engineering experience
  • Strong proficiency in one or more of the following languages: Python, Java, C/C++, Go, Swift, Objective-C, PHP, or SQL
  • Strong understanding of data structures, algorithms, software design principles, and debugging methodologies
  • Experience performing code reviews and evaluating code quality in production or large-scale codebases
  • Ability to analyze complex technical problems and assess solution correctness with minimal supervision
  • Familiarity with version control systems (e.g., Git) and modern software development workflows
  • Strong written communication skills and attention to detail
  • Experience with AI/ML data annotation, NLP, prompt engineering, model evaluation, or LLM-related projects
  • Experience evaluating AI-generated code, benchmark creation, or software quality assessment

Benefits

  • Mode of Work: Remote
  • Contract: 12 months
  • Commitments Required: At least 4 hours per day and minimum 20 hours per week with overlap of 4 hours with PST
  • Engagement type : Contractor assignment (no medical/paid leave)

Company Overview

  • As a trusted Oracle Partner, MillionLogics is more than just an IT solutions provider - it's a global powerhouse blending innovation, expertise, and strategic vision. It was founded in 2020, and is headquartered in London, United Kingdom, GB, with a workforce of 51-200 employees. Its website is https://www.millionlogics.com.
  • Apply To This Job

    You might like

    [Remote] Client Sales Manager

    Work from home Full-time role

    [Remote] Data Engineer

    Work from home Full-time role

    [Remote] Account Executive (Enterprise) - West Region

    Work from home Full-time role

    [Remote] Staff/Senior Machine Learning Scientist - Pricing/Forecasting (Open to Remote)

    Work from home Full-time role

    [Remote] Technical Account Executive

    Work from home Full-time role

    [Remote] Senior Project Manager, South Carolina

    Work from home Full-time role

    [Remote] Senior Fullstack Software Engineer, AI Agents

    Work from home Full-time role

    [Remote] Senior Machine Learning Engineer, GenAI Security

    Work from home Full-time role

    [Remote] Sales Development Manager - AI Frontier Labs

    Work from home Full-time role

    [Remote] Director of Business Development- Lifeycle Support and Solutions (Job ID: 4314)

    Work from home Full-time role

    Remote Special Needs Educator

    Work from home Full-time role

    Principal Software Engineer

    Work from home Full-time role

    Mental Health/Cloquet/Full Time DSP - $18.00-No Mandates

    Work from home Full-time role

    Video Content Creator - Hybrid

    Work from home Full-time role

    Urgently Hiring: National Forum on Second Language Literacy

    Work from home Full-time role

    Registered Nurse (RN) Care Management Specialty – Remote in Deerfield, IL

    Work from home Full-time role

    Experienced Remote Customer Service Representative – Delivering Exceptional Support and Building Strong Relationships with Clients at blithequark

    Work from home Full-time role

    Experienced Full-Time Data Entry Specialist - Remote/Home-Based Opportunity with Competitive Salary, Performance Incentives, and Career Growth

    Work from home Full-time role

    Customer Advocate, Remote

    Work from home Full-time role

    Experienced Data Entry Specialist – Unlock Your Potential in a Remote Role at arenaflex

    Work from home Full-time role