See all roles

Software Engineer, Hardware Enablement (Ruyman Reyes, L5)

Work from home Full-time role Hiring

About the position ML developers today face significant friction in taking trained models into deployment. They work in a highly fragmented space, with incomplete and patchwork solutions that require significant performance tuning and non-generalizable, model-specific enhancements. At Modular, we are building the next generation AI platform that will radically improve the way developers build and deploy AI models. As part of our mission to build AI's unified compute layer, we are expanding the Modular software stack to a variety of new and exciting hardware platforms. We are looking for a motivated engineer to join the Hardware Enablement team at Modular. In this role you will work across the Modular software stack — from Mojo kernels and the graph compiler to MAX model serving — to bring up and optimize support for new accelerator platforms. You'll collaborate closely with internal teams and external hardware partners, and you'll develop deep expertise in novel architectures while contributing to our portability story. LOCATION: We welcome candidates who are based in and have work authorization in the United Kingdom, Norway, or the United States (Eastern Time Zone). To support growth and collaboration, those in earlier career stages work in a hybrid capacity at our Edinburgh, UK or Boston, MA office (minimum 2 days per week on-site). Onboarding for new hires is conducted in-person.

Responsibilities

  • Implement and validate support for new hardware architectures across the

Modular stack, working under the guidance of senior engineers on the team

  • Write and optimize Mojo kernels targeting novel accelerator architectures,

with a focus on correctness first and performance iteration

  • Contribute to cross-team efforts improving portability infrastructure,

tooling, and debugging workflows for new target hardware

  • Collaborate with hardware vendor engineers to understand target platforms,

build integration tests, and triage platform-specific issues

  • Develop working knowledge of new hardware platforms — including ISA

documentation, memory hierarchies, and vendor toolchains — and share findings with the team through demos and write-ups

  • Participate in company events such as on-sites and hackathons, contributing

to a collaborative and open engineering culture

Requirements

  • 5+ years of experience in high-performance computing, compiler engineering,

or related domains in industry or research

  • Familiarity with how AI operators are implemented at a low level (e.g.,

experience writing or modifying GPU kernels, custom operators, or working with frameworks like PyTorch at the C++ layer)

  • Proficiency in C++ and experience working in complex, multi-component

software systems

  • Hands-on experience with at least one heterogeneous programming model (CUDA,

SYCL, OpenCL, or similar), either as a user or contributor

  • Some exposure to non-GPU accelerator architectures (DSPs, NPUs, or other

hardware accelerators) is a strong plus

  • Curiosity and willingness to learn new hardware platforms quickly,

comfortable reading architecture manuals and vendor documentation

  • A collaborative, team-oriented attitude and alignment with our culture

Nice-to-haves

  • Experience with GPU DSLs/DSELs such as Triton, CUTLASS, or CuTe
  • Familiarity with MLIR or LLVM compiler infrastructure
  • Experience working directly with hardware vendor teams or on platform

bring-up efforts

  • Exposure to model serving or inference optimization workflows

Benefits

  • Amazing Team.

We are a progressive and agile team with some of the industry’s best engineering and product leaders.

  • World-class Benefits.

In order to attract the best, we need to offer the best. Premier insurance plans, up to 5% 401k matching, flexible paid time off, and more are available to you. Please note that specific benefit packages may vary based on your location.

  • Competitive Compensation.

We offer very strong compensation packages, including stock options. We want people to be focused on their best work and believe in tailoring compensation plans to meet the needs of our workforce.

  • Team Building Events. We organize regular team onsites and local meetups in

Los Altos, CA as well as different cities. Traveling 2-4 times a year is expected for all roles. Apply tot his job Apply To this Job

You might like

AI Engineering Manager​/Delivery Lead; Remote

Work from home Full-time role

Technical Lead / Manager (TLM) - Contract AI

Work from home Full-time role

AI Engineering Manager

Work from home Full-time role

Google Cloud Architect

Work from home Full-time role

Customer AI Strategist

Work from home Full-time role

Hardware Engineering Manager

Work from home Full-time role

Sr. Project Manager Data and AI

Work from home Full-time role

AI Scientist

Work from home Full-time role

Senior/Principal R&D AI Researcher, Onsite

Work from home Full-time role

Sr. AI Solution Architect required for a Cont to Hire Remote Project

Work from home Full-time role

Entry-Level Data Entry Clerk – Part-Time Remote Opportunity for Detail-Oriented Individuals to Join arenaflex and Enhance Data Management Skills

Work from home Full-time role

Evening driver/courier | Bethlehem, PA

Work from home Full-time role

Interior Designer - New York

Work from home Full-time role

Remote Customer Service Representative – Healthcare Member & Provider Support – Full‑Time Work‑From‑Home (US)

Work from home Full-time role

Analyst, PGIM Global Marketing and Distribution Compliance

Work from home Full-time role

Selling Assistant, Women's RTW

Work from home Full-time role

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Work from home Full-time role

Sr. Technical Recruiter (12-month contract)

Work from home Full-time role

Strategic Account Executive Provider Market - Remote in New York

Work from home Full-time role

QA- Automation Engineer, Cypress

Work from home Full-time role