See all roles

SRE

Work from home Full-time role Hiring

Summary

The Purple Platform Engineer – SRE is a hybrid engineering role combining Site Reliability Engineering excellence, cloudnative software engineering expertise, and deep knowledge of our internal Purple Platform, HealthEquity’s cloudnative application delivery ecosystem.

Responsibilities

You will design, build, and operate highly reliable systems while enabling product teams to selfserve, deploy, and operate applications securely and efficiently—aligned with the platform’s core tenets: GitOps integration, and cloudnative operational excellence. This role requires an engineer who thrives in modern DevOps environments, understands distributed systems deeply, writes high-quality code, and can translate platform guardrails and policies into a world class developer experience.

Requirements

CloudNative Ecosystem Strong Kubernetes expertise (workloads, scaling, networking, operators, CRDs). Advanced containerization practices (Docker multi-stage, security hardening). Hands On Experience implementing service mesh (ISTIO) and API gateways Infrastructure-as-Code (Terraform). Understanding and ability to configure and troubleshoot MongoDB collections, Redis Cache, Azure Service Bus, Azure Document Storage etc. Software Engineering Core Strong background in C# , Python and/or Node.js. Ability to build highly reliable distributed applications and automation tools. Building CI/CD pipelines. Experience with AI Assisted development to improve quality and productivity GitOps & Platform Delivery Deep understanding of declarative deployment workflows (Argo CD, Flux). Expertise in Helm, Kustomize, deployment manifests, and environment modeling. Experience integrating automated tests, scans, and policy controls into Git workflows—supporting the platform’s “shift-left feedback and shift-right enforcement” model. Observability & Monitoring Strong experience with configuring and using Dynatrace for observability, setting up OpenTelemetry integrations, App Insights Competence using Kusto (KQL), analyzing logs, distributed traces, and performance metrics. Incident response leadership, postmortem writing, error budget management. Security & Governance Familiarity with container scanning, supply chain security, SBOM tools. Experience applying and troubleshooting policies for security and using secure secret management (Vault/KMS). Configuring and implementing Managed Identities for secure authentication Understanding of compliance frameworks relevant to healthcare systems. Developer Tooling & Automation Building internal tools, CLIs, templates, plug-ins that improve velocity. Knowledge of Backstage or internal developer portals is a plus. Strong scripting skills (Bash, PowerShell, Python, Go utilities). Preferred Qualifications 3+ Years Experience in large-scale, enterprise-grade cloud native platforms. Previous work in SRE, Platform Engineering, DevOps, or Production Engineering roles. Experience with self-service portals and cloud resource orchestration. Familiarity with classification-driven policy models and governance automation. Selection Process Screening with Resilient Co. team. First technical interview. Client interview. Manager interview. \n \n Apply To This Job

You might like