See all roles

Sr. Staff Site Reliability Engineer

Work from home Full-time role Hiring

Description:

  • Define and drive the company-wide reliability strategy across services.
  • Establish end-to-end system visibility frameworks for observability, detection, and resilience.
  • Partner with DevOps and Platform Engineering leadership to standardize SLI/SLOs and improve reliability practices across teams.
  • Serve as a technical escalation expert for reliability issues and incident response.
  • Build intelligent detection systems, including anomaly detection and connector health models.
  • Enable self-service observability for engineering teams.
  • Define and evolve a tiered incident communication strategy.
  • Lead postmortems and improve incident response practices to strengthen customer trust.
  • Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines.

Requirements:

  • 5+ years of experience in SRE, Production Engineering, or a related role.
  • 3+ years of experience operating at a senior or technical leadership level, such as Staff scope or equivalent.
  • Deep expertise with AWS and/or GCP.
  • Experience with Kubernetes and Helm.
  • Experience with observability stacks such as Prometheus and Grafana, or equivalent tools.
  • Experience with CI/CD systems such as GitLab CI/CD and ArgoCD, or similar tools.
  • Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms.
  • Strong debugging and systems thinking across distributed microservices and legacy systems.
  • Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience.
  • Hands-on engineering approach with a track record of building reliability systems, not just configuring them.
  • Experience in B2B SaaS serving enterprise or financial customers, preferred.
  • Familiarity with third-party SaaS connector architectures and ingestion patterns, preferred.
  • Experience building anomaly detection or intelligent alerting systems, preferred.
  • Experience designing customer-facing status pages and incident communication frameworks, preferred.

Benefits:

  • Competitive compensation with equity and 401(k).
  • Comprehensive healthcare with dental and vision coverage.
  • Flexible paid time off and paid holiday time off.
  • 12 weeks of new parent or family leave.
  • Personal and professional development resources.
  • Base salary range of $232,000 to $263,000 USD.
  • Eligibility for equity awards and possible sales commission or incentive compensation, depending on role or function.

Apply tot his job Apply To this Job

You might like

Openshift / Kubernetes Engineer

Work from home Full-time role

DevSecOps - Site Reliability Engineer (SRE) / US Gov

Work from home Full-time role

Sr. Infrastructure Engineer - Kubernetes (Remote)

Work from home Full-time role

Site Reliability Engineer (Consult to Hire)

Work from home Full-time role

Senior Site Reliability Engineer (m/f/d)

Work from home Full-time role

Site Reliability Engineer – SRE

Work from home Full-time role

Kubernetes Engineer - AWS EKS / Platform Engineering (REMOTE)

Work from home Full-time role

Kubernetes Software Engineer

Work from home Full-time role

Staff Kubernetes Security Engineer

Work from home Full-time role

Senior Systems Administrator (Remote from anywhere in Colorado)

Work from home Full-time role

Director, Enterprise Sales

Work from home Full-time role

Director, Continuous Improvement - Sterile Manufacturing

Work from home Full-time role

Rewritten Job Title:

Work from home Full-time role

Remote' Sales Representative (Work From Home) Apply Today - | Flexible Schedule | Immediate Start |

Work from home Full-time role

Sr Territory Manager- - Fire Station Alerting S...

Work from home Full-time role

Senior IaaS / Kubernetes Platform Engineer (worldwide remote, work anywhere)

Work from home Full-time role

Experienced Customer Service Representative – Remote Work Opportunity at arenaflex

Work from home Full-time role

Account Executive

Work from home Full-time role

Transcriptionist (Part-time / Remote)

Work from home Full-time role

Remote Overnight Customer Service Representative - 3rd Shift (11pm-7am) | Consumer Loan Approval & Financial Services

Work from home Full-time role