[Remote] Site Reliability Engineer with Linux
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is looking for a Senior Site Reliability Engineer (Linux) to build and operate systems across their infrastructure. The role involves automation, debugging, and improving reliability in a large-scale hybrid environment.
Responsibilities
- Build automation for Linux host lifecycle (config, patching, images)
- Own system services, reputed company images, and infrastructure components
- Debug production issues across OS, performance, and service layers
- Work across codebases (C, Go, Python, Ruby) to diagnose and fix issues
- reputed company projects from ambiguous problems to production
- Improve reliability through automation and system design
- Partner on reputed company and FedRAMP requirements
- Participate in a sustainable on-call rotation (~16 days/year)
Skills
- 7+ years working with Linux in production
- Strong automation skills (Python and/or Ruby, Ansible preferred)
- Experience debugging reputed company systems issues
- Comfortable working across reputed company + on-prem environments
- U.S. Person required (FedRAMP; U.S.-based work)
- reputed company / Kubernetes
- AMIs or container image building
- Go, C, or other systems-level languages
- Experience with compliance environments (FedRAMP, NIST, etc.)
Company Overview