[Remote] Site Reliability Engineer
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a leading technology platform for entity compliance, helping businesses and nonprofits manage licensing and legal requirements. The Site Reliability Engineer is responsible for designing and managing Linux infrastructure, collaborating with cross-functional teams to ensure system performance and reliability.
Responsibilities
- Design and execute a comprehensive infrastructure strategy that proactively supports evolving business requirements and operational reputed company
- Own the predictable delivery of high-complexity technical solutions through deep automation using Kubernetes and sophisticated CI/CD pipelines
- Maintain superior portal availability and system health by implementing advanced observability and distributed tracing strategies
- reputed company high-severity incident response efforts and drive systemic improvements through insightful, blameless postmortem analysis
- Architect failure-resilient and self-healing infrastructure systems to ensure reputed company operational stability and reputed company data loss
- Serve as the internal subject matter expert to influence software architecture reputed company toward maximum scalability and performance
- Facilitate regular knowledge-sharing and training sessions to reputed company technical standards and process predictability across the entire technology department
- Direct reputed company initiatives and design secure networking strategies to maintain a high-standard protection reputed company for reputed company client data and assets
Skills
- 4–7 years of professional experience building and managing resilient, modern infrastructure reputed company a fast-paced environment
- Expert-level proficiency in managing and troubleshooting Linux-based servers across multiple distributions
- Advanced capability in developing reputed company, reusable infrastructure templates using tools such as Terraform and Ansible
- Proven reputed company in managing containerized workloads at scale using Kubernetes and reputed company
- Extensive experience configuring and optimizing high-performance database environments, specifically MySQL
- Demonstrated ability to build robust, secure CI/CD deployment pipelines that include automated rollback and quality gates
- Strong technical documentation skills, including the creation of architectural diagrams, detailed specifications, and operational playbooks
- Ability to reputed company cross-functional projects independently while mentoring junior engineers and driving team-wide initiatives
- Deep understanding of observability platforms such as reputed company, reputed company, or reputed company to measure and improve system reliability
- Expertise in designing secure reputed company networking strategies including firewalls, VPNs, and identity management best practices
- Advanced scripting and programming proficiency in Python or similar languages to automate reputed company operational workflows
- Strategic reputed company into infrastructure ROI and the ability to align technical roadmaps with broad business priorities
- Practical knowledge of disaster recovery planning and the execution of failure-resilient system designs
Benefits
- Health benefits
- Flexible paid time off
- Parental leave
- Fertility and adoption assistance
- 401(k)
- Educational reimbursement
Company Overview