[Remote] Senior Site Reliability Engineer - Remote
Note: The job is a remote job and is open to candidates in USA. Kforce Inc is seeking a remote Senior Site Reliability Engineer to be a leading member of the team working with a diverse range of technologies. The role involves delivering resilient application stacks, monitoring critical applications, and collaborating with various teams to ensure system availability.
Responsibilities
- Delivery of resilient application stacks via "Infrastructure as Code" and other DevOps practices
- Monitoring and on-going support of critical, high revenue business applications
- Diagnosis and resolution of complex system and application issues
- Working with diverse technical and non-technical teams, including Development, QA, IT Operations, Customer Operations and Project Management teams
- Write and maintain systems/application documentation for technical and non-technical audiences
Skills
- BSc Engineering/Computer Science or relevant experience
- 5+ years of SRE experience
- Proven background working in a technical, IT related position
- Experience with Configuration Management tools - e.g. Ansible, Puppet, Chef or equivalents
- Professional experience of working within the public cloud - Azure, AWS or GCP
- Hands-on experience of Linux and Windows server including support and troubleshooting
- System and application monitoring - e.g. Prometheus, Grafana, Nagios, Cloudwatch, etc
- Familiarity with common source control tools - e.g. Git, SVN
- Cloud Architecture and system design to solve key business problems and facilitate team goals
- Strong and enthusiastic technologist, able to demonstrate a broad technical knowledge
- Excellent oral and written communication skills
- Ability to act as a point of expertise, advise others in the team on best practices and impart knowledge
- Azure/AWS certifications
- Experience with use of orchestration tools such as Terraform, Ansible or CloudFormation
- Experience migrating application from on-premises to public cloud
- Familiarity with Blue-Green deployment methodologies
- Continuous Integration/Delivery such as Gitlab or Jenkins
- Experience working with containerized workloads such as Docker
- Familiarity with Log Management tools e.g. - Elastic Stack, Graylog or Splunk
- Experience working with an enterprise RDBMS such as MySQL and/or Microsoft SQL Server
- Knowledge of change control and associated procedures
- Use of Secret Management services e.g. - Hashicorp Vault
- Familiarity with any high-level programming language
Benefits
- Medical/dental/vision insurance
- HSA
- FSA
- 401(k)
- Life, disability & ADD insurance to eligible employees
- Salaried personnel receive paid time off
- Hourly employees are not eligible for paid time off unless required by law
- Hourly employees on a Service Contract Act project are eligible for paid sick leave
Company Overview
Company H1B Sponsorship