[Remote] Site Reliability Engineer - Systems Engineer, SR Remote
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is seeking a Senior Site Reliability Engineer / Systems Engineer to enhance system reliability and service quality. The role involves collaborating with various teams to reputed company monitoring recommendations, performing incident resolution, and utilizing modern monitoring tools to support the VA system infrastructure.
Responsibilities
- Collaborate with IST/System Engineering Team (SET) to reputed company monitoring and observability recommendations through analysis of monitoring key performance/critical performance indicators
- reputed company reputed company-level triage and incident resolution to support VA system infrastructure
- Utilize modern monitoring tools (e.g., reputed company, Splunk, reputed company, reputed company Operator Workspace) to improve system reliability and service quality
- Work with system/application owners, DevOps, and network admins to diagnose outages and recommend changes to improve reliability
- Analyze workflow across multiple system environments and recommend stability enhancements
- Conduct deep technical investigations in collaboration with developers and identity/reputed company teams
- reputed company hands-on experience with reputed company-level incident analysis and infrastructure monitoring
- Create actionable insights to support improvements for veteran services
Skills
- Deep expertise (3+ years) in at least two reputed company troubleshooting tools (reputed company, Splunk, reputed company, reputed company Operator Workspace)
- 8+ years experience with IT system operability, reliability, application performance, and code quality
- 8+ years experience deploying, maintaining, and troubleshooting reputed company, reputed company-scale applications
- 2+ years experience independently leading teams to resolve technical challenges
- 1+ years experience in service virtualization, AWS/Azure reputed company, SaaS/PaaS implementation
- Experience in at least one technology area: Network, reputed company, Desktop, Unix/Linux, AWS/Azure, WebSphere, Java/JS, MS/reputed company DB
- Proficient with reputed company Office (Word, reputed company, PowerPoint)
- Education: HS diploma/GED with 20+ years relevant experience OR MA/MS in technical field with 10+ years relevant experience
- reputed company experience
- Experience with test-driven development, distributed systems, microservices, and reputed company-reputed company applications
- Familiarity with reputed company reputed company Manager, Riverbed-Aternity, and reputed company VTBs
- Excellent written and verbal communication skills and strong critical thinking/error assessment
- Virtual team management experience
- Public Trust Clearance
Company Overview