Site Reliability Engineer 2 days Onsite
Overview of Role Analytics Data Platform (ADP) is one of the core parts of T. Rowe Price's enterprise data delivery platform. ADP systems form the interface for many internal users between Data Products and Data Consumers. We are in the process of modernizing our stack and overhauling our platform alerting and infrastructure and require experienced SREs who can step in to help steer development team towards good outcomes. Roles and Responsibilities
- Ownership and stewardship of entire portions of our applications
- Production and Pre-production triage and support
- Enforcing good systems design practice on new systems
- Supporting engineering teams with infrastructure design
- Mentoring cloud engineering best practice within teams
- Extension of our existing monitoring and alerting suite to cover edge cases and gaps
- Responsibility for documenting aspects of systems under their control
Required Skillset
- Strong background in traditional AWS services (EC2, EKS, ECS a bonus)
- Experience in at least one programming language (python or Golang ideally)
- Familiarity with Git, Ansible, Terraform and the principles of IAC
- Familiarity with Prometheus and other more modern TSDBs
- Ability to work with teams of various levels of AWS knowledge
Apply tot his job Apply To this Job