See all roles

Specialist - Software Engineering (MX)

Work from home Full-time role Hiring

Job Title

Site Reliability Engineer (SRE)

Role Description

We are seeking an experienced Site Reliability Engineer (SRE) with strong DevOps and automation expertise to ensure the reliability, scalability, and performance of distributed systems. This role focuses on CI/CD automation, monitoring, observability, and system troubleshooting across cloud-native and Kubernetes-based environments.

You will play a critical role in building and maintaining monitoring platforms, automating operational processes, and improving system reliability across multiple application domains.

Key Responsibilities

  • Apply Site Reliability Engineering (SRE) and DevOps best practices to improve system availability, performance, and scalability.
  • Design, build, and maintain CI/CD pipelines with a strong focus on automation.
  • Implement and manage metrics collection, monitoring, and alerting across platforms.
  • Perform system troubleshooting and problem-solving across infrastructure and application layers.
  • Create, operate, and maintain Prometheus and Grafana clusters for monitoring Kubernetes environments.
  • Implement and support observability standards, including OpenTelemetry.
  • Develop and maintain automation tools and scripts using Python, Groovy, and Shell.
  • Collaborate with engineering and platform teams to improve reliability, deployment processes, and operational efficiency.

Required Skills & Qualifications

  • Hands-on experience in Site Reliability Engineering (SRE) and DevOps roles.
  • Strong expertise in CI/CD pipelines, automation, and deployment strategies.
  • Experience with metrics collection, monitoring, and alerting systems.
  • Proven ability in system troubleshooting and root cause analysis across platforms and applications.
  • Hands-on experience managing Prometheus and Grafana for Kubernetes cluster monitoring.
  • Strong automation and scripting skills using:
    • Python
    • Shell scripting
    • Groovy
  • Experience working with OpenTelemetry for distributed tracing and observability.

Key Skills

  • SRE experience managing Google Cloud services and accounts.
  • Strong Prometheus and Grafana querying and dashboarding skills.
  • Observability and monitoring best practices.
  • Automation-first mindset with strong scripting capabilities.
  • Kubernetes monitoring and cloud-native operations experience.
Apply To This Job

You might like

Overseas Contractor (BR)

Work from home Full-time role

Ingénieur data - CDI - Montréal, Canada

Work from home Full-time role

Software Engineer - Java - CDI - Bangalore, Inde

Work from home Full-time role

Orchard - Strategy Director Animal/Human Health - New Jersey 3 days

Work from home Full-time role

Business Development Manager (Massachusetts, US)

Work from home Full-time role

Business Development Manager Nordics (f/m/d) (DE)

Work from home Full-time role

Business Development Manager Ireland (f/m/d) (DE)

Work from home Full-time role

Sales Representative

Work from home Full-time role

Senior Proposal Specialist

Work from home Full-time role

Agentic Systems Architect

Work from home Full-time role

Insurance Sales - Training Provided

Work from home Full-time role

Experienced Online Data Entry Specialist for Teens – Flexible Work Arrangements and Competitive Compensation at blithequark

Work from home Full-time role

Research Scientist - Complex Systems & Network Modeling (5720) Remote / Telecommute Jobs

Work from home Full-time role

Experienced Virtual Travel Customer Reservationist – Transforming Passion into a Rewarding Career with arenaflex

Work from home Full-time role

Director - Data Governance

Work from home Full-time role

[Remote] New Orders Specialist

Work from home Full-time role

Vehicle Protection Agent

Work from home Full-time role

Experienced Part-Time Amazon E-commerce Data Entry Clerk - Remote Work Opportunity with Flexible Hours and Growth Potential

Work from home Full-time role

VP, Transaction Banking

Work from home Full-time role

Community Care Manager

Work from home Full-time role