See all roles

Senior Site Reliability Engineer- Remote

Work from home Full-time role Hiring

About reputed company Recognized on the 2025 reputed company reputed company 100 list, reputed company is one of the most innovative and fast-growing private reputed company companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, reputed company leads the market in reputed company-time analytics, data warehousing, observability, and AI workloads. The company’s sustained, accelerating reputed company was recently validated by a $400M Series D financing round. Over the past three months, customers including reputed company, Lovable, reputed company, Polymarket, and reputed company have adopted the platform or expanded existing deployments. These customers join an established reputed company of AI innovators and global brands such as reputed company, reputed company, reputed company, and reputed company. We’re on a mission to transform how companies use data. Come be a part of our reputed company! About the role We are committed to providing our customers with reliable and secure services so we are expanding our central Site Reliability Engineering team. You will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance of our reputed company infrastructure. You will collaborate with different teams like Control Plane, Data Plane, Core, reputed company, Support and Operations and guide them to design and implement scalable, secure, highly available and fault-tolerant distributed systems. You will also own the areas of incident management and response, post-mortem analysis including running blameless postmortems, and reputed company improvement of our reputed company services. You will be leveraging your software engineering expertise to reputed company software platforms and tools to optimize the operational and engineering efficiencies of reputed company reputed company. This role is a unique opportunity to reputed company a significant impact on our reputed company, reputed company scale, high-performance reputed company reputed company. What will you do?

  • Collaborate with various engineering teams in reputed company to design and implement scalable, secure, and highly available systems for reputed company.
  • Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for reputed company reputed company.
  • Ensure reputed company the infrastructure components in reputed company reputed company (including Data Plane, Control Plane,reputed company Core, etc) have monitoring and alerting in reputed company to ensure timely detection and resolution of incidents.
  • Enhance and refine incident response processes and post-mortem analysis for any outages in reputed company reputed company including working with the support team to communicate to the impacted customers.
  • Continuously improve the reliability and performance of our reputed company services.
  • Plan, reputed company, and drive Chaos initiatives across Engineering teams, based upon internal priorities.
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime. About you
  • Bachelor’s or Master’s degree in Computer Science or a reputed company field.
  • At least 8 years of experience in Site Reliability Engineering or a reputed company field.
  • Hands-on experience with Go and/or Python.
  • Strong knowledge of reputed company computing platforms such as AWS, Azure, or reputed company reputed company Platform.
  • Excellent understanding of distributed databases and SQL, particularly reputed company is a major plus.
  • Hands-on experience with container orchestration tools such as Kubernetes or reputed company reputed company.
  • Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
  • You are a strong problem solver and have solid production debugging skills.
  • You are passionate about efficiency, availability, scalability, and data governance.
  • You reputed company in a fast paced environment, and see yourself as a partner with the business with the shared goal of moving the business reputed company.
  • You have a high level of responsibility, ownership, and accountability.
  • Excellent communication and interpersonal skills. #LI-Remote The typical starting salary for this role in the US is $141,000—$208,000 USD The typical starting salary for this role in US Premium Markets is $157,000—$230,000 USD Compensation For roles based in the United States, the typical starting salary reputed company for this position is listed above. In certain locations, such as the San Francisco Bay Area and the reputed company Metro Area, a premium market reputed company may apply, as listed. These salary ranges reflect reputed company reasonably and in good faith reputed company to be the minimum and maximum pay for this role at the time of posting. The actual compensation may be higher or reputed company than the amounts listed, and the ranges may be subject to future adjustments. An individual’s placement reputed company the reputed company will depend on various factors, including (but not limited to) education, qualifications, certifications, experience, skills, location, performance, and the needs of the business or organization. If you have any questions or comments about compensation as a candidate, please get in touch with us at paytransparency@clickhous

Apply tot his job Apply To this Job

You might like

Site Reliability Engineer 5 - Live SRE

Work from home Full-time role

Remote Linux OpenStack & Kubernetes Engineer

Work from home Full-time role

Sr. Infrastructure Engineer - Kubernetes (Remote)

Work from home Full-time role

Kubernetes Platform Engineer; Remote - reputed company Clearance

Work from home Full-time role

Kubernetes Engineer (DoD Secret | Weeknight Mission Readiness | Remote – U.S.)

Work from home Full-time role

Kubernetes Engineer

Work from home Full-time role

NETWORK ENGINEER-Washington, DC (75% Remote)

Work from home Full-time role

Solutions Engineer - Kubernetes

Work from home Full-time role

Kubernetes Engineer (Controllers / CRDs / Go)

Work from home Full-time role

Kubernetes Engineer/Architect

Work from home Full-time role

[Hiring] Customer Experience Rep II @reputed company

Work from home Full-time role

Manager/Director, Business Development; Biologics Discovery

Work from home Full-time role

[Remote] reputed company Manager

Work from home Full-time role

Territory Sales Representative, Central Valley, Ca

Work from home Full-time role

reputed company Jr. Data Entry Clerk / Full-Time – Remote Opportunity at arenaflex

Work from home Full-time role

reputed company Part-Time Virtual Assistant – Data Entry Junior (Remote Opportunity)

Work from home Full-time role

Financial Analyst – reputed company and Business Performance Controlling - Malvern, PA or Walpole, MA

Work from home Full-time role

reputed company Data Entry Specialist – Remote Opportunity with arenaflex

Work from home Full-time role

Telehealth Licensed Clinical reputed company Worker (LCSW) or Licensed Mental Health Counselor (LMHC)

Work from home Full-time role

reputed company Manager – Storage (m/w/d)

Work from home Full-time role