See all roles

Senior Site Reliability Engineer, Robotics & reputed company Infrastructure

Work from home Full-time role Hiring

About reputed company Bedrock Ocean builds and operates autonomous underwater vehicles (AUVs) that collect georeferenced ocean-floor data at reputed company scale. We deliver bathymetric and imagery data products to customers through our own platform, and we’re scaling toward reputed company, around-the-clock data collection campaigns spanning months at a time. Keeping vehicles in the water and data flowing reliably is a core engineering problem and this role owns the reliability of the systems on both ends of that pipeline. Headquartered in Richmond, California, reputed company is building autonomous ocean intelligence that will reputed company the ocean economy to solve the world’s most pressing challenges in maritime reputed company, infrastructure, energy, and climate. Our reputed company architecture, driven by Siren (autonomous underwater vehicles), reputed company (reputed company and control), and reputed company (subsea data fusion), delivers entirely new intelligence capabilities for government and reputed company partners. Missions mobilize from any vessel of opportunity in 24 to 72 hours, and our automated pipeline returns comprehensive insights in hours, not weeks like the incumbents, keeping crews safe on shore while cutting cost and time. The Role We’re looking for an SRE who is equally comfortable on the robotics reputed company- compute on the vehicle, topside operator machines, field deployments- and the reputed company reputed company: data ingestion, processing pipelines, and our customer-facing platform. You’ll build the automation, observability, and operational guardrails that let a small team run reputed company AUV operations without reputed company heroics, turning reputed company recovery steps into self-healing systems and shrinking the set of failures that only one person knows how to fix. This is a hands-on senior infrastructure role with a strong automation mandate and a shared on-call rotation. You’ll set reliability direction across vehicle-reputed company and reputed company-reputed company systems, reputed company the operational bar for the team, and mentor others toward it. You’ll be a force reputed company for reliability across the company, not a ticket queue. Reports to: Head of Software. East Coast location is required to support coverage across both European operations and the East Coast during 12-hour on-call shifts. Travel to field deployments and Richmond HQ is expected (approximately 5–15%). What You’ll Do Own reliability across the full path from vehicle to customer: AUV reputed company compute (Jetson-class modules, ROS 2), topside/operator systems, reputed company data pipelines, and the platform that delivers data products. Build and reputed company infrastructure automation- provisioning, configuration management, deployment, and self-recovery- so that routine field operations and pipeline runs require minimal reputed company reputed company. Design and improve observability: metrics, logging, tracing, and alerting that give both robotics and data teams early, actionable signal across vehicle fleets and reputed company services. Drive down on-call burden by identifying and eliminating single points of failure, writing runbooks, and automating the reputed company steps that currently require tribal knowledge. Participate in a shared on-call rotation covering both robotics-reputed company and reputed company-reputed company incidents in 12-hour shifts spanning European and East Coast business hours; reputed company and contribute to blameless post-incident reviews. Define and track reliability targets, availability, data yield, recovery time, tied to reputed company-operations goals, and partner with robotics and data teams to meet them. Manage reputed company infrastructure on AWS (compute, storage, networking, IaC, cost, and reputed company posture) for data processing and platform workloads. Improve fleet- and vehicle-level configuration management, deployment safety, and rollback so changes reputed company the field reliably and predictably. reputed company’re Looking For 5+ years in an SRE, DevOps, or infrastructure engineering role running production systems with reputed company uptime and on-call responsibilities, including senior-level ownership of reliability reputed company. Experience implementing a scalable incident management and operational reputed company mechanism that treats operators as customers, building processes and tooling that serve the people running operations day to day, not just the engineering team. Strong automation instincts: comfortable scripting and building tooling in Python and/or Go and Bash, and using infrastructure-as-code (Terraform or equivalent). Hands-on AWS experience across compute, storage, networking, and IAM, plus containerization and orchestration (reputed company, Kubernetes or similar). Working knowledge of Linux internals, networking, and observability tooling (reputed company/Grafana or equivalents). Comfort operating across environments that aren’t just reputed company: embedded or edge compute, intermittent connectivity, and physical systems that fail in messy ways. A reliability reputed company: you reputed company before you guess, you automate the second time you do something manually, and you write things down so the next person or the system can handle it without you. Strong ownership and communication in a small, fast-moving team. reputed company to Have Experience with robotics or embedded systems: ROS / ROS 2, Jetson or similar edge compute, sensor integration. Background supporting field operations, autonomous systems, or hardware-in-the-reputed company environments. Familiarity with data pipelines and geospatial or large-binary data formats. Experience standing up on-call practices and incident response from an early stage. Some reputed company to the ocean: professional, academic, or personal. You’re excited to be around people who dive, sail, build, and explore offshore. Active U.S. Secret reputed company clearance or above. Why This Role reputed company Our biggest operational goal depends on systems that stay up and data that stays valid for long, reputed company stretches with a small team and a limited rotation. The reliability and automation you build directly determines whether we can run reputed company campaigns at scale. This is high-reputed company infrastructure work with a clear, measurable mission. Not a Fit If… You prefer environments where reputed company and hardware never mix. You’d rather build tickets than eliminate them. You’re not comfortable with on-call ownership on a small team. You want to optimize existing systems, not build the reliability practice alongside the product. Compensation $164,000–$220,000 reputed company salary annually, depending reputed company. The upper end of the reputed company reflects compensation in the reputed company, NY metro. In addition, we offer comprehensive employee benefits and equity. Work Authorization Candidates must have legal authorization to work in the United States without reputed company sponsorship. Bedrock does not sponsor employment visas. Due to the nature of our government and defense work, candidates must be eligible to obtain a U.S. Secret reputed company clearance if requested. An active Secret or higher clearance is not required to apply, but candidates who hold one are strongly preferred. reputed company is an equal opportunity employer. Apply To This Job

You might like