Browse 40 exciting jobs hiring in Site Reliability Engineer now. Check out companies hiring such as UChicago, Trimble, Intel in Garland, Arlington, Indianapolis.
The University of Chicago's CTDS is hiring a Senior Platform Engineer to lead production support, CI/CD pipelines, monitoring, and security automation across hybrid cloud and on‑prem translational data science platforms.
Trimble is seeking a Site Reliability Engineer to strengthen and scale Vista Cloud infrastructure for enterprise AECO customers by delivering automation, robust monitoring, and deep technical support.
Be the engineer who designs and operates large-scale Linux infrastructure, CI/CD pipelines, and automation to power Intel's architecture modeling and simulation workflows.
Ro is hiring a Senior Site Reliability Engineer to strengthen and scale our AWS-based infrastructure, improve uptime and MTTR, and help embed reliability practices across the engineering organization.
SpaceX is hiring a Site Reliability Engineer to build and operate mission-critical application infrastructure that accelerates and secures vehicle and satellite software delivery.
Nabla seeks a senior SRE/Backend engineer to drive platform reliability and scalability for its clinical AI systems supporting clinicians across the US and EU.
Lead PlayStation's Service Reliability Engineering team to own global uptime, stability, and operational excellence for FTG's cloud gaming infrastructure.
Hammerhead is hiring a Site Reliability Engineer to establish and run the reliability function for an AI-driven power orchestration platform deployed across cloud and on-prem data centers.
Stitch Fix is hiring a Platform Engineer to enhance cloud-native infrastructure, developer tooling, and CI/CD workflows to improve developer experience across the company.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
Anduril's Discovery team is hiring a Site Reliability Engineer to design and operate scalable, secure deployments that integrate cloud, robotics, and mesh networking for mission-critical systems.
Kochava is hiring a Senior Site Reliability Engineer to develop and operate scalable, highly available infrastructure and tooling across cloud and on-prem environments.
ServiceNow seeks a Staff Site Reliability Engineer to drive performance troubleshooting, incident escalation, and availability improvements across its cloud platform while working directly with customers and engineering teams.
HomeVision is hiring an Associate Site Reliability Engineer to help scale its AWS/Terraform platform, improve reliability and observability, and support IT and product initiatives in a fully remote environment.
Lead site reliability and platform engineering efforts at WGU as a Senior Software Engineer, building scalable, cloud-aware systems that power the university's online learning platform.
Crusoe is hiring a Software Engineer to help design and scale highly available distributed systems and build platform tools that power sustainable AI infrastructure.
Medtronic is hiring a Principal Software Cloud Engineer to architect and implement cloud-native microservices for CRM Software at its Minneapolis site.
Workday Government is hiring an SRE-focused software engineer to operate, troubleshoot, and harden large-scale cloud services for U.S. federal customers, requiring U.S. citizenship and clearance eligibility.
Lead ServiceNow CMDB and ETL engineering efforts at Visa to design, build, and operate reliable discovery, ingestion, and data pipelines supporting enterprise CMDB and ITOM capabilities.
Lead Site Reliability Engineer needed to own SLO-driven reliability, Infrastructure as Code, and observability for athenahealth's hybrid cloud infrastructure while mentoring SRE teams.
Lead the Consumer Lending domain's SRE efforts at Toyota Financial Services to drive observability, automation, and high availability for mission-critical applications.
Senior Software Engineer (remote) to develop and operate a full-stack observability platform for a high-growth SaaS company focused on reliability and user-centered solutions.
Sysdig is hiring a Senior Software Engineer for the Data Platform team to architect and implement scalable Go-based data pipelines and drive technical direction for cloud-scale telemetry and analytics.
Lead the design and operation of secure, scalable cloud infrastructure for Anduril's Corporate Technology team as a Senior Site Reliability Engineer focused on reliability, automation, and observability.
Experienced reliability engineer needed to drive automation, observability, incident response, and SLO-driven operations for mission-critical cloud and hybrid systems supporting a U.S. Air Force program.
Work on core software and infrastructure at Dimensional to shape scalable, reliable systems that power general-purpose robotics.
Bluefish seeks a Senior Data Acquisition Engineer to design, operate, and scale production-grade web scraping and ingestion systems that power AI-driven marketing insights.
Lead reliability and security for a distributed GPU marketplace, driving SLOs, incident response, capacity automation, and secure rollouts to ensure 24/7 platform availability.
Pismo (part of Visa) is hiring a Senior Network Platform SRE to design, automate, and operate secure, resilient hybrid and multi-cloud network topologies with a focus on Azure.
Provide platform reliability and incident ownership for DSN's AWS-based customer services, driving operational improvements and cross-team coordination.
CyberArk is hiring a Senior Production Engineer to architect and operate highly available, secure cloud infrastructure and CI/CD pipelines for its machine identity security platform.
Senior Site Reliability Engineer (Azure) to design and deliver production-ready, scalable Azure infrastructure and automation for a growing distributed systems platform.
Senior Systems Engineer (DSP) to engineer and operate highly available, large-scale infrastructure supporting Basis’ DSP platform across cloud and on-prem environments.
MLabs is seeking a Senior Site Reliability Engineer to design and operate secure, scalable Azure infrastructure and automation for an enterprise distributed systems platform.
DriveWarealth seeks a Senior Site Reliability Engineer to build automation, observability, and resilient cloud-native platforms that support global brokerage operations.
Senior Backend Engineer for Commure's RCM team to build scalable, production-grade Python services that transform revenue cycle management for healthcare providers.
Senior individual contributor SRE role at Pismo (a Visa company) to lead architecture, reliability, and operational excellence across cloud-native payment platforms in a hybrid Austin role.
Lead CI/CD and infrastructure automation efforts as a Staff Site Reliability Engineer at Pismo (Visa) to strengthen platform resilience and mentor engineering teams.
Lead the Site Reliability Engineering efforts for NG911 and other mission-critical systems, driving HA architecture, automation, and incident excellence at Motorola Solutions in Chicago.
Rula is looking for a Staff Software Engineer — Platform Infrastructure to lead reliability, observability, and platform automation across a remote-first engineering organization.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
25
|