Browse 37 exciting jobs hiring in Systems Reliability Engineer now. Check out companies hiring such as LinkedIn, Crusoe, Shield AI in Augusta-Richmond County, Omaha, Bakersfield.
Senior technical leader sought to shape LinkedIn’s core infrastructure strategy and lead cross-team initiatives across networking, storage, and messaging at massive scale.
Lead the architecture and delivery of Crusoe's cloud and infrastructure management systems to enable highly available, secure, and scalable AI infrastructure.
Experienced failure analysis engineer needed to lead root-cause investigations and reliability improvements for avionics and high-reliability electronics at Shield AI's Dallas facility.
Lead PlayStation's Service Reliability Engineering team to own global uptime, stability, and operational excellence for FTG's cloud gaming infrastructure.
Design and operate high-throughput backend systems at Mercor to power candidate-job matching, routing, and marketplace workflows.
Lead PECVD equipment reliability and scaling for Starlink's high-efficiency solar cell production at SpaceX's Bastrop facility.
CaptivateIQ is seeking a Staff Software Engineer to lead the technical direction and scaling of its Modeling Platform, turning its computation engine into a distributed, enterprise-grade service.
Experienced SRE needed to lead multi-cloud reliability, observability, and automation at a fast-growing defense-focused infrastructure company.
Experienced backend engineer needed to architect and scale core systems and shared services that power WHOOP’s member and partner ecosystem.
Crusoe is hiring a Software Engineer to help design and scale highly available distributed systems and build platform tools that power sustainable AI infrastructure.
Lead the technical strategy and hands-on implementation of core sharing and collaboration systems at Dropbox, shaping multi-year product and AI initiatives that impact millions of users.
Workday Government is hiring an SRE-focused software engineer to operate, troubleshoot, and harden large-scale cloud services for U.S. federal customers, requiring U.S. citizenship and clearance eligibility.
Mysten Labs is hiring a Senior Software Engineer on the Interoperability team to build and productionize high-performance RPC and distributed services for the Sui blockchain.
Valon is hiring a Senior Cloud Infrastructure Engineer to architect and operate scalable, secure cloud infrastructure that powers its AI-native regulated-finance platform.
Coates Group is hiring a Principal Engineer to drive architecture, technical direction, and cross-domain platform reliability for large-scale, mission-critical systems.
Senior Software Engineer (remote) to develop and operate a full-stack observability platform for a high-growth SaaS company focused on reliability and user-centered solutions.
NVIDIA is hiring a systems software engineer to build reliable userspace distributed systems and orchestration for large-scale chip-design workflows on bare-metal Linux.
Sysdig is hiring a Senior Software Engineer for the Data Platform team to architect and implement scalable Go-based data pipelines and drive technical direction for cloud-scale telemetry and analytics.
Technical leader wanted to set architecture and own production outcomes for Blacksmith’s large-scale virtualization and storage infrastructure supporting CI at scale.
Waabi is hiring a Vehicle Reliability Engineer to diagnose complex vehicle failures, implement safe temporary fixes, and drive root-cause improvements for its autonomous vehicle fleet.
Lead the design and aftermarket support of mechanical control units for aerospace engines at Rolls-Royce, coordinating suppliers to deliver reliable, qualified hardware that meets program requirements.
Work on ChatGPT Enterprise backend systems to deliver enterprise-grade controls, compliance, and scalable residency-aware architectures that enable secure adoption at scale.
Be the primary owner of product validation at Arable—driving design verification, automated test fixtures, and reliability testing to ensure prototypes survive real-world conditions while using AI tools to accelerate workflows.
Lead system architecture and verification for advanced satellite and aerospace programs at ALTEN Technology USA, supporting mission definition through deployment.
ALTEN Technology USA is hiring a Senior Systems Engineer to drive system-level requirements, MBSE, and integration for advanced satellite missions in a remote capacity.
Senior Software Engineer (remote, California) to design and ship scalable developer-focused cloud tooling and MacOS support while providing technical leadership and improving system reliability.
Work on core software and infrastructure at Dimensional to shape scalable, reliable systems that power general-purpose robotics.
Senior Site Reliability Engineer (Azure) to design and deliver production-ready, scalable Azure infrastructure and automation for a growing distributed systems platform.
LeoLabs is hiring a Senior Staff Software Engineer to lead architecture and deliver reliable, high-performance cloud services that power real-time decisions for satellite operations and space domain awareness.
Senior Systems Engineer (DSP) to engineer and operate highly available, large-scale infrastructure supporting Basis’ DSP platform across cloud and on-prem environments.
MLabs is seeking a Senior Site Reliability Engineer to design and operate secure, scalable Azure infrastructure and automation for an enterprise distributed systems platform.
OpenAI for Finance is hiring a backend software engineer to build and scale platform capabilities and integrations that power AI products for the finance industry.
Senior individual contributor SRE role at Pismo (a Visa company) to lead architecture, reliability, and operational excellence across cloud-native payment platforms in a hybrid Austin role.
Lead the technical vision and execution of a scalable, secure cloud platform that enables rapid, reliable delivery across autonomy, backend, hardware, and data teams at HavocAI.
Vast seeks a Senior Design Reliability Engineer to lead design reviews, qualification efforts, and engineering standards for human-rated, artificial-gravity space station systems at its Long Beach site.
Brellium is hiring a Senior Backend / Infrastructure Engineer to architect and build resilient, high-throughput backend systems that enable 10x growth in session volume for its AI-driven clinical review platform.
Lead Engineer to design and scale LENS, a real-time AI-enabled platform used by public safety agencies to detect pre-accidents and prevent traffic accidents.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|