Browse 37 exciting jobs hiring in Gpu Infrastructure now. Check out companies hiring such as Harvard University, The San Francisco Compute Company, NVIDIA in Greensboro, Fremont, Brownsville.
Harvard Kennedy School is hiring a Research Infrastructure and Software Engineer to architect and implement reproducible, secure research computing and data solutions that support the School's research and compliance needs.
Join a small SF startup as a Platform Engineer to build and own a high-fidelity pre-production EKS environment, internal developer tooling, and deployment pipelines that bridge dev and prod.
Lead a global campaigns team to plan and execute high-visibility data center and AI infrastructure marketing programs that drive awareness and pipeline for NVIDIA and its ecosystem partners.
Sygaldry is seeking an ML Infrastructure Engineer to design and operate multi‑cloud GPU orchestration, research compute tooling, and CI/CD pipelines that enable reproducible, scalable ML and simulation workloads.
Prime Intellect is hiring a Senior Security Engineer to define and lead security for a frontier-scale RL training platform and distributed GPU infrastructure.
Experienced Technical Project Manager needed to drive engineering projects and sprint planning for Vultr's AI and cloud infrastructure teams in a remote US role.
Lead and scale Andromeda's partnerships function, turning strategic relationships across GPU providers, AI labs, VCs, and data centers into durable commercial and operational engines.
Drive the infrastructure that enables frontier research by building scalable, high-performance distributed training systems and experiment tooling used across thousands of GPUs.
Andromeda is hiring a Compute Trader to source, negotiate, and match GPU compute supply with customer demand across global providers to maximize utilization and revenue.
TRI's Future Factory team is hiring a Senior Research Engineer to design scalable training/evaluation infrastructure and high-performance geometry and physics-aware tooling that translate research into production-grade systems.
Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.
Lead development of verification tooling and CI infrastructure to accelerate High-Speed IO ASIC verification for NVIDIA's GPU teams.
Senior Director of Product Marketing to lead customer-driven product positioning, competitive strategy, and GTM for Crusoe Cloud, engaging deeply technical AI and infrastructure audiences.
Lead product marketing for Crusoe Cloud by shaping technical messaging, market intelligence, and GTM execution to drive adoption among AI-native companies and infrastructure practitioners.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Lead and build True Anomaly’s AI platform and engineering team to deliver production-grade model hosting, agent infrastructure, and enterprise AI tooling that embed AI across the company.
Contribute to KIOXIA's AI infrastructure strategy by researching AI platforms/systems, storage implementations, and future storage requirements as an Engineering Management intern.
Lead the design and operation of multi-cluster scheduling and orchestration systems to automate placement, maximize accelerator utilization, and keep large-scale ML training reliable and fast.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
TensorWave is hiring a hands-on Data Center Manager to own 24/7 operations, uptime, and team development at our high-density GPU facility in New Kensington.
Wyetech is seeking an experienced Software Engineer 2 to productionize ML research into high-performance, containerized systems for federal customers while working hybrid from Laurel, MD.
Armada is hiring a cleared Senior Mission Success Engineer to lead federal deployments, drive authorization and operational stability, and serve as the primary technical owner for mission-critical AI infrastructure.
Crusoe Cloud seeks a Staff Network Deployment Engineer to lead lab network build-outs, validation, and advanced diagnosis for high-performance GPU compute clusters in San Francisco.
General Robotics seeks an ML Systems Engineer in Redmond to productionize and optimize real-time, GPU-accelerated model serving and ML infrastructure for autonomous robotics.
Senior Staff TPM role leading portfolio-level IaaS and GPU-generation programs, shaping NPI frameworks, and coaching TPMs at a rapidly scaling AI infrastructure company.
Lead North American GTM as a hands-on Head of Marketing to drive PLG adoption of GPU instances and LLM APIs for a fast-growing AI infrastructure startup.
Drive production-quality integrations of NVIDIA Grove into Dynamo and leading open-source AI frameworks, delivering adapters, runtime components, and developer tooling for scalable training and inference.
Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.
Prime Intellect seeks a Research Engineer to build and optimize scalable RL training and orchestration infrastructure that powers frontier agentic models.
Andromeda Cluster is hiring an Infrastructure Manager to scale global GPU compute supply and demand matching by sourcing suppliers, optimizing utilization, and negotiating commercial terms.
Lead the design and implementation of scalable ML backend systems and sensor-data pipelines to enable production-grade robotics and autonomous manufacturing at Nidus in New York City.
Lead LinkedIn’s physical infrastructure strategy and execution as a Staff TPM, overseeing data center buildouts, AI/GPU deployments, capacity planning, and executive-level program governance.
Lead cross-functional data center and AI infrastructure programs at LinkedIn, driving strategy, execution, and executive alignment to deliver large-scale, reliable physical infrastructure.
Crusoe is hiring a New Grad Software Engineer to build AI-driven automation and observability tooling for large-scale GPU fleets in San Francisco.
Lead enterprise sales into regulated industries for Baseten, owning full-cycle deals, strategic account expansion, and technical evaluations to establish industry lighthouse customers.
Runpod seeks a Customer Marketing Manager to build and scale a repeatable customer storytelling and reference program that drives pipeline and strengthens market credibility.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|