Browse 110 exciting jobs hiring in Gpu now. Check out companies hiring such as RunPod, Inc., LinkedIn, Harvard University in Anaheim, Scottsdale, Vancouver.
Runpod is seeking a senior Developer Relations & Community Manager to create technical content, run large-scale community operations (Discord primary), and drive developer adoption of its AI infrastructure platform.
LinkedIn seeks a Staff Technical Program Manager in Mountain View to lead high-impact, AI/LLM-driven programs across Talent Marketplace Engineering and partner orgs to drive roadmap, execution, and measurable outcomes.
Harvard Kennedy School is hiring a Research Infrastructure and Software Engineer to architect and implement reproducible, secure research computing and data solutions that support the School's research and compliance needs.
Lead platform partnerships across energy and utilities to drive grid modernization and AI-driven operational outcomes using NVIDIA's full-stack solutions.
Join a small SF startup as a Platform Engineer to build and own a high-fidelity pre-production EKS environment, internal developer tooling, and deployment pipelines that bridge dev and prod.
Lead a global campaigns team to plan and execute high-visibility data center and AI infrastructure marketing programs that drive awareness and pipeline for NVIDIA and its ecosystem partners.
Work across engine, driver, and hardware teams to profile, optimize, and validate Unreal Engine features on Intel platforms while partnering closely with Epic Games and external studios.
Sygaldry is seeking an ML Infrastructure Engineer to design and operate multi‑cloud GPU orchestration, research compute tooling, and CI/CD pipelines that enable reproducible, scalable ML and simulation workloads.
Lead the reliability and observability strategy for Crusoe’s cloud infrastructure, shaping SLOs, incident response, and platform tooling across compute, storage, and networking.
Prime Intellect is hiring a Senior Security Engineer to define and lead security for a frontier-scale RL training platform and distributed GPU infrastructure.
Technical leader sought to own mechanical cooling strategy and programmatic design for FluidStack’s rapidly scaling, multi-site AI data center portfolio.
Hammerhead is hiring a Site Reliability Engineer to establish and run the reliability function for an AI-driven power orchestration platform deployed across cloud and on-prem data centers.
Experienced Technical Project Manager needed to drive engineering projects and sprint planning for Vultr's AI and cloud infrastructure teams in a remote US role.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
WeRide is hiring new-grad software engineers to develop real-time in-vehicle systems, backend/cloud infrastructure, and performance-focused ML deployment tooling for autonomous vehicles.
Epoch AI is hiring remote Researchers and Senior Researchers to conduct data-driven investigations, build benchmarks, and forecast AI capabilities and trends.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
Lead and scale Andromeda's partnerships function, turning strategic relationships across GPU providers, AI labs, VCs, and data centers into durable commercial and operational engines.
DreamWorks Animation is hiring a Software Engineer I to build and maintain Houdini-based artist tools and pipeline systems that streamline production workflows for feature animation.
K1X is hiring a hands-on Machine Learning Operations Engineer to design and operate scalable ML infrastructure, pipelines, and production inference systems for a fully remote, Midwest-preferred startup.
NVIDIA's ADI team seeks a Senior Software Engineer to design and implement high-performance C++/CUDA libraries for accelerating GPU data processing and contribute to major open-source projects.
Lead the architecture and delivery of real-time RF sensor software at STR, transitioning algorithms to optimized C/C++ implementations and driving open-system integration across distributed platforms.
Drive the infrastructure that enables frontier research by building scalable, high-performance distributed training systems and experiment tooling used across thousands of GPUs.
Lead design and implementation of real-time computer vision and ML algorithms for minimally invasive robotic surgery at a market-leading medical robotics company.
Lead cross-functional engineering programs on NVIDIA's Deep Learning Software Team to deliver Gen AI models and scalable software solutions for advanced AI research.
Andromeda is hiring a Compute Trader to source, negotiate, and match GPU compute supply with customer demand across global providers to maximize utilization and revenue.
Lead development and performance optimization of high-performance communication libraries for Intel's HPC/AI runtimes targeting modern CPUs and GPUs.
Render Network Foundation is hiring a UI Frontend Engineer to craft performant React/TypeScript interfaces and real-time 3D experiences for neural rendering and creator workflows.
Crusoe is hiring a Technology Scout to establish and run an R&D scouting function that identifies, evaluates, and advances emerging technologies for AI data center infrastructure.
Build and maintain scalable test automation and stress-testing frameworks to validate distributed AI training and inference infrastructure as our Software Development Engineer in Test.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.
Contribute to mission-critical radar systems at LeoLabs by developing deployment, testing, observability, and real-time software tools that enable safe and reliable space operations.
TRI's Future Factory team is hiring a Senior Research Engineer to design scalable training/evaluation infrastructure and high-performance geometry and physics-aware tooling that translate research into production-grade systems.
NVIDIA is looking for an experienced System Software Engineer to design and implement GPU system software focused on power, performance, and low-level platform integration.
Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.
Help build and optimize the real-time on-vehicle software stack for Humble Robotics' autonomous hauling system, working across sensors, inference, tooling, and fleet integrations.
Design and optimize distributed software and low-level system components to support foundation-model training at large scale in a research-focused HPC environment.
Lead development of verification tooling and CI infrastructure to accelerate High-Speed IO ASIC verification for NVIDIA's GPU teams.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Senior Director of Product Marketing to lead customer-driven product positioning, competitive strategy, and GTM for Crusoe Cloud, engaging deeply technical AI and infrastructure audiences.
Lead product marketing for Crusoe Cloud by shaping technical messaging, market intelligence, and GTM execution to drive adoption among AI-native companies and infrastructure practitioners.
NVIDIA is seeking a Senior System Software Engineer to architect and implement CUDA driver features for Windows, advancing GPU computing across AI, graphics, and system workloads.
Lead and scale NVIDIA's embedded AI software go-to-market and partner co‑sales to accelerate ISV, OEM, and system integrator adoption of NVIDIA's platform.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Lead and build True Anomaly’s AI platform and engineering team to deliver production-grade model hosting, agent infrastructure, and enterprise AI tooling that embed AI across the company.
Sandisk seeks a senior technologist to found and lead an AI Systems & Performance Lab in Milpitas, driving workload‑driven performance analysis and influencing next‑generation system and silicon direction.
Contribute to KIOXIA's AI infrastructure strategy by researching AI platforms/systems, storage implementations, and future storage requirements as an Engineering Management intern.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|