Browse 60 exciting jobs hiring in Cuda now. Check out companies hiring such as Intel, NVIDIA, Woven by Toyota in Portland, Houston, Detroit.
Lead development of Intel's neuromorphic AI compiler and runtime to enable production-grade, high-performance physical AI applications across hardware and software ecosystems.
Lead developer advocacy for NVIDIA's Newton and Warp toolchains, partnering with industry, academia, and ISVs to drive GPU-accelerated, differentiable simulation adoption across robotics.
Lead the design and delivery of production-ready online sensor calibration algorithms for Toyota’s autonomous driving systems while optimizing for accuracy, robustness, and constrained runtime environments.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Encord is hiring a Machine Learning Engineer to research, adapt, and productionize cutting-edge computer vision and deep learning methods within a fast-growing AI infrastructure startup.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
NVIDIA's ADI team seeks a Senior Software Engineer to design and implement high-performance C++/CUDA libraries for accelerating GPU data processing and contribute to major open-source projects.
Lead the architecture and delivery of real-time RF sensor software at STR, transitioning algorithms to optimized C/C++ implementations and driving open-system integration across distributed platforms.
Lead design and implementation of real-time computer vision and ML algorithms for minimally invasive robotic surgery at a market-leading medical robotics company.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Take a leading role developing state-of-the-art visual intelligence models and systems at an ambitious AI research-focused company based in Palo Alto.
ClearEdge is hiring an HPC Software Engineer III to lead development and performance optimization of compute-intensive, parallel/distributed software for high-impact DoD programs.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
NVIDIA is seeking a Senior System Software Engineer to architect and implement CUDA driver features for Windows, advancing GPU computing across AI, graphics, and system workloads.
Applied Research Scientist role to design and implement cutting-edge computer vision and generative models that move research from prototype to production in creative simulation tools.
Lead and scale NVIDIA's embedded AI software go-to-market and partner co‑sales to accelerate ISV, OEM, and system integrator adoption of NVIDIA's platform.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Lead the development of production-ready software for SpaceX’s metal 3D printing systems, driving controls, data acquisition, and in-process monitoring to improve printed hardware outcomes.
Work on advanced graph neural network models and 3D reconstruction pipelines to power AI-first generative design and BIM generation at an early-stage startup focused on transforming construction design and estimation.
A systems researcher/engineer role focused on prototyping, benchmarking, and system-level analysis of AI and data-center workloads to drive Intel's next-generation architecture and product decisions.
Lead the architecture and delivery of NVIDIA’s Retail & CPG product platform, blending agentic AI and accelerated computing to enable scalable retail, supply chain, and commerce solutions.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
Lead developer strategy and partnerships to drive adoption of NVIDIA's core CUDA Math Libraries, with a focus on mixed-precision enablement and high-performance numerical computing.
General Robotics seeks an ML Systems Engineer in Redmond to productionize and optimize real-time, GPU-accelerated model serving and ML infrastructure for autonomous robotics.
Anduril is hiring a Senior FPGA Engineer in Costa Mesa to lead Xilinx-based FPGA design, verification, and bring-up for next-generation software defined radios and EW platforms.
Senior Staff TPM role leading portfolio-level IaaS and GPU-generation programs, shaping NPI frameworks, and coaching TPMs at a rapidly scaling AI infrastructure company.
Drive production-quality integrations of NVIDIA Grove into Dynamo and leading open-source AI frameworks, delivering adapters, runtime components, and developer tooling for scalable training and inference.
KLA is seeking an experienced AI Software Engineer to build and maintain scalable Generative AI and LLM solutions deployed to cloud production environments.
Join vCluster Labs as an AI Infrastructure Specialist to lead technical pre-sales and production deployments of GPU-powered Kubernetes on bare metal, turning early customer engagements into scalable playbooks.
Lumafield is hiring a Senior Embedded Systems Engineer to design and ship high-performance firmware and Linux-based edge software for next-generation CT scanning products in San Francisco.
Lead the development of state estimation and localization algorithms for SandboxAQ’s MagNav team, applying expertise in C++, sensor integration, and navigation theory to novel GNSS-alternative systems.
Toyota Research Institute is hiring a Senior Machine Learning Engineer to build ML infrastructure, integrate and fine-tune LLMs, and operationalize multimodal research workflows for robotics, autonomy, energy, and materials programs.
Work on cutting-edge embedded graphics and interaction software for intraoperative navigation and guidance within a leading surgical-robotics company.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Lead NVIDIA’s embedded AI software go-to-market and partner co-sales to drive broad ISV, OEM, and system integrator adoption of NVIDIA AI platforms.
Meshy is hiring an AI 3D Dataset Engineer to design and operate scalable 3D data pipelines, tooling, and quality systems that enable high-performance generative 3D models.
Work on training and deploying large-scale ML systems for physical robots while building the infrastructure and pipelines to operate them in production.
NVIDIA is looking for a Senior Software Engineer to architect and implement CUDA driver features that unlock peak GPU performance across AI, scientific, and graphics workloads.
NVIDIA seeks a seasoned Developer Relations Manager to partner with hyperscaler AI teams, provide hands-on technical enablement for NVIDIA AI software, and drive developer adoption and feedback into the product roadmap.
Lead a cross-functional engineering team building scalable post-training and alignment infrastructure for LLMs at LinkedIn's Mountain View office.
Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.
NVIDIA seeks a Developer Relations Manager to partner with research labs and universities to drive adoption of its AI and HPC platforms and expand academic AI education.
Adaptive ML is hiring a Performance Engineer (Rust) to develop high-performance, production-grade systems that power scalable RLOps for enterprise LLM deployments.
Lead NVIDIA’s HPC Compiler team to advance high-performance compiler technology and GPU code generation across Python, C++, and Fortran.
Build and optimize real-time XR and AI-driven medical visualization systems at IHC, applying C++/C#/Python engineering and GPU expertise to ship production-grade immersive tools for clinical use.
Work at the kernel layer to design, profile, and ship custom CUDA/ROCm kernels that maximize performance across NVIDIA and AMD GPUs for inference and training workloads.
Work as an Inference Engine Engineer at FriendliAI to design high-performance GPU kernels and core runtime components that power latency-critical, production-scale generative AI systems.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
60
|