Browse 18 exciting jobs hiring in Tensorrt now. Check out companies hiring such as Zoox, LinkedIn, NVIDIA in Nashville-Davidson, Santa Ana, Detroit.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
NVIDIA is hiring a Principal Software Engineer to lead architecture, reliability, and production hardening of enterprise agentic AI applications and shared platform services.
Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.
Lead and grow a high-performing edge software engineering team to build and scale AI-enabled IoT solutions deployed across distributed devices for a fast-growing intelligent site technology company.
Wyetech is seeking an experienced Software Engineer 2 to productionize ML research into high-performance, containerized systems for federal customers while working hybrid from Laurel, MD.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Help build the ML platform powering enterprise agentic automation by owning production AI features end-to-end at Sola’s NYC headquarters.
Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.
Lead automation and release engineering for NVIDIA DRIVE OS, combining CI/CD, embedded platform expertise, and LLM-driven developer tooling to streamline builds, tests, and public library releases.
Lead development of multi-sensor fusion and 3D perception models to improve obstacle understanding and enable robust autonomous driving for Zoox's vehicle fleet.
Lead federal-focused developer relations to drive integration and adoption of NVIDIA’s GPU-accelerated AI stack across ISVs, defense contractors, and public-sector platforms.
Lead NVIDIA's developer relations strategy for financial services, driving adoption of GPU-accelerated AI across top capital markets ISVs and platform partners.
Lead technical strategy and global developer engagement for manufacturing at NVIDIA, driving adoption of AI and GPU-accelerated platforms across ISVs and developer communities.
Lead the design and scaling of distributed GPU infrastructure and production computer vision pipelines to bring state-of-the-art models from research into reliable, low-latency cloud deployments for Claryo's warehouse AI platform.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
16
|