Browse 8 exciting jobs hiring in Tensorrt Llm now. Check out companies hiring such as Zoox, LinkedIn, NVIDIA in Omaha, Philadelphia, San Antonio.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
NVIDIA is hiring a Principal Software Engineer to lead architecture, reliability, and production hardening of enterprise agentic AI applications and shared platform services.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Help build the ML platform powering enterprise agentic automation by owning production AI features end-to-end at Sola’s NYC headquarters.
Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.
Lead automation and release engineering for NVIDIA DRIVE OS, combining CI/CD, embedded platform expertise, and LLM-driven developer tooling to streamline builds, tests, and public library releases.