Browse 12 exciting jobs hiring in Nccl now. Check out companies hiring such as Jobgether, Andromeda Cluster, NVIDIA in Fontana, Colorado Springs, Oklahoma City.
Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.
Lead the architecture and operation of production-scale GPU clusters at Andromeda, partnering with customers to maximize distributed training reliability and performance.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Design and optimize distributed software and low-level system components to support foundation-model training at large scale in a research-focused HPC environment.
Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Lead the design and operation of multi-cluster scheduling and orchestration systems to automate placement, maximize accelerator utilization, and keep large-scale ML training reliable and fast.
Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.
Lead a cross-functional engineering team building scalable post-training and alignment infrastructure for LLMs at LinkedIn's Mountain View office.
Crusoe is hiring a New Grad Software Engineer to build AI-driven automation and observability tooling for large-scale GPU fleets in San Francisco.
NVIDIA seeks a Senior Software Developer, AI Networking to benchmark, profile, and optimize distributed LLM workloads across GPUs, CPUs, and high-performance networking stacks.
Lead design and optimization of NIC and networking software for next-generation GPU and NIC platforms in NVIDIA’s hyperscale engineering team.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
22
|