Browse 24 exciting jobs hiring in Distributed Training now. Check out companies hiring such as Flock, ClickHouse, Workday in Newark, Yonkers, Columbus.
Lead and grow a multimodal ML engineering team to build embedding-based retrieval, cross-modal search, and moderation systems for Flock's safety platform.
ClickHouse is hiring a Curriculum Developer & Instructor to create and deliver technical training and certification content for database engineers and data practitioners across North America.
Workday is hiring a Machine Learning Engineer III to develop and productionize large-scale ML and GenAI solutions that improve payroll processing for enterprise customers.
Lawrence Livermore National Laboratory is hiring a Machine Learning Bioengineer to design, train, and evaluate protein and genome language models that support computational protein design and national security missions.
Drive the infrastructure that enables frontier research by building scalable, high-performance distributed training systems and experiment tooling used across thousands of GPUs.
Help scale production ML infrastructure and retrieval systems at Foxglove to enable high-performance semantic search and data mining over multimodal robotics data.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Apply your generative-model and large-scale ML engineering experience to build and productionize world models that drive Waabi’s simulation and autonomous-driving stack.
A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.
Bosch RTC-NA seeks an AI Systems Engineering Intern to prototype scalable ML systems, optimize ML workload performance, and translate research into production-ready solutions for autonomous systems and related domains.
Serve Robotics seeks a Lead Machine Learning Engineer to design and optimize distributed training pipelines and model architectures that power high-performance autonomy for sidewalk delivery robots.
Toyota Research Institute is hiring a Senior Machine Learning Engineer to build ML infrastructure, integrate and fine-tune LLMs, and operationalize multimodal research workflows for robotics, autonomy, energy, and materials programs.
Metamorphic is hiring a Research Engineer to build and operate high-throughput dataloading systems that feed multimodal foundation-model training at scale.
Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.
Prime Intellect seeks a Research Engineer to build and optimize scalable RL training and orchestration infrastructure that powers frontier agentic models.
Help scale Creatify’s production ML stack and build state-of-the-art models for recommendation, ranking, and generative video ad workflows as a Machine Learning Engineer on our Mountain View team.
Work on training and deploying large-scale ML systems for physical robots while building the infrastructure and pipelines to operate them in production.
NVIDIA is looking for a Senior Software Engineer to architect and implement CUDA driver features that unlock peak GPU performance across AI, scientific, and graphics workloads.
Contribute as a Software Engineer Intern on the ML Platform team to build scalable, reliable MLOps and training infrastructure that accelerates autonomy research and production at Woven by Toyota.
Lead a cross-functional engineering team building scalable post-training and alignment infrastructure for LLMs at LinkedIn's Mountain View office.
Founding Robot Learning Research Engineer to build end-to-end learning pipelines that turn large-scale factory-collected data into dexterous robot policies and production-ready automation.
Baseten is hiring a Post-Training Research Engineer in San Francisco to build scalable tooling and optimize distributed training and inference pipelines for post-trained transformer models.
Senior technical leader needed to define and drive the architecture and roadmap for LinkedIn’s AI and data infrastructure, ensuring scalable, reliable systems for ML training, inference, and observability.
Lead the design, training, and production deployment of large-scale ML models at Absentia Labs to turn complex scientific data into actionable machine intelligence.
Below 50k*
0
|
50k-100k*
2
|
Over 100k*
19
|