Browse 18 exciting jobs hiring in Model Optimization now. Check out companies hiring such as Brillio, Zoox, LinkedIn in Aurora, Newport News, Omaha.
Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead the design and optimization of LLM and RAG systems that power personalized, data-driven insights for athletes and coaches at Texas Sports Academy.
Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.
Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.
Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Applied Research Scientist role to design and implement cutting-edge computer vision and generative models that move research from prototype to production in creative simulation tools.
Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.
Lead the development of custom quantization algorithms and low-precision techniques to maximize model performance on Quadric's Chimera GPNPU from our Burlingame engineering office.
Join ADI's Embedded AI Tooling Team to build end-to-end model deployment, optimization, and compilation tooling that unlocks AI on heterogeneous embedded SoCs.
Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.
Prime Intellect seeks a Research Engineer to build and optimize scalable RL training and orchestration infrastructure that powers frontier agentic models.
Wizard AI is hiring a Senior MLOps Engineer to own and scale the production ML lifecycle for a real-time inference platform behind a conversational shopping agent.
A PhD internship at Intel to contribute to cutting-edge AI software, focusing on computer vision, model optimization, and hardware/software integration while gaining mentorship and real-world deployment experience.
Project Lion seeks a US-based Prompt Engineer to drive template-to-autorater migrations, optimize prompts using APG/APO tooling, and validate autorater quality versus human baselines.
Lead the design and scaling of distributed GPU infrastructure and production computer vision pipelines to bring state-of-the-art models from research into reliable, low-latency cloud deployments for Claryo's warehouse AI platform.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
9
|