Browse 9 exciting jobs hiring in Model Quantization now. Check out companies hiring such as Micron, webAI, Zoox in Tempe, Toledo, Portland.
Micron seeks an experienced Staff Data Scientist to lead the architecture and production deployment of agentic AI and LLM-based systems that accelerate semiconductor design and manufacturing workflows.
Senior Machine Learning Engineer needed to transform prototype AI models into optimized, production-ready systems for secure, distributed public sector and edge deployments.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Lead the development of custom quantization algorithms and low-precision techniques to maximize model performance on Quadric's Chimera GPNPU from our Burlingame engineering office.
Join ADI's Embedded AI Tooling Team to build end-to-end model deployment, optimization, and compilation tooling that unlocks AI on heterogeneous embedded SoCs.
Toyota Research Institute is hiring a Senior Machine Learning Engineer to build ML infrastructure, integrate and fine-tune LLMs, and operationalize multimodal research workflows for robotics, autonomy, energy, and materials programs.
Senior-level embedded AI engineer role at Renesas to lead development of model translation tooling and high-performance inference for resource-constrained MCUs/MPUs.