Inference Optimization Jobs

Browse 14 exciting jobs hiring in Inference Optimization now. Check out companies hiring such as Zoox, LinkedIn, MLabs in Brownsville, Indianapolis, Newark.

VIEW COMPANIES

AI Inference Engineer - Model Optimization & Deployment

Zoox Hybrid No location specified

VIEW

Posted 5 days ago

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Sr. Staff Software Engineer, Systems Infrastructure

LinkedIn Hybrid Mountain View, CA

VIEW

Posted 6 days ago

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Staff AI Engineer

MLabs Hybrid No location specified

VIEW

Posted 9 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Machine Learning Engineer, Platform Integrations

TwelveLabs Hybrid San Francisco

VIEW

Posted 10 days ago

Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.

Senior ML/AI Engineer

Sandbar Hybrid New York City

VIEW

Posted 12 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Multimodal AI Model Optimization Research Engineer

Tavus Hybrid No location specified

VIEW

Posted 12 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Member of Technical Staff

Awesome Motive Hybrid San Francisco

VIEW

Posted 13 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Data Scientist - Model Optimization

Quadric, Inc Hybrid No location specified

VIEW

Posted 14 days ago

Lead the development of custom quantization algorithms and low-precision techniques to maximize model performance on Quadric's Chimera GPNPU from our Burlingame engineering office.

Senior Software Engineer, ML Infrastructure

Decagon Hybrid San Francisco

VIEW

Posted 21 days ago

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

ML Research Engineer (Performance Engineering)

Awesome Motive Hybrid Palo Alto

VIEW

Posted 21 days ago

Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.

Senior ML Ops Engineer

Wizard Hybrid Remote - USA

VIEW

Posted 22 days ago

Wizard AI is hiring a Senior MLOps Engineer to own and scale the production ML lifecycle for a real-time inference platform behind a conversational shopping agent.

AI Engineer

Varick Agents Hybrid No location specified

VIEW

Posted 23 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Developer Advocate Engineer

Dexmate Hybrid Santa Clara

VIEW

Posted 26 days ago

Lead developer-facing content and sample projects that help ML engineers train, fine-tune, and deploy models on Dexmate humanoid robots while shipping production-quality code weekly.

Staff Software Engineer - Computer Vision Deployment

Awesome Motive Hybrid San Francisco

VIEW

Posted last month

Lead the design and scaling of distributed GPU infrastructure and production computer vision pipelines to bring state-of-the-art models from research into reliable, low-latency cloud deployments for Claryo's warehouse AI platform.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks

Company Rating

Salary (USD)

Only show jobs with salary info

Keywords to Exclude

Reset filters

How much do inference optimization jobs pay?

Below 50k* 0 0%
50k-100k* 0 0%
Over 100k* 5 100%

*average yearly salary (USD)

Inference Optimization Jobs

AI Inference Engineer - Model Optimization & Deployment

Sr. Staff Software Engineer, Systems Infrastructure

Staff AI Engineer

Machine Learning Engineer, Platform Integrations

Senior ML/AI Engineer

Multimodal AI Model Optimization Research Engineer

Member of Technical Staff

Data Scientist - Model Optimization

Senior Software Engineer, ML Infrastructure

ML Research Engineer (Performance Engineering)

Senior ML Ops Engineer

AI Engineer

Developer Advocate Engineer

Staff Software Engineer - Computer Vision Deployment

How much do inference optimization jobs pay?

Top companies hiring for inference optimization jobs

Best cities to find inference optimization jobs

Inference Optimization Jobs

How much do inference optimization jobs pay?

Top companies hiring for inference optimization jobs

Best cities to find inference optimization jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs