Browse 21 exciting jobs hiring in Llm Optimization now. Check out companies hiring such as Zencore, Jobgether, Darkroom in Tulsa, St. Paul, Austin.
Lead design and delivery of secure, scalable, production-grade AI/ML solutions as Zencore’s Principal Architect, advising clients and shaping cloud-native architectures.
Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.
Darkroom seeks an SEO Specialist to own SEO and GEO strategy across high-growth consumer brands, optimizing for both traditional search and LLM-driven discovery.
Character.AI is seeking a Product Marketing Manager to lead GTM strategy, own ASO across stores, and craft conversion-focused in-product copy for a high-growth consumer AI platform.
Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.
Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.
Fortune Brands is hiring an SEO Specialist to lead SEO, GEO and AI-driven search optimizations for the Master Lock brand across eCommerce and owned websites.
Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.
Lead performance engineering for FSI-focused AI and HPC workloads at NVIDIA, optimizing parallel algorithms and GPU/CPU systems to unlock world-class performance.
Lead the design and optimization of LLM and RAG systems that power personalized, data-driven insights for athletes and coaches at Texas Sports Academy.
Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.
Lead development of ML-based combinatorial optimization and design-space-exploration tools to optimize LLM training and inference across GPU/CPU clusters and high-performance networking at datacenter scale.
Instrument is hiring a Senior AI Engineer to design and implement the core multi-agent intelligence, context management, and evals infrastructure for a large-scale, stateful generative-AI simulation project.
Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.
Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.
Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.
Lead the design and implementation of Slate's unified AI backend platform to make model integrations reliable, cost‑efficient, and production-ready at scale.
Concentrate is hiring a hands-on Forward Deployed AI Engineer to combine customer-facing problem solving with engineering work to improve multi-provider LLM routing, reliability, observability, and cost efficiency.
Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.
Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.
Project Lion seeks a US-based Prompt Engineer to drive template-to-autorater migrations, optimize prompts using APG/APO tooling, and validate autorater quality versus human baselines.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|