Rise Jobs & Careers icon Llm Inference Jobs

Browse 20 exciting jobs hiring in Llm Inference now. Check out companies hiring such as Jobgether, Sunday, Awesome Motive in Plano, Huntington Beach, San Antonio.

Photo of the Rise User

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

Photo of the Rise User

Contribute to state-of-the-art robot learning and on-robot deployment at a fast-moving consumer robotics startup focused on dexterous home manipulation.

Photo of the Rise User
Posted 4 days ago

Aviator Health seeks a Technical Ex‑Founder to lead 0→1 consumer product development and build autonomous agent systems that navigate real healthcare workflows from our NYC office.

Photo of the Rise User

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Photo of the Rise User

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Photo of the Rise User
Bosch Group Hybrid 2555 Smallman St, Pittsburgh, PA 15222, USA
Posted 9 days ago

Lead cutting-edge research on multimodal foundation models and efficient GenAI at Bosch Research Pittsburgh, translating innovations into industrial and product impact while publishing at top-tier venues.

MLabs Hybrid No location specified
Posted 9 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Posted 12 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Photo of the Rise User
Posted 13 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Photo of the Rise User
Posted 13 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Photo of the Rise User
Posted 15 days ago

Pluralsight seeks an experienced Data Scientist to design, validate, and deploy machine learning and NLP solutions that drive product and business impact.

Photo of the Rise User
ASAPP Hybrid No location specified
Posted 19 days ago

Lead the Core GenerativeAgent team to design, build, and deploy low-latency, enterprise-grade conversational voice AI combining LLMs with speech-to-text, text-to-speech, and real-time streaming pipelines.

FriendliAI Hybrid San Francisco
Posted 20 days ago

Shape and own the QA strategy for FriendliAI’s inference platform, covering backend, frontend, model deployments, and novel validation for LLM inference quality.

Photo of the Rise User

Senior technical role focused on researching, engineering, and scaling privacy-preserving ML and LLM alignment solutions across LinkedIn's platforms.

Photo of the Rise User

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

Varick Agents Hybrid No location specified
Posted 24 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Photo of the Rise User
Posted 24 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.

Photo of the Rise User

Help shape Baseten's model ecosystem by combining hands-on engineering, developer education, and product thinking to improve model discovery, evaluation, and adoption.

Photo of the Rise User
Posted 27 days ago

Senior Staff AI Engineer to lead research and productionization of privacy-preserving ML (differential privacy, federated learning, secure computation) and LLM alignment across LinkedIn’s AI platforms.

Photo of the Rise User

Lead the design, training, and production deployment of large-scale ML models at Absentia Labs to turn complex scientific data into actionable machine intelligence.

Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do llm inference jobs pay?

Below 50k*
0
0%
50k-100k*
0
0%
Over 100k*
1
100%
*average yearly salary (USD)

Top companies hiring for llm inference jobs

Best cities to find llm inference jobs