Rise Jobs & Careers icon Inference Jobs

Browse 61 exciting jobs hiring in Inference now. Check out companies hiring such as FriendliAI, Spotify, webAI in Huntington Beach, Cape Coral, Atlanta.

Posted 21 hours ago

Develop and productionize agent systems and the Friendli Agent API at FriendliAI to enable developers to build reliable, high-impact AI agent applications.

Photo of the Rise User
Posted yesterday
Inclusive & Diverse
Empathetic
Take Risks
Transparent & Candid
Feedback Forward
Mission Driven
Collaboration over Competition
Work/Life Harmony
Maternity Leave
Paternity Leave
Snacks
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
401K Matching
Paid Sick Days
Paid Time-Off
Paid Volunteer Time

Drive product decisions for Spotify Premium as a Data Scientist focused on experimentation, AI-enabled analytics, and insights that increase conversion and retention.

Photo of the Rise User
webAI Hybrid No location specified
Posted yesterday

Senior Machine Learning Engineer needed to transform prototype AI models into optimized, production-ready systems for secure, distributed public sector and edge deployments.

Photo of the Rise User

Lead performance and scalability improvements for LLM inference by optimizing runtime components, multi-GPU execution, and open-source serving frameworks at scale.

Photo of the Rise User

Contribute to state-of-the-art robot learning and on-robot deployment at a fast-moving consumer robotics startup focused on dexterous home manipulation.

Photo of the Rise User
Posted 5 days ago

Aviator Health seeks a Technical Ex‑Founder to lead 0→1 consumer product development and build autonomous agent systems that navigate real healthcare workflows from our NYC office.

Photo of the Rise User

Drive production-ready model optimization, custom kernel development, and edge deployment to enable real-time inference of large-scale models on vehicle SOCs for Zoox's Perception team.

Photo of the Rise User
Multi Media LLC Hybrid No location specified
Posted 8 days ago

Multi Media LLC is hiring a Senior Data Scientist to lead rigorous statistical analyses and measurement efforts that drive product and business decisions for a high-traffic live streaming platform.

Photo of the Rise User

Lead system- and hardware-focused optimizations for LinkedIn’s AI inference platform, improving GPU utilization, compiler workflows, and low-latency model serving at scale.

Lead DeepWalk’s computer vision platform as a Staff Software Engineer, driving the architecture and productionization of ML systems that process millions of images for sidewalk inspection and city infrastructure decisions.

Photo of the Rise User
Posted 10 days ago

Lead a small analytics team to drive causal, hypothesis-driven investigations into network reliability and subscriber experience for a major communications client while producing executive-ready insights and recommendations.

Photo of the Rise User
Posted 10 days ago

pureIntegration is hiring a Mid-Level Data Analyst to analyze large-scale datasets, produce dashboards and reports, and deliver actionable insights to improve network reliability and subscriber experience on a remote contract.

Photo of the Rise User
Bosch Group Hybrid 2555 Smallman St, Pittsburgh, PA 15222, USA
Posted 10 days ago

Lead cutting-edge research on multimodal foundation models and efficient GenAI at Bosch Research Pittsburgh, translating innovations into industrial and product impact while publishing at top-tier venues.

MLabs Hybrid No location specified
Posted 10 days ago

Lead the design and delivery of a closed-loop intelligence layer that enables an autonomous trading fleet to learn from real-time outcomes and improve profitability.

Posted 11 days ago

Help scale production ML infrastructure and retrieval systems at Foxglove to enable high-performance semantic search and data mining over multimodal robotics data.

Twelve Labs is hiring a senior Machine Learning Engineer to optimize and scale multimodal video foundation models for deployment across cloud and data platforms.

Photo of the Rise User
Solace Hybrid No location specified
Posted 12 days ago

Solace is seeking a hands-on Marketing Analytics Manager to build and own attribution, incrementality testing, and measurement infrastructure that drives data-informed growth decisions for a fast-scaling healthcare startup.

Photo of the Rise User

Deepgram is hiring an ML Ops Infrastructure Engineer to design and operate scalable model deployment, CI/CD, and monitoring systems that deliver production-grade voice AI at scale.

Posted 13 days ago

Lead the design and deployment of low-latency, production ML systems for voice, audio, and agentic control at an early-stage hardware and software startup in New York City.

Photo of the Rise User
Posted 14 days ago

Tavus is hiring a Multimodal AI Model Optimization Research Engineer to convert cutting-edge multimodal models into efficient, low-latency production systems.

Photo of the Rise User
Posted 14 days ago
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning

Work with research teams to productionize large-scale generative models, build GPU inference infrastructure, and ensure reliable deployment and observability for production ML workloads.

Photo of the Rise User
Posted 14 days ago

Work across modeling, systems, and product to design, optimize, and ship production-grade AI systems for real-world users.

Photo of the Rise User
Inclusive & Diverse
Collaboration over Competition
Fast-Paced
Growth & Learning

A Research Engineer role focused on GPU/kernel and distributed-training optimizations to scale and accelerate real-time world-model AI.

Photo of the Rise User
Posted 15 days ago

Lead and build True Anomaly’s AI platform and engineering team to deliver production-grade model hosting, agent infrastructure, and enterprise AI tooling that embed AI across the company.

Posted 15 days ago

Lead the development of custom quantization algorithms and low-precision techniques to maximize model performance on Quadric's Chimera GPNPU from our Burlingame engineering office.

Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Customer-Centric
Fast-Paced
Growth & Learning
Medical Insurance
Dental Insurance
401K Matching
Paid Time-Off
Maternity Leave
Paternity Leave
Mental Health Resources
Flex-Friendly

Drive the design and implementation of experimentation methodologies, inference pipelines, and production tooling as a Full‑Stack Data Scientist on Netflix’s Experimentation Platform.

Photo of the Rise User
Posted 16 days ago

Fundamental is hiring a Model Serving Engineer to build and optimize production inference infrastructure for NEXUS, focusing on Triton-based pipelines, GPU efficiency, and low-latency, high-throughput serving.

Lead Blackbird’s analytics layer to translate product and customer data into strategic decisions that accelerate growth and retention.

Photo of the Rise User
Posted 16 days ago

Pluralsight seeks an experienced Data Scientist to design, validate, and deploy machine learning and NLP solutions that drive product and business impact.

Photo of the Rise User
Triumph Hybrid San Francisco
Posted 17 days ago

Triumph is hiring a Data Scientist to build pricing, risk, and behavior models that drive monetization and retention for a high-growth real-money gaming platform.

Photo of the Rise User
Posted 18 days ago

Dentsu is hiring a VP of Data Science to lead and productize advanced measurement science (MMM, RBA, Bayesian methods) and scale a distributed team to deliver client-facing analytics products.

Photo of the Rise User

Lead ML-driven improvements to ad auction performance by building scalable models, running experiments, and partnering with engineering and product teams at a fast-paced ad tech organization.

Photo of the Rise User

Develop and optimize high-performance C++ AI and computer-vision software for embedded camera systems used in mission-critical public safety and security applications at Motorola Solutions.

Laurel Hybrid No location specified
Posted 19 days ago

Lead the design and productionization of mission-critical NLP and LLM-powered features at Laurel, shaping the AI platform that returns time to professional services firms.

Photo of the Rise User
ASAPP Hybrid No location specified
Posted 20 days ago

Lead the Core GenerativeAgent team to design, build, and deploy low-latency, enterprise-grade conversational voice AI combining LLMs with speech-to-text, text-to-speech, and real-time streaming pipelines.

Posted 21 days ago

Lead product strategy and discovery for Kamiwaza’s on-prem enterprise AI orchestration platform, turning customer problems into coherent, outcome-driven releases.

Photo of the Rise User
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Transparent & Candid
Growth & Learning
Fast-Paced
Collaboration over Competition
Take Risks
Friends Outside of Work
Passion for Exploration
Customer-Centric
Reward & Recognition
Feedback Forward
Rapid Growth
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Paternity Leave
Fully Distributed
Flex-Friendly
Some Meals Provided
Snacks
Social Gatherings
Pet Friendly
Company Retreats
Dental Insurance
Life insurance
Health Savings Account (HSA)

Amazon Security seeks a Senior Security Engineer to lead offensive operations and research against AI systems, scaling automated threat emulation across the AI portfolio.

FriendliAI Hybrid San Francisco
Posted 21 days ago

Shape and own the QA strategy for FriendliAI’s inference platform, covering backend, frontend, model deployments, and novel validation for LLM inference quality.

Photo of the Rise User
Posted 22 days ago

Senior-level embedded AI engineer role at Renesas to lead development of model translation tooling and high-performance inference for resource-constrained MCUs/MPUs.

Photo of the Rise User

Senior technical role focused on researching, engineering, and scaling privacy-preserving ML and LLM alignment solutions across LinkedIn's platforms.

Photo of the Rise User

Decagon is hiring a Senior ML Infrastructure Engineer to design and scale distributed training and multi-provider inference platforms for LLMs and multimodal models.

Work on FriendliAI's core developer experience by owning the Python SDK and CLI, packaging pipelines, and internal dev tools that enable reliable integrations with our inference and agent platform.

Photo of the Rise User

Metamorphic is hiring an ML Research Engineer (Performance Engineering) to implement and optimize GPU kernels, low-precision training, and MoE systems for next-generation foundation models.

Photo of the Rise User
SoFi Hybrid (CA - San Francisco; NY - New York City; UT - Salt Lake City; FL, Jacksonville; TX - Frisco)
Posted 23 days ago

Lead a high-performing data science team to build and govern next-generation portfolio management and loss mitigation models for a regulated, consumer-focused fintech.

Photo of the Rise User
Posted 24 days ago

Work on training and deploying large-scale ML systems for physical robots while building the infrastructure and pipelines to operate them in production.

Photo of the Rise User
Posted 24 days ago

Wizard AI is hiring a Senior MLOps Engineer to own and scale the production ML lifecycle for a real-time inference platform behind a conversational shopping agent.

Andromeda Cluster is hiring an Infrastructure Manager to scale global GPU compute supply and demand matching by sourcing suppliers, optimizing utilization, and negotiating commercial terms.

Photo of the Rise User
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

NVIDIA seeks a seasoned Developer Relations Manager to partner with hyperscaler AI teams, provide hands-on technical enablement for NVIDIA AI software, and drive developer adoption and feedback into the product roadmap.

Varick Agents Hybrid No location specified
Posted 25 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Photo of the Rise User
Posted 25 days ago
Customer-Centric
Mission Driven
Inclusive & Diverse
Rise from Within
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Medical Insurance
Paid Time-Off
Maternity Leave
Mental Health Resources
Equity
Child Care stipend
Paternity Leave
WFH Reimbursements
Flex-Friendly
Dental Insurance
Vision Insurance
Life insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
401K Matching
Military leave

Drive adoption of NVIDIA accelerated computing by advising AI-native startups on architecture, optimization, and scaling of agentic, multimodal, and LLM-powered applications.

Employment type
Remote/Onsite
Application Type
Date Posted
Department
Work Experience
Industries
Skills
Company size
Funding
Company Culture
Benefits & Perks
Company Rating
Salary (USD)
Keywords to Exclude

How much do inference jobs pay?

Below 50k*
0
0%
50k-100k*
0
0%
Over 100k*
26
100%
*average yearly salary (USD)

Best cities to find inference jobs