56 Ai Evaluation Jobs Hiring Now (April 2026)

Director of AI Engineering

Cover Whale Hybrid No location specified

Posted 20 hours ago

Lead and build the agentic AI platform that enables pods of engineers and AI agents to safely and reliably deliver production software at scale.

L

AI Engineer

LanguageWire Hybrid No location specified

VIEW

Posted yesterday

LanguageWire is hiring an AI Engineer to design and productionize LLM-based translation workflows and bridge ML experimentation with production engineering.

E

AI Engineer

EQL Tech Hybrid No location specified

VIEW

Posted 2 days ago

Work on a mission-driven fintech team to build and ship core AI products (LLM/VLM and evaluation pipelines) that power eligibility and compliance for education savings accounts.

Engineering Manager, Applied AI

Mercor Hybrid No location specified

VIEW

Posted 2 days ago

Lead and grow an Applied AI engineering team at Mercor to build scalable evaluation and data systems that measurably improve frontier model performance.

k

AI Product Engineer, Clinical Tools

knownwell Hybrid Remote

VIEW

Posted 4 days ago

Lead the product vision and engineering for clinician-facing AI tools at knownwell, building and operating RAG-based clinical decision support with full product ownership and direct clinician partnership.

Senior AI Technical Product Manager - R01563914

Brillio Hybrid New York, New York, United States

VIEW

Posted 4 days ago

Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.

Machine Learning Engineer, AI Agent Platform

Arta Finance Hybrid Mountain View

VIEW

Posted 4 days ago

Help build and deploy production AI agent platforms that power personalized financial advisory workflows for institutional clients at Arta.

VP, Product (AI & Search) - Slack

Salesforce Hybrid California - San Francisco

VIEW

Posted 6 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Paid Time-Off

Maternity Leave

Paternity Leave

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Employee Resource Groups

Lead Slack's search and AI platform as VP Product to set strategy, drive model and infrastructure decisions, and deliver reliable, scalable AI-powered search and knowledge services for enterprise users.

Forward Deployed Engineer

NICE Hybrid USA - Remote

VIEW

Posted 6 days ago

NiCE is hiring a Forward Deployed Engineer to design, ship, and operate production-scale conversational AI agents that solve high-impact enterprise problems.

V

AI Writing Evaluators (Domain Experts) - English Expertise

Volga Partners Hybrid No location specified

VIEW

Posted 6 days ago

Experienced domain experts in Business Operations & Communications or Education and Academic Research are needed for a remote, retainer-based 2‑week role evaluating and crafting prompts for AI writing models with US-contextual standards.

A

Founding Forward Deployed Engineer

Artificial Intelligence Underwriting Company Hybrid San Francisco

VIEW

Posted 6 days ago

Join an early-stage AI safety startup as a founding Forward Deployed Engineer to design rigorous AI evals, lead customer implementations, and shape product strategy for certification of real-world AI agents.

E

(Senior) Researcher

Epoch AI Hybrid Remote

VIEW

Posted 7 days ago

Epoch AI is hiring remote Researchers and Senior Researchers to conduct data-driven investigations, build benchmarks, and forecast AI capabilities and trends.

Product Analyst - Generative AI Platform

Visa Hybrid Austin, TX

VIEW

Posted 7 days ago

Visa is hiring a Product Analyst to define and scale generative AI platform capabilities, combining product analytics, prototyping, and cross-functional collaboration to deliver responsible, enterprise-grade AI solutions.

AI Engineering Intern

Colibri Group Hybrid 1 Remote

VIEW

Posted 7 days ago

Colibri Group is hiring an AI Engineering Intern to help design and evaluate AI-driven educational tools, focusing on model behavior, alignment, and responsible AI practices under senior mentorship.

U

AI Engineer - Public Sector

Unstructured Technologies Inc. Hybrid No location specified

VIEW

Posted 9 days ago

Unstructured is hiring an AI Engineer to architect and ship production-grade RAG and agentic systems that process messy multimodal data for high-impact government and military contracts.

W

Generalist - English & Hindi

Weekday AI Hybrid No location specified

VIEW

Posted 11 days ago

Contract opportunity to evaluate and improve LLM conversational responses in Hindi and English by performing fact-checking, annotation, and qualitative assessment.

Staff Software Engineer, Applied AI

Valence Hybrid San Francisco

VIEW

Posted 12 days ago

Lead the design and production of LLM-driven coaching systems at Valence, applying deep ML and engineering expertise to build enterprise-grade, context-aware AI experiences.

Software Engineer, Machine Learning

LinkedIn Hybrid Sunnyvale, CA

VIEW

Posted 13 days ago

LinkedIn seeks a Hybrid Machine Learning Engineer to build and deploy scalable relevance and evaluation models for recommender systems and generative/NLP-driven product features.

ServiceNow AI.Accelerate Bootcamp

ServiceNow Hybrid Building A,B,C 2225 Lawson Lane, Santa Clara, CALIFORNIA, United States

VIEW

Posted 13 days ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

A selective, eight-week (mostly virtual) unpaid bootcamp at ServiceNow for undergraduate students to learn agentic AI, build and evaluate agents, and present a capstone project during an in-person finale.

Technical Assistance Consultant, Employment and Economic Opportunity

American Institutes for Research Hybrid US-Remote

VIEW

Posted 15 days ago

AIR is hiring a Technical Assistance Consultant to develop and deliver workforce-focused TA, training, and capacity-building to advance economic mobility, workforce development, and future-of-work strategies including AI integration.

AI Product Portfolio Director

ServiceNow Hybrid 15725 Dallas Pkwy, Addison, TX 75001, USA

VIEW

Posted 15 days ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

Lead the strategic integration of AI across ServiceNow marketing by owning the MarTech and agentic product portfolio to drive adoption, efficiency, and measurable business impact.

D

Applied AI & Agent Engineering Lead - Vice President

DB Hybrid Cary, 3000 CentreGreen Way

VIEW

Posted 16 days ago

Senior engineering leader to design, evaluate and productionize agentic AI systems, prompt architectures and multi-agent orchestration for critical banking workflows at Deutsche Bank in Cary, NC.

W

Generative AI Data Analyst - USA (Remote)

Welo Global Hybrid United States

VIEW

Posted 17 days ago

Generative AI Analyst at Welocalize to craft prompts, annotate and evaluate LLM outputs, and lead labeling workflows in a remote full-time role.

AI Architect

Cambium Learning Group Hybrid Remote

VIEW

Posted 17 days ago

Lead the design and implementation of secure, scalable Generative AI and ML architectures for an EdTech organization focused on building production-ready RAG, retrieval, and MLOps solutions.

AI Developer Experience Engineer

Crosby Hybrid New York City

VIEW

Posted 17 days ago

Build the internal tooling and evaluation infrastructure that empowers engineers and researchers to iterate quickly and reliably on Crosby’s LLM-powered legal platform.

N

Recruiting Coordinator - Fully Remote

Neighbors Bank Hybrid Remote - United States

VIEW

Posted 18 days ago

Neighbors Bank is looking for a decisive, process-improvement focused Recruiting Coordinator to manage hiring pipelines, conduct candidate evaluations, and help evolve recruiting practices in a fully remote role.

ML Research Scientist

Handshake Hybrid Remote

VIEW

Posted 18 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake is hiring an ML Research Scientist to drive open scientific research, create public benchmarks, and collaborate with top AI labs to advance data and evaluation methods for frontier models.

M

Senior AI Data Scientist

MLabs Hybrid No location specified

VIEW

Posted 19 days ago

Lead the design and evaluation of agentic LLM systems that power a fintech's financial intelligence platform, ensuring correctness, scalability, and production reliability.

Instructional Designer/eLearning Developer - IT Initiatives

Awesome Motive Hybrid Remote

VIEW

Posted 20 days ago

SweetRush is hiring an Instructional Designer/eLearning Developer to create and deliver IT-focused learning solutions (AI, cybersecurity, workplace apps) for a global enterprise in a remote, Eastern Time–preferred contract role.

Senior Software Engineer

Awesome Motive Hybrid Chicago

VIEW

Posted 20 days ago

Experienced software engineers with strong system-design and ML/LLM experience are needed to build and productionize LLM-powered agents, evaluation pipelines, and scalable AI infrastructure at Permute.

Staff, Machine Learning Engineer

Fullscript Hybrid No location specified

VIEW

Posted 20 days ago

Fullscript is looking for a Staff Machine Learning Engineer to architect and ship production LLM-driven clinical features that improve clinician workflows and patient outcomes.

Sr. AI Engineer (24 months fixed-term)

Khan Academy Hybrid Remote

VIEW

Posted 21 days ago

Inclusive & Diverse

Diversity of Opinions

Growth & Learning

Mission Driven

Social Impact Driven

Empathetic

Dental Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Performance Bonus

Family Medical Leave

Paid Holidays

Khan Academy is hiring a Senior AI Engineer (24-month fixed-term) to lead integration, evaluation, and quality improvements of generative AI features that support learning at scale.

Medical Image Analyst - AI Trainer

Handshake Hybrid Remote

VIEW

Posted 21 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks experienced 3D Slicer users to remotely evaluate AI-generated medical imaging content and provide expert feedback on segmentation, DICOM workflows, and clinical research relevance.

VIdeo Content Producer - AI Trainer

Handshake Hybrid Remote

VIEW

Posted 21 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks experienced Shotcut users to evaluate AI-generated video edits and create tool-focused assessment materials on a flexible, remote, hourly contract basis.

AI Product Portfolio Director- Marketing

ServiceNow Hybrid 15725 Dallas Pkwy, Addison, TX 75001, USA

VIEW

Posted 22 days ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

Lead the AI product portfolio for marketing to turn enterprise AI strategy into a cohesive MarTech roadmap, measurable productivity gains, and durable automation at scale.

AI Product Portfolio Director- Martech

ServiceNow Hybrid 275 Wyman St 2nd floor, Waltham, MA 02451, USA

VIEW

Posted 22 days ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

Lead the AI MarTech product portfolio at ServiceNow to convert AI strategy into scalable agentic workflows, measurable productivity gains, and sustained marketing leverage.

AI Agent Engineer - San Francisco Only

TRM Labs Hybrid San Fracisco

VIEW

Posted 23 days ago

Work on TRM’s AI Engineering team to design and ship agentic LLM systems and scalable infrastructure that augment investigations and ensure safe, auditable behavior in high-sensitivity environments.

a

Sr Lead, Research & Evaluation

aiEDU.org Hybrid San Francisco

VIEW

Posted 23 days ago

aiEDU is hiring a Senior Lead, Research & Evaluation to design and run impact measurement, lead research strategy, and build data systems that inform program decisions across the organization.

V

AI Engineer

Varick Agents Hybrid No location specified

VIEW

Posted 23 days ago

Varick seeks an AI Engineer to architect and ship production-grade agent systems, evaluation pipelines, and retrieval-driven context strategies for enterprise AI deployments.

Principal Software Developer, Applied AI

Savvas Learning Company Hybrid Remote

VIEW

Posted 24 days ago

Lead the design, production deployment, and continual improvement of AI-powered features for Savvas's flagship K-12 platform, applying deep LLM, cloud, and software engineering expertise to improve student learning at scale.

R

Decision Intelligence Analyst

Rwazi, Inc. Hybrid United States

VIEW

Posted 24 days ago

Rwazi is hiring a Decision Intelligence Analyst to validate and improve AI-driven decision outputs by identifying failure modes, formalizing evaluation rubrics, and refining judgment frameworks.

AI Product Portfolio Director

ServiceNow Hybrid 15725 Dallas Pkwy, Addison, TX 75001, USA

VIEW

Posted 24 days ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

Lead the AI product portfolio for marketing at ServiceNow, defining and delivering a unified MarTech and agentic roadmap that drives measurable productivity and enterprise-scale adoption.

Principal Software Engineer (AI) ( US)

PointClickCare Hybrid Remote - US

VIEW

Posted 25 days ago

Lead architecture and delivery of scalable, secure AI and agentic systems at PointClickCare to drive measurable clinical and operational outcomes across the platform.

[Urgent] US English Text Quality Review

CrowdGen by Appen Hybrid United States

VIEW

Posted 25 days ago

Contract reviewers are needed to compare AI-generated English text pairs, choose the clearer response, and provide concise explanations to help improve model output quality.

V

AI Product Testing Engineer

Virtue AI Hybrid San Francisco

VIEW

Posted 26 days ago

Virtue AI is seeking a hands-on Testing Engineer to lead product and backend QA, automate system testing, and perform model red-teaming for a cutting-edge AI security platform.

Principal AI Engineer - Nexus Black

IFS Hybrid Itasca, United States

VIEW

Posted 27 days ago

Lead architecture and delivery of enterprise-scale LLMs, agent orchestration, and retrieval systems to build safe, scalable AI workflows for IFS Nexus Black.

AI Research Engineer

TRM Labs Hybrid San Fracisco

VIEW

Posted 28 days ago

TRM Labs is hiring a Senior AI Research Engineer to drive model evaluation, fine-tuning, and production orchestration for large-scale LLM and ML systems that power blockchain intelligence.

Physics Expert

Albert Hybrid Remote

VIEW

Posted 29 days ago

Handshake AI seeks Physics PhDs to perform flexible, hourly contract work evaluating AI-generated physics content for scientific accuracy and physical reasoning.

Math Expert

Albert Hybrid Remote

VIEW

Posted 29 days ago

Handshake seeks Math PhDs for flexible, remote hourly contracts to design domain-relevant math questions and evaluate AI-generated mathematical reasoning and proofs.

Biology Expert

Albert Hybrid Remote

VIEW

Posted 29 days ago

Handshake seeks doctoral-level biology experts to review and critique AI-generated biological content on a flexible, remote, hourly contract basis.

Below 50k* 2 15%
50k-100k* 1 8%
Over 100k* 10 77%

Ai Evaluation Jobs

How much do ai evaluation jobs pay?

Top companies hiring for ai evaluation jobs

Best cities to find ai evaluation jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs