Model Evaluation Jobs

Browse 24 exciting jobs hiring in Model Evaluation now. Check out companies hiring such as thomsonreuters, sonyglobal, Iambic Therapeutics, Inc in San Antonio, Arlington, Anaheim.

VIEW COMPANIES

Staff Software Engineer / Architect - AI, CoCounsel FDE

thomsonreuters Hybrid United States of America, Frisco, Texas

VIEW

Posted 14 hours ago

Lead the design and delivery of scalable, secure AI-native systems for sophisticated legal customers as a Staff Software Engineer / Architect on Thomson Reuters' CoCounsel FDE team.

Research Intern- AI Ethics

sonyglobal Hybrid Remote - California

VIEW

Posted 17 hours ago

Sony AI’s Research Ethics team is hiring a remote Research Intern to work on generative AI ethics, evaluation, and harm-mitigation research with opportunities for publication.

Software Engineer II, Machine Learning Systems & Productization

Iambic Therapeutics, Inc Hybrid San Diego

VIEW

Posted 4 days ago

Iambic Therapeutics seeks a Software Engineer II to co-develop and harden ML training, evaluation, and productization workflows that enable AI-driven drug discovery.

Engineering Manager, Applied AI

Mercor Hybrid No location specified

VIEW

Posted 4 days ago

Lead and grow an Applied AI engineering team at Mercor to build scalable evaluation and data systems that measurably improve frontier model performance.

Senior AI Technical Product Manager - R01563914

Brillio Hybrid New York, New York, United States

VIEW

Posted 5 days ago

Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.

VP, Product (AI & Search) - Slack

Salesforce Hybrid California - San Francisco

VIEW

Posted 8 days ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Paid Time-Off

Maternity Leave

Paternity Leave

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Employee Resource Groups

Lead Slack's search and AI platform as VP Product to set strategy, drive model and infrastructure decisions, and deliver reliable, scalable AI-powered search and knowledge services for enterprise users.

Founding Forward Deployed Engineer

Artificial Intelligence Underwriting Company Hybrid San Francisco

VIEW

Posted 8 days ago

Join an early-stage AI safety startup as a founding Forward Deployed Engineer to design rigorous AI evals, lead customer implementations, and shape product strategy for certification of real-world AI agents.

Product Analyst - Generative AI Platform

Visa Hybrid Austin, TX

VIEW

Posted 8 days ago

Visa is hiring a Product Analyst to define and scale generative AI platform capabilities, combining product analytics, prototyping, and cross-functional collaboration to deliver responsible, enterprise-grade AI solutions.

AI Engineering Intern

Colibri Group Hybrid 1 Remote

VIEW

Posted 9 days ago

Colibri Group is hiring an AI Engineering Intern to help design and evaluate AI-driven educational tools, focusing on model behavior, alignment, and responsible AI practices under senior mentorship.

AI Engineer - Public Sector

Unstructured Technologies Inc. Hybrid No location specified

VIEW

Posted 10 days ago

Unstructured is hiring an AI Engineer to architect and ship production-grade RAG and agentic systems that process messy multimodal data for high-impact government and military contracts.

Applied ML Engineer

Foxglove Hybrid San Francisco

VIEW

Posted 11 days ago

Help scale production ML infrastructure and retrieval systems at Foxglove to enable high-performance semantic search and data mining over multimodal robotics data.

Generalist - English & Hindi

Weekday AI Hybrid No location specified

VIEW

Posted 13 days ago

Contract opportunity to evaluate and improve LLM conversational responses in Hindi and English by performing fact-checking, annotation, and qualitative assessment.

Staff Software Engineer, Applied AI

Valence Hybrid San Francisco

VIEW

Posted 13 days ago

Lead the design and production of LLM-driven coaching systems at Valence, applying deep ML and engineering expertise to build enterprise-grade, context-aware AI experiences.

Data Scientist

Crosby Hybrid New York City

VIEW

Posted 18 days ago

Crosby AI is hiring a Data Scientist to develop NLP/LLM models, evaluation frameworks, and data strategies that power its AI-driven legal platform.

AI Architect

Cambium Learning Group Hybrid Remote

VIEW

Posted 18 days ago

Lead the design and implementation of secure, scalable Generative AI and ML architectures for an EdTech organization focused on building production-ready RAG, retrieval, and MLOps solutions.

AI Developer Experience Engineer

Crosby Hybrid New York City

VIEW

Posted 19 days ago

Build the internal tooling and evaluation infrastructure that empowers engineers and researchers to iterate quickly and reliably on Crosby’s LLM-powered legal platform.

Senior Software Engineer

Awesome Motive Hybrid Chicago

VIEW

Posted 21 days ago

Experienced software engineers with strong system-design and ML/LLM experience are needed to build and productionize LLM-powered agents, evaluation pipelines, and scalable AI infrastructure at Permute.

Medical Image Analyst - AI Trainer

Handshake Hybrid Remote

VIEW

Posted 22 days ago

Dental Insurance

Disability Insurance

Flexible Spending Account (FSA)

Health Savings Account (HSA)

Vision Insurance

Sabbatical

Paid Holidays

Handshake seeks experienced 3D Slicer users to remotely evaluate AI-generated medical imaging content and provide expert feedback on segmentation, DICOM workflows, and clinical research relevance.

AI Agent Engineer - San Francisco Only

TRM Labs Hybrid San Fracisco

VIEW

Posted 24 days ago

Work on TRM’s AI Engineering team to design and ship agentic LLM systems and scalable infrastructure that augment investigations and ensure safe, auditable behavior in high-sensitivity environments.

Decision Intelligence Analyst

Rwazi, Inc. Hybrid United States

VIEW

Posted 26 days ago

Rwazi is hiring a Decision Intelligence Analyst to validate and improve AI-driven decision outputs by identifying failure modes, formalizing evaluation rubrics, and refining judgment frameworks.

Principal Software Engineer (AI) ( US)

PointClickCare Hybrid Remote - US

VIEW

Posted 26 days ago

Lead architecture and delivery of scalable, secure AI and agentic systems at PointClickCare to drive measurable clinical and operational outcomes across the platform.

AI Product Testing Engineer

Virtue AI Hybrid San Francisco

VIEW

Posted 27 days ago

Virtue AI is seeking a hands-on Testing Engineer to lead product and backend QA, automate system testing, and perform model red-teaming for a cutting-edge AI security platform.

Software Engineer - Model Developer Ecosystem

Baseten Hybrid San Francisco

VIEW

Posted 28 days ago

Help shape Baseten's model ecosystem by combining hands-on engineering, developer education, and product thinking to improve model discovery, evaluation, and adoption.

AI Research Engineer

TRM Labs Hybrid San Fracisco

VIEW

Posted 29 days ago

TRM Labs is hiring a Senior AI Research Engineer to drive model evaluation, fine-tuning, and production orchestration for large-scale LLM and ML systems that power blockchain intelligence.

Employment type

Remote/Onsite

Application Type

Date Posted

Department

Work Experience

Industries

Skills

Company size

Funding

Company Culture

Benefits & Perks

Company Rating

Salary (USD)

Only show jobs with salary info

Keywords to Exclude

Reset filters

How much do model evaluation jobs pay?

Below 50k* 1 5%
50k-100k* 2 10%
Over 100k* 18 86%

*average yearly salary (USD)

Model Evaluation Jobs

Staff Software Engineer / Architect - AI, CoCounsel FDE

Research Intern- AI Ethics

Software Engineer II, Machine Learning Systems & Productization

Engineering Manager, Applied AI

Senior AI Technical Product Manager - R01563914

VP, Product (AI & Search) - Slack

Founding Forward Deployed Engineer

Product Analyst - Generative AI Platform

AI Engineering Intern

AI Engineer - Public Sector

Applied ML Engineer

Generalist - English & Hindi

Staff Software Engineer, Applied AI

Data Scientist

AI Architect

AI Developer Experience Engineer

Senior Software Engineer

Medical Image Analyst - AI Trainer

AI Agent Engineer - San Francisco Only

Decision Intelligence Analyst

Principal Software Engineer (AI) ( US)

AI Product Testing Engineer

Software Engineer - Model Developer Ecosystem

AI Research Engineer

How much do model evaluation jobs pay?

Top companies hiring for model evaluation jobs

Best cities to find model evaluation jobs

Model Evaluation Jobs

How much do model evaluation jobs pay?

Top companies hiring for model evaluation jobs

Best cities to find model evaluation jobs

Sign up for our weekly newsletter of fresh jobs

Sign up for our weekly
newsletter of fresh jobs