Browse 49 exciting jobs hiring in Data & Evaluation now. Check out companies hiring such as Foster America, Iambic Therapeutics, Inc, Mercor in Laredo, Tampa, Virginia Beach.
Serve as Foster America's South Carolina Site Lead to coordinate partners, drive implementation of the OPT-In initiative, and translate learning into sustained local impact for families.
Iambic Therapeutics seeks a Software Engineer II to co-develop and harden ML training, evaluation, and productization workflows that enable AI-driven drug discovery.
Lead and grow an Applied AI engineering team at Mercor to build scalable evaluation and data systems that measurably improve frontier model performance.
Experienced technical product leader needed to own prioritization, quality, and stakeholder alignment for LLM-driven products while staying hands-on with architecture, code reviews, and AI cost optimization.
Welo Data is building a flexible, remote contributor network of native English speakers to annotate, evaluate, and create prompts that improve AI systems.
Lead and develop a remote evaluation team in WGU’s School of Technology to ensure accurate, scalable competency-based assessment and continuous improvement for Electrical and Computer Engineering programs.
Epoch AI is hiring remote Researchers and Senior Researchers to conduct data-driven investigations, build benchmarks, and forecast AI capabilities and trends.
Visa is hiring a Product Analyst to define and scale generative AI platform capabilities, combining product analytics, prototyping, and cross-functional collaboration to deliver responsible, enterprise-grade AI solutions.
Colibri Group is hiring an AI Engineering Intern to help design and evaluate AI-driven educational tools, focusing on model behavior, alignment, and responsible AI practices under senior mentorship.
Experienced analytics professional needed to perform human capital program evaluations and deliver data-driven reporting and dashboards in support of federal HR modernization efforts.
BryceTech seeks an experienced Data Analyst to support DHS intelligence performance programs by turning complex data into actionable insights, reports, and training materials.
KBR is seeking an experienced DoD Technical Writer with an active Secret clearance to create and maintain documentation for Big Data systems and DoD test-range programs.
A technical, hands-on Senior Application Analyst role to manage and automate third-party business applications integrated with Salesforce at a fast-growing health-tech company.
Serve as a Workforce Research Analyst supporting human capital analytics and workforce planning for federal clients, leveraging data analysis, program evaluation, and executive-level briefing materials to inform HR modernization.
ACS' Family Services Division is hiring a Program Strategy & Data Undergraduate Intern to support data management, develop reports and process maps, and assist with event coordination during the summer internship program.
TRI's Future Factory team is hiring a Senior Research Engineer to design scalable training/evaluation infrastructure and high-performance geometry and physics-aware tooling that translate research into production-grade systems.
Contract opportunity to evaluate and improve LLM conversational responses in Hindi and English by performing fact-checking, annotation, and qualitative assessment.
Lead the design and production of LLM-driven coaching systems at Valence, applying deep ML and engineering expertise to build enterprise-grade, context-aware AI experiences.
LinkedIn seeks a Hybrid Machine Learning Engineer to build and deploy scalable relevance and evaluation models for recommender systems and generative/NLP-driven product features.
Guidehouse is hiring a Logistics IT and Acquisitions Consultant to lead contract development, supply-chain data strategy, and stakeholder coordination for a DoD weapon-system program office.
Drive content valuation and international sales forecasting as Manager, Sales Optimization to support greenlight decisions and sales performance tracking across NBCUniversal’s global TV portfolio.
Crosby AI is hiring a Data Scientist to develop NLP/LLM models, evaluation frameworks, and data strategies that power its AI-driven legal platform.
Generative AI Analyst at Welocalize to craft prompts, annotate and evaluate LLM outputs, and lead labeling workflows in a remote full-time role.
Lead the design and implementation of secure, scalable Generative AI and ML architectures for an EdTech organization focused on building production-ready RAG, retrieval, and MLOps solutions.
WestEd seeks a Research Associate to support the SEPP team with quantitative educational research, project coordination, and report and proposal production to advance outcomes for students with disabilities.
WestEd seeks a Research Associate in Teaching and Learning to support special education and learner variability research through proposal writing, project coordination, and mixed-methods data analysis.
Build the internal tooling and evaluation infrastructure that empowers engineers and researchers to iterate quickly and reliably on Crosby’s LLM-powered legal platform.
Experienced electrical/traffic engineer to evaluate traffic signal equipment, produce technical analyses, and support the Traffic Electronics Center for statewide highway projects.
Lead and grow a cross-functional Evaluation team to design pipelines, tools, and metrics that characterize autonomy performance and support safe development of Waabi's self-driving systems.
Constellation Schools seeks an experienced, instructional-focused Principal to lead a Cleveland campus, improve student achievement, and build a strong school culture for 2026–2027.
Serve as a field-based Outreach Representative for the NYC Health Department, delivering educational detailing visits to health care practices to improve chronic disease prevention and clinical systems.
Lead the design and evaluation of agentic LLM systems that power a fintech's financial intelligence platform, ensuring correctness, scalability, and production reliability.
The Evaluation Associate will support bureau-wide monitoring, evaluation, data management, and analysis efforts using tools like R, SAS, and SQL to inform behavioral health programs and reporting.
Khan Academy is hiring a Senior AI Engineer (24-month fixed-term) to lead integration, evaluation, and quality improvements of generative AI features that support learning at scale.
Work on evaluation infrastructure and tooling to measure and analyze autonomy system performance at scale for a fast-growing Physical AI startup.
Lean In and the Sandberg Bernthal Family Foundation are hiring a Data Scientist to turn data into actionable insights that shape products, programs, and community impact for millions of users.
Lead HSH’s Planning, Performance, and Strategy team to develop and implement equity-driven strategic plans, evaluations, and cross-department initiatives that advance the City’s efforts to prevent and end homelessness.
Support visitor research and program evaluation at the Exploratorium by conducting interviews, managing audio/video data, and assisting with data entry and coding on an on-call basis.
Lead the AI product portfolio for marketing to turn enterprise AI strategy into a cohesive MarTech roadmap, measurable productivity gains, and durable automation at scale.
aiEDU is hiring a Senior Lead, Research & Evaluation to design and run impact measurement, lead research strategy, and build data systems that inform program decisions across the organization.
The OMB Community Development Unit is hiring a Summer Graduate Intern to support CDBG-DR compliance, research federal requirements, and manage program data for resiliency initiatives.
The NYC Health Department's Harm Reduction Unit is hiring a Data Associate to manage datasets, produce reports and dashboards, and support evaluation and quality-improvement for alcohol and drug use programs.
Associate, Social Impact at Lever for Change: support network programming, storytelling, small awards administration, and impact measurement for a global nonprofit network.
Handshake seeks doctoral-level biology experts to review and critique AI-generated biological content on a flexible, remote, hourly contract basis.
Handshake seeks Chemistry PhDs for flexible, remote hourly contracts to evaluate and improve AI-generated chemistry content and reasoning.
Lead BADU’s Community Initiatives team to manage harm reduction program evaluation, community-facing operations, and cross-agency collaboration to improve syringe service programs and related public health outcomes in NYC.
Work as a bilingual English–Mandarin evaluator to assess, fact-check, and annotate LLM responses for a remote contract role covering Taiwan, Malaysia, and the USA.
Project Lion seeks a US-based Prompt Engineer to drive template-to-autorater migrations, optimize prompts using APG/APO tooling, and validate autorater quality versus human baselines.
Macmillan Audio is hiring an entry-level Editorial Assistant to support editorial operations, research audiobook opportunities, and help manage launch and licensing workflows for a leading publishing group.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
2
|