Browse 6 exciting jobs hiring in Benchmarks now. Check out companies hiring such as Polymath, Epoch AI, Variance in Worcester, Fayetteville, Oxnard.
Polymath seeks a Software Engineer to build high-fidelity simulation environments, long-horizon tasks, and robust verifiers to benchmark and improve autonomous agents.
Epoch AI is hiring remote Researchers and Senior Researchers to conduct data-driven investigations, build benchmarks, and forecast AI capabilities and trends.
At Variance, you will design and implement domain-specific benchmarks and evaluation systems that reveal failure modes and drive improvements in ML and agent behavior for fraud, identity, and risk workflows.
Handshake is hiring an ML Research Scientist to drive open scientific research, create public benchmarks, and collaborate with top AI labs to advance data and evaluation methods for frontier models.
Lead the market definition and GTM for Deepgram's TTS offering, shaping narrative, launches, and sales enablement to make Deepgram the default for teams building voice agents.
Mercor is hiring a Product Marketing Manager to own positioning, messaging, and launches for APEX and establish the company’s product marketing practice for technical AI researchers and buyers.