AI Engineers at Varick own the intelligence layer. You design, build, and optimize the agent systems that run inside enterprise operations — processing thousands of transactions, making classification decisions, routing exceptions, and learning from human feedback.
This role is for engineers who have been deep in LLMs, agent architectures, and evaluation systems. You’ve built agentic workflows that run in production, not just demos. You understand prompt engineering, retrieval, tool calling, multi-agent orchestration, and the evaluation infrastructure required to ship AI systems that enterprises trust.
• Design and build agent architectures for complex enterprise workflows (multi-step reasoning, tool calling, exception handling)
• Build and maintain evaluation systems for agent quality, accuracy, safety, and groundedness
• Design prompt systems, retrieval pipelines, and context engineering strategies for reliable agent behavior
• Build the feedback loops that allow agents to learn from human corrections and improve over time
• Optimize inference cost and latency for production workloads
• Define best practices for agent reliability, observability, and governance
• Stay current with the latest models, frameworks, and research — and ship what matters into production
• 3+ years of software engineering with at least 1–2 years focused on LLM applications or AI systems in production
• Hands-on experience building agentic workflows with tool calling, retrieval, and multi-step reasoning
• Deep understanding of prompt engineering, context engineering, and how to get reliable behavior from LLMs
• Experience building evaluation and quality systems for AI outputs
• Strong Python skills and backend engineering fundamentals
• You’ve shipped AI features to real users and dealt with the messy parts: hallucinations, edge cases, accuracy degradation, cost management
• Based in SF.
• Agent frameworks: LangGraph, CrewAI, Claude Code/Codex patterns, or custom orchestration
• Retrieval systems: vector databases (Qdrant, pgvector, Pinecone), reranking, hybrid search
• MCP, tool-calling protocols, and third-party API integrations
• Fine-tuning, LoRA, or other model adaptation methods
• Evaluation frameworks and continuous quality monitoring
• Experience with enterprise AI deployments (compliance, audit trails, governance)
• Prior work at AI labs, AI-native startups, or applied ML teams
• Ship to production, not to demos. Every system you build runs inside real enterprise operations. 100% deployment rate.
• Early enough to shape everything. Your work defines the product, the platform, and the company.
• Compounding impact. Every client deployment feeds the pattern library and makes the next one faster. You’re building leverage, not doing the same thing twice.
• Work with operators, not committees. You talk directly to the people who run the business — CFOs, COOs, ops leads — not procurement layers.
*We are an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status. Employment is subject to a standard confidentiality and non-disclosure agreement.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead cross-team engineering to build scalable catalog, integration, and AI-native merchant systems that improve onboarding, catalog health, and merchant growth at Pinterest.
CSCI Consulting is seeking an experienced MuleSoft Integration Developer to design and implement secure, high-performance integrations and API-led connectivity for a major Federal modernization program.
Lead design and implementation of manufacturing software and diagnostics to assure kinematic performance and safety for next-generation surgical robotic instruments at a market-leading medical robotics company.
Senior Director of Engineering needed to drive AI-powered engineering practices and operational excellence across global development teams in a remote role based in Pennsylvania.
Senior product-minded engineer needed to prototype, architect, and de-risk browser-based 2D/3D CAD and engineering-data systems for a remote-first AI platform used by major OEMs.
Lead the architecture and productionization of Spotify’s shared Agent Engine to power scalable, reliable agent-based experiences across the platform.
Senior frontend engineer to lead architecture and development of React/TypeScript platform UIs that surface and orchestrate machine identity workflows at scale for CyberArk.
Workday is hiring a Principal Software Engineer to own and evolve AI-native infrastructure tooling and automation across large-scale, distributed platform environments.
K2 Space is hiring a Senior Embedded Firmware Engineer to design, implement, and validate low-level firmware and bring-up for custom high-performance SoCs used in next-generation satellites.
A senior, hands-on Principal Software Engineer is needed to own architecture, performance, and delivery across a high-revenue web platform, mobile app, and ML-driven ad-tech systems for a remote-first ad-tech agency/startup.
Senior technical leader sought to shape LinkedIn’s core infrastructure strategy and lead cross-team initiatives across networking, storage, and messaging at massive scale.
Experienced software engineer needed to develop and prototype NLP and LLM-based solutions that extract, structure, and automate aviation data for national airspace modernization.
FINRA is hiring a Software Engineer in Rockville, MD to develop robust, maintainable software and support engineering and operational excellence across the SDLC in a hybrid environment.