We’re seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.
These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.
Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features
Evaluate and integrate open-source models to power production-ready agent features where possible
Develop reference agent applications to showcase workflows and accelerate customer adoption
Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems
Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation
Continuously improve the reliability, scalability, and performance of agent features in production
3+ years of experience in software engineering, preferably in backend, ML systems, or API development
Bachelor’s or Master's degree in Computer Science, Computer Engineering, or equivalent
Strong programming skills in Python; experience with various Python frameworks
Solid understanding of LLM workflows, agent patterns, or tool invocation systems
Experience designing and delivering production APIs
Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)
Strong foundations in cloud-native development
Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)
Familiarity with Kubernetes or container orchestration in production
Built or contributed to agent frameworks, SDKs, or CLIs
Have worked in a startup or fast-paced environments with ownership and ambiguity
Passion for developer experience and enabling AI adoption
Flexible working hours
Daily lunch and dinner provided; unlimited snacks and beverages
Supportive and highly collaborative work environment
Health check-up support and top-tier equipment/hardware support
A front-row seat to the generative AI infrastructure revolution
Competitive compensation, startup equity, health insurance, and other benefits.
FriendliAI is building the world’s best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.
We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Help architect and operate FriendliAI’s enterprise inference platform as a Senior Backend Engineer focused on APIs, multi-tenant SaaS features, and data/system reliability at scale.
Hands-on internship building real-time 3D interfaces and visualizations for industrial equipment with Hypertherm’s Kent software team.
Lead the software architecture for Shield AI’s XBAT program, defining safe, secure, and scalable system designs that enable high‑assurance airborne and ground software development.
Help build the Console control plane at Poolside — the operational backbone that powers secure, enterprise deployments of cutting-edge AI.
Lead the design, automation, and security of Novo's cloud infrastructure and developer platform as a senior individual contributor driving reliability and velocity.
Lead the engineering organization for Skylar AI—shaping platform architecture, teams, and processes to deliver scalable, responsible AI for enterprise IT operations.
Edgesource is hiring an RPA Developer SME I to lead design and deployment of scalable RPA solutions and establish best practices for federal and commercial automation programs.
Help architect and operate FriendliAI’s enterprise inference platform as a Senior Backend Engineer focused on APIs, multi-tenant SaaS features, and data/system reliability at scale.
Ironclad is seeking a hands-on Technical Engineering Manager to lead and grow an engineering team building AI-powered metadata extraction agents for enterprise contract workflows.
Work on the Client Infrastructure team to improve performance, reliability, and architecture of Superhuman's desktop email client using React, TypeScript, and Electron.
Hudu is hiring an experienced DevOps Engineer to operate and optimize its Rails-based SaaS infrastructure on AWS and Kubernetes, focusing on reliability, security, and performance.
Experienced backend-focused full‑stack engineer wanted to lead and architect critical AI-native security features for Abnormal AI’s Adaptive Classifications Team.
Lead a remote engineering team at Tremendous to own technical quality, coach engineers, and drive product outcomes at a profitable company that sends payouts globally.
Work on core cloud platform infrastructure and analytics at UiPath, building backend systems, Kubernetes-based primitives, and production-grade observability to power AI-driven automation at scale.