Senior Backend Engineer
LiteLLM is the world's most popular AI Gateway, trusted by top companies like Adobe, Netflix, and NASA. Our platform empowers developers by providing secure, reliable access to LLMs and adjacent services, and we're looking for a Backend Engineer (New Grad) to help us build rock-solid guardrails and observability tooling at scale.
About The Role
You’ll focus on owning our guardrails and logging world-class. You will be in charge of the backend code that ensures all guardrail calls are consistently logged, errors are surfaced to users (not silently swallowed), and our observability instrumentation works for real-world, high-volume traffic. Your attention to detail in areas like latency metrics, logging traceability, and backend guardrail registration will directly impact user trust in our security and compliance features.
Responsibilities
Build and scale our product, ensuring performance, reliability, and continuous improvement.
Ensure all guardrail and policy enforcement calls (e.g., applyguardrail) are properly logged and traceable through our SpendLogs and relevant database tables
Build and design CPU-level guardrails to cover common attacks on LLM API's / MCP servers / Agents
Identify and fix areas where silent failures occur in guardrail creation, registration, and policy application—ensuring robust error handling and transparency to end users
Work with observability integrations, including Datadog, Splunk, Prometheus, and OpenTelemetry, to maintain accurate, configurable, and usable monitoring and logging for backend systems
Enhance observability integrations to work for 1B+ requests/mo., with minimal latency overhead and no memory leaks (e.g. due to cardinality of Prometheus metrics)
Collaborate cross-functionally on backend engineering priorities (performance, reliability, security)
What We’re Looking For
Bachelor’s or Master’s in Computer Science or related field
4+ years of experience with Python and backend frameworks (e.g. FastAPI, Flask)
Understanding of logging best practices, error handling, and secure backend development
Exposure to monitoring, logging, or metrics platforms (Datadog, Splunk, Prometheus, OpenTelemetry)
Familiarity with database integration and troubleshooting (PostgreSQL, Redis, etc.)
Driven to deliver high-quality backend code with strong guardrails, auditing, and debugging capabilities
Eagerness to tackle hard bugs and ensure system transparency for end users
Why Join LiteLLM?
High-impact, mission-critical work on the core of compliance and reliability
Contribute directly to features used by enterprise customers at global scale
Fast-paced growth environment with room for technical ownership
Competitive salary, health, dental, and vision benefits
About LiteLLM
LiteLLM (https://github.com/BerriAI/litellm) is a Python SDK and Proxy Server enabling seamless calls to 100+ LLM APIs in the OpenAI format, trusted by industry leaders worldwide.
Ready to shape the future of secure, observable AI infrastructure? Apply now!
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Labcorp seeks an entry-level Software Engineer I in Bloomfield, CT to develop embedded and application-level software for laboratory robotic and automation systems.
Work remotely on cloud infrastructure and data systems that power large-scale AI-driven automation for a mission-focused company transforming global waste systems.
Senior product-minded engineer needed to prototype, architect, and de-risk browser-based 2D/3D CAD and engineering-data systems for a remote-first AI platform used by major OEMs.
Lead the architecture and productionization of Spotify’s shared Agent Engine to power scalable, reliable agent-based experiences across the platform.
Wellmark is hiring a seasoned Platform Engineer to design, build, and scale agentic AI platforms and infrastructure that enable autonomous, enterprise-grade AI workflows.
Constructor seeks a Senior Backend Engineer to design and operate low-latency, high-throughput Attribute Enrichment and Badges services that deliver ML-generated item attributes to global e-commerce customers.
PracticeQ is hiring a Lead Software Engineer to drive design and delivery of scalable .NET services and modern front-end features that improve practice management and patient experiences.
K2 Space is hiring a Senior Embedded Firmware Engineer to design, implement, and validate low-level firmware and bring-up for custom high-performance SoCs used in next-generation satellites.
Design and deliver full-stack, production-grade AI agent features at Workday—building scalable front-end and backend solutions that simplify HR and finance workflows for millions of users.
Experienced platform engineer needed to lead and scale Signifyd's GCP/Kubernetes cloud platform, building self-service tooling, AI-driven automation, and robust observability for a global commerce product.
Lead and architect enterprise-scale AI initiatives at AVEVA, translating cutting-edge AI research into production-ready architectures, repeatable patterns, and cross-functional delivery across industrial domains.
Lead design and development of secure, high-availability APIs and enterprise integrations for San Francisco’s JUSTIS criminal justice data exchange as the Principal System Integration Engineer.
Lead and mentor a software engineering team to design and deliver manufacturing software and tooling that enables production of next‑generation surgical robotics.