Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Founding Platform & Reliability Engineer image - Rise Careers
Job details

Founding Platform & Reliability Engineer

Founding Platform & Reliability Engineer

🎨 About OpenArt

OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters, and stories with unprecedented speed and imagination. We believe the future of creativity is AI-native, and we're shaping that future.

🚀 Why Join OpenArt

  • Small team, massive surface area, senior engineers own real systems, not slices.

  • Ship at real scale, your work goes to millions of users, fast.

  • Founder-led engineering culture, both founders are technical and deeply involved in product and architecture.

  • AI-native product, you’ll design how cutting-edge AI models are exposed as real user experiences.

  • High ownership, low process, we value judgment, clarity, and speed over bureaucracy.

  • 7-10X growth in revenue for the past 2 years. Now you’ll play a critical role in helping the company scale to the next stage.

🎯 About the Role

We’re looking for a Founding Platform & Reliability Engineer who can own the design, scalability, and reliability of our entire infrastructure stack end-to-end, from high-level architecture decisions to hands-on implementation, observability, and cost optimization.

This is NOT a role for traditional operators or narrow DevOps specialists. You should be comfortable working across cloud infrastructure, distributed systems, backend services, and developer tooling, making pragmatic decisions that balance product velocity, system reliability, and cost efficiency—especially in a fast-evolving, AI-native environment.

You will work closely with the founders and product engineers to design and evolve the platform that powers OpenArt, shaping key decisions such as serverless vs. containerized architecture, multi-provider AI reliability, and scaling systems to millions of users—while acting as a force multiplier for the entire engineering team.

🛠 What You’ll Do

  • Define and operationalize SLOs/SLIs across critical user journeys (generation, editing, payments/credits, uploads, etc.), and use them to drive prioritization (including error budgets)

  • Participate in an on-call rotation and lead incident response improvements (alert quality, runbooks, escalation paths). Establish blameless postmortems and ensure action items are implemented.

  • Implement reliability patterns at external boundaries, and build mechanisms for per-vendor “health” measurement and routing/fallback policies

  • Stand up end-to-end observability: structured logs, metrics, traces, and dashboards that let engineers answer “what broke” and “why now” quickly.

  • Build deploy safety practices: automated rollbacks, canarying, feature-flag patterns, and reliable CI/CD gates.

  • Own the direction of our infrastructure architecture, including defining when serverless is the right approach versus when we should evolve toward containerized or more managed systems, and guiding the team through those transitions as we scale.

  • Build cost observability and cost-control primitives: per-request cost attribution, caching strategies, capacity planning, and budget alerts.

  • Act as a senior technical voice, influencing architecture, tooling, engineering best practices, and raising the overall engineering bar.

🧑‍💻 What We’re Looking For

Core Requirements

  • 5+ years building and operating production systems where reliability and scaling are core.

  • Strong software engineering skills (you can ship production code, not just configure tools).

  • Cloud-native experience (AWS or GCP), ideally with serverless/event-driven systems and at least one container path (Fargate/ECS/Cloud Run/Kubernetes).

  • Deep knowledge of observability practices: dashboards, alerting, distributed tracing, and incident response maturity.

  • Ability to design resilient interactions with external dependencies (timeouts, retries/backoff/jitter, circuit breakers, idempotency).

  • Can communicate tradeoffs to non-infra peers clearly

  • Ability to operate with ambiguity and define problems before solving them.

Nice to Have

  • Have designed an internal platform abstraction (e.g., API gateway / workflow engine / job orchestration) that enabled multiple product teams to ship faster with fewer incidents.

  • Have shipped concrete reliability outcomes: e.g., reduced MTTR, improved SLO attainment, lowered p95 latency, or reduced infra/unit costs

  • Prior startup experience or experience owning large surface-area features.

Tech Stack You’ll Work With

GCP, Cloud Run, Modal, Upstash, Sentry, Amplitude, Firebase, Redis, React / Next.js, Node.js, TypeScript, Python, etc.

💰 Compensation

  • Competitive base salary and bonus program

  • Equity - meaningful ownership in what you build

  • High autonomy, high growth environment

🌍 Work Setup

  • Bay Area preferred (hybrid allowed)

  • Visa sponsorship available

  • We’ll consider remote

Average salary estimate

$205000 / YEARLY (est.)
min
max
$170000K
$240000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
Pinterest Hybrid San Francisco, CA, US; Palo Alto, CA, US
Posted 49 minutes ago

Lead cross-team engineering to build scalable catalog, integration, and AI-native merchant systems that improve onboarding, catalog health, and merchant growth at Pinterest.

Photo of the Rise User

Senior technical leader sought to shape LinkedIn’s core infrastructure strategy and lead cross-team initiatives across networking, storage, and messaging at massive scale.

Photo of the Rise User
PayPal Hybrid San Jose, California, United States of America
Posted 16 hours ago

Experienced backend-focused Staff Software Engineer needed to lead architecture and delivery of scalable Node.js/React services for PayPal's commerce platform.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Performance Bonus
Family Medical Leave
Paid Holidays

GoodLeap is hiring a Senior Full-Stack Software Engineer/Tech Lead to drive frontend-focused, full-stack initiatives and build scalable, AI-enabled finance platform features while mentoring teammates.

Posted 13 hours ago

Lead modernization and secure identity/access efforts for enterprise applications at M&T Bank, driving cloud migrations, containerization, and engineering best practices.

Photo of the Rise User
Parloa Hybrid Remotely in the USA
Posted 14 hours ago

Design and build AI‑enabled internal systems and integrations to scale Parloa’s Go‑To‑Market operations using TypeScript, Python, and modern AI tooling.

Photo of the Rise User

As a Senior Frontend Software Engineer on ActiveCampaign's DUX team, you will drive frontend architecture, build scalable design-system components, and improve the developer and user experience across a micro-frontend platform.

Photo of the Rise User
AVEVA Hybrid San Leandro, California, United States of America
Posted 13 hours ago

AVEVA is hiring a Distinguished AI Tech Lead to shape and operationalize frontier AI capabilities across industrial products, bridging advanced research and production delivery.

Photo of the Rise User

Design and deliver full-stack, production-grade AI agent features at Workday—building scalable front-end and backend solutions that simplify HR and finance workflows for millions of users.

Photo of the Rise User
Posted 18 hours ago

Lead design and implementation of manufacturing software and diagnostics to assure kinematic performance and safety for next-generation surgical robotic instruments at a market-leading medical robotics company.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Family Medical Leave
Paid Holidays

Lead the design and implementation of LaunchDarkly's statistically rigorous, warehouse-native experimentation platform—building engines for hypothesis testing, adaptive bandit allocation, and large-scale analysis across customer data warehouses.

Posted 6 hours ago

K2 Space is hiring a Senior Embedded Firmware Engineer to design, implement, and validate low-level firmware and bring-up for custom high-performance SoCs used in next-generation satellites.

Photo of the Rise User

Lead and mentor a software engineering team to design and deliver manufacturing software and tooling that enables production of next‑generation surgical robotics.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
March 26, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!