As vCluster’s AI Infrastructure Specialist, you will work directly with customers at the earliest and most critical stage of their journey: from bare metal GPU nodes through to a production-ready deployment. This is not a traditional professional services role; you operate pre-sale as part of a proof of value engagement scoped to reach production. You will be one of the first team members a neocloud or AI Factory engages with at a technical depth, and the playbooks you develop will scale the motion for the next hire and customer.
vCluster is gaining rapid traction with GPU AI Clouds and enterprises building AI Factories: organizations that need to offer Kubernetes as a managed service on bare metal GPU infrastructure, and need to do it fast. This role exists to make that happen.
As an AI Infrastructure Engineer, your role will include:
Lead Technical Deployments: Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated vCluster environment.
Infrastructure Optimization: Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBand.
Validation: Deploy and validate Kubernetes and vCluster to provide GPU-powered managed K8s.
Knowledge Transfer: Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independently.
Scaling through Documentation: Document reusable playbooks and deployment architectures so your learnings become the next customer's head start.
Feedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback loop from the field into the roadmap.
Strategic Partnering: Join Sales in the pre-sales process where deep infrastructure work is required to achieve a meaningful proof of value.
This role could be a fit for you if you bring:
Production K8s Mastery: 5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environments.
GPU Fluency: Practical knowledge of NVIDIA GPU Operators, CUDA tooling, and systems-level configuration for GPU nodes.
Networking Fundamentals: Deep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environments.
Storage Expertise: Experience with persistent volume configuration, CSI drivers, and distributed systems like Ceph, Rook, Weka, or Longhorn.
Operational Agility: Comfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real time.
Modern Tech Mindset: You thrive in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal services.
Automation Skills: Experience writing automation scripts with Bash, Python, or Go.
Kubernetes Depth: Relevant certifications such as CKA (Certified Kubernetes Administrator) or experience writing Kubernetes Operators.
AI/ML Familiarity: Experience with inference serving, GPU scheduling, and the tooling around LLM deployment.
Documentation: Experience building AI Automation in documentation to contribute to a shared knowledge base.
We are a venture-backed tech startup striving to be the leading force in enabling platform engineers. We raised +$30M from top-tier VCs such as Khosla Ventures (first investor in OpenAI, GitLab, Stripe, Doordash) and are in a hyper-growth phase looking for motivated people to complement our team. Our headquarters are in San Francisco (Salesforce Tower), but our team is distributed around the globe and we have a remote-first work culture.
We're the company behind vCluster, an open-source technology for virtualizing Kubernetes (+10k GitHub stars). Open source is part of our DNA.
The adoption of our commercial product based on vCluster has grown extremely fast (multi-million dollar revenue) and our customer base includes some of the biggest companies in the world, including 6 Global Fortune 500 companies as well as some of the fastest-growing tech unicorns.
Benefits
We offer the following benefits:
Competitive Salary: We offer a competitive compensation package, including equity.
Platinum-Level Insurance: Health, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country).
Flexible Working Schedule: You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day.
Workplace Flexibility: We’re very flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way.
At vCluster Labs, we value and stand for:
Open Source, Open Mind: We are actively contributing to and maintaining open-source projects. Internally, we foster meritocracy — the strongest ideas win, no matter who or where they come from.
Build Tomorrow’s Standards, Intentionally: We don't just ship software; we define the state-of-the-art of tomorrow. We are fearless in tearing down old approaches to build something better, but we are disciplined in how we do it because we know our users rely on our technology to run mission-critical infrastructure platforms.
Create Wow: We measure success by the experience we generate, both inside and outside the company. For our customers, this means impressive speed and intuitive experiences. For our team, this means going the extra mile to support one another and to continuously drive each other to new heights.
Own the Outcome: We understand that our responsibility doesn't end when a task is checked off; it ends when the value is delivered. We connect our daily individual actions to the broader success of the company and our customers.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
MasterBrand Cabinets is hiring an Engineering Technician to support CNC operations, manage Microvellum nesting and tooling, and drive manufacturing improvements at the Arthur, IL plant.
Intel is hiring a Sr. Facilities Engineer to lead mechanical system ownership, reliability engineering, and cross-discipline coordination for critical data center and lab environments at its Mission Campus.
Experienced voice communications engineer needed to design, integrate, and sustain analog and digital voice systems for NASA operations at Kennedy Space Center.
Lead ODOT’s Bridge Preservation Unit to develop policy, direct statewide preservation programs, and manage staff and resources to keep Oregon’s bridges safe and functional.
AECOM is hiring a Hydromechanical Engineer to design and deliver large-scale mechanical systems for water infrastructure projects across the United States.
AECOM seeks experienced Project Engineers to provide design review, field engineering, and construction coordination for the California High‑Speed Rail program, starting in Sacramento and relocating to Fresno.
Olea Kiosks seeks a SolidWorks-savvy Design Engineer with sheet metal and manufacturing experience to develop production-ready kiosk enclosures at its Cerritos facility.
Support and maintain Intuitive’s surgical systems in the Dallas area by performing installation, diagnostics, repairs, and customer training on a weekend-focused field service shift.
Notre Dame is hiring a Nanofabrication Engineering Specialist to operate, maintain, and develop cleanroom processes while training and consulting with academic and external users.
Wabtec is hiring a Production Support Engineer in Erie, PA to troubleshoot build issues, maintain engineering documentation, and drive change management in support of manufacturing operations.
Tooling Engineer II to own design and fabrication of assembly, transport, inspection, and TPS tooling for flight hardware at a fast-paced Playa Vista aerospace startup.
Skyloom (an IonQ company) is hiring a hands-on Electrical Engineer to test and validate PCBAs and multi-board electronic subsystems for space-grade optical communications in Broomfield, CO.
Experienced mechanical design engineer needed to lead development and validation of small mechanisms and disposable instrument components for Intuitive's minimally invasive surgical systems.