Job details

Senior Software Engineer, AI Networking

NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team. In this pivotal role, the candidate is responsible for building and productizing machine learning tools. These include tools that use ML-based combinatorial optimization and build space exploration (DSE) techniques. These tools will be employed to optimize AI workloads across large GPU and CPU clusters, thereby ensuring the most efficient and productive utilization of system resources at data center scale. The role involves working on distributed Deep Learning, particularly within LLM training and inference stacks. A strong passion for collective communication and networking is desirable. The candidate will interact with diverse hardware and platforms, such as Host Channel Adapters (HCAs), Switches, CPUs, GPUs, and complete Systems. Furthermore, the role requires engagement across multiple software layers, including LLM applications, machine learning frameworks, and communication and computing libraries. The candidate will develop tools and methodologies using Machine Learning (ML) for comprehensive performance analysis and optimization, potentially incorporating learning-based agentic techniques. This work involves deep-diving across the software stack, from LLM applications and ML frameworks down to communication and computing libraries. This position offers a distinct opportunity to make significant contributions to the core infrastructure powering the next generation of large-scale AI systems.

What you'll be doing:

Design and implement resource allocation and combinatorial optimization techniques (e.g., reinforcement learning, LLM agents for DSE, Bayesian optimization and other multi-objective optimization techniques) to optimize LLM models at datacenter scale.
Research, develop, and deploy AI/ML techniques to optimize large-scale Deep Learning (LLM) training and inference on NVIDIA supercomputers and distributed systems. This includes a focus on high-performance networking and NVIDIA communication libraries.
Build and productionize ML-based tools for performance prediction and optimization, with a strong emphasis on networking aspects.
Develop and deploy a scalable, reliable data curation pipeline capable of handling complex data types, such as time series and PyTorch model graphs, to effectively support the training of high-performance Machine Learning models.
Collaborate across hardware and software teams to deliver valuable performance analysis insights.
Lead performance test planning, establish performance targets for new technologies and solutions, and drive efforts to achieve those performance goals.

What we need to see:

PhD or Master's degree in Computer Science, Software Engineering, or equivalent experience.
4+ years of experience applying machine learning techniques to computer architecture and system optimization problems. Desired experience involves leveraging ML at the intersection of at least two of the following areas: HPC, networking, and AI applications.
Hands-on experience developing and deploying various learning algorithms (e.g., reinforcement learning, offline RL, supervised learning) to tackle optimization challenges within computer architecture, system design, or networking domains.
Proficiency in building and using ML models with leading frameworks such as PyTorch or TensorFlow, or JAX.
Proven ability to apply GNNs/transformers-based optimization to PyTorch model graph and Kineto execution traces.
Expertise combining knowledge of NVIDIA GPUs, the CUDA library, and deep learning frameworks (TensorFlow/PyTorch) with networking concepts, including collective communication libraries (like NCCL) and protocols (such as RoCE and RDMA).
Strong programming capabilities in Python, Bash, and C++.
A collaborative teammate with effective communication and interpersonal abilities.

Ways to stand out from the crowd:

In-depth knowledge and experience with machine learning/reinforcement learning and frameworks.
Comprehensive understanding of computer architecture, system architecture and networking.
Extensive experience in applying machine learning techniques such as GNNs or related graph-based models.
Knowledge in PyTorch, CUDA, and NCCL libraries.
Proven software engineering/development skills

With competitive salaries and a comprehensive benefits package, NVIDIA is widely regarded as one of the most desirable technology employers in the world. Our teams are composed of some of the most forward‑thinking and driven engineers in the industry, and we continue to grow rapidly. If you are a senior data engineer passionate about building large‑scale, high‑impact data platforms, we’d love to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until April 10, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Senior Software Engineer LLM Distributed Training Performance Engineering GPU CUDA NCCL PyTorch TensorFlow JAX GNN Transformers Reinforcement Learning Bayesian Optimization Kineto RDMA RoCE Datacenter C++ Python

NVIDIA Glassdoor Company Review

4.6

NVIDIA DE&I Review

No rating

CEO of NVIDIA

Jensen Huang

Approve of CEO

Average salary estimate

$219750 / YEARLY (est.)

min

max

$152000K

$287500K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Software Architect, DriveOS

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 18 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead DriveOS architecture at NVIDIA to design integrated, safety- and security-certified system software solutions for self-driving vehicles and other regulated intelligent machines.

Senior Staff Business Systems Analyst

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 14 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead strategic, large-scale business systems initiatives at NVIDIA, shaping architecture and process modernization to support global growth and AI-driven enterprise solutions.

Staff Software Engineer, QualityOS

Anduril Industries Hybrid Costa Mesa, California, United States

VIEW

Posted 23 hours ago

Lead the technical architecture and implementation of distributed systems for QualityOS, driving reliability, scalability, and cross-team alignment for Anduril’s manufacturing execution platform.

Senior Staff Software Engineer - Agentic Automation

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 1 hour ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a Senior Staff Software Engineer to design agentic AI automation and build integrations to transform enterprise IT operations and prevent problems at scale.

Full Stack Developer, Senior

Bah Hybrid Fort Meade, MD

VIEW

Posted 30 minutes ago

Lead a small cross-functional team to build production-ready full stack solutions, advanced visualizations, and data-driven systems for Booz Allen’s clients requiring TS/SCI with polygraph.

Senior Software Engineer I, L3 Applications Team

CareMessage Hybrid Remote - USA

VIEW

Posted 9 hours ago

CareMessage is hiring a Senior Software Engineer I (L3) to lead full‑stack development and technical direction for its Rails + React core application serving safety‑net clinics.

Software Engineer, Backend (New-York)

Mistral AI Hybrid New York, NY

VIEW

Posted 19 hours ago

Mistral AI is hiring a Backend Engineer in New York to build scalable, high-performance backend services and APIs for its enterprise AI platform and consumer-facing products.

Sr. Software Engineer - REMOTE

Jobgether Hybrid Minnesota

VIEW

Posted 18 hours ago

Build and maintain scalable backend and full-stack features for an industry-leading solar design and sales platform as a senior engineer on a fully remote team.

Associate Applications Developer

osu Hybrid Columbus Campus

VIEW

Posted 21 hours ago

The SAI Research Lab is hiring an Associate Applications Developer to build and maintain research-grade software using C, C++, and Python in Linux-based, edge-to-cloud and AI-enabled environments.

SRE Architect

QODE Hybrid No location specified

VIEW

Posted 8 hours ago

Lead the design and scaling of enterprise-grade, reliable cloud platforms as an SRE Architect working with cross-functional teams in a hybrid Austin, TX environment.

Senior Software Engineer, DevOps and Infrastructure

Axle Health Hybrid Santa Monica

VIEW

Posted 10 hours ago

Lead the design and operation of Axle Health's secure, scalable AWS infrastructure and CI/CD pipelines to support enterprise-grade, HIPAA-compliant in-home healthcare software.

Platform Engineer – Senior Tech (Platform)

Jobgether Hybrid Germany

VIEW

Posted 19 hours ago

Senior Platform Engineer sought to design and build scalable, API-first backend systems and configuration-driven tooling that power progression, rewards, leaderboards and account services for high-scale game experiences.

Principal Engineer, Software Architect (R4562)

Shield AI Hybrid Dallas, Texas

VIEW

Posted 21 hours ago

Lead the software architecture for Shield AI’s XBAT program, defining safe, secure, and scalable system designs that enable high‑assurance airborne and ground software development.

Senior Software Architect, DriveOS

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 18 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

Lead DriveOS architecture at NVIDIA to design integrated, safety- and security-certified system software solutions for self-driving vehicles and other regulated intelligent machines.

Cloud Software Engineer

Bah Hybrid Fort Meade, MD

VIEW

Posted 9 hours ago

Lead full-stack cloud software development at Booz Allen’s Fort Meade program, building production-ready data-driven systems and mentoring an engineering team under TS/SCI clearance.

NVIDIA

NVIDIA is a publicly traded, multinational technology company headquartered in Santa Clara, California. NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI.

67 jobs

MATCH

Calculating your matching score...

BADGES