Apply

10,000+ Similar Jobs

NVIDIA Dynamo is an innovative, open-source platform focused on efficient, scalable inference for large language and reasoning models in distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intelligent request handling, Dynamo achieves high-performance AI inference for demanding applications. Our team is addressing the most challenging issues in distributed AI infrastructure, and we’re searching for engineers enthusiastic about building the next generation of scalable AI systems.

As an Applied AI Research Software Engineering intern on the Dynamo project, you will address some of the most sophisticated and high-impact challenges in distributed inference, including: Dynamo k8s serving platform, disaggregated serving, dynamic GPU scheduling, intelligent routing, and distributed KV cache management.

What you'll be doing:

Collaborate on the design and development of the Dynamo Kubernetes stack.
Introduce new features to the Dynamo Python SDK and Dynamo Rust Runtime Core Library; design, implement, and optimize distributed inference components in Rust and Python.
Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT-LLM, llama.cpp, mistral.rs).
Improve intelligent routing and KV-cache management subsystems.
Contribute to open-source repositories, participate in code reviews, assist with issue triage on GitHub, work closely with the community to address issues, capture feedback, and evolve the framework’s APIs and architecture.

What We Need To See:

Pursuing Bachelors or Masters in Computer Science or a related field
Excellent Golang, Rust and/or Python programming and software design skills, including debugging, performance and service health analysis, and test design
Good understanding of algorithms and data structures, solid knowledge of RESTful APIs
Highly motivated, dedicated, and curious about new technologies. You take pride in your work and strive to achieve incredible results and possess excellent communication, planning, and problem solving skills.

Ways To Stand Out From The Crowd

Understanding of machine learning or NLP concepts
Experience in software shipping cycles (dev, deploy, release, CI) and open-source software development
Experience working with inference engines such as vLLM, SGLang TensorRT-LLM and similar
Experience building and deploying containers in Kubernetes environments

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The hourly rate for our interns is 18 USD - 71 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Read Full Description

Apply

Jobs at NVIDIA
Similar Jobs

Confirmed 19 hours ago. Posted 3 days ago.

Discover Similar Jobs

Hardware Engineering Intern/ Co-Op (Graduate | Fall 2025 | Hybrid)

AMD - Santa Clara, CA
Hardware Engineering Intern/ Co-Op (PhD | Fall 2025 | Hybrid)

AMD - Santa Clara, CA
Software Engineering Intern, Robot Learning Platform - Fall 2025

NVIDIA - Santa Clara, CA
PhD Research Intern, Generalist Embodied Agents Research - Fall 2025

NVIDIA - Santa Clara, CA Remote
- Entry-Level
- Hourly
- Internship
- Junior
- NLP Engineering
- Doctor of Philosophy
- Remote
- Tech
Technical Marketing Engineer Intern, AI Platform Software - Fall 2025

NVIDIA - Santa Clara, CA
- Bachelor's Degree
- Entry-Level
- Hourly
- Internship
- Junior
- Tech

Applied AI Research Engineering Intern - Fall 2025

NVIDIA

Discover Similar Jobs

Hardware Engineering Intern/ Co-Op (Graduate | Fall 2025 | Hybrid)

Hardware Engineering Intern/ Co-Op (PhD | Fall 2025 | Hybrid)

Software Engineering Intern, Robot Learning Platform - Fall 2025

PhD Research Intern, Generalist Embodied Agents Research - Fall 2025

Technical Marketing Engineer Intern, AI Platform Software - Fall 2025

Suggested Articles

Applied AI Research Engineering Intern - Fall 2025

NVIDIA

Discover Similar Jobs

Hardware Engineering Intern/ Co-Op (Graduate | Fall 2025 | Hybrid)

Hardware Engineering Intern/ Co-Op (PhD | Fall 2025 | Hybrid)

Software Engineering Intern, Robot Learning Platform - Fall 2025

PhD Research Intern, Generalist Embodied Agents Research - Fall 2025

Technical Marketing Engineer Intern, AI Platform Software - Fall 2025

Suggested Articles

Entry-Level Jobs and Internships @ Entertainment Companies

Non-Technical Jobs at eSports Companies

Junior Software Jobs & Internships at Media Technology Firms

Technology Jobs at Real Industry Partner Companies

Tech Focused Internships at Media Technology Companies