Senior Software Engineer, DGX Cloud Lepton Marketplace

NVIDIA

Education
Benefits
Skills

One of DGX Cloud’s top priorities is to build a two-side marketplace, connecting AI start-ups and ISVs to NCPs and CSPs via the DGX Cloud Lepton opinionated PaaS platform. This is an exciting mission that supports NVIDIA’s larger push to grow the AI ecosystem and support sovereign AI buildouts around the world. We are looking to expand this team to accelerate the buildout and integration of NCPs and CSPs into this marketplace.

As a software engineer on the DGX Cloud Lepton Marketplace team, you’ll play a key role in building integrations between NVIDIA Cloud Partners (NCPs) and our platform, enabling seamless access to GPU-optimized virtual machines for developers worldwide. We expect you to have significant software engineering experience with kubernetes including cluster operations, operator development, node health monitoring and working with GPU resource scheduling.

What you will be doing:

  • Be part of a DGX Cloud Lepton team responsible for developing the two-side marketplace, including integration of compute providers and developing discovery and bidding experiences to match supply with demand.
  • Design and implement IaaS API integrations, including collaborating with external engineering teams to ensure reliable, scalable, and consistent connectivity across a diverse set of cloud environments
  • Shape integration strategies, develop stateful workflow orchestration, and drive improvements in testing, observability, and automation to ensure high quality, fault-tolerant solutions

What we need to see:

  • 12+ years of experience in developing software infrastructure for large scale AI systems.
  • Direct experience in a software engineering role within a highly technical organization with demonstrable impact from your work. Software development experience with kubernetes APIs and frameworks.
  • Familiarity with setting up cloud infrastructure environments (VMaaS, VPCs, RDMA, shared file-systems)
  • Proven track record with 3rd party API integrations: communicating with external teams, writing API clients, and improving integration reliability
  • Comfortable working in a fast-paced environment and collaborating with external engineering teams to test and debug integrations
  • Technical knowledge, including a systems programming language (strong preference for experience writing production code in Go) and a solid understanding of software design patterns for stateful workflow orchestration
  • BS in Computer Science, Engineering, Physics, Mathematics or a comparable Degree or equivalent experience.
  • 2+ years in similar role and experience on large-scale production systems. Experience with common software engineering principles, tools and techniques.

If you are excited about deepening your experience in cloud infrastructure, kubernetes, distributed systems, and API development and love working in dynamic, fast-moving teams, please apply!

The base salary range is 224,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Read Full Description
Confirmed 18 hours ago. Posted a day ago.

Discover Similar Jobs

Suggested Articles