Senior Solutions Architect, HPC and Generative AI Deployment

NVIDIA

NVIDIA is seeking outstanding Solutions Architects to assist and support customers that are building solutions with our newest High Performance Computing (HPC) and Artificial Intelligence (AI) technologies. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects focused on HPC and GenAI. You will also collaborate with a diverse set of scientific researchers and developers at Universities and Research Institutions. You should be comfortable working in a dynamic environment, and have experience with HPC, GenAI and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team at NVIDIA!

What You Will Be Doing

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions
  • Dynamically engaging with developers, scientific researchers, and data scientists, gaining experience across a range of technical areas
  • Strategically partnering with lighthouse customers and researchers to help them adopt and build creative solutions using NVIDIA technology
  • Analyzing performance and power efficiency of AI workloads on Kubernetes
  • Some travel to conferences and customers is required for this role

What We Need To See

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
  • 8+ years of hands-on experience with accelerated computing and deep Learning frameworks such as PyTorch
  • Experience porting and/or optimizing scientific applications targeting GPUs
  • Strong fundamentals in programming and software design, especially in Python and C++
  • Experience with containerization and orchestration technologies, monitoring, and observability solutions for AI deployments
  • Excellent knowledge of theory and practice of AI at scale
  • Excellent presentation, communication and collaboration skills

Ways To Stand Out From The Crowd

  • Experience with NVIDIA GPUs and parallel programming libraries, such as CUDA, OpenMP, OpenACC, communication libraries and runtime (MPI, NCCL, UCX, NVSHMEM)
  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
  • Experience working with academic research community supporting HPC or AI
  • Familiarity with distributed computing platforms, containers and scheduling tools
  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 29, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Read Full Description
Confirmed 3 hours ago. Posted a day ago.

Discover Similar Jobs

Suggested Articles