Deep Learning Performance Architect - Intern - 2026 job opportunity at NVIDIA.



DateMore Than 30 Days Ago bot
NVIDIA Deep Learning Performance Architect - Intern - 2026
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Job

Copy Link Report
degreeGeneral
loacation China, Shanghai, China
loacation China, Shangha..........China

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an intern deep learning system performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. What you’ll be doing: Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency. Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations. Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams. What we need to see: BS or higher degree in a relevant technical field (CS, EE, CE, Math, etc.). Strong programming skills in Python, C, C++. Strong background in computer architecture. Experience with performance modeling, architecture simulation, profiling, and analysis. Prior experience with LLM or generative AI algorithms. Ways to stand out from the crowd: GPU Computing and parallel programming models such as CUDA and OpenCL. Architecture of or workload analysis on other deep learning accelerators. Deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, TensorRT-LLM, vLLM, etc.). Open-source AI compilers (OpenAI Triton, MLIR, TVM, XLA, etc.). NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Other Ai Matches

Senior Deep Learning Engineer - Model Evaluation & AI Systems Applicants are expected to have a solid experience in handling Job related tasks
Senior Software Engineer - Networking Applicants are expected to have a solid experience in handling Job related tasks
Manufacturing Test Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior Memory System Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior Automotive Sensor Ecosystem Engineer - Autonomous Vehicles Applicants are expected to have a solid experience in handling Job related tasks
Senior System Software Engineer - Data Engineering Applicants are expected to have a solid experience in handling Job related tasks
Global Head of Business Development, Digital Health Applicants are expected to have a solid experience in handling Digital Health related tasks
Firmware Manager - NVLink Applicants are expected to have a solid experience in handling Job related tasks
Senior Lab Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior Solution Architect, HPC - NVIS Applicants are expected to have a solid experience in handling HPC - NVIS related tasks
ASIC Physical Design Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior Deep Learning Performance Architect Applicants are expected to have a solid experience in handling Job related tasks
Senior CAD Engineer - DFX Software Applicants are expected to have a solid experience in handling Job related tasks
Senior Software Test Development Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior SRAM Co-Design Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior HPC and AI Networking Performance Research and Analysis Engineer Applicants are expected to have a solid experience in handling Job related tasks
Senior GenAI Technical Lead, Partner Platforms Applicants are expected to have a solid experience in handling Partner Platforms related tasks
Global Commodity Manager, Semiconductors Applicants are expected to have a solid experience in handling Semiconductors related tasks
Senior Software Engineer Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Senior Software Architect - Deep Learning and HPC Communications Applicants are expected to have a solid experience in handling Job related tasks
Senior Counsel, Product Legal – Software Applicants are expected to have a solid experience in handling Product Legal – Software related tasks
Senior Manufacturing Development Engineer Applicants are expected to have a solid experience in handling Job related tasks
Software QA Automation Infrastructure Engineer Applicants are expected to have a solid experience in handling Job related tasks