Senior Software Engineer, Deep Learning Inference job opportunity at NVIDIA.



DateMore Than 30 Days Ago bot
NVIDIA Senior Software Engineer, Deep Learning Inference
Experience: 5-years
Pattern: full-time
apply Apply Now
Salary:
Status:

Deep Learning Inference

Copy Link Report
degreeGeneral
loacation Israel, Tel Aviv, Israel
loacation Israel, Tel Av..........Israel

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence. We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential. What you’ll be doing: Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes Optimize inference workloads using sophisticated profiling and simulation tools Build SOLID, extendable inference software systems, and refine robust APIs Implement and debug low-level GPU code to harness the latest HW features Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products What we need to see: B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering 5+ years of relevant hands-on software engineering experience Profound knowledge of software design principles Strong proficiency in at least one system and one scripting language Strong grasp of machine learning concepts People person with excellent communication skills that enjoys collaboration and teamwork. Ways to stand out from the crowd: Familiarity with Nvidia's DL software stack, e.g. Triton Inference Server , TensorRT-LLM , and Model Optimizer Proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with Nvidia's accelerators. Familiarity with LLM quantization, fine-tunning, and caching algorithms Proficiency in GPU kernel programming (CUDA or OpenCL) Prior experience working on a large software project with 50+ contributors NVIDIA is widely considered one of the world’s most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people working for us. If you're creative and autonomous, we want to hear from you! We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. We highly value diversity in our current and future employees. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Other Ai Matches

Senior Digital Design Engineer Applicants are expected to have a solid experience in handling Job related tasks
Formal Verification Engineer - New College Grad 2026 Applicants are expected to have a solid experience in handling Job related tasks
System Software Engineer, Tegra SoC Products Applicants are expected to have a solid experience in handling Tegra SoC Products related tasks
Senior Software Engineer, Profiling Services Applicants are expected to have a solid experience in handling Profiling Services related tasks
Senior Circuit Design Engineer - Noise Applicants are expected to have a solid experience in handling Job related tasks
Business Development Leader – Cloud Partner Strategy and Enablement Applicants are expected to have a solid experience in handling Job related tasks
Developer Relations Manager, Inception Program Startups Applicants are expected to have a solid experience in handling Inception Program Startups related tasks
Senior Silicon and System Product Lead Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Senior Software Engineer Aerial Platform Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Inception Community Manager Applicants are expected to have a solid experience in handling Job related tasks
Senior Formal Verification Engineer Applicants are expected to have a solid experience in handling Job related tasks
Solutions Architect Applicants are expected to have a solid experience in handling Job related tasks
Senior Software Architect, Humanoid Robotics Applicants are expected to have a solid experience in handling Humanoid Robotics related tasks
ASIC Design Verification Engineer - New College Grad 2026 Applicants are expected to have a solid experience in handling Job related tasks
Server Factory Planner Applicants are expected to have a solid experience in handling Job related tasks
Manager, Large Language Model Inference Applicants are expected to have a solid experience in handling Large Language Model Inference related tasks
Solutions Architect, Generative AI Applicants are expected to have a solid experience in handling Generative AI related tasks
Photonic Design Engineer Intern - Summer 2026 Applicants are expected to have a solid experience in handling Job related tasks
Senior Technical Program Manager – VLSI Applicants are expected to have a solid experience in handling Job related tasks
Manager, Hardware Offensive Security - Silicon Architecture Applicants are expected to have a solid experience in handling Hardware Offensive Security - Silicon Architecture related tasks
Datacenter GPU Power Architect Applicants are expected to have a solid experience in handling Job related tasks
Senior AI and ML HPC Cluster Engineer Applicants are expected to have a solid experience in handling Job related tasks
System Software Engineer - Secure Cryptographic Services Applicants are expected to have a solid experience in handling Job related tasks