post ads here
AI Engineer & Researcher, Inference - Portland, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Portland, USA
full-time Sourced PhD General Speechify... United States Of America
AI Engineer & Researcher, Inference - Ithaca, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Ithaca, USA
full-time Sourced PhD General Speechify... United States Of America
AI Engineer & Researcher, Inference - Boise, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Boise, USA
full-time Sourced PhD General Speechify... United States Of America
Staff Software Engineer, Inference
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In Dublin, IE
full-time Sourced High School (S.S.C.E) General Anthropic...
Walk In
Principal Machine Learning Engineer, Distributed vLLM Inference
Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers ......
Hiring In Boston
full-time Sourced Associate General Red Hat, ... United States Of America
DatePosted 22 Days Ago
Senior Software Engineer, Deep Learning Inference - TensorRT
We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming ex......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 3-years NVIDIA United States Of America
Software Engineer, Inference - Multi Modal
About the Team OpenAI’s Inference team powers the deployment of our most advanced models - including our GPT models, 4o Image Generation, and Whisper - across a variety of platforms. Our work ensures these models are available, performant, and scalable in production, and we partner closely with Rese......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
Senior Backend Engineer, Inference Platform
Not enough description was found for this Job.....
Hiring In San Francisco, California
full-time Sourced Bachelor's (B.Sc.) 5+years Together ... United States Of America
DatePosted 8 Days Ago
Principal Software Engineer - AI Inference
NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVIDIA GPUs and systems.......
Hiring In US, CA, Santa Clara
full-time Sourced High School (S.S.C.E) 15-years NVIDIA United States Of America
DateMore Than 30 Days Ago
Manager, Large Language Model Inference
At NVIDIA, we aren't just powering the AI revolution—we're accelerating it. The TensorRT inference platform is the backbone of modern AI, delivering the industry's fastest and most efficient deployment of cutting-edge deep learning models on every NVIDIA GPU. With demand for AI exploding, particular......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 3-years NVIDIA United States Of America
Senior Performance and Scale Engineer - Distributed LLM Inference
Job Summary The Red Hat Performance and Scale Engineering team is looking for a Senior Performance Engineer to join us in the PSAP (Performance and Scale for AI Platforms) team, driving the performance and scalability of distributed inference for Large Language Models (LLMs) Serving modern LLMs for ......
Hiring In Raanana
full-time Sourced Associate 3-years Red Hat, ... Israel
Inference Technical Lead, Sora
About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societa......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
DateMore Than 30 Days Ago
Senior Software Engineer, Deep Learning Inference
NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence......
Hiring In Israel, Tel Aviv
full-time Sourced General 5-years NVIDIA Israel
AI Engineer & Researcher, Inference - Chicago, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Chicago, USA
full-time Sourced PhD General Speechify... United States Of America
Experimentation & Causal Inference Intern, Summer 2026
Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can c......
Hiring In Los Gatos,California
Onsite Sourced Bachelor's (B.Sc.) General Netflix I... United States Of America
DateMore Than 30 Days Ago
Senior Deep Learning Software Engineer, Inference
NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications. Our team is responsible for developing and mainta......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 5-years NVIDIA United States Of America
Software Engineer, Networking - Inference
About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our state-of-the-art AI models, allowing them to do things that they’ve never been able to before. We foc......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
Software Engineer, Inference – AMD GPU Enablement
About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our state-of-the-art AI models, allowing them to do things that they’ve never been able to before. We foc......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
AI Inference Engineer (London)
Develop APIs for AI inference that will be used by both internal and external #customers __ Benchmark and address bottlenecks throughout our inference stack __ Improve the reliability and observability of our #systems and respond to system outages __ Explore novel #research and implement LLM i......
Hiring In Cogency Global
full-time Sourced Technical Certificate 1-year Perplexit... United Kingdom
DateMore Than 30 Days Ago
Senior GenAI Algorithms Engineer — Model Optimizations for Inference
NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculativ......
Hiring In US, CA, Santa Clara
full-time Sourced OND 5-years NVIDIA United States Of America
DateMore Than 30 Days Ago
Senior Deep Learning Software Engineer, Inference
NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications. Our team is responsible for developing and mainta......
Hiring In Netherlands, Remote
Remote Sourced PhD 5-years NVIDIA Netherlands
Date2025-12-17T08:54:03.371Z
Embedded Computer Vision Engineer (Edge Inference)
Not enough description was found for this Job.....
Hiring In Singapore
Full-time Sourced General 8-years Rapsodo Singapore
AI Engineer & Researcher, Inference
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Remote
Remote Sourced PhD General Speechify...
Walk In
Senior Principal MLOps Engineer, AI Inference
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM ......
Hiring In Boston
full-time Sourced Associate 10-years Red Hat, ... United States Of America
AI Engineer & Researcher, Inference - Columbus, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Columbus, USA
full-time Sourced PhD General Speechify... United States Of America
LLM Inference Frameworks and Optimization Engineer
Not enough description was found for this Job.....
Hiring In Singapore
full-time Sourced Vocational 3-years Together ... Singapore
Member of technical staff (Inference)
About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely an......
Hiring In Paris
full-time Sourced Bachelor's (B.Sc.) General H Company France
Machine Learning Engineer - Inference
Not enough description was found for this Job.....
Hiring In San Francisco
full-time Sourced Other 3-years Together ... United States Of America
Senior Data Scientist - Inference, Global Markets
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible fo......
Hybrid Sourced OND 5-years Airbnb In... China
Engineering Manager, Cloud Inference Azure
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In San Francisco, CA | Seattle, WA; Seattle, WA
full-time Sourced Bachelor's (B.A.) 10-years Anthropic... United States Of America
DateMore Than 30 Days Ago
Solutions Architect, Inference Deployments
We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you’ll collaborate closely with our engineering, DevOps, and customer success teams to foster enterprise AI ad......
Hiring In US, CA, Santa Clara
full-time Sourced General General NVIDIA United States Of America
Staff Data Scientist, Platform (Inference/Payments)
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible fo......
Hybrid Sourced PhD 9-years Airbnb In... United States Of America
DateMore Than 30 Days Ago
Senior Inference Technical Product Marketing Manager - Accelerated Computing
We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CP......
Hiring In US, CA, Santa Clara
full-time Sourced OND 6-years NVIDIA United States Of America
DateMore Than 30 Days Ago
Senior Technical Marketing Engineer - AI Inference at Scale
Modern data centers are transforming into AI factories, and NVIDIA accelerated computing is the engine of artificial intelligence. Our data center platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketi......
Hiring In US, CA, Santa Clara
full-time Sourced OND 7-years NVIDIA United States Of America
DatePosted 14 Days Ago
Senior Software Engineer – Inference Platform Infrastructure
NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Sr. Software Engineer – Inference Platform Infrastructure to help build and automate the foundations that keep NVIDIA’s inference services running smoothly—so they are reliable, scalable, and easy to operat......
Hiring In US, CA, Santa Clara
full-time Sourced General 5-years NVIDIA United States Of America
Distributed ML Systems Engineer- Inference
Design and build large-scale, distributed #machine learning systems that are fault-tolerant and high-performance. __ Develop and optimize distributed processing frameworks and storage systems. __ Collaborate with researchers, #engineers, and product managers to integrate ML systems into our infr......
Hiring In San Francisco
full-time Sourced Technical Certificate 3-years Together ... United States Of America
Engineering Manager, Inference
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In San Francisco, CA | New York City, NY | Seat......
full-time Sourced Bachelor's (B.A.) 1-years Anthropic... United States Of America
Senior/Staff Software Engineer, Inference
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In New York City, NY; San Francisco, CA | New Y......
full-time Sourced Bachelor's (B.A.) General Anthropic... United States Of America
Machine Learning Engineer, vLLM Inference - Tool Calling and Structured Output
About the Job At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading contributors and......
Hiring In Boston
full-time Sourced Associate General Red Hat, ... United States Of America
Staff Product Manager, Managed Inference (SF/Sunnyvale)
Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be a part of the AI revolution with sustainable technology at Crusoe. Here, you......
Hiring In San Francisco, CA - US
full-time Sourced Bachelor's (B.Sc.) General Crusoe United States Of America
Senior Software Engineer - vLLM Inference
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM ......
Hiring In Boston
full-time Sourced Associate 2-years Red Hat, ... United States Of America
AI Inference Engineer
Develop #APIs for AI inference that will be used by both internal and external customers __ Benchmark and address bottlenecks throughout our #inference stack __ Improve the reliability and observability of our systems and respond to #system outages __ Explore novel research and implement #LLM ......
Hiring In San Francisco
full-time Sourced Professional Certificate 1-year Perplexit... United States Of America
DateMore Than 30 Days Ago
Senior DL Algorithms Engineer - Inference Performance
We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardware/software stack f......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 3-years NVIDIA United States Of America
Senior Engineering Manager, Model Inference & Serving, Machine Learning Platform
Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can c......
Hiring In United States Of America
Remote Sourced Bachelor's (B.Sc.) General Netflix I... United States Of America
DateMore Than 30 Days Ago
Research Manager, Center for Causal Inference (Biostatistics Division)
University Overview The University of Pennsylvania, the largest private employer in Philadelphia, is a world-renowned leader in education, research, and innovation. This historic, Ivy League school consistently ranks among the top 10 universities in the annual U.S. News & World Report survey. Penn h......
Hiring In Blockley Hall
full-time Sourced OND 7-years Universit... United States Of America
Machine Learning Engineer - Inference
Responsibilities Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale. #Develop and optimize runtime inference services for large-scale AI applications. Collaborate with researchers, #engineers, product managers, and ......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) 3-years Together ... United States Of America
DateMore Than 30 Days Ago
Distributed Training & Inference Optimization Engineer (LLM) - GPU Optimization Department (GPUOD)
Job Description: Business Overview AI & Data Division (AIDD) spearheads data science & AI initiatives by leveraging data from Rakuten Group. We build a platform for large-scale field experimentations using cutting-edge technologies to provide critical insights that enable faster and better and faste......
Hiring In Tokyo, Japan
full-time Sourced High School (S.S.C.E) 3-years Rakuten I... Japan
AI Engineer & Researcher, Inference - Boston, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Boston, USA
full-time Sourced PhD General Speechify... United States Of America
Staff ML Engineer, Inference Platform
Job Description Hybrid This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, at minimum or other frequency dictated by the business. This job is eligible for relocation assistance. About th......
Hiring In Sunnyvale, California, United States of Amer......
full-time Sourced General 8-years General M... United States Of America
DateMore Than 30 Days Ago
Senior Deep Learning Software Engineer, Inference and Model Optimization
NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from neural architecture sear......
Hiring In US, CA, Santa Clara
full-time Sourced OND 5-years NVIDIA United States Of America
Research Scientist (L4) - Machine Learning and Inference Research, LLM Post-Training
Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can c......
Hiring In Los Gatos,California
Onsite Sourced Bachelor's (B.Sc.) General Netflix I... United States Of America
AI Engineer & Researcher, Inference - Champaign-Urbana, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Champaign-Urbana, USA
full-time Sourced PhD General Speechify... United States Of America
Technical Program Manager, Inference
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In San Francisco, CA | Seattle, WA
full-time Sourced High School (S.S.C.E) General Anthropic... United States Of America
AI Engineer & Researcher, Inference - Ann Arbor, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Ann Arbor, USA
full-time Sourced PhD General Speechify... United States Of America
Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform ( LLM-D , and vLLM ) and our customers' m......
Hiring In Boston
full-time Sourced Associate General Red Hat, ... United States Of America
DatePosted 8 Days Ago
Senior Compiler Engineer, AI Inference Platforms
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-d......
Hiring In US, CA, Santa Clara
full-time Sourced General 3-years NVIDIA United States Of America
Inference Runtime, Engineering Manager
About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI models, allowing them to do things that they’ve never been able to before. We focu......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
AI Engineer & Researcher, Inference - Seattle, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Seattle, USA
full-time Sourced PhD General Speechify... United States Of America
Senior Principal Machine Learning Engineer, vLLM Inference
Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers ......
Hiring In Remote US MA
Remote Sourced Associate General Red Hat, ... United States Of America
AI Engineer & Researcher, Inference - Philadelphia, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Philadelphia, USA
full-time Sourced PhD General Speechify... United States Of America
AI Engineer & Researcher, Inference - San Francisco, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In San Francisco, USA
full-time Sourced PhD General Speechify... United States Of America
Member of Technical Staff, Applied Inference
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive ......
Hiring In Palo Alto, CA; San Francisco, CA
full-time Sourced OND General xAI United States Of America
DatePosted 22 Days Ago
Principal Software Engineer - Inference as a Service
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in......
Hiring In US, CA, Santa Clara
full-time Sourced PhD Highly Experienced NVIDIA United States Of America
DateMore Than 30 Days Ago
Senior Software Engineer, AI Inference Systems
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale w......
Hiring In Canada, Toronto
full-time Sourced OND 7-years NVIDIA Canada
DateMore Than 30 Days Ago
Product Manager MBA Intern, AI Platform Inference - Summer 2026
Our work at NVIDIA is dedicated towards a computing model focused on visual and AI computing. For two decades, NVIDIA has pioneered visual computing, the art and science of computer graphics, with our invention of the GPU. The GPU has also shown to be spectacularly effective at solving some of the m......
Hiring In US, CA, Santa Clara
full-time Sourced MBA General NVIDIA United States Of America
Date2025-12-17T09:06:15.834Z
Senior Software Engineer, Inference Platform
About AION AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI......
Hiring In Bengaluru
full-time Sourced OND 4-years AION India
AI Engineer & Researcher, Inference - Atlanta, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Atlanta, USA
full-time Sourced PhD General Speechify... United States Of America
Principal Machine Learning Engineer, AI Inference
Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers ......
Hiring In Boston
full-time Sourced Associate General Red Hat, ... United States Of America
AI Engineer & Researcher, Inference - Detroit-Ann Arbor, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Detroit-Ann Arbor, USA
full-time Sourced PhD General Speechify... United States Of America
Senior Hardware/Software ML Inference IP and Compiler Developer
Job Details: Job Description: Altera is one of the world’s leading providers of programmable logic solutions. With a renewed focus on agility and hardware‑accelerated innovation, Altera is redefining the future of computing through flexible, high‑performance FPGA technology. Our products power nex......
Hiring In Toronto, Ontario, Canada
full-time Sourced OND 10-years Altera Co... Canada
AI Engineer & Researcher, Inference - Madison, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Madison, USA
full-time Sourced PhD General Speechify... United States Of America
Senior Software Engineer, Inference
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......
Hiring In Dublin, IE
full-time Sourced Bachelor's (B.A.) General Anthropic...
Walk In
AI Engineer & Researcher, Inference - Minneapolis-St. Paul, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Minneapolis-St. Paul, USA
full-time Sourced PhD General Speechify... United States Of America
DateMore Than 30 Days Ago
Senior Deep Learning Inference Performance Architect
We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-softwar......
Hiring In US, NC, Durham
full-time Sourced PhD 5-years NVIDIA United States Of America
Member of Technical Staff, Inference
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive ......
Hiring In Palo Alto, CA; San Francisco, CA
full-time Sourced OND General xAI United States Of America
Rust Systems Engineer - Inference
Together AI is seeking a Rust Systems Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference #systems.  If you are passionate about developing high-performance systems, we want to hear from you. This position offers the chance to collabor......
Hiring In San Francisco
full-time Sourced Professional Certificate 1-year Together ... United States Of America
DatePosted 13 Days Ago
Senior System Software Engineer - Dynamo-Triton Inference Server
We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthrough......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 5-years NVIDIA United States Of America
Software Engineer, Model Inference
About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI models, allowing them to do things that they’ve never been able to before. We focu......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
AI Engineer & Researcher, Inference - Austin, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Austin, USA
full-time Sourced PhD General Speechify... United States Of America
DatePosted 21 Days Ago
Senior Software Engineer - Inference as a Service
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in......
Hiring In US, CA, Santa Clara
full-time Sourced PhD Highly Experienced NVIDIA United States Of America
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
At d-Matrix , we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe......
Hiring In Santa Clara
Intern Sourced Bachelor's (B.Sc.) General d-Matrix United States Of America
Senior AI Inference Compiler Engineer
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior AI Inference Compiler Engineer in the United States.This role offers the opportunity to advance the performance and efficiency of AI inference engines across GPUs, personal devices, robotics, a......
Hiring In United States Of America
Remote Sourced Bachelor's (B.Sc.) 3 Years Jobgether United States Of America
AI Engineer & Researcher, Inference - Salt Lake City, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Salt Lake City, USA
full-time Sourced PhD General Speechify... United States Of America
AI Engineer & Researcher, Inference - Denver, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Denver, USA
full-time Sourced PhD General Speechify... United States Of America
DateMore Than 30 Days Ago
Senior System Software Engineer - AI Data Platform - Inference Factory Optimization
Our team is building the foundational infrastructure that powers NVIDIA's cutting-edge innovations in AI and high-performance computing. We are seeking a Senior Software Engineer to design, build, and optimize highly scalable and reliable automation systems that ensure the peak performance and seaml......
Hiring In Vietnam, Hanoi
full-time Sourced Bachelor's (B.A.) 5-years NVIDIA Vietnam
Software Engineer, Load Balancing - Inference
About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our state-of-the-art AI models, allowing them to do things that they’ve never been able to before. We foc......
Hiring In San Francisco
full-time Sourced Bachelor's (B.Sc.) General OpenAI United States Of America
Staff Applied Scientist - Causal Inference
Ready to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system......
Hiring In Remote - USA
Remote Sourced OND 8-years Coinbase ... United States Of America
Senior Principal Machine Learning Engineer, Distributed vLLM Inference with Kubernetes
Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers ......
Hiring In Boston
full-time Sourced Associate General Red Hat, ... United States Of America
DatePosted 13 Days Ago
Senior DL Algorithms Engineer - Inference Performance
We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardware/software stack f......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 5-years NVIDIA United States Of America
AI Engineer & Researcher, Inference - Raleigh-Durham, USA
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......
Hiring In Raleigh-Durham, USA
full-time Sourced PhD General Speechify... United States Of America
Senior Research Engineer, TikTok AI Search (LLM Pretraining/Alignment/Inference)
Responsibilities About the team On the TikTok Search Team, you will have the opportunity to develop and apply cutting edge machine learning technologies in real-time large-scale systems, which serve billions of search requests every day. Via advanced NLP and multi-modal models, our projects impa......
Hiring In San Jose
full-time Sourced Bachelor's (B.Sc.) 5-years Tiktok United States Of America
LLM Inference Deployment Engineer
EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today’s best-in-class solutions. The high-per......
Hiring In U.S., Canada, Germany, Norway
full-time Sourced OND General Encharge ... United States Of America
DateMore Than 30 Days Ago
Senior Software Engineer, AI Inference Systems
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale w......
Hiring In US, CA, Santa Clara
full-time Sourced OND 7-years NVIDIA United States Of America
Senior/Staff Software Engineer - Machine Learning Platform (Inference)
Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for building big, moving fast, and taking technology — and careers — to the next level. Build the future of d......
Hiring In US-CA-Menlo Park
full-time Sourced Bachelor's (B.Sc.) General Snowflake... United States Of America
Staff Data Scientist, Inference - Customer Support
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible fo......
Hybrid Sourced PhD 9-years Airbnb In... United States Of America
post ads here