Requiring Inference Experience Jobs & Careers

AI Engineer & Researcher, Inference - Portland, USA

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember mor......

Hiring In Portland, USA

full-time

Sourced

PhD

General

Speechify...

AI Engineer & Researcher, Inference - Ithaca, USA

Hiring In Ithaca, USA

full-time

Sourced

PhD

General

Speechify...

AI Engineer & Researcher, Inference - Boise, USA

Hiring In Boise, USA

full-time

Sourced

PhD

General

Speechify...

Staff Software Engineer, Inference

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working tog......

Hiring In Dublin, IE

full-time

Sourced

High School (S.S.C.E)

General

Anthropic...

Walk In

Principal Machine Learning Engineer, Distributed vLLM Inference

Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers ......

Hiring In Boston

full-time

Sourced

Associate

General

Red Hat, ...

_{Posted 22 Days Ago}

Senior Software Engineer, Deep Learning Inference - TensorRT

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming ex......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

3-years

NVIDIA

Software Engineer, Inference - Multi Modal

About the Team OpenAI’s Inference team powers the deployment of our most advanced models - including our GPT models, 4o Image Generation, and Whisper - across a variety of platforms. Our work ensures these models are available, performant, and scalable in production, and we partner closely with Rese......

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

Senior Backend Engineer, Inference Platform

Not enough description was found for this Job.....

Hiring In San Francisco, California

full-time

Sourced

Bachelor's (B.Sc.)

5+years

Together ...

_{Posted 8 Days Ago}

Principal Software Engineer - AI Inference

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVIDIA GPUs and systems.......

Hiring In US, CA, Santa Clara

full-time

Sourced

High School (S.S.C.E)

15-years

NVIDIA

_{More Than 30 Days Ago}

Manager, Large Language Model Inference

At NVIDIA, we aren't just powering the AI revolution—we're accelerating it. The TensorRT inference platform is the backbone of modern AI, delivering the industry's fastest and most efficient deployment of cutting-edge deep learning models on every NVIDIA GPU. With demand for AI exploding, particular......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

3-years

NVIDIA

Senior Performance and Scale Engineer - Distributed LLM Inference

Job Summary The Red Hat Performance and Scale Engineering team is looking for a Senior Performance Engineer to join us in the PSAP (Performance and Scale for AI Platforms) team, driving the performance and scalability of distributed inference for Large Language Models (LLMs) Serving modern LLMs for ......

Hiring In Raanana

full-time

Sourced

Associate

3-years

Red Hat, ...

Inference Technical Lead, Sora

About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our AI products, ensuring they are reliable, user-friendly, and aligned with our mission of broad societa......

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

_{More Than 30 Days Ago}

Senior Software Engineer, Deep Learning Inference

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence......

Hiring In Israel, Tel Aviv

full-time

Sourced

General

5-years

NVIDIA

AI Engineer & Researcher, Inference - Chicago, USA

Hiring In Chicago, USA

full-time

Sourced

PhD

General

Speechify...

Experimentation & Causal Inference Intern, Summer 2026

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can c......

Hiring In Los Gatos,California

Onsite

Sourced

Bachelor's (B.Sc.)

General

Netflix I...

_{More Than 30 Days Ago}

Senior Deep Learning Software Engineer, Inference

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and optimize the GPU-accelerated software that powers today’s most sophisticated AI applications. Our team is responsible for developing and mainta......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

5-years

NVIDIA

Software Engineer, Networking - Inference

About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our state-of-the-art AI models, allowing them to do things that they’ve never been able to before. We foc......

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

Software Engineer, Inference – AMD GPU Enablement

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

AI Inference Engineer (London)

Develop APIs for AI inference that will be used by both internal and external #customers __ Benchmark and address bottlenecks throughout our inference stack __ Improve the reliability and observability of our #systems and respond to system outages __ Explore novel #research and implement LLM i......

Hiring In Cogency Global

full-time

Sourced

Technical Certificate

1-year

Perplexit...

_{More Than 30 Days Ago}

Senior GenAI Algorithms Engineer — Model Optimizations for Inference

NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculativ......

Hiring In US, CA, Santa Clara

full-time

Sourced

OND

5-years

NVIDIA

_{More Than 30 Days Ago}

Senior Deep Learning Software Engineer, Inference

Hiring In Netherlands, Remote

Remote

Sourced

PhD

5-years

NVIDIA

_{2025-12-17T08:54:03.371Z}

Embedded Computer Vision Engineer (Edge Inference)

Not enough description was found for this Job.....

Hiring In Singapore

Full-time

Sourced

General

8-years

Rapsodo

AI Engineer & Researcher, Inference

Hiring In Remote

Remote

Sourced

PhD

General

Speechify...

Walk In

Senior Principal MLOps Engineer, AI Inference

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM ......

Hiring In Boston

full-time

Sourced

Associate

10-years

Red Hat, ...

AI Engineer & Researcher, Inference - Columbus, USA

Hiring In Columbus, USA

full-time

Sourced

PhD

General

Speechify...

LLM Inference Frameworks and Optimization Engineer

Not enough description was found for this Job.....

Hiring In Singapore

full-time

Sourced

Vocational

3-years

Together ...

Member of technical staff (Inference)

About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely an......

Hiring In Paris

full-time

Sourced

Bachelor's (B.Sc.)

General

H Company

Machine Learning Engineer - Inference

Not enough description was found for this Job.....

Hiring In San Francisco

full-time

Sourced

Other

3-years

Together ...

Senior Data Scientist - Inference, Global Markets

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible fo......

Hybrid

Sourced

OND

5-years

Airbnb In...

Engineering Manager, Cloud Inference Azure

Hiring In San Francisco, CA | Seattle, WA; Seattle, WA

full-time

Sourced

Bachelor's (B.A.)

10-years

Anthropic...

_{More Than 30 Days Ago}

Solutions Architect, Inference Deployments

We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU technology and Kubernetes. As a Solutions Architect (Inference Focus), you’ll collaborate closely with our engineering, DevOps, and customer success teams to foster enterprise AI ad......

Hiring In US, CA, Santa Clara

full-time

Sourced

General

NVIDIA

Staff Data Scientist, Platform (Inference/Payments)

Hybrid

Sourced

PhD

9-years

Airbnb In...

_{More Than 30 Days Ago}

Senior Inference Technical Product Marketing Manager - Accelerated Computing

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CP......

Hiring In US, CA, Santa Clara

full-time

Sourced

OND

6-years

NVIDIA

_{More Than 30 Days Ago}

Senior Technical Marketing Engineer - AI Inference at Scale

Modern data centers are transforming into AI factories, and NVIDIA accelerated computing is the engine of artificial intelligence. Our data center platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketi......

Hiring In US, CA, Santa Clara

full-time

Sourced

OND

7-years

NVIDIA

_{Posted 14 Days Ago}

Senior Software Engineer – Inference Platform Infrastructure

NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Sr. Software Engineer – Inference Platform Infrastructure to help build and automate the foundations that keep NVIDIA’s inference services running smoothly—so they are reliable, scalable, and easy to operat......

Hiring In US, CA, Santa Clara

full-time

Sourced

General

5-years

NVIDIA

Distributed ML Systems Engineer- Inference

Design and build large-scale, distributed #machine learning systems that are fault-tolerant and high-performance. __ Develop and optimize distributed processing frameworks and storage systems. __ Collaborate with researchers, #engineers, and product managers to integrate ML systems into our infr......

Hiring In San Francisco

full-time

Sourced

Technical Certificate

3-years

Together ...

Engineering Manager, Inference

Hiring In San Francisco, CA | New York City, NY | Seat......

full-time

Sourced

Bachelor's (B.A.)

1-years

Anthropic...

Senior/Staff Software Engineer, Inference

Hiring In New York City, NY; San Francisco, CA | New Y......

full-time

Sourced

Bachelor's (B.A.)

General

Anthropic...

Machine Learning Engineer, vLLM Inference - Tool Calling and Structured Output

About the Job At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading contributors and......

Hiring In Boston

full-time

Sourced

Associate

General

Red Hat, ...

Staff Product Manager, Managed Inference (SF/Sunnyvale)

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be a part of the AI revolution with sustainable technology at Crusoe. Here, you......

Hiring In San Francisco, CA - US

full-time

Sourced

Bachelor's (B.Sc.)

General

Crusoe

Senior Software Engineer - vLLM Inference

Hiring In Boston

full-time

Sourced

Associate

2-years

Red Hat, ...

AI Inference Engineer

Develop #APIs for AI inference that will be used by both internal and external customers __ Benchmark and address bottlenecks throughout our #inference stack __ Improve the reliability and observability of our systems and respond to #system outages __ Explore novel research and implement #LLM ......

Hiring In San Francisco

full-time

Sourced

Professional Certificate

1-year

Perplexit...

_{More Than 30 Days Ago}

Senior DL Algorithms Engineer - Inference Performance

We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardware/software stack f......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

3-years

NVIDIA

Senior Engineering Manager, Model Inference & Serving, Machine Learning Platform

Hiring In United States Of America

Remote

Sourced

Bachelor's (B.Sc.)

General

Netflix I...

_{More Than 30 Days Ago}

Research Manager, Center for Causal Inference (Biostatistics Division)

University Overview The University of Pennsylvania, the largest private employer in Philadelphia, is a world-renowned leader in education, research, and innovation. This historic, Ivy League school consistently ranks among the top 10 universities in the annual U.S. News & World Report survey. Penn h......

Hiring In Blockley Hall

full-time

Sourced

OND

7-years

Universit...

Machine Learning Engineer - Inference

Responsibilities Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale. #Develop and optimize runtime inference services for large-scale AI applications. Collaborate with researchers, #engineers, product managers, and ......

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

3-years

Together ...

_{More Than 30 Days Ago}

Distributed Training & Inference Optimization Engineer (LLM) - GPU Optimization Department (GPUOD)

Job Description: Business Overview AI & Data Division (AIDD) spearheads data science & AI initiatives by leveraging data from Rakuten Group. We build a platform for large-scale field experimentations using cutting-edge technologies to provide critical insights that enable faster and better and faste......

Hiring In Tokyo, Japan

full-time

Sourced

High School (S.S.C.E)

3-years

Rakuten I...

AI Engineer & Researcher, Inference - Boston, USA

Hiring In Boston, USA

full-time

Sourced

PhD

General

Speechify...

Staff ML Engineer, Inference Platform

Job Description Hybrid This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, at minimum or other frequency dictated by the business. This job is eligible for relocation assistance. About th......

Hiring In Sunnyvale, California, United States of Amer......

full-time

Sourced

General

8-years

General M...

_{More Than 30 Days Ago}

Senior Deep Learning Software Engineer, Inference and Model Optimization

Hiring In US, CA, Santa Clara

full-time

Sourced

OND

5-years

NVIDIA

Research Scientist (L4) - Machine Learning and Inference Research, LLM Post-Training

Hiring In Los Gatos,California

Onsite

Sourced

Bachelor's (B.Sc.)

General

Netflix I...

AI Engineer & Researcher, Inference - Champaign-Urbana, USA

Hiring In Champaign-Urbana, USA

full-time

Sourced

PhD

General

Speechify...

Technical Program Manager, Inference

Hiring In San Francisco, CA | Seattle, WA

full-time

Sourced

High School (S.S.C.E)

General

Anthropic...

AI Engineer & Researcher, Inference - Ann Arbor, USA

Hiring In Ann Arbor, USA

full-time

Sourced

PhD

General

Speechify...

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)

The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform ( LLM-D , and vLLM ) and our customers' m......

Hiring In Boston

full-time

Sourced

Associate

General

Red Hat, ...

_{Posted 8 Days Ago}

Senior Compiler Engineer, AI Inference Platforms

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-d......

Hiring In US, CA, Santa Clara

full-time

Sourced

General

3-years

NVIDIA

Inference Runtime, Engineering Manager

About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI models, allowing them to do things that they’ve never been able to before. We focu......

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

AI Engineer & Researcher, Inference - Seattle, USA

Hiring In Seattle, USA

full-time

Sourced

PhD

General

Speechify...

Senior Principal Machine Learning Engineer, vLLM Inference

Hiring In Remote US MA

Remote

Sourced

Associate

General

Red Hat, ...

AI Engineer & Researcher, Inference - Philadelphia, USA

Hiring In Philadelphia, USA

full-time

Sourced

PhD

General

Speechify...

AI Engineer & Researcher, Inference - San Francisco, USA

Hiring In San Francisco, USA

full-time

Sourced

PhD

General

Speechify...

Member of Technical Staff, Applied Inference

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive ......

Hiring In Palo Alto, CA; San Francisco, CA

full-time

Sourced

OND

General

xAI

_{Posted 22 Days Ago}

Principal Software Engineer - Inference as a Service

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

Highly Experienced

NVIDIA

_{More Than 30 Days Ago}

Senior Software Engineer, AI Inference Systems

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale w......

Hiring In Canada, Toronto

full-time

Sourced

OND

7-years

NVIDIA

_{More Than 30 Days Ago}

Product Manager MBA Intern, AI Platform Inference - Summer 2026

Our work at NVIDIA is dedicated towards a computing model focused on visual and AI computing. For two decades, NVIDIA has pioneered visual computing, the art and science of computer graphics, with our invention of the GPU. The GPU has also shown to be spectacularly effective at solving some of the m......

Hiring In US, CA, Santa Clara

full-time

Sourced

MBA

General

NVIDIA

_{2025-12-17T09:06:15.834Z}

Senior Software Engineer, Inference Platform

About AION AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI......

Hiring In Bengaluru

full-time

Sourced

OND

4-years

AION

AI Engineer & Researcher, Inference - Atlanta, USA

Hiring In Atlanta, USA

full-time

Sourced

PhD

General

Speechify...

Principal Machine Learning Engineer, AI Inference

Hiring In Boston

full-time

Sourced

Associate

General

Red Hat, ...

AI Engineer & Researcher, Inference - Detroit-Ann Arbor, USA

Hiring In Detroit-Ann Arbor, USA

full-time

Sourced

PhD

General

Speechify...

Senior Hardware/Software ML Inference IP and Compiler Developer

Job Details: Job Description: Altera is one of the world’s leading providers of programmable logic solutions. With a renewed focus on agility and hardware‑accelerated innovation, Altera is redefining the future of computing through flexible, high‑performance FPGA technology. Our products power nex......

Hiring In Toronto, Ontario, Canada

full-time

Sourced

OND

10-years

Altera Co...

AI Engineer & Researcher, Inference - Madison, USA

Hiring In Madison, USA

full-time

Sourced

PhD

General

Speechify...

Senior Software Engineer, Inference

Hiring In Dublin, IE

full-time

Sourced

Bachelor's (B.A.)

General

Anthropic...

Walk In

AI Engineer & Researcher, Inference - Minneapolis-St. Paul, USA

Hiring In Minneapolis-St. Paul, USA

full-time

Sourced

PhD

General

Speechify...

_{More Than 30 Days Ago}

Senior Deep Learning Inference Performance Architect

We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-softwar......

Hiring In US, NC, Durham

full-time

Sourced

PhD

5-years

NVIDIA

Member of Technical Staff, Inference

Hiring In Palo Alto, CA; San Francisco, CA

full-time

Sourced

OND

General

xAI

Rust Systems Engineer - Inference

Together AI is seeking a Rust Systems Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference #systems. If you are passionate about developing high-performance systems, we want to hear from you. This position offers the chance to collabor......

Hiring In San Francisco

full-time

Sourced

Professional Certificate

1-year

Together ...

_{Posted 13 Days Ago}

Senior System Software Engineer - Dynamo-Triton Inference Server

We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthrough......

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

5-years

NVIDIA

Software Engineer, Model Inference

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

AI Engineer & Researcher, Inference - Austin, USA

Hiring In Austin, USA

full-time

Sourced

PhD

General

Speechify...

_{Posted 21 Days Ago}

Senior Software Engineer - Inference as a Service

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

Highly Experienced

NVIDIA

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

At d-Matrix , we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration. We value humility and believe......

Hiring In Santa Clara

Intern

Sourced

Bachelor's (B.Sc.)

General

d-Matrix

Senior AI Inference Compiler Engineer

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior AI Inference Compiler Engineer in the United States.This role offers the opportunity to advance the performance and efficiency of AI inference engines across GPUs, personal devices, robotics, a......

Hiring In United States Of America

Remote

Sourced

Bachelor's (B.Sc.)

3 Years

Jobgether

AI Engineer & Researcher, Inference - Salt Lake City, USA

Hiring In Salt Lake City, USA

full-time

Sourced

PhD

General

Speechify...

AI Engineer & Researcher, Inference - Denver, USA

Hiring In Denver, USA

full-time

Sourced

PhD

General

Speechify...

_{More Than 30 Days Ago}

Senior System Software Engineer - AI Data Platform - Inference Factory Optimization

Our team is building the foundational infrastructure that powers NVIDIA's cutting-edge innovations in AI and high-performance computing. We are seeking a Senior Software Engineer to design, build, and optimize highly scalable and reliable automation systems that ensure the peak performance and seaml......

Hiring In Vietnam, Hanoi

full-time

Sourced

Bachelor's (B.A.)

5-years

NVIDIA

Software Engineer, Load Balancing - Inference

Hiring In San Francisco

full-time

Sourced

Bachelor's (B.Sc.)

General

OpenAI

Staff Applied Scientist - Causal Inference

Ready to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system......

Hiring In Remote - USA

Remote

Sourced

OND

8-years

Coinbase ...

Senior Principal Machine Learning Engineer, Distributed vLLM Inference with Kubernetes

Hiring In Boston

full-time

Sourced

Associate

General

Red Hat, ...

_{Posted 13 Days Ago}

Senior DL Algorithms Engineer - Inference Performance

Hiring In US, CA, Santa Clara

full-time

Sourced

PhD

5-years

NVIDIA

AI Engineer & Researcher, Inference - Raleigh-Durham, USA

Hiring In Raleigh-Durham, USA

full-time

Sourced

PhD

General

Speechify...

Senior Research Engineer, TikTok AI Search (LLM Pretraining/Alignment/Inference)

Responsibilities About the team On the TikTok Search Team, you will have the opportunity to develop and apply cutting edge machine learning technologies in real-time large-scale systems, which serve billions of search requests every day. Via advanced NLP and multi-modal models, our projects impa......

Hiring In San Jose

full-time

Sourced

Bachelor's (B.Sc.)

5-years

Tiktok

LLM Inference Deployment Engineer

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today’s best-in-class solutions. The high-per......

Hiring In U.S., Canada, Germany, Norway

full-time

Sourced

OND

General

Encharge ...

_{More Than 30 Days Ago}

Senior Software Engineer, AI Inference Systems

Hiring In US, CA, Santa Clara

full-time

Sourced

OND

7-years

NVIDIA

Senior/Staff Software Engineer - Machine Learning Platform (Inference)

Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for building big, moving fast, and taking technology — and careers — to the next level. Build the future of d......

Hiring In US-CA-Menlo Park

full-time

Sourced

Bachelor's (B.Sc.)

General

Snowflake...

Staff Data Scientist, Inference - Customer Support

Hybrid

Sourced

PhD

9-years

Airbnb In...

Thank You For Visiting Jobserver