AI Infra Engineer job opportunity at Perplexity AI.



bot
Perplexity AI AI Infra Engineer
Experience: 5-years
Pattern: full-time
apply Apply Now
Salary:
Status:

Engineer

Copy Link Report
degreeProfessional Certificate
loacation San Francisco, United States Of America
loacation San Francisco....United States Of America

Design, deploy, and maintain scalable Kubernetes clusters for #AI model inference and training workloads __ Manage and optimize Slurm-based #HPC environments for distributed training of large language #models __ Develop robust APIs and orchestration systems for both training pipelines and inference services __ Implement resource scheduling and job management systems across heterogeneous compute environments __ Benchmark system performance, diagnose bottlenecks, and implement improvements across both training and inference infrastructure __ Build monitoring, alerting, and observability solutions tailored to ML workloads running on #Kubernetes and Slurm __ Respond swiftly to system outages and collaborate across teams to maintain high uptime for critical training runs and inference services __ Optimize cluster utilization and implement autoscaling strategies for dynamic workload demands

Other Ai Matches

Search Golang Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Search DevOps Engineer (London, Belgrade, Berlin) Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Machine Learning Engineer - Search Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Java Developer - Search Core Applicants are expected to have a solid experience in handling developer related tasks
Cloud Security Engineer Applicants are expected to have a solid experience in handling security engineer related tasks
AI Research Lead Applicants are expected to have a solid experience in handling Research Lead related tasks
Search Machine Learning Research Engineer (Berlin) Applicants are expected to have a solid experience in handling Engineer related tasks
Backend Software Engineer - Mobile Applicants are expected to have a solid experience in handling Software Engineer related tasks
Product Quality Assurance Lead Applicants are expected to have a solid experience in handling Assurance Lead related tasks
Frontend Software Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Growth Product Manager — Conversion Applicants are expected to have a solid experience in handling Product Management related tasks
Product Designer - Growth Applicants are expected to have a solid experience in handling Product Designer related tasks
Detection & Response Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Senior/Staff Engineer - Reliability (SRE) Applicants are expected to have a solid experience in handling Engineer related tasks
AI Inference Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Backend/Infrastructure Engineer - Search Applicants are expected to have a solid experience in handling backend engineer related tasks
Engineering Site Lead - London Applicants are expected to have a solid experience in handling Engineering related tasks
Application Security Engineer Applicants are expected to have a solid experience in handling security engineer related tasks
Developer Relations Manager - API Platform Applicants are expected to have a solid experience in handling Relations Manager related tasks
IT Systems Administrator Applicants are expected to have a solid experience in handling Administrator related tasks
Desktop Browser Product Manager (London, Belgrade, Remote) Applicants are expected to have a solid experience in handling Software Development related tasks
Analytics Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Research Resident Applicants are expected to have a solid experience in handling resident related tasks