Machine Learning Operations Lead job opportunity at Together AI.



bot
Together AI Machine Learning Operations Lead
Experience: 6-years
Pattern: full-time
apply Apply Now
Salary:
Status:

Machine Learning

Copy Link Report
degreeBachelor's (B.Sc.)
loacation San Francisco, United States Of America
loacation San Francisco....United States Of America

You will be in charge of designing and scaling our ML processes & tooling at production scale – optimizing #operations to ensure availability and reliability for our #services, across differing tenants and user loads, and in a multi-cluster #deployment. __ You will serve as a passionate advocate for internal and external #customers, providing feedback to the wider engineering and infrastructure #teams to improve our systems and core #business metrics. __ If you thrive in a collaborative, problem-solving environment and are driven to deliver operational excellence, we encourage you to apply for this exciting opportunity. __ Own availability and performance SLAs for production inference and fine-tuning services across serverless and dedicated deployments __ Own & improve testing, deployment, configuration management, and monitoring practices for multi-cluster ML infrastructure – partnering closely with Infra SREs __ Build self-serve tooling and automation to reduce operational toil and enable internal #users (MLOps, customer experience) and self-serve offerings __ Define and enforce configuration best practices for inference engines (vLLM, tvLLM, Pulsar) to prevent runtime issues __ Lead incident response, conduct postmortems, and drive reliability improvements __ Hire, mentor, and grow an MLOps engineering #team __ Partner with infrastructure and ML engineering teams to improve #system reliability and cost efficiency

Other Ai Matches

AI Researcher, Core ML Applicants are expected to have a solid experience in handling AI Researcher related tasks
Account Executive Europe (Net New Logo) Applicants are expected to have a solid experience in handling Sales related tasks
Machine Learning Engineer - Inference Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Strategic Sourcing & Procurement Lead, Compute Applicants are expected to have a solid experience in handling Procurement related tasks
Machine Learning, Platform Engineer Applicants are expected to have a solid experience in handling Engineering | Developer related tasks
Solutions Architec Applicants are expected to have a solid experience in handling Solutions Architect related tasks
Senior Software Engineer, Observability Applicants are expected to have a solid experience in handling Software Engineer related tasks
Lead DX Engineer - Documentation (SF / NYC) Applicants are expected to have a solid experience in handling Production related tasks
Machine Learning Engineer - Inference Applicants are expected to have a solid experience in handling Engineering related tasks
Machine Learning Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
LLM Training Dataset and Checkpoint Optimization Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Staff Partner Marketing Manager Applicants are expected to have a solid experience in handling Marketing Manager related tasks
Senior Network Operations Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Customer Support Engineer Applicants are expected to have a solid experience in handling Customer service related tasks
Senior Developer Productivity Engineer Applicants are expected to have a solid experience in handling Productivity Engineer related tasks
Research Scientist, Large-Scale Learning Applicants are expected to have a solid experience in handling Research Scientist related tasks
Platform Engineer, Model Shaping Applicants are expected to have a solid experience in handling Platform Engineer related tasks
Machine Learning Operations Lead Applicants are expected to have a solid experience in handling Machine Learning related tasks
Customer Support Engineer, India Applicants are expected to have a solid experience in handling Support Engineer related tasks
Senior Software Engineer - Together Cloud Platform Applicants are expected to have a solid experience in handling Software Engineer related tasks
Rust Systems Engineer - Inference Applicants are expected to have a solid experience in handling System Engineer related tasks
Senior Director, Capital Markets & Corporate Development Applicants are expected to have a solid experience in handling Corporate Finance related tasks
Senior Systems Administrator Applicants are expected to have a solid experience in handling System Administrator related tasks