Member of Technical Staff - RL Infrastructure [data, evals, agent] job opportunity at xAI.



bot
xAI Member of Technical Staff - RL Infrastructure [data, evals, agent]
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Foundation Model

Copy Link Report
degreeOND
loacation Palo Alto, CA; San Francisco, CA, United States Of America
loacation Palo Alto, CA;..........United States Of America

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.  About the Role xAI is seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers. Focus Creating and maintaining frameworks for agent, data, and model evaluation tasks. Building environments for AI agents. Tools for automating common workflows. Improving alerts, metrics and error handling on large scale RL jobs. Refactoring existing agent, data, eval, training frameworks for better modularity. Designing operation procedures and coding standards to streamline the transition from small scale experimentation to large scale RL training.  Writing unit tests, CI/CD frameworks to support rapid development cycles. Ideal Experience Experience building and maintaining frameworks that are used by many engineers. Experience in building high-performance sandboxes, virtual machines, and simulations. Experience building full-stack apps for automating workflows and data visualization. Experience in rapid iteration of research to production cycles. Experience in test automation, CI/CD. Typical problems you will deal with We have a new agentic model capability that we’d like to improve. How do we design an efficient and robust environment for the agent to perform actions in? Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability? A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework? Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time? A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline? Tech Stack Python / Rust / C++ Typescript / React Location The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located in the Bay Area or open to relocation. Interview Process After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews: Coding assessment in a language of your choice. Two systems hands-on: Demonstrate practical skills in live problem-solving sessions that involve both system design and coding. Meet the Team: Present your past exceptional work and your vision with xAI to a small audience. Our goal is to finish the main process within one week. All interviews will be conducted via Google Meet. Annual Salary Range $180,000 - $440,000 USD Benefits Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Other Ai Matches

Manager, Law Enforcement Response Team Applicants are expected to have a solid experience in handling Legal related tasks
Software Engineer - MacOS Applicants are expected to have a solid experience in handling Product related tasks
Member of Technical Staff, RL Training Framework Applicants are expected to have a solid experience in handling Foundation Model related tasks
Member of Technical Staff, Recommendation Systems Applicants are expected to have a solid experience in handling Product related tasks
Senior Legal Counsel Applicants are expected to have a solid experience in handling Legal related tasks
remote-jobserver Remote
Investment Banking Expert - DCM Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff, Pre-training Data Infrastructure Applicants are expected to have a solid experience in handling Foundation Model related tasks
Facilities Operations Manager Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Backend Engineer - Grok Chat Applicants are expected to have a solid experience in handling Product related tasks
Member of Technical Staff, Pre-training Data Scaling Applicants are expected to have a solid experience in handling Foundation Model related tasks
Director, Revenue Ops Applicants are expected to have a solid experience in handling Finance related tasks
Senior Backend Engineer - Starfleet Applicants are expected to have a solid experience in handling Engineering related tasks
remote-jobserver Remote
Model Behavior Tutor - Style, Taste & Aesthetics Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff - Macrohard Applicants are expected to have a solid experience in handling Engineering related tasks
Network Engineer - Edge Applicants are expected to have a solid experience in handling Engineering related tasks
Software Engineer - Infrastructure/Supercomputing Applicants are expected to have a solid experience in handling Infrastructure related tasks
Software Engineer - Networking Software and Services Applicants are expected to have a solid experience in handling Infrastructure related tasks
remote-jobserver Remote
Civil Engineering Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff - Coding Agents, Post Training - RL, Evals Applicants are expected to have a solid experience in handling Foundation Model related tasks
Application Security Engineer Applicants are expected to have a solid experience in handling Information Security related tasks
remote-jobserver Remote
Finance Expert - Quantitative Trading Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff, Product Safety Applicants are expected to have a solid experience in handling Product related tasks
Member of Technical Staff, Applied Inference Applicants are expected to have a solid experience in handling Foundation Model related tasks