Member of Technical Staff - Enterprise Model Evaluation job opportunity at xAI.



bot
xAI Member of Technical Staff - Enterprise Model Evaluation
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Engineering

Copy Link Report
degreeOND
loacation Palo Alto, CA, United States Of America
loacation Palo Alto, CA....United States Of America

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.  About the Role The Model Evaluations team aims to design and implement xAI’s evaluations that shapes how we understand, measure and improve our model’s capabilities. You will work at the intersection of research and product to develop and implement model evaluations that give us high signal into merging model capabilities and robust evaluation infrastructure that enables fast iterations of our models. Your work will be essential to xAI’s mission of understanding the universe. You will collaborate closely with the training and product teams to ensure our models meet the highest standards before deployment. This is a technical leadership role where you will be expected to drive both the vision and implementation of our model evaluations. Responsibilities Design and implement next-generation evaluation suites beyond traditional benchmarks, creating frameworks that capture real-world utility and performance of Grok in production environments. Coordinate model evaluation efforts and collaborations to ensure comprehensive coverage and fast iterations. Integrate Grok into production systems, gain deep insights into real-world environments, and ensure alignment with user needs and business objectives. Partner with research teams to translate cutting-edge techniques and Grok models into production-ready implementations, optimizing for performance and impact. Exceptional candidates may have: Proven expertise in designing and implementing sophisticated evaluation frameworks for machine learning models, especially LLMs. Experience with statistical analysis, experimental design, and benchmarking AI systems in real-world settings. Location The role is based in Palo Alto. Our team usually works from the office 5 days a week but allow work-from-home days when required. Candidates are expected to be located near Palo Alto or open to relocation. Interview Process After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews: Coding assessment (2): in a language of your choice to solve a logical, algorithmic problem meant to be solved in 40 minutes setting. System design: designing a scalable, reliable system for one of the real world problem demonstrating first-principles thinking by breaking down requirements, justifying architectural choices, and addressing challenges. Meet the Team: Present your past exceptional work and your vision with xAI to a small audience. Our goal is to finish the main process within one week. Final interviews will be conducted in person. Annual Salary Range $180,000 - $440,000 USD Benefits Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Other Ai Matches

Member of Technical Staff, Multimodal Understanding (Visual / Audio) Applicants are expected to have a solid experience in handling Foundation Model related tasks
remote-jobserver Remote
Pure Math Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
remote-jobserver Remote
Finance Expert - Macro Research Analyst Applicants are expected to have a solid experience in handling Human Data related tasks
Software Engineer, Legal Operations Applicants are expected to have a solid experience in handling Engineering related tasks
remote-jobserver Remote
Personality & Behavior Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Client Partner Applicants are expected to have a solid experience in handling Sales related tasks
Member of Technical Staff, Voice (Enterprise) Applicants are expected to have a solid experience in handling Product related tasks
Manager, Law Enforcement Response Team Applicants are expected to have a solid experience in handling Legal related tasks
Mission Manager - International Government Applicants are expected to have a solid experience in handling Engineering related tasks
remote-jobserver Remote
Safety Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Operations & Maintenance (O&M) Lead Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Talent Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Member of Technical Staff, Video Generation Applicants are expected to have a solid experience in handling Foundation Model related tasks
remote-jobserver Remote
Biology Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Backend Engineer Applicants are expected to have a solid experience in handling Product related tasks
remote-jobserver Remote
Economics Expert Applicants are expected to have a solid experience in handling Human Data related tasks
remote-jobserver Remote
Multilingual Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Site Reliability Engineer - Storage Applicants are expected to have a solid experience in handling Infrastructure related tasks
remote-jobserver Remote
Space Science Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff, World Model Applicants are expected to have a solid experience in handling Foundation Model related tasks
Site Coordinator Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Engineering Technical Lead Manager (TLM) - Enterprise Applicants are expected to have a solid experience in handling Engineering related tasks
remote-jobserver Remote
Occupational Math Tutor Applicants are expected to have a solid experience in handling Human Data related tasks