RL Environments Specialist job opportunity at xAI.



bot
xAI RL Environments Specialist
Experience: General
Pattern: Remote
Walk In
apply Apply Now
Salary:
Status:

Human Data

Copy Link Report
degreeGeneral
Hiring inbound within Remote

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.  About the Role We need talented engineers that will create full RL environments (UI, backend, programmatically generate tasks and validation) for training computer use agents. This means that we need you to take ownership of the entire task creation process for a given environment. In this role, you will Build sandbox UIs that our agents and RL actors will interact with. Create tasks for built environments and programmatically validate task completion. Enjoys working remotely Qualifications Strong professional experience with React.js (hooks, modern state management, TypeScript preferred) — required Strong professional experience building backend services in Python (FastAPI, Flask, or Django) — required Hands-on experience with containerization (Docker required; Docker Compose/Kubernetes a plus) Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL) Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.) Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.) Preferred Qualifications Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment. Eager to teach to and learn from teammates. Enthusiasm to collaboratively build the best truth-seeking AI out there! Interview Process Technical hands-on live coding round Hiring Manager / Final interview round Location & Other Expectations Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. They may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role specific needs. For US based candidates, please note we are unable to hire in the states of Wyoming and Illinois at this time. We are unable to provide visa sponsorship. For those who will be working from a personal device, your computer must be a Chromebook, Mac with MacOS 11.0 or later, or Windows 10 or later. Compensation US based candidates: $35/hour - $100/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications.  International candidates: Information will be provided to you during the recruitment process. Benefits Benefits vary based on employment type, location and jurisdiction. Benefits for eligible U.S. based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role specific information will be provided to you during the interview process.xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Other Ai Matches

Mechanical Engineer (HVAC / Chilled Water) Applicants are expected to have a solid experience in handling Data Center Operations related tasks
remote-jobserver Remote
Chemistry Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Global Supply Manager- SaaS Applicants are expected to have a solid experience in handling Finance related tasks
Manager, Law Enforcement Response Team Applicants are expected to have a solid experience in handling Legal related tasks
Senior Sourcing Specialist- Indirect Applicants are expected to have a solid experience in handling Finance related tasks
Member of Technical Staff - Multimodal Interactions Post-training Applicants are expected to have a solid experience in handling Foundation Model related tasks
Member of Technical Staff, Pre-training Data Infrastructure Applicants are expected to have a solid experience in handling Foundation Model related tasks
Mission Manager - International Government Applicants are expected to have a solid experience in handling Engineering related tasks
Client Partner Applicants are expected to have a solid experience in handling Sales related tasks
Member of Technical Staff, Ads Product Applicants are expected to have a solid experience in handling Product related tasks
Legal Director, X Payments Applicants are expected to have a solid experience in handling Legal related tasks
Network Engineer - Backbone Applicants are expected to have a solid experience in handling Engineering related tasks
Facilities Maintenance Technician Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Member of Technical Staff, Image Generation - Agent, RL Applicants are expected to have a solid experience in handling Foundation Model related tasks
remote-jobserver Remote
System Design Specialist Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff - Reasoning Post-training Applicants are expected to have a solid experience in handling Foundation Model related tasks
Site Ops Lead Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Software Engineer - Reliability Applicants are expected to have a solid experience in handling Infrastructure related tasks
remote-jobserver Remote
Medicine Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Member of Technical Staff - Search Post Training Applicants are expected to have a solid experience in handling Foundation Model related tasks
Member of Technical Staff - Government - Cleared Applicants are expected to have a solid experience in handling Engineering related tasks
remote-jobserver Remote
Materials Science Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
remote-jobserver Remote
Data Science Tutor Applicants are expected to have a solid experience in handling Human Data related tasks