Site Reliability Engineer (SRE) job opportunity at xAI.



bot
xAI Site Reliability Engineer (SRE)
Experience: General
Pattern: full-time
apply Apply Now
Salary:
Status:

Product

Copy Link Report
degreeOND
loacation London, UK, United Kingdom
loacation London, UK....United Kingdom

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.  About the team You will work on the team that is responsible for the backend services that power our products such as grok.com and the API. We focus on writing and maintaining highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud). About the role An ideal candidate meets at least the following requirements: Expert knowledge of Kubernetes. Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD. Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty. Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform. Familiarity with a systems programming language like Rust, C++ or Go Experience with traffic management and HTTP proxies such as nginx and envoy. Location This position is in-person in London, UK. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates must be willing to attend late meetings at least once a week to coordinate with the rest of our team in Palo Alto. Interview process After submitting your application, the team reviews your statement of exceptional work and CV. If your application passes this stage, the interview process is as follows: Initial technical screening during which a member of our team will ask some basic technical questions (15 minutes) Coding interview (45 minutes) Distributed System Design interview (45 minutes) Final stage with founding engineer Toby Pohlen (30 minutes) All interviews will be conducted via Google Meet. Benefits Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to our Aviva pension plan, short & long-term disability insurance, life insurance, and various other discounts and perks. Privacy PolicyxAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Other Ai Matches

remote-jobserver Remote
Biology Tutor Applicants are expected to have a solid experience in handling Human Data related tasks
Senior Sourcing Specialist- Indirect Applicants are expected to have a solid experience in handling Finance related tasks
Backend Engineer - Product Safety Applicants are expected to have a solid experience in handling Product related tasks
Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics Applicants are expected to have a solid experience in handling Foundation Model related tasks
Site Ops Lead Applicants are expected to have a solid experience in handling Data Center Operations related tasks
Member of Technical Staff, Web Scale Video Data Applicants are expected to have a solid experience in handling Foundation Model related tasks
Infrastructure Engineer - US Government Applicants are expected to have a solid experience in handling Product related tasks
remote-jobserver Remote
Model Behavior Tutor - Epistemic Rigor & Truthfulness Applicants are expected to have a solid experience in handling Human Data related tasks
Security Engineer, Detection and Response Applicants are expected to have a solid experience in handling Information Security related tasks
Site Reliability Engineer - Cybersecurity Applicants are expected to have a solid experience in handling Engineering related tasks
Manager, Law Enforcement Response Team Applicants are expected to have a solid experience in handling Legal related tasks
Senior Backend Engineer - Starfleet Applicants are expected to have a solid experience in handling Engineering related tasks
Member of Technical Staff, Interpretability Applicants are expected to have a solid experience in handling Engineering related tasks
People Operations Specialist Applicants are expected to have a solid experience in handling People related tasks
Member of Technical Staff, Product Safety Applicants are expected to have a solid experience in handling Product related tasks
Dispute Analyst, X Payments Applicants are expected to have a solid experience in handling Product related tasks
remote-jobserver Remote
Finance Expert - Private Credit Applicants are expected to have a solid experience in handling Human Data related tasks
Senior Accountant - Revenue Applicants are expected to have a solid experience in handling Finance related tasks
remote-jobserver Remote
AI Tutor - Image Specialist Applicants are expected to have a solid experience in handling Human Data related tasks
remote-jobserver Remote
Model Behavior Tutor - Wit & Conversation Applicants are expected to have a solid experience in handling Human Data related tasks
Fullstack Engineer - Education (Spanish Bilingual) Applicants are expected to have a solid experience in handling Engineering related tasks
Member of Technical Staff - Reasoning Post-training Applicants are expected to have a solid experience in handling Foundation Model related tasks
Member of Technical Staff - Infrastructure Reliability Applicants are expected to have a solid experience in handling Infrastructure related tasks