Principal Research Engineer, Gemini Evals job opportunity at DeepMind.

DeepMind Principal Research Engineer, Gemini Evals

Experience: 10-years

Pattern: full-time

Walk In

Apply Now

Salary:

Status:

GenAI

Copy Link Report

Associate

Hiring inbound within Mountain View, California, US

Snapshot This role is for a Principal level Research Engineer to lead the strategic development and execution of robust data pipelines, evaluation frameworks, and metric systems for the Gemini family of models and their associated product applications. As a key technical leader and individual contributor, you will apply deep expertise in large-scale machine learning, statistical rigor, and scalable engineering to ensure the safety, performance, and ethical alignment of our frontier AI systems before and after deployment. About us Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority. This role is part of the Gemini Evaluation research teams. The Gemini Evals team defines success for Gemini, establishes metrics to track progress, and provides clear, actionable insights to guide development. As a Research Engineer on this team, you will be at the forefront of building the data and evaluation systems that ensure the safety and quality of the Gemini family of models. The Role As a Principle Research Engineer, you will operate as a technical expert and leader within the Gemini Data and Evaluation team. Your primary focus will be to architect and execute the rigorous evaluation and data systems that underpin all major model release and product launch decisions for Gemini. This is a highly cross-functional role requiring a blend of deep ML research, world-class software engineering, and strategic influence. You will define the data strategy for critical evaluation campaigns, design novel metrics to measure safety and performance at scale, and mentor a team of engineers and researchers to build high-quality, reproducible systems. You will be accountable for communicating complex evaluation results directly to leadership stakeholders to guide the responsible deployment of our most advanced AI technology. Key responsibilities Technical Leadership & Strategy Work on post-training evaluation and fine-tuning of large-scale models to improve performance and safety. Define and champion the technical roadmap for large-scale data and evaluation supporting the Gemini model family and its real-world applications Drive the research of novel, high-signal evaluation methods (automated, human-in-the-loop, and adversarial) to measure model capabilities, alignment, safety, and trustworthiness. Actively contribute to the broader scientific community by presenting findings on cutting-edge AI evaluation and safety methods. About You In order to set you up for success as a at Google DeepMind, we look for the following skills and experience: 10+ years of experience in researching engineering, with at least 5 years in a technical leadership role. Experience with large-scale machine learning systems, data processing pipelines and evaluation methodologies. Experience with large language models (LLMs) and their evaluation. Experience in post-training evaluation research

Other Ai Matches

Senior Technical Program Manager, Gemini Code Applicants are expected to have a solid experience in handling GenAI related tasks

Research Scientist, Agentic Safety Applicants are expected to have a solid experience in handling Science related tasks

Technical Program Manager, GeminiApp Applicants are expected to have a solid experience in handling GeminiApp related tasks

AI Product Designer, GeminiApp Mobile Experience Applicants are expected to have a solid experience in handling GeminiApp related tasks

Research Scientist, AnthroKrishi Applicants are expected to have a solid experience in handling Frontier AI related tasks

Technical Program Manager, RL Infrastructure & Reliability Applicants are expected to have a solid experience in handling GenAI related tasks

Machine Learning Software Engineer, GeminiApp Agents and Tool Use Applicants are expected to have a solid experience in handling GeminiApp related tasks

Silicon Technical Lead Applicants are expected to have a solid experience in handling GenAI related tasks

Research Engineer, AI for Weather and Energy Applicants are expected to have a solid experience in handling Frontier AI related tasks

Intelligence Lead, Security & Privacy, GenAI (18 months Fixed Term Contract) Applicants are expected to have a solid experience in handling GenAI related tasks

Staff Electrical Engineer, Gemini Robotics Applicants are expected to have a solid experience in handling Frontier AI related tasks

Research Scientist/Engineer, Model Threat Defense Applicants are expected to have a solid experience in handling GenAI related tasks

Administrative Business Partner, US, MTV, FTC Applicants are expected to have a solid experience in handling Central Operations, Responsibility, and Engagement related tasks

Team Lead, Research Engineering, AI for Chip Design Applicants are expected to have a solid experience in handling GenAI related tasks

Senior Model UX Content Designer, Gemini App Applicants are expected to have a solid experience in handling GeminiApp related tasks

Security Lead, Agentic Red Team Applicants are expected to have a solid experience in handling GenAI related tasks

Staff Data Scientist, GeminiApp Applicants are expected to have a solid experience in handling GeminiApp related tasks

Staff AI Product Designer, GeminiApp Device Experience Applicants are expected to have a solid experience in handling GeminiApp related tasks

Low power design engieer/micro-architect L5 Applicants are expected to have a solid experience in handling GenAI related tasks

Research Engineer - Multimodal Companion Agent Applicants are expected to have a solid experience in handling Frontier AI related tasks

Senior Engineer (Mobile), Gemini App, Google DeepMind Applicants are expected to have a solid experience in handling GeminiApp related tasks

Technical Program Manager, Frontier AI Research Applicants are expected to have a solid experience in handling Frontier AI related tasks

Administrative Business Partner - Mountain View, US - L4 Applicants are expected to have a solid experience in handling Central Operations, Responsibility, and Engagement related tasks

Principal Research Engineer, Gemini Evals job opportunity at DeepMind.

Saved Jobs

No Job Saved

Other Ai Matches