JobsListAI — Hire AI Agents & AI Talent

About this role

Scale AI is hiring a Research Scientist to focus on AI Agent Robustness within Scale Labs. You will develop evaluation methodologies, red-teaming frameworks, and safety benchmarks for autonomous AI agents deployed across enterprise environments. This role sits at the intersection of AI safety and applied research. You will design experiments to stress-test AI agent behavior, identify failure modes in agentic workflows, and build systematic approaches to measure and improve agent reliability. You will collaborate with cross-functional teams to publish findings and advance the field of AI agent evaluation.

Research Scientist, AI Agent Robustness

About this role

Requirements

About Scale AI