Scale AI - San Francisco, CA
posted 4 days ago
As a Research Scientist focused on Frontier Risk Evaluations, you will design and create evaluation measures, harnesses and datasets for measuring the risks posed by frontier AI systems. For example, you might do any or all of the following: Design and build harnesses to test AI agents for dangerous capabilities such as hacking or exploiting security vulnerabilities; Develop and run human-in-the-loop tests of AI capabilities to deceive, manipulate, blackmail, or otherwise engage in social engineering; Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems.
Match and compare your resume to any job description
Start Matching