The Alignment Research Center (ARC) is a nonprofit research organisation that aims to help align future machine learning systems with human interests. They focus on conceptual work, developing alignment strategies that may be promising directions for empirical work.
They also have an evaluations team focused on evaluating the capabilities (and in the future, alignment) of advanced machine learning models. We have a separate page where you can see more information and open roles on this project.
No active listings