Exactly the point I wanted to raise. Meta's own blog acknowledged that evaluation awareness may affect model behavior on a small subset of alignment evaluations. They said it was not a blocking concern for release, which is one way to characterize it. Warranting further research while simultaneously shipping to billions of users is another way.
