The authors argue that generative AI introduces a new class of alignment risks because interaction itself becomes a mechanism of influence. Humans adapt their behavior in response to AI outputs, ...
(A) An example of the screen in the experimental video. The aircraft and carrier in the red box are targets, which are hidden in the layer, wave, and island. (B) Experimental paradigm. (C) ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results