RevengeBench: Reverse Engineering Code-Space Policies from Behavioral Experiments
Babak Rahmani +4
For most of scientific history, researchers studying behavior could only infer hidden mechanisms from outward actions: an inverse problem that becomes more tractable when observation is augmented by targeted intervention.
arXiv→