Tag
1 articles
A small set of attention heads can steer a VLM to describe a chosen image region without retraining.