Tag
1 articles
ART tunes frozen multimodal LLMs by optimizing a single image instead of model weights.