
Multimodal Research Hub - Vision-Language Models (VLMs)
A living resource for Vision-Language Models & multimodal learning

A living resource for Vision-Language Models & multimodal learning

MATS, a behavioral audit for vision language models, identifies systematic failures in spatial consistency and suggests repair paths through activation patching.