Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling
Paper
• 2510.08470 • Published
• 1
Cambridge NLIP Multimodal BabyLM 2025: decoder with token-wise dynamic gating, feature modulation, and channel attention