Post
1830
Cool new dataset from
@isidentical
- https://huggingface.co/datasets/isidentical/moondream2-coyo-5M-captions
The VeCLIP paper showed a +3% gain while only using 14% of the data by synthetically captioning like this. You get diversity from the alt text (middle column) without having to deal with all of the noise.
The VeCLIP paper showed a +3% gain while only using 14% of the data by synthetically captioning like this. You get diversity from the alt text (middle column) without having to deal with all of the noise.