@sergiopaniego on Hugging Face: "This summer TRL leveled up for multimodal alignment 🌞 ✅ New VLM alignment…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

sergiopaniego

posted an update Sep 22, 2025

Post

1412

This summer TRL leveled up for multimodal alignment 🌞

✅ New VLM alignment methods (MPO, GRPO, GSPO)
✅ Extended RLOO & Online DPO for VLMs
✅ Native SFT support
✅ Ready-to-use training scripts

🔗 https://huggingface.co/blog/trl-vlm-alignment

In this post

sergiopaniego Sergio Paniego