FaceLLM Collection A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm β’ 3 items β’ Updated Jul 23, 2025 β’ 4
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25, 2025 β’ 218
Running on Zero MCP 405 Multimodal OCR π 405 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Paused Agents 9 Flux Quantized or Original? π 9 Generate and compare quantized images from prompts
Running 241 MedGemma - Radiology Explainer Demo π©Ί 241 Radiology Image & Report Explainer Demo. Built with MedGemma
view changelog Hugging Face Changelog Filter by MCP compatibility available in HF Spaces May 21, 2025 β’ 79
Runtime error Agents Featured 128 OctoTools π 128 An Agentic Framework with Tools for Complex Reasoning
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize β’ 7 items β’ Updated Jul 19, 2024 β’ 14
Sleeping Questionnanswer Document π Upload document and ask question about it β Groq will answer