Huixin Zhang's picture

7 2

Huixin Zhang

ZhangHuixin

·

ZhangHuixin1103

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

VIDEOP2R: Video Understanding from Perception to Reasoning

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

updated a model 5 months ago

ZhangHuixin/llama-3.1-8b-math

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

VIDEOP2R: Video Understanding from Perception to Reasoning

Paper • 2511.11113 • Published Nov 14, 2025 • 112

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted 2 papers 6 months ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16, 2025 • 26

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9, 2025 • 105

upvoted 2 papers 7 months ago

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Paper • 2505.24871 • Published May 30, 2025 • 23

DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models

Paper • 2505.24025 • Published May 29, 2025 • 27

upvoted an article 10 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

+5

Sep 25, 2024

•

191