CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 13 days ago • 266
longtermrisk/Qwen3-8B-selectivesftjob-b2c5751cecde-plain-g0.0-b0.0-d0.0-s1234 Updated 12 days ago • 1
PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors Paper • 2605.06455 • Published 19 days ago • 3
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 26 days ago • 57
Context-Value-Action Architecture for Value-Driven Large Language Model Agents Paper • 2604.05939 • Published Apr 7 • 10
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published Apr 6 • 13
UniRecGen: Unifying Multi-View 3D Reconstruction and Generation Paper • 2604.01479 • Published Apr 1 • 7
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 351