MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models Paper • 2410.08182 • Published Oct 10, 2024
Persistent Personas? Role-Playing, Instruction Following, and Safety in Extended Interactions Paper • 2512.12775 • Published Dec 14, 2025 • 2
ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge Paper • 2506.14407 • Published Jun 17, 2025 • 2
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published Mar 6, 2025 • 4
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published Mar 6, 2025 • 4
NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper • 2502.05167 • Published Feb 7, 2025 • 16