Text2Zinc: A Cross-Domain Dataset for Modeling Optimization and Satisfaction Problems in MiniZinc Paper • 2503.10642 • Published Feb 22, 2025 • 2
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification Paper • 2311.07593 • Published Nov 10, 2023
Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance Paper • 2503.23239 • Published Mar 29, 2025 • 1
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation Paper • 2406.16320 • Published Jun 24, 2024 • 3
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21, 2025 • 2
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8, 2025 • 99
IsoScore: Measuring the Uniformity of Embedding Space Utilization Paper • 2108.07344 • Published Aug 16, 2021 • 1
PERSONA: A Reproducible Testbed for Pluralistic Alignment Paper • 2407.17387 • Published Jul 24, 2024 • 20
Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages Paper • 2407.03321 • Published Jul 3, 2024 • 20
Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation Paper • 2402.18334 • Published Feb 28, 2024 • 12
Suppressing Pink Elephants with Direct Principle Feedback Paper • 2402.07896 • Published Feb 12, 2024 • 11
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning Paper • 2311.03736 • Published Nov 7, 2023 • 12
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines? Paper • 2301.11722 • Published Jan 27, 2023