Dataset - a JuanRafap Collection

JuanRafap 's Collections

Fondation model

Dataset

updated Apr 14

DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training

Paper • 2504.17565 • Published Apr 24, 2025 • 2
AI-MO/NuminaMath-1.5

Viewer • Updated Jan 29 • 896k • 7.28k • 191
PrimeIntellect/synthetic-code-understanding

Viewer • Updated Feb 15, 2025 • 60.6k • 141 • 20
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published Jul 9, 2025 • 56
VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6, 2025 • 164
allenai/CoSyn-400K

Viewer • Updated Feb 28, 2025 • 408k • 3.39k • 51
nvidia/Granary

Viewer • Updated about 1 month ago • 113M • 3.03k • 212
jupyter-agent/jupyter-agent-dataset

Viewer • Updated Sep 10, 2025 • 95.8k • 1.57k • 170
HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 161k • 507
PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits

Paper • 2509.11362 • Published Sep 14, 2025 • 5
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111
MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Paper • 2509.14142 • Published Sep 17, 2025 • 10
MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning

Paper • 2507.21924 • Published Jul 29, 2025 • 1
ScaleAI/SWE-bench_Pro

Benchmark • Updated Feb 23 • 731 • 69.7k • 157
nvidia/NitroGen

Updated Jan 12 • 1.12k • 217
Hierarchical Dataset Selection for High-Quality Data Sharing

Paper • 2512.10952 • Published Dec 11, 2025 • 2
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published Dec 15, 2025 • 15
RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models

Paper • 2601.03699 • Published Jan 7 • 8
Extreme Multi-Label Skill Extraction Training using Large Language Models

Paper • 2307.10778 • Published Jul 20, 2023
tencent/CL-bench

Viewer • Updated Feb 6 • 1.9k • 993 • 145
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

Paper • 2602.16742 • Published Feb 18 • 12
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published Feb 27 • 92
open-index/hacker-news

Updated 5 minutes ago • 25.4k • 338
nvidia/SPEED-Bench

Viewer • Updated Apr 28 • 8.56k • 10.9k • 36
Neural Computers

Paper • 2604.06425 • Published Apr 7 • 32