AmanPriyanshu/regularizer-250K-from-reasoning-and-tool-use-sft-4M-random-compilation Viewer • Updated 2 days ago • 250k • 1
AmanPriyanshu/regularizer-250K-from-reasoning-sft-3M-random-compilation Viewer • Updated 6 days ago • 250k • 31
AmanPriyanshu/tool-reasoning-sft-RESEARCH-rlvr-env-retrieval-source Viewer • Updated 14 days ago • 156k • 33
AmanPriyanshu/tool-reasoning-sft-RESEARCH-openresearcher-dataset-sft-deep-research-agent-data-cleaned Updated 15 days ago • 470 • 1
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenHands-CodeScout_Training_Rollouts Viewer • Updated 15 days ago • 56.8k • 30
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenSeeker-v1-Data Viewer • Updated 15 days ago • 7.19k • 18
AmanPriyanshu/tool-reasoning-sft-RESEARCH-REDSearcher_SFT_10K Viewer • Updated 15 days ago • 9.05k • 20
AmanPriyanshu/reasoning-sft-poor-quality-reasoning-sample-mix Viewer • Updated 23 days ago • 150k • 22
AmanPriyanshu/reasoning-sft-minimax-microsoft-orca-agentinstruct-1M-v1 Viewer • Updated 24 days ago • 945k • 89 • 1
AmanPriyanshu/reasoning-sft-minimax-stratified-kmeans-diverse-reasoning-842K-only Viewer • Updated 24 days ago • 843k • 42
AmanPriyanshu/tool-reasoning-sft-TOOLS-toucan-1.5m-sft-tool-use-data-cleaned-rectified-333k Viewer • Updated 25 days ago • 566k • 44
AmanPriyanshu/RLVR-Env-Retrieval-Source-Retrieval-Synthetic-NVDocs-v1 Viewer • Updated 25 days ago • 100k • 44
AmanPriyanshu/tool-reasoning-sft-CODING-nvidia-Nemotron-Agentic-v1 Viewer • Updated 26 days ago • 331k • 27
AmanPriyanshu/reasoning-sft-Nemotron-Instruction-Following-Chat-v1 Viewer • Updated 26 days ago • 158k • 34
AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectified Viewer • Updated 29 days ago • 49.9k • 65
AmanPriyanshu/reasoning-sft-Edge-Agent-Reasoning-WebSearch-260K Viewer • Updated 29 days ago • 262k • 34
AmanPriyanshu/tool-reasoning-sft-CODING-allenai-SERA-data-cleaned-rectified Viewer • Updated 29 days ago • 211k • 37
AmanPriyanshu/tool-reasoning-sft-TOOLS-hermes-reasoning-tool-style-data-cleaned-rectified-115k Viewer • Updated 29 days ago • 115k • 39
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-python Viewer • Updated 30 days ago • 100k • 45
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-javascript Viewer • Updated 30 days ago • 100k • 27
AmanPriyanshu/tool-reasoning-sft-CODING-CoVe-12k-data-cleaned-rectified Viewer • Updated Mar 7 • 12k • 33