Finetune data - a Deventhedude Collection

Deventhedude 's Collections

computer use

Terminus

grounding data

TRADING/FINANCES DATASETS

Benchmarks

Safety Training

Embedding Fintune Datasets

updated Dec 11, 2025

Upvote

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Paper • 2505.10597 • Published May 15, 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44
nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 9.57k • 110
nvidia/Nemotron-RL-instruction_following

Preview • Updated Jan 12 • 263 • 15
nvidia/Nemotron-RL-knowledge-web_search-mcqa

Viewer • Updated Jan 12 • 2.93k • 126 • 15
nvidia/Nemotron-RL-agent-workplace_assistant

Viewer • Updated Feb 26 • 1.8k • 609 • 24
nvidia/Nemotron-RL-instruction_following-structured_outputs

Viewer • Updated Jan 12 • 9.95k • 294 • 37
nvidia/Nemotron-RL-knowledge-mcqa

Viewer • Updated Dec 12, 2025 • 686k • 842 • 12
nvidia/Nemotron-RL-math-OpenMathReasoning

Viewer • Updated Jan 12 • 113k • 537 • 17
nvidia/Nemotron-RL-knowledge-openqa

Viewer • Updated Dec 12, 2025 • 136k • 279 • 10
nvidia/Nemotron-RL-math-advanced_calculations

Viewer • Updated Dec 12, 2025 • 6k • 149 • 11
nvidia/Nemotron-AIQ-Agentic-Safety-Dataset-1.0

Viewer • Updated Dec 6, 2025 • 10.8k • 1.03k • 16
nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated Dec 18, 2025 • 4.58M • 16k • 90
nvidia/ProfBench

Viewer • Updated Mar 4 • 40 • 784 • 29
google/code_x_glue_cc_code_completion_token

Viewer • Updated Jan 24, 2024 • 178k • 622 • 9
google/code_x_glue_cc_cloze_testing_all

Viewer • Updated Jan 24, 2024 • 176k • 358 • 6
google/code_x_glue_cc_clone_detection_big_clone_bench

Viewer • Updated Jan 24, 2024 • 1.73M • 726 • 22
google/code_x_glue_ct_code_to_text

Viewer • Updated Jan 24, 2024 • 1.01M • 8.5k • 80
google/code_x_glue_tc_nl_code_search_adv

Viewer • Updated Jan 24, 2024 • 281k • 506 • 11
TeichAI/claude-sonnet-4.5-high-reasoning-250x

Viewer • Updated Oct 31, 2025 • 247 • 176 • 37
Idea2Plan: Exploring AI-Powered Research Planning

Paper • 2510.24891 • Published Oct 28, 2025
TGPR: Tree-Guided Policy Refinement for Robust Self-Debugging of LLMs

Paper • 2510.06878 • Published Oct 8, 2025 • 1
FML-bench: A Benchmark for Automatic ML Research Agents Highlighting the Importance of Exploration Breadth

Paper • 2510.10472 • Published Oct 12, 2025 • 9
Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research

Paper • 2510.06056 • Published Oct 7, 2025 • 6
RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Paper • 2510.06186 • Published Oct 7, 2025
AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Paper • 2511.08522 • Published Nov 11, 2025 • 18
Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 39.9k • 1.74k
open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 21.5k • 230
nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 362 • 106
Open-Bee/Honey-Data-15M

Viewer • Updated Mar 10 • 14.8M • 38k • 118
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

Paper • 2310.19923 • Published Oct 30, 2023 • 15
nvidia/Nemotron-PII

Viewer • Updated Dec 17, 2025 • 200k • 4.03k • 100
HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 1.03M • 2.83k
rl-research/dr-tulu-sft-data

Viewer • Updated Nov 25, 2025 • 13.1k • 287 • 29
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 55.7k • 807
miromind-ai/MiroVerse-v0.1

Viewer • Updated Jan 16 • 228k • 7.41k • 235
nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 4.76k • 668
wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 248k • 1.23k
HuggingFaceH4/MATH-500

Viewer • Updated Dec 15, 2025 • 500 • 176k • 310
nick007x/github-code-2025

Viewer • Updated Apr 1 • 148M • 1.24k • 117
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 212
nvidia/ToolScale

Viewer • Updated Dec 17, 2025 • 4.06k • 1.6k • 198
natolambert/GeneralThought-430K-filtered

Viewer • Updated Mar 26, 2025 • 338k • 1.22k • 35
RJT1990/GeneralThoughtArchive

Viewer • Updated Sep 5, 2025 • 431k • 1.11k • 73
open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31, 2025 • 228k • 113k • 846
open-r1/OpenR1-Math-Raw

Viewer • Updated Feb 24, 2025 • 516k • 5.63k • 76
PrimeIntellect/SYNTHETIC-1

Viewer • Updated Feb 21, 2025 • 1.99M • 4.86k • 61
PrimeIntellect/synthetic-code-understanding

Viewer • Updated Feb 15, 2025 • 60.6k • 32 • 20
PrimeIntellect/INTELLECT-3-SFT

Viewer • Updated Nov 28, 2025 • 6.98M • 663 • 4
openbmb/InfLLM-V2-data-5B

Viewer • Updated Oct 25, 2025 • 7.19M • 348 • 33
kenhktsui/open-react-retrieval-multi-neg-result-new-kw

Viewer • Updated Aug 7, 2023 • 25.2k • 24 • 3
alwaysfurther/tiny-agent-with-tools

Viewer • Updated Dec 3, 2025 • 27 • 76
tiny-agents/tiny-agents

Viewer • Updated Apr 26 • 9 • 393 • 38
PleIAs/SYNTH

Viewer • Updated 21 days ago • 68M • 20.2k • 266
TuringEnterprises/Turing-Open-Reasoning

Viewer • Updated Dec 6, 2025 • 50 • 203 • 192
TeichAI/claude-4.5-opus-high-reasoning-250x

Viewer • Updated Nov 28, 2025 • 250 • 1.59k • 390
PrimeIntellect/INTELLECT-3-RL

Viewer • Updated 13 days ago • 70.7k • 5.56k • 7
PrimeIntellect/Reverse-Text-RL

Viewer • Updated Aug 12, 2025 • 1k • 8.76k • 2
PrimeIntellect/Reverse-Text-SFT

Viewer • Updated Aug 12, 2025 • 1k • 1.21k • 3
PrimeIntellect/SYNTHETIC-2-Base-Code

Viewer • Updated Jun 23, 2025 • 57.3k • 75
PrimeIntellect/SYNTHETIC-2-Base-Math

Viewer • Updated Jun 23, 2025 • 105k • 22 • 1
PrimeIntellect/SYNTHETIC-2-Base

Viewer • Updated Jun 23, 2025 • 465k • 55 • 9
PrimeIntellect/SYNTHETIC-2-Base-General-Reasoning

Viewer • Updated Jun 23, 2025 • 165k • 13 • 1
PrimeIntellect/SYNTHETIC-2-SFT-verified

Viewer • Updated Jul 10, 2025 • 105k • 441 • 11
PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique

Viewer • Updated Jun 23, 2025 • 50k • 12 • 2
PrimeIntellect/SYNTHETIC-2-Base-Instruction-Following

Viewer • Updated Jun 23, 2025 • 87.5k • 13
PrimeIntellect/SYNTHETIC-2

Viewer • Updated Jul 10, 2025 • 51.6k • 279 • 15
PrimeIntellect/AIME-24

Viewer • Updated Jun 24, 2025 • 30 • 30
PrimeIntellect/AIME-25

Viewer • Updated Jun 24, 2025 • 30 • 37
PrimeIntellect/MATH-500

Viewer • Updated Jun 24, 2025 • 500 • 144
PrimeIntellect/LiveCodeBench-v5

Viewer • Updated Jun 25, 2025 • 279 • 242
arcee-ai/bfcl_v4_web_search

Viewer • Updated Sep 13, 2025 • 100 • 230 • 6
arcee-ai/EvolKit-75K

Viewer • Updated Dec 5, 2024 • 74.2k • 150 • 37
arcee-ai/general-dpo-datasets

Viewer • Updated Jul 4, 2024 • 91.6k • 114
arcee-ai/synthetic-data-gen

Viewer • Updated Sep 21, 2023 • 999k • 308 • 2
arcee-ai/DAM

Viewer • Updated Nov 25, 2024 • 10.4k • 177
arcee-ai/EvolKit-20k-vi

Viewer • Updated Nov 7, 2024 • 15.4k • 139 • 7
arcee-ai/reasoning-sharegpt

Viewer • Updated Jul 5, 2024 • 29.9k • 157 • 23
arcee-ai/agent-data

Viewer • Updated Jul 22, 2024 • 486k • 288 • 64
arcee-ai/infini-instruct-top-500k

Viewer • Updated Jun 30, 2024 • 500k • 123 • 6
arcee-ai/cleaned-mlabonne-distilabel-truthy-dpo-v0.1-filtered

Viewer • Updated Jun 18, 2024 • 663 • 11
Nanbeige/ToolMind

Viewer • Updated Jan 9 • 369k • 2.66k • 153
Salesforce/APIGen-MT-5k

Viewer • Updated Oct 10, 2025 • 5k • 1.85k • 99
Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 10.5k • 178
glaiveai/glaive-function-calling-v2

Viewer • Updated Sep 27, 2023 • 113k • 63.2k • 508
nvidia/When2Call

Viewer • Updated Apr 29, 2025 • 28k • 1.45k • 46
Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24, 2025 • 60k • 29.5k • 623
HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 634k • 1.1k
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 52
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Paper • 2510.08567 • Published Oct 9, 2025
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published Nov 24, 2025 • 10
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use

Paper • 2510.27363 • Published Oct 31, 2025 • 23
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries

Paper • 2511.00710 • Published Nov 1, 2025 • 5
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

Paper • 2510.01623 • Published Oct 2, 2025 • 13
DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 47
DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published Oct 14, 2025 • 14
DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 103
Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26, 2025 • 31
smolagents/android-control

Viewer • Updated May 9, 2025 • 15.3k • 2.53k • 14
smolagents/guiact-web-single

Viewer • Updated Aug 5, 2025 • 13.3k • 211 • 1
smolagents/tool-scraping

Viewer • Updated Sep 16, 2025 • 1.89k • 62 • 6
smolagents/hermes-function-calling-v1-formatted-code-agent

Viewer • Updated Jun 30, 2025 • 9k • 258 • 3
smolagents/aguvis-stage-1

Viewer • Updated Aug 5, 2025 • 459k • 1.92k • 17
smolagents/aguvis-stage-2

Viewer • Updated Sep 5, 2025 • 784k • 4.04k • 29
beyoru/ToolCalll_fusion

Viewer • Updated Sep 15, 2025 • 10.5k • 30 • 1
beyoru/ToolCall_synthetic_qwen3

Viewer • Updated Jul 20, 2025 • 60k • 17 • 10
rogue-security/mcp-tool-use-quality-benchmark

Viewer • Updated Sep 25, 2025 • 5k • 26 • 3
mlx-community/hermes-reasoning-tool-use

Viewer • Updated Jul 29, 2025 • 51k • 70 • 5
TeichAI/gemini-3-pro-preview-high-reasoning-1000x

Viewer • Updated Dec 10, 2025 • 1.02k • 244 • 78
openbmb/Ultra-FineWeb

Viewer • Updated Dec 10, 2025 • 1.29B • 52.2k • 345
allenai/Dolci-Instruct-SFT-Tool-Use

Viewer • Updated Jan 5 • 228k • 355 • 16
nvidia/Nemotron-Content-Safety-Reasoning-Dataset

Preview • Updated Nov 26, 2025 • 311 • 10
ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 4k • 56
Voxel51/ScreenSpot-v2

Viewer • Updated Jun 25, 2025 • 1.27k • 2.63k • 2
rootsautomation/ScreenSpot

Viewer • Updated Apr 10, 2024 • 1.27k • 3.5k • 48
microsoft/WebTailBench

Preview • Updated 15 days ago • 345 • 16
DeepShop/DeepShop

Viewer • Updated May 13, 2025 • 150 • 32 • 4
osunlp/Online-Mind2Web

Viewer • Updated 4 days ago • 300 • 1.57k • 25
zai-org/T1

Preview • Updated Mar 2, 2025 • 44 • 11
zai-org/LongBench-v2

Viewer • Updated Dec 20, 2024 • 503 • 35.2k • 47

Upvote

Collection guide
Browse collections