Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Rubric-based On-policy Distillation

upvoted a paper 17 days ago

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

liked a model about 1 month ago

moonshotai/Kimi-K2.6

View all activity

Organizations

upvoted a paper 13 days ago

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published 23 days ago • 41

upvoted a paper 17 days ago

Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why

Paper • 2605.10889 • Published 20 days ago • 5

liked a model about 1 month ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 12 days ago • 3.01M • • 1.37k

upvoted a paper about 2 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 101

updated a collection 2 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 1 day ago • 141

published a dataset 2 months ago

nvidia/Nemotron-SFT-OpenCode-v1

Preview • Updated Mar 23 • 2.73k • 45

updated a dataset 2 months ago

nvidia/Nemotron-SFT-OpenCode-v1

Preview • Updated Mar 23 • 2.73k • 45

upvoted a paper 2 months ago

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published Mar 13 • 30

published a dataset 3 months ago

smajumdar94/Nemotron-SFT-OpenCode-v1

Updated Mar 13 • 8

upvoted a collection 3 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 1 day ago • 141

upvoted a paper 3 months ago

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

Paper • 2603.03194 • Published Mar 3 • 57

liked a dataset 3 months ago

nebius/SWE-rebench-V2

Viewer • Updated 18 days ago • 32.1k • 22.8k • 48

upvoted an article 3 months ago

Article

Custom Kernels for All from Codex and Claude

burtenshaw, sayakpaul, ariG23498, evalstate

•

Feb 13

• 80

upvoted 2 articles 4 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

MiniMax-AI

•

Feb 13

• 155

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

liked a model 5 months ago

nvidia/nemotron-speech-streaming-en-0.6b

Automatic Speech Recognition • Updated 27 days ago • 7.87k • 549

New activity in nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 5 months ago

Unexpected... "Performance"?

👍 2

#15 opened 5 months ago by

ponzles

Upload @PlaySafeGirls - 1.png

#24 opened 5 months ago by

Playsafegirls

liked a model 5 months ago

zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 58.4k • • 2.04k

Somshubra Majumdar

AI & ML interests

Recent Activity

Organizations

smajumdar94's activity

Custom Kernels for All from Codex and Claude

Forge: Scalable Agent RL Framework and Algorithm

We Got Claude to Build CUDA Kernels and teach open models!

Unexpected... "Performance"?

Upload @PlaySafeGirls - 1.png