127 646 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

commented on a paper 16 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

upvoted a paper 20 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

commented on a paper 20 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

View all activity

Organizations

None yet

commented a paper 16 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

Paper • 2512.12777 • Published 22 days ago • 3 •

upvoted a paper 20 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

Paper • 2512.12777 • Published 22 days ago • 3

commented a paper 20 days ago

State over Tokens: Characterizing the Role of Reasoning Tokens

Paper • 2512.12777 • Published 22 days ago • 3 •

commented 4 papers about 1 month ago

commented a paper 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 660 •

upvoted a paper 4 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 660

commented a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195 •

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

commented a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195 •

upvoted a paper 4 months ago

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25, 2025 • 14

commented 2 papers 5 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8 •

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15, 2025 • 8 •

upvoted a paper 5 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 39

commented 2 papers 5 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 39 •

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 238 •

upvoted 2 papers 5 months ago

Cyber-Zero: Training Cybersecurity Agents without Runtime

Paper • 2508.00910 • Published Jul 29, 2025 • 8

Exploitation Is All You Need... for Exploration

Paper • 2508.01287 • Published Aug 2, 2025 • 6

Michael Barry

AI & ML interests

Recent Activity

Organizations

MichaelBarryUK's activity