Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
1
3
Sunny Sanyal
Sunny111
Follow
21world's profile picture
ryanmarten's profile picture
thomwolf's profile picture
4 followers
ยท
5 following
https://sites.google.com/view/sunnysanyal/home
SunnySanyal9
sanyalsunny111
AI & ML interests
Efficient Training Recipes of Large Models (mostly LLMs)
Recent Activity
posted
an
update
about 4 hours ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
upvoted
a
paper
29 days ago
Pre-training Small Base LMs with Fewer Tokens
liked
a model
about 1 month ago
GuminiResearch/Gumini-1.5B-Base
View all activity
Organizations
Sunny111
's models
1
Sort:ย Recently updated
Sunny111/LLM-Inheritune
Updated
Sep 21, 2025