Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sunny Sanyal's picture
5 1 3

Sunny Sanyal

Sunny111
21world's profile picture ryanmarten's profile picture thomwolf's profile picture
ยท
https://sites.google.com/view/sunnysanyal/home
  • SunnySanyal9
  • sanyalsunny111

AI & ML interests

Efficient Training Recipes of Large Models (mostly LLMs)

Recent Activity

posted an update about 4 hours ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐Ÿš€ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ€” I have ~4 followers and zero expectations for reach ๐Ÿ˜„
upvoted a paper 29 days ago
Pre-training Small Base LMs with Fewer Tokens
liked a model about 1 month ago
GuminiResearch/Gumini-1.5B-Base
View all activity

Organizations

University of Texas at Austin's profile picture ML Foundations's profile picture Institute for Foundations of Machine Learning's profile picture

Sunny111 's models 1

Sunny111/LLM-Inheritune

Updated Sep 21, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs