·
AI & ML interests
NLP
Organizations
upvoted a paper 8 months ago view article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
upvoted an article about 1 year ago view article Deploy LLMs with Hugging Face Inference Endpoints