<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs Paper ⢠2509.08358 ⢠Published Sep 10, 2025 ⢠13
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper ⢠2509.08755 ⢠Published Sep 10, 2025 ⢠57
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 ⢠273
Qwen/Qwen3-30B-A3B-Thinking-2507 Text Generation ⢠31B ⢠Updated Aug 17, 2025 ⢠424k ⢠⢠355