Kristaller486's picture

Kristaller486

kristaller486

·

AI & ML interests

NLP, Machine Translation

Recent Activity

upvoted a collection 2 days ago

liked a model 2 days ago

rednote-hilab/dots.mocr

liked a dataset 3 days ago

zai-org/ZClawBench

View all activity

Organizations

upvoted a collection 2 days ago

dots.mocr

Multimodal OCR: Parse Anything from Documents • 2 items • Updated 2 days ago • 5

upvoted a paper 2 months ago

VIBE: Visual Instruction Based Editor

Paper • 2601.02242 • Published Jan 5 • 64

upvoted a paper 3 months ago

Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM

Paper • 2512.21580 • Published Dec 25, 2025 • 8

upvoted 4 collections 3 months ago

T-lite-2.1

3 items • Updated 19 days ago • 4

T-pro-2.1

3 items • Updated Dec 23, 2025 • 6

Kandinsky 5.0 Video Pro Diffusers

Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. • 4 items • Updated Dec 14, 2025 • 12

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 22 items • Updated 1 day ago • 52

upvoted a paper 3 months ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 118

upvoted a changelog 3 months ago

Hugging Face Changelog

Featured Spaces are now easier to spot

Nov 25, 2025

• 67

upvoted a paper 5 months ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published Oct 6, 2025 • 117

upvoted a collection 5 months ago

Nanonets-OCR2

2 items • Updated Oct 13, 2025 • 25

upvoted a collection 7 months ago

DeepSeek-V3.1

3 items • Updated 19 days ago • 261

upvoted 2 papers 7 months ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13, 2025 • 15

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 47

upvoted a paper 8 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 70

upvoted a collection 8 months ago

T-pro-2.0

Hybrid reasoning model based on Qwen3 32B • 7 items • Updated 19 days ago • 29

upvoted a collection 9 months ago

Skywork-Reward-V2

Scaling preference data curation to the extreme • 9 items • Updated Jul 4, 2025 • 26

upvoted a paper 9 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7, 2025 • 71

upvoted 2 papers 10 months ago

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Paper • 2505.21189 • Published May 27, 2025 • 61

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78