meituan

company

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

DadaCloud01 submitted a paper 7 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

wanggd-meituan updated a dataset 20 days ago

meituan/LIBERO-X

wanggd-meituan published a dataset 20 days ago

meituan/LIBERO-X

View all activity

Papers

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue

View all Papers

authored 3 papers 5 days ago

Rediscovering Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement Learning

Paper • 2510.10959 • Published Oct 13, 2025 • 2

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 193

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Paper • 2507.01327 • Published Jul 2, 2025 • 1

authored a paper 7 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 7 days ago • 56

submitted a paper to Daily Papers 7 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 7 days ago • 56

updated a dataset 20 days ago

meituan/LIBERO-X

Viewer • Updated 20 days ago • 172k • 1.14k

published a dataset 20 days ago

meituan/LIBERO-X

Viewer • Updated 20 days ago • 172k • 1.14k

updated a model 21 days ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated 21 days ago • 54 • 7

authored 5 papers 25 days ago

WebGuard: Building a Generalizable Guardrail for Web Agents

Paper • 2507.14293 • Published Jul 18, 2025 • 1

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published 26 days ago • 36

World Models with Hints of Large Language Models for Goal Achieving

Paper • 2406.07381 • Published Jun 11, 2024 • 1

ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning

Paper • 2505.23871 • Published May 29, 2025 • 1

Multi-Agent Coordination via Multi-Level Communication

Paper • 2209.12713 • Published Sep 26, 2022 • 2

published a model 29 days ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated 21 days ago • 54 • 7

authored a paper about 1 month ago

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 30

submitted a paper to Daily Papers about 1 month ago

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Paper • 2602.06820 • Published Feb 6 • 13

submitted a paper to Daily Papers about 1 month ago

SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue

Paper • 2602.03548 • Published Feb 3 • 4

authored 2 papers about 1 month ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 178

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Paper • 2602.06820 • Published Feb 6 • 13

authored a paper about 2 months ago

Reasoning-Enhanced Large Language Models for Molecular Property Prediction

Paper • 2510.10248 • Published Oct 11, 2025 • 2