Frankentext: Stitching random text fragments into long-form narratives Paper • 2505.18128 • Published May 23, 2025 • 4
Large-Scale Data Selection for Instruction Tuning Paper • 2503.01807 • Published Mar 3, 2025 • 14
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Paper • 2502.18890 • Published Feb 26, 2025 • 30
CLIPPER: Compression enables long-context synthetic data generation Paper • 2502.14854 • Published Feb 20, 2025 • 11
Jamba 1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Mar 6, 2025 • 87
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249