thunlp
/

Qwen2-7B-Instruct-FR-Spec

Text Generation

text-generation-inference

Model card Files Files and versions

Token frequency statistics based on SlimPajama-627B, used for FR-Spec (https://arxiv.org/abs/2502.14856), see more at https://github.com/thunlp/FR-Spec.

freq_32768.pt can be loaded by torch.load(), and it is a list of high-frequency tokens.

config.json and pytorch_model.bin are the same as https://huggingface.co/yuhuili/EAGLE-Qwen2-7B-Instruct, and can be downloaded from their repo.

Downloads last month: 11

Model tree for thunlp/Qwen2-7B-Instruct-FR-Spec

Base model

Qwen/Qwen2-7B

Finetuned

Qwen/Qwen2-7B-Instruct

Finetuned

(117)

this model

Dataset used to train thunlp/Qwen2-7B-Instruct-FR-Spec

Paper for thunlp/Qwen2-7B-Instruct-FR-Spec

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20, 2025 • 8