Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhihao Jia's picture
1

Zhihao Jia

zhihaojia
JackFram's profile picture ruipeterpan's profile picture
·
  • jiazhihao

AI & ML interests

None yet

Organizations

None yet

authored 5 papers over 1 year ago

SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification

Paper • 2305.09781 • Published May 16, 2023 • 4

GradSign: Model Performance Inference with Theoretical Insights

Paper • 2110.08616 • Published Oct 16, 2021

Accelerating Retrieval-Augmented Language Model Serving with Speculation

Paper • 2401.14021 • Published Jan 25, 2024 • 2

Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models

Paper • 2401.07159 • Published Jan 13, 2024 • 1

Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

Paper • 2312.15234 • Published Dec 23, 2023 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs