Nothing Works for HiDream

by qpqpqpqpqpqp - opened 20 days ago

20 days ago

calcuis' GGUF CLIP Loader:
llama3.1-encoder-q3_k_m.gguf
RuntimeError: Error(s) in loading state_dict for Llama2:
size mismatch for model.embed_tokens.weight: copying a param with shape torch.Size([128320, 4096]) from checkpoint, the shape in current model is torch.Size([128256, 4096]).
I had to download a quant of Llama from someone else

calcuis

20 days ago

not test that model for very long; guess something changed over the time; have you found the one working for this?

calcuis

20 days ago

•

edited 20 days ago

oh, i see; they changed to the old scheme for this for some reason
token_embd.weight [4 096, 128 256]

https://huggingface.co/kaetemi/Meta-Llama-3.1-8B-Q4_K_M-GGUF-old/blob/main/meta-llama-3.1-8b-q4_k_m.gguf

and this one should work also:
https://huggingface.co/chatpig/encoder/blob/main/llama-hidream-q2_k.gguf

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment