Alexa Mariann Barta
Aly87
ยท
AI & ML interests
None yet
Organizations
None yet
Does this model support MLA or only the flash version does?
4
#41 opened about 2 months ago
by
Aly87
Request to Add AWQ Quantization Model
โค๏ธ 1
1
#13 opened 6 months ago
by
wunu
How much Vram needed for the full context length?
6
#31 opened 7 months ago
by
Aly87
Can we get the base model?
#5 opened 7 months ago
by
Aly87
Are you guys not going to release base models anymore?
๐ 2
#8 opened 7 months ago
by
Aly87
No base model
๐ 15
8
#2 opened 7 months ago
by
ricardo-rei
NF4 for inference?
#32 opened 8 months ago
by
Aly87