AraMix is a SOTA Arabic pretraining dataset
Sultan Alrashed PRO
SultanR
AI & ML interests
Smol language modelling and Arabic!
Recent Activity
published
a dataset
about 5 hours ago
AdaMLLab/ThaiMix
updated
a dataset
2 days ago
AdaMLLab/TurMix
updated
a dataset
2 days ago
AdaMLLab/AraMix