Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
Zayd Muhammad Kawakibi Zuhri PRO
zaydzuhri
AI & ML interests
I really like watching loss go down
Recent Activity
updated a model 5 days ago
zaydzuhri/tasklets_tokenizer_64 published a model 5 days ago
zaydzuhri/tasklets_tokenizer_64 updated a dataset 5 days ago
zaydzuhri/noisy-recall-madOrganizations
None yet