pinned
Running
45
Multilingual Leaderboards π
π
Generative Evaluation for Global South
Generative AI, Arabic NLP
Generative Evaluation for Global South
Generative Tasks Evaluation of Arabic LLMs
Generate heatmaps for model metrics comparison
Study the Extreme threats of Frontier Large Language Models