Umair Abbas
umairsiddiquie
AI & ML interests
None yet
Recent Activity
published a model 8 days ago
umairsiddiquie/statecraft reacted to alibidaran's post with โค๏ธ 5 months ago
This shared notebook comprises the MMLU benchmark evaluating task for my latest reasoning model for the sociology field. The results show that using Few-shot prompting in the system prompt can significantly improve the model's performance at answering questions.
Model's link:
https://huggingface.co/alibidaran/GRPO_LLAMA3-instructive_reasoning1
Notebook evaluation:
https://www.kaggle.com/code/alibidaran/mmlu-socialogy-thinking-evals?scriptVersionId=277240033 upvoted a paper 5 months ago
Tongyi DeepResearch Technical Report