AI & ML interests
LLM/RAG/Agents/LSTM/CNN
Recent Activity
Organizations
view article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)
view article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch
view article How to generate text: using different decoding methods for language generation with Transformers