Llama baseline checkpoints (0.6B, 1.3B)
Chunyuan Deng
CharlesDDDD
·
AI & ML interests
Architecheture, Interpretability.
Recent Activity
updated
a collection
about 12 hours ago
looped_transformer
updated
a model
about 12 hours ago
CharlesDDDD/looped_transformer_loop_count_4
published
a model
about 12 hours ago
CharlesDDDD/looped_transformer_loop_count_4