arxiv:2604.27263
théo gigant
AI & ML interests
multimodal
Recent Activity
authored a paper 10 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation submitted a paper 11 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation