Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
théo gigant's picture
32 65 205

théo gigant

gigant
NousResearch
bloc97's profile picture neurohacker352's profile picture Tonic's profile picture
·
https://giganttheo.github.io/
  • gigant_theo
  • giganttheo
  • theo-gigant

AI & ML interests

multimodal

Recent Activity

authored a paper 10 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
upvoted a paper 11 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
submitted a paper 11 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
View all activity

Organizations

Flax Community's profile picture NousResearch's profile picture Speech Recognition Community Event Version 2's profile picture HugGAN Community's profile picture Gradio-Blocks-Party's profile picture BigLAM: BigScience Libraries, Archives and Museums's profile picture Blog-explorers's profile picture open/ acc's profile picture HF x fal x BFL 's profile picture
gigant 's papers 5
arxiv:2604.27263
arxiv:2605.06546
arxiv:2504.10049
arxiv:2211.05100
arxiv:2206.15076
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs