-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 107 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 262 -
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 82 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 135
Mayor PRO
Eric111
AI & ML interests
None yet
Recent Activity
liked a model about 8 hours ago
apple/CLaRa-7B-Instruct liked a model about 8 hours ago
mudler/LFM2.5-8B-A1B-APEX-GGUF liked a model about 8 hours ago
w-ahmad/LFM2.5-8B-A1B-GGUF-MoQ