Article 14 "Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack
Google Gemma-3 for ANE ANEMLL conversion of Google's models anemll/anemll-google-gemma-3-1b-it-ctx4096_0.3.4 Updated 5 days ago • 54 • 1 anemll/anemll-google-gemma-3-4b-it-qat-int4-unquantized-ctx1024_0.3.5 Updated 5 days ago • 21 anemll/anemll-google-gemma-3-4b-it-qat-int4-unquantized-ctx4096_0.3.5 Updated 5 days ago • 40 anemll/anemll-google-gemma-3-270m-it-ctx512-monolithic_0.3.5 Updated 4 days ago • 41
ANEMLL-0.3.4 Models build with 0.3.4, improved quality and bug fixes anemll/anemll-Qwen-Qwen3-0.6B-ctx512_0.3.4 Updated Jul 7, 2025 • 17 • 1 anemll/anemll-meta-llama-Llama-3.2-1B-Instruct-ctx1024_0.3.4 Updated Jul 3, 2025 • 6 anemll/anemll-Qwen-Qwen3-0.6B-LUT888-ctx512_0.3.4 Updated Jul 7, 2025 • 9
Google Gemma-3 for ANE ANEMLL conversion of Google's models anemll/anemll-google-gemma-3-1b-it-ctx4096_0.3.4 Updated 5 days ago • 54 • 1 anemll/anemll-google-gemma-3-4b-it-qat-int4-unquantized-ctx1024_0.3.5 Updated 5 days ago • 21 anemll/anemll-google-gemma-3-4b-it-qat-int4-unquantized-ctx4096_0.3.5 Updated 5 days ago • 40 anemll/anemll-google-gemma-3-270m-it-ctx512-monolithic_0.3.5 Updated 4 days ago • 41
ANEMLL-0.3.4 Models build with 0.3.4, improved quality and bug fixes anemll/anemll-Qwen-Qwen3-0.6B-ctx512_0.3.4 Updated Jul 7, 2025 • 17 • 1 anemll/anemll-meta-llama-Llama-3.2-1B-Instruct-ctx1024_0.3.4 Updated Jul 3, 2025 • 6 anemll/anemll-Qwen-Qwen3-0.6B-LUT888-ctx512_0.3.4 Updated Jul 7, 2025 • 9
Runtime error 3 On-Device LLM Throughput Calculator 🚀 Generate throughput plots for LLMs on devices