Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1054.0
TFLOPS
8
4
5
Jon Doe
bullpoint
Follow
memas-d's profile picture
id-2's profile picture
Krasyliv's profile picture
6 followers
ยท
17 following
AI & ML interests
None yet
Recent Activity
liked
a model
about 18 hours ago
AQ-MedAI/Kimi-K25-eagle3
new
activity
8 days ago
nvidia/Qwen3.5-397B-A17B-NVFP4:
Getting nvidia/Qwen3.5-397B-A17B-NVFP4 running with SGLang (requires transformers v5) on RTX PRO 6000 (blackwell) CUDA 12.9
upvoted
a
collection
9 days ago
Qwen3.5
View all activity
Organizations
None yet
bullpoint
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
nvidia/Qwen3.5-397B-A17B-NVFP4
8 days ago
Getting nvidia/Qwen3.5-397B-A17B-NVFP4 running with SGLang (requires transformers v5) on RTX PRO 6000 (blackwell) CUDA 12.9
๐ฅ
1
5
#1 opened 10 days ago by
bullpoint
New activity in
vincentzed-hf/Qwen3.5-397B-A17B-NVFP4
14 days ago
Anyone try this on 4x RTX 6000 Pro yet?
52
#1 opened 16 days ago by
zenmagnets
New activity in
bullpoint/Qwen3-Coder-Next-AWQ-4bit
27 days ago
vllm says it's not AWQ
4
#2 opened 27 days ago by
jcowles
New activity in
bullpoint/GLM-4.6-AWQ
3 months ago
endless response
7
#4 opened 3 months ago by
ramidahbash
GLM-4.6-FP8 - 55 tokens/sec on 4x RTX 6000 PRO
6
#2 opened 5 months ago by
festr2
Speed measured on 2 x H200 / 4 x RTX 6000 Pro / 4 x A100
1
#3 opened 3 months ago by
HristoTodorov
New activity in
bullpoint/GLM-4.6-AWQ
5 months ago
llm-compressor version
1
#1 opened 5 months ago by
bullerwins
New activity in
microsoft/phi-4
about 1 year ago
Suggested tokenizer changes by Unsloth.ai
๐ฅ
13
7
#21 opened about 1 year ago by
gugarosa