Jon Doe's picture

Jon Doe

bullpoint

·

AI & ML interests

None yet

Recent Activity

liked a model about 18 hours ago

AQ-MedAI/Kimi-K25-eagle3

new activity 8 days ago

nvidia/Qwen3.5-397B-A17B-NVFP4:Getting nvidia/Qwen3.5-397B-A17B-NVFP4 running with SGLang (requires transformers v5) on RTX PRO 6000 (blackwell) CUDA 12.9

upvoted a collection 9 days ago

View all activity

Organizations

None yet

New activity in nvidia/Qwen3.5-397B-A17B-NVFP4 8 days ago

Getting nvidia/Qwen3.5-397B-A17B-NVFP4 running with SGLang (requires transformers v5) on RTX PRO 6000 (blackwell) CUDA 12.9

#1 opened 10 days ago by

New activity in vincentzed-hf/Qwen3.5-397B-A17B-NVFP4 14 days ago

Anyone try this on 4x RTX 6000 Pro yet?

#1 opened 16 days ago by

New activity in bullpoint/Qwen3-Coder-Next-AWQ-4bit 27 days ago

vllm says it's not AWQ

#2 opened 27 days ago by

New activity in bullpoint/GLM-4.6-AWQ 3 months ago

endless response

#4 opened 3 months ago by

GLM-4.6-FP8 - 55 tokens/sec on 4x RTX 6000 PRO

#2 opened 5 months ago by

Speed measured on 2 x H200 / 4 x RTX 6000 Pro / 4 x A100

#3 opened 3 months ago by

New activity in bullpoint/GLM-4.6-AWQ 5 months ago

llm-compressor version

#1 opened 5 months ago by

New activity in microsoft/phi-4 about 1 year ago

Suggested tokenizer changes by Unsloth.ai

#21 opened about 1 year ago by