Considering releasing the FP8 quantization model?

by a463724055 - opened about 20 hours ago

about 20 hours ago

The weight is too large.(Cry)

about 19 hours ago

You can quantize it yourself

about 18 hours ago

You can quantize it yourself

ok, setting CPU offload allows it to run with 16GB VRAM.

a463724055 changed discussion status to closed about 18 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment