Request - int8 version
#23
by erosdiffusion - opened
Would it be possible to have the stack at int8. It seems 30xx (eg 3080) can benefit from that (lower size, faster inference)
erosdiffusion changed discussion title from Request - int8 fersion to Request - int8 version