Heroes

#5
by Espionage-IV - opened

Big fan of you all at Cohere, just wanted to drop you a line and say I'll be putting North through its paces in the coming weeks. Found out about Cohere because of Transcribe and was blown away at how good that model is, so no doubt you'll do great in this space too!

Getting 80 tok/s on low context, 45tok/s once context gets loaded in at 256k full window setup on 3080Ti with 22 moe offloaded to CPU. It's very snappy for this model class on this hardware and I can tell it's already a bit of a bash god. Will put it through its paces and report back. Thanks for all that you guys do.

Sign up or log in to comment