high hallucinations

#1
by krustik - opened

I've tried it with that special branch of llama and got very high hallucinations with code test. It just stuck in reasoning forever loop trying to determine the correct notes melody.
I will quantize myself the original model into Q8 version and try that, but 96 model shards are not so user friendly to download, it fact the main problem to get it from huggingface at all, git in xet or lfs (no matter) clonning they disconnect for some reason all the time.
Q4 uses ~150Gb RAM

I've checked it in Q8 (uses ~255Gb RAM) and F16(original, uses ~474Gb RAM).
Unfortunately it's useless for any coding, same self looping even in original size, also kinda repeating thoughts many times in reasoning over and over. It basically gives up to make code for me on reasoning stage, which is very weird to see in model of such size.
Example:

However, if the user is interested in a simplified example, perhaps taking a small excerpt from a symphony and converting that into ChucK code would be more feasible.

For example, let's take the first few bars of the first movement of Mozart's Symphony No. 40 (K. 550). Then, we can write a ChucK program that plays those few bars using a piano instrument.

But even then, the user might not be aware that writing out the entire symphony would require an enormous amount of code, which is not practical to do manually.

So, perhaps the best approach is to provide a ChucK code example that plays a small excerpt from a Mozart symphony, using a piano instrument, and explain that converting the entire symphony into ChucK code is not feasible due to the sheer length and complexity.

But the user specifically said "in its full length", so they want the entire symphony. However, given that it's impossible to do manually, perhaps the user is mistaken.

Alternatively, maybe the user is asking for a ChucK program that uses a piano instrument to play the symphony, but they don't realize that the code would need to be written out note by note.

Given all this, perhaps the most appropriate response is to explain that converting a full-length Mozart symphony into ChucK code using only a piano instrument is not feasible due to the sheer length and complexity of the symphony.

However, if the user is interested in a simplified example, perhaps taking a small excerpt from a symphony and converting that into ChucK code would be more feasible.

So, for example, let's take the first four bars of the first movement of Mozart's Symphony No. 40 (K. 550), and convert that into ChucK code.

But even then,

Also i've checked it for literature, also quite weak.
Haven't checked on censorship and anything else, dont have time on this, because new Deepseek downloaded and waiting.
Someone maybe find the use for this model.

krustik changed discussion status to closed

Sign up or log in to comment