How to get longer outputs?

#49

by Apps - opened Aug 26, 2023

Aug 26, 2023

Im using Inference Enpoints to do QA over docs:

Prompt:

CONTEXT: 
<document chunk 1>
<document chunk 2>
<document chunk 3>
<document chunk 4>
<document chunk 5>

QUESTION: What is the answer to life, the universe and everything?
ANSWER:

I usually get very short outputs of one or two words. How can i get longer outputs?

Thanks

Muennighoff

BigScience Workshop org Aug 27, 2023

You can force a minimum generation length by setting the min_new_tokens kwarg to e.g. 100

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment