Skip to content

How does the "Target length (tokens)" work? #769

Closed Answered by LostRuins
GaelicThunder asked this question in Q&A
Discussion options

You must be logged in to vote

Then I think such a feature does not exist in stock KoboldCpp. I am not sure how the SillyTavern implementation works, but in Kobold a generation will continue until

  1. Max length reaches
  2. EOS token hit
  3. Stopping sequence hit

Normally EOS will be hit long before the max length is hit. Combined with "Trim Sentences" and a decently long max length, it should get you close to what you'd expect.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@GaelicThunder
Comment options

@LostRuins
Comment options

Answer selected by GaelicThunder
@GaelicThunder
Comment options

@GaelicThunder
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants