Can we control level of thinking ?

#170

by kalashshah19 - opened 8 days ago

8 days ago

I am using llama server directly from llama cpp, is there any parameter or a way to set the level of thinking like Low, Medium or High? Many times the model thinks too much and uses all the remaining tokens from the context length and couldn't return the final message.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment