Spaces:

rasyosef
/

phi-2-chat

Sleeping

rasyosef commited on Jul 20, 2024

Commit

db3d063

verified ·

1 Parent(s): 26f43ad

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -64,12 +64,11 @@ with gr.Blocks() as demo:
   In order to reduce the response time on this hardware, `max_new_tokens` has been set to `21` in the text generation pipeline. With this default configuration, it takes approximately `60 seconds` for the response to start being generated, and streamed one word at a time. Use the slider below to increase or decrease the length of the generated text.
   """)
   chatbot = gr.ChatInterface(
     fn=generate,
-    additional_inputs=[
-        gr.Slider(8, 128, value=21, label="Maximum new tokens", info="A larger `max_new_tokens` parameter value gives you longer text responses but at the cost of a slower response time.")
-    ],
     stop_btn=None,
     examples=[["Who is Leonhard Euler?"]]
   )

   In order to reduce the response time on this hardware, `max_new_tokens` has been set to `21` in the text generation pipeline. With this default configuration, it takes approximately `60 seconds` for the response to start being generated, and streamed one word at a time. Use the slider below to increase or decrease the length of the generated text.
   """)
+  tokens_slider = gr.Slider(8, 128, value=21, render=False, label="Maximum new tokens", info="A larger `max_new_tokens` parameter value gives you longer text responses but at the cost of a slower response time.")
   chatbot = gr.ChatInterface(
     fn=generate,
+    additional_inputs=[tokens_slider],
     stop_btn=None,
     examples=[["Who is Leonhard Euler?"]]
   )