Spaces:

tnt306
/

finetuned-llm-demo-app

Sleeping

App Files Files Community

finetuned-llm-demo-app / run_local_llm_server.py

Commit History

Use HuggingFace persisent storage

c7f1f5a

tnt306 commited on Apr 27

Change the way to get cpu count

8d33281

tnt306 commited on Apr 25

Fix error when starting streamlit; start original first

2cc9f75

tnt306 commited on Apr 23

Add threads dynamically

3359734

tnt306 commited on Apr 23

Support getting LOCAL LLM API KEY from environment

5f62773

tnt306 commited on Apr 23

Support Tools

b9aed24

tnt306 commited on Apr 18

remove verbose log

1cf4cc4

tnt306 commited on Apr 17

Support RAG

93d7da6

tnt306 commited on Apr 17

Use mlock to force system to keep model in RAM

2ee09a5

tnt306 commited on Apr 15

Start Finetuned model first

38cf468

tnt306 commited on Apr 15

Stable version

ef9997f

tnt306 commited on Apr 14

Add print and print flush

519f144

tnt306 commited on Apr 11

Temporary upload

d304f41

tnt306 commited on Apr 10

Initial Version (Qwen2.5-7B-Instruct-1M-q4_k_m Original & Finetuned; Chat Web UI)

b77991a

tnt306 commited on Apr 10

Commit History

Use HuggingFace persisent storage c7f1f5a

Change the way to get cpu count 8d33281

Fix error when starting streamlit; start original first 2cc9f75

Add threads dynamically 3359734

Support getting LOCAL LLM API KEY from environment 5f62773

Support Tools b9aed24

remove verbose log 1cf4cc4

Support RAG 93d7da6

Use mlock to force system to keep model in RAM 2ee09a5

Start Finetuned model first 38cf468

Stable version ef9997f

Add print and print flush 519f144

Temporary upload d304f41

Initial Version (Qwen2.5-7B-Instruct-1M-q4_k_m Original & Finetuned; Chat Web UI) b77991a

Use HuggingFace persisent storage

c7f1f5a

Change the way to get cpu count

8d33281

Fix error when starting streamlit; start original first

2cc9f75

Add threads dynamically

3359734

Support getting LOCAL LLM API KEY from environment

5f62773

Support Tools

b9aed24

remove verbose log

1cf4cc4

Support RAG

93d7da6

Use mlock to force system to keep model in RAM

2ee09a5

Start Finetuned model first

38cf468

Stable version

ef9997f

Add print and print flush

519f144

Temporary upload

d304f41

Initial Version (Qwen2.5-7B-Instruct-1M-q4_k_m Original & Finetuned; Chat Web UI)

b77991a