Text Generation
Transformers
Safetensors
German
English
mistral
Merge
mergekit
lazymergekit
flemmingmiguel/NeuDist-Ro-7B
johannhartmann/Brezn3
ResplendentAI/Flora_DPO_7B
conversational
text-generation-inference
Instructions to use cstr/Spaetzle-v8-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use cstr/Spaetzle-v8-7b with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="cstr/Spaetzle-v8-7b") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("cstr/Spaetzle-v8-7b") model = AutoModelForCausalLM.from_pretrained("cstr/Spaetzle-v8-7b") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use cstr/Spaetzle-v8-7b with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "cstr/Spaetzle-v8-7b" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cstr/Spaetzle-v8-7b", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/cstr/Spaetzle-v8-7b
- SGLang
How to use cstr/Spaetzle-v8-7b with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "cstr/Spaetzle-v8-7b" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cstr/Spaetzle-v8-7b", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "cstr/Spaetzle-v8-7b" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cstr/Spaetzle-v8-7b", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use cstr/Spaetzle-v8-7b with Docker Model Runner:
docker model run hf.co/cstr/Spaetzle-v8-7b
Delete requirements.txt with huggingface_hub
Browse files- requirements.txt +0 -114
requirements.txt
DELETED
|
@@ -1,114 +0,0 @@
|
|
| 1 |
-
accelerate @ file:///home/conda/feedstock_root/build_artifacts/accelerate_1710334587919/work
|
| 2 |
-
aiohttp @ file:///croot/aiohttp_1707342283163/work
|
| 3 |
-
aiosignal @ file:///tmp/build/80754af9/aiosignal_1637843061372/work
|
| 4 |
-
appdirs==1.4.4
|
| 5 |
-
asttokens @ file:///home/conda/feedstock_root/build_artifacts/asttokens_1698341106958/work
|
| 6 |
-
attrs @ file:///croot/attrs_1695717823297/work
|
| 7 |
-
bitsandbytes==0.43.0
|
| 8 |
-
Bottleneck @ file:///croot/bottleneck_1707864210935/work
|
| 9 |
-
Brotli @ file:///work/ci_py311/brotli-split_1676830125088/work
|
| 10 |
-
cachetools==5.3.3
|
| 11 |
-
certifi @ file:///home/conda/feedstock_root/build_artifacts/certifi_1707022139797/work/certifi
|
| 12 |
-
cffi @ file:///croot/cffi_1700254295673/work
|
| 13 |
-
charset-normalizer @ file:///tmp/build/80754af9/charset-normalizer_1630003229654/work
|
| 14 |
-
click @ file:///croot/click_1698129812380/work
|
| 15 |
-
comm @ file:///home/conda/feedstock_root/build_artifacts/comm_1710320294760/work
|
| 16 |
-
datasets @ file:///home/conda/feedstock_root/build_artifacts/datasets_1709395865330/work
|
| 17 |
-
debugpy @ file:///croot/debugpy_1690905042057/work
|
| 18 |
-
decorator @ file:///home/conda/feedstock_root/build_artifacts/decorator_1641555617451/work
|
| 19 |
-
dill @ file:///croot/dill_1692271232022/work
|
| 20 |
-
docker-pycreds @ file:///Users/ktietz/demo/mc3/conda-bld/docker-pycreds_1630654474270/work
|
| 21 |
-
einops==0.7.0
|
| 22 |
-
exceptiongroup @ file:///home/conda/feedstock_root/build_artifacts/exceptiongroup_1704921103267/work
|
| 23 |
-
executing @ file:///home/conda/feedstock_root/build_artifacts/executing_1698579936712/work
|
| 24 |
-
filelock @ file:///croot/filelock_1700591183607/work
|
| 25 |
-
flash-attn==2.5.6
|
| 26 |
-
frozenlist @ file:///croot/frozenlist_1698702560391/work
|
| 27 |
-
fsspec==2023.4.0
|
| 28 |
-
gitdb @ file:///tmp/build/80754af9/gitdb_1617117951232/work
|
| 29 |
-
GitPython @ file:///croot/gitpython_1696936983078/work
|
| 30 |
-
gmpy2 @ file:///work/ci_py311/gmpy2_1676839849213/work
|
| 31 |
-
huggingface-hub @ file:///croot/huggingface_hub_1708634519519/work
|
| 32 |
-
idna @ file:///work/ci_py311/idna_1676822698822/work
|
| 33 |
-
importlib_metadata @ file:///home/conda/feedstock_root/build_artifacts/importlib-metadata_1709821103657/work
|
| 34 |
-
ipykernel @ file:///home/conda/feedstock_root/build_artifacts/ipykernel_1708996548741/work
|
| 35 |
-
ipython @ file:///home/conda/feedstock_root/build_artifacts/ipython_1709559745751/work
|
| 36 |
-
jedi @ file:///home/conda/feedstock_root/build_artifacts/jedi_1696326070614/work
|
| 37 |
-
Jinja2==3.1.2
|
| 38 |
-
jupyter_client @ file:///home/conda/feedstock_root/build_artifacts/jupyter_client_1710255804825/work
|
| 39 |
-
jupyter_core @ file:///home/conda/feedstock_root/build_artifacts/jupyter_core_1710257359434/work
|
| 40 |
-
MarkupSafe @ file:///croot/markupsafe_1704205993651/work
|
| 41 |
-
matplotlib-inline @ file:///home/conda/feedstock_root/build_artifacts/matplotlib-inline_1660814786464/work
|
| 42 |
-
mkl-fft @ file:///croot/mkl_fft_1695058164594/work
|
| 43 |
-
mkl-random @ file:///croot/mkl_random_1695059800811/work
|
| 44 |
-
mkl-service==2.4.0
|
| 45 |
-
mpmath @ file:///croot/mpmath_1690848262763/work
|
| 46 |
-
multidict @ file:///croot/multidict_1701096859099/work
|
| 47 |
-
multiprocess @ file:///croot/multiprocess_1692294385131/work
|
| 48 |
-
nest_asyncio @ file:///home/conda/feedstock_root/build_artifacts/nest-asyncio_1705850609492/work
|
| 49 |
-
networkx==3.2.1
|
| 50 |
-
ninja==1.11.1.1
|
| 51 |
-
numexpr @ file:///croot/numexpr_1696515281613/work
|
| 52 |
-
numpy @ file:///croot/numpy_and_numpy_base_1708638617955/work/dist/numpy-1.26.4-cp311-cp311-linux_x86_64.whl#sha256=5f96f274d410a1682519282ae769c877d32fdbf171aa8badec7bf5e1d3a1748a
|
| 53 |
-
nvidia-cublas-cu11==11.11.3.6
|
| 54 |
-
nvidia-cuda-cupti-cu11==11.8.87
|
| 55 |
-
nvidia-cuda-nvrtc-cu11==11.8.89
|
| 56 |
-
nvidia-cuda-runtime-cu11==11.8.89
|
| 57 |
-
nvidia-cudnn-cu11==8.7.0.84
|
| 58 |
-
nvidia-cufft-cu11==10.9.0.58
|
| 59 |
-
nvidia-curand-cu11==10.3.0.86
|
| 60 |
-
nvidia-cusolver-cu11==11.4.1.48
|
| 61 |
-
nvidia-cusparse-cu11==11.7.5.86
|
| 62 |
-
nvidia-ml-py==12.535.133
|
| 63 |
-
nvidia-nccl-cu11==2.19.3
|
| 64 |
-
nvidia-nvtx-cu11==11.8.86
|
| 65 |
-
nvitop==1.3.2
|
| 66 |
-
packaging @ file:///croot/packaging_1693575174725/work
|
| 67 |
-
pandas @ file:///croot/pandas_1709590491089/work/dist/pandas-2.2.1-cp311-cp311-linux_x86_64.whl#sha256=0a2793a31a0135a35735e1431d453a06186a3a7c607d9b441d9bd5f0fe4ded31
|
| 68 |
-
parso @ file:///home/conda/feedstock_root/build_artifacts/parso_1638334955874/work
|
| 69 |
-
pathtools @ file:///Users/ktietz/demo/mc3/conda-bld/pathtools_1629713893697/work
|
| 70 |
-
pexpect @ file:///home/conda/feedstock_root/build_artifacts/pexpect_1706113125309/work
|
| 71 |
-
pickleshare @ file:///home/conda/feedstock_root/build_artifacts/pickleshare_1602536217715/work
|
| 72 |
-
pillow==10.2.0
|
| 73 |
-
platformdirs @ file:///home/conda/feedstock_root/build_artifacts/platformdirs_1706713388748/work
|
| 74 |
-
prompt-toolkit @ file:///home/conda/feedstock_root/build_artifacts/prompt-toolkit_1702399386289/work
|
| 75 |
-
protobuf==3.20.3
|
| 76 |
-
psutil @ file:///work/ci_py311_2/psutil_1679337388738/work
|
| 77 |
-
ptyprocess @ file:///home/conda/feedstock_root/build_artifacts/ptyprocess_1609419310487/work/dist/ptyprocess-0.7.0-py2.py3-none-any.whl
|
| 78 |
-
pure-eval @ file:///home/conda/feedstock_root/build_artifacts/pure_eval_1642875951954/work
|
| 79 |
-
pyarrow @ file:///croot/pyarrow_1707330824290/work/python
|
| 80 |
-
pyarrow-hotfix @ file:///home/conda/feedstock_root/build_artifacts/pyarrow-hotfix_1700596371886/work
|
| 81 |
-
pycparser @ file:///tmp/build/80754af9/pycparser_1636541352034/work
|
| 82 |
-
Pygments @ file:///home/conda/feedstock_root/build_artifacts/pygments_1700607939962/work
|
| 83 |
-
PySocks @ file:///work/ci_py311/pysocks_1676822712504/work
|
| 84 |
-
python-dateutil @ file:///tmp/build/80754af9/python-dateutil_1626374649649/work
|
| 85 |
-
pytz @ file:///croot/pytz_1695131579487/work
|
| 86 |
-
PyYAML @ file:///croot/pyyaml_1698096049011/work
|
| 87 |
-
pyzmq @ file:///croot/pyzmq_1705605076900/work
|
| 88 |
-
regex @ file:///croot/regex_1696515298636/work
|
| 89 |
-
requests @ file:///croot/requests_1707355572290/work
|
| 90 |
-
safetensors @ file:///croot/safetensors_1708633833937/work
|
| 91 |
-
sentry-sdk @ file:///work/ci_py311/sentry-sdk_1676862120883/work
|
| 92 |
-
setproctitle @ file:///work/ci_py311/setproctitle_1676838789127/work
|
| 93 |
-
six @ file:///tmp/build/80754af9/six_1644875935023/work
|
| 94 |
-
smmap @ file:///tmp/build/80754af9/smmap_1611694433573/work
|
| 95 |
-
stack-data @ file:///home/conda/feedstock_root/build_artifacts/stack_data_1669632077133/work
|
| 96 |
-
sympy @ file:///croot/sympy_1701397643339/work
|
| 97 |
-
termcolor==2.4.0
|
| 98 |
-
tokenizers @ file:///croot/tokenizers_1708633814160/work
|
| 99 |
-
torch==2.2.1+cu118
|
| 100 |
-
torchaudio==2.2.1+cu118
|
| 101 |
-
torchvision==0.17.1+cu118
|
| 102 |
-
tornado @ file:///croot/tornado_1696936946304/work
|
| 103 |
-
tqdm @ file:///croot/tqdm_1679561862951/work
|
| 104 |
-
traitlets @ file:///home/conda/feedstock_root/build_artifacts/traitlets_1710254411456/work
|
| 105 |
-
transformers @ file:///home/conda/feedstock_root/build_artifacts/transformers_1709308155748/work
|
| 106 |
-
triton==2.2.0
|
| 107 |
-
typing_extensions==4.8.0
|
| 108 |
-
tzdata @ file:///croot/python-tzdata_1690578112552/work
|
| 109 |
-
urllib3 @ file:///croot/urllib3_1707770551213/work
|
| 110 |
-
wandb @ file:///home/conda/feedstock_root/build_artifacts/wandb_1707246480133/work
|
| 111 |
-
wcwidth @ file:///home/conda/feedstock_root/build_artifacts/wcwidth_1704731205417/work
|
| 112 |
-
xxhash @ file:///work/ci_py311/python-xxhash_1676842384694/work
|
| 113 |
-
yarl @ file:///croot/yarl_1701105127787/work
|
| 114 |
-
zipp @ file:///home/conda/feedstock_root/build_artifacts/zipp_1695255097490/work
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|