Text Generation
Transformers
Safetensors
PyTorch
llama
facebook
meta
llama-3
text-generation-inference
Instructions to use meta-llama/Llama-3.2-1B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use meta-llama/Llama-3.2-1B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="meta-llama/Llama-3.2-1B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-3.2-1B") model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-1B") - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use meta-llama/Llama-3.2-1B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "meta-llama/Llama-3.2-1B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "meta-llama/Llama-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/meta-llama/Llama-3.2-1B
- SGLang
How to use meta-llama/Llama-3.2-1B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "meta-llama/Llama-3.2-1B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "meta-llama/Llama-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "meta-llama/Llama-3.2-1B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "meta-llama/Llama-3.2-1B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use meta-llama/Llama-3.2-1B with Docker Model Runner:
docker model run hf.co/meta-llama/Llama-3.2-1B
Access request rejected – could you please reset my request?
#359 opened 15 days ago
by
feitianyujian
Mine Gating Group Collection approvals on Hugging Face is still pending.
2
#358 opened 16 days ago
by
ajit-782
Access request rejected – please reset
#357 opened about 1 month ago
by
atw996
Access Request
#356 opened about 1 month ago
by
aurelius-1
[Apply for reset the access request which has been rejected]
#355 opened about 1 month ago
by deleted
Access Request
#354 opened about 1 month ago
by
macrez181122
Access Request
#352 opened about 2 months ago
by
1i1i1i
Access Request
#351 opened 2 months ago
by
Rebecca0876
Access request
1
#350 opened 2 months ago
by
pbozeman
course-project
#348 opened 2 months ago
by
Owos
Access-Request (National Taiwan University)
#347 opened 2 months ago
by
hankb09901
fix: set `clean_up_tokenization_spaces` to `false`
#346 opened 2 months ago
by
maxsloef
Access Request
#345 opened 2 months ago
by
dancinlife
Access Request
#344 opened 3 months ago
by
yaemikooO3
Access request
#343 opened 3 months ago
by
john-qq
Install & run meta-llama/Llama-3.2-1B easily using llmpm
#342 opened 3 months ago
by
sarthak-saxena
Access Request
#341 opened 3 months ago
by
xuyangwq
Access Request
#340 opened 3 months ago
by
mikumifa
testing
#339 opened 3 months ago
by
Terminator-2026
Access Request
1
#338 opened 3 months ago
by
limingwanggrant
Access request was rejected, accidentally not filling the "affiliate" section
#337 opened 3 months ago
by
Katariinahe
Access Application Re-review Request
#336 opened 3 months ago
by
Warfarin09
Access Rejection Reconsideration
#335 opened 3 months ago
by
sdjordjevicTT
Discussion with empty pull_request parameter test
#333 opened 4 months ago
by
121tester
Reject
#332 opened 4 months ago
by
Wangliang11
Request: DOI
#331 opened 4 months ago
by
wanderingnomadd
Request Denied
#329 opened 6 months ago
by
w12-1
Reject
#328 opened 6 months ago
by
guangfawang
Request access for llama3.2 models
#327 opened 6 months ago
by
pha000
Request reject
#326 opened 6 months ago
by
zzc2313
Request Rejected
1
#325 opened 6 months ago
by
RB1998
Request: DOI
#324 opened 6 months ago
by
LunarLIN
request for help of access approval
#323 opened 6 months ago
by
fmyuan
Request for access after rejected
#322 opened 7 months ago
by
Ugine-16
Request Access
#321 opened 7 months ago
by
IIAlexII
Model access
#320 opened 7 months ago
by
acambece25
request the access permission
#319 opened 7 months ago
by
Stephen9969
Request Access
#318 opened 7 months ago
by
mcgravity
Add link to Neuron-optimized version
#316 opened 7 months ago
by
badaoui
Request access
#315 opened 7 months ago
by
cmy11a8b
Request: DOI
#314 opened 8 months ago
by
razkalian84
Request: DOI
#313 opened 8 months ago
by
mbort1
Request to revoke my access to the model
#312 opened 8 months ago
by deleted
Your request to access model meta-llama/Llama-3.2-1B has been rejected by the repo's authors.
#311 opened 8 months ago
by
leisirone
Could you please help me grant pull permissions?
#310 opened 8 months ago
by
leisirone
Request: DOI
#308 opened 8 months ago
by
prateekrai436
Access Token?
#307 opened 8 months ago
by
stonedspirit
Re-request rejected access
#306 opened 8 months ago
by
SiwyTorpeda