https://huggingface.co/docs/text-generation-inference/basic_tutorials/consuming_tgi
Text Generation Inference
2024/7/18 22:39:00
podman run --gpus all --shm-size 1g -p 9389:80 -v $PWD/data:/data ghcr.io/huggingface/text-generation-inference:2.1.1 --model-id Qwen/Qwen2-0.5B