consciousness

History

Kent Overstreet 49ccdf87e1 Add vllm provisioning script for RunPod GPU instances Sets up vllm with Qwen 2.5 27B Instruct, prefix caching enabled, Hermes tool call parser for function calling support. Configurable via environment variables (MODEL, PORT, MAX_MODEL_LEN). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 23:13:04 -04:00
..
provision-vllm.sh	Add vllm provisioning script for RunPod GPU instances	2026-03-18 23:13:04 -04:00

Kent Overstreet 49ccdf87e1 Add vllm provisioning script for RunPod GPU instances

Sets up vllm with Qwen 2.5 27B Instruct, prefix caching enabled,
Hermes tool call parser for function calling support. Configurable
via environment variables (MODEL, PORT, MAX_MODEL_LEN).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-18 23:13:04 -04:00

provision-vllm.sh

Add vllm provisioning script for RunPod GPU instances

2026-03-18 23:13:04 -04:00