You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think you are missing the --n-gpu-layers option so that model layers are offloaded to the GPU:
-ngl N, --n-gpu-layers N: When compiled with GPU support, this option allows offloading some layers to the GPU for computation. Generally results in increased performance.
Name and Version
version: 4393 (d79d8f3)
built with x86_64-conda-linux-gnu-cc (conda-forge gcc 14.2.0-1) 14.2.0 for x86_64-conda-linux-gnu
Operating systems
Linux
GGML backends
CUDA
Hardware
NVIDIA GeForce RTX 4090
Models
Qwen2-VL-7B-Instruct-Q5_K_M.gguf
Problem description & steps to reproduce
I use the following command:
observe
How to load clip_model_load to CUDA
First Bad Commit
No response
Relevant log output
The text was updated successfully, but these errors were encountered: