-
I'm using whisper.cpp with great results on an AMD GPU:
My questions are:
A bit confused on what the optimal flags to use are for this case: AMD GPU and an Intel CPU. Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
I have it set to 4 threads when using CPU or GPU, but I don't think it has much impact at all in the case of GPU. You could try doing some timings to see perhaps? With the release of 1.5.0 and the updated support for CUDA, the CPU usage is minimal. I don't think there is a way to use both CPU and GPU for operations. It's one or the other with the lib/program. |
Beta Was this translation helpful? Give feedback.
-
It could become feasible in the future, once the scheduler is fully implemented in the
Starting from version
In the latest version of whisper.cpp, the CPU mainly performs two functions. First, it processes the log-mel spectrogram and then determines the most suitable next token based on the model's output during sampling. For shorter audio files, setting the thread count to |
Beta Was this translation helpful? Give feedback.
It could become feasible in the future, once the scheduler is fully implemented in the
ggml
backend.Starting from version
1.5.0
, the majority of the graph processing has been shifted to the GPU. As a result, the CPU threads spend most of their time idle, simply waiting for data from the GPU.In the latest version of whisper.cpp, the CPU mainly performs two functions. First, it processes the log-mel spectrogram and then determines the most suitable next token based on the model's output during sampling. For shorter audio files, setti…