Replies: 2 comments 3 replies
-
Without hardware acceleration, e.g. CUDA, Metal, BLAS, etc., yes, that's a totally expected amount of time. |
Beta Was this translation helpful? Give feedback.
3 replies
-
Cloud service providers often engage in oversubscription. Although they advertise it as a 4-core processor, in reality, it consists of 4 threads and competes with other users for resources. Therefore, this speed may be considered normal. I suggest you run Geekbench once to test the performance and see how it performs. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Greetings,
I am trying whisper.cpp because I do not have a GPU on my dreamhost web server, which has 4 vCPUs and 8GB RAM. I believe I have everything installed and I'm using base.en as the model. Here are the steps I've taken.
Step 1) Downloaded a random 4 minute Presidential speech from Wikipedia and upload it to my server: https://upload.wikimedia.org/wikipedia/commons/9/99/Confidence_in_Government_%28James_M._Cox%29.ogg
Step 2) Convert .ogg file to 16Khz .wav file using FFMPEG (assuming whisper.cpp can only work on 16Khz .wav files??):
ffmpeg -i speech.ogg -ar 16000 speech.wav
Step 3) Run the following whisper.cpp command:
I added "--no-fallback true --max-context 0" to command after viewing a separate post that suggest that this might speed things up, but it made no difference on our end.
I noticed in the output above when running the command: "error: input file not found 'true'", however the file is found and is transcribing correctly, but extremely long.
Let me know if I'm doing something wrong, or if 50 minutes is actually an expected time for transcribing 4 minute video.
Thanks
Beta Was this translation helpful? Give feedback.
All reactions