fix building & UTF8 output with MSVC + SDL2 #2826

foldl · 2025-02-19T06:47:16Z

This PR could also solve the issue mentioned in #2804, in which, TDM64 (gcc compiler) is used because MSVC failed to build.

Note: example lsp talks to vim script through std IO, so not updated.

rabidcopy · 2025-02-20T02:09:11Z

#2804 is 99.99% just SDL2.dll not being with the binaries. So I'm a little confused by this PR's naming.

foldl · 2025-02-20T03:38:03Z

#2804 is 99.99% just SDL2.dll not being with the binaries. So I'm a little confused by this PR's naming.

With this PR, MSVC can be used to build with SDL2. This is an issue mentioned in #2804, in which, since MSVC failed to build, TDM64 (gcc compiler) is used.

ggerganov

The CI errors are unrelated to this change. If the errors persist, we can apply the same fix that we did in llama.cpp: ggml-org/llama.cpp#11895

tokyovigilante · 2025-02-20T09:10:17Z

Thanks, this fixes the whisper-stream build for me on Win10 with MSVC, but the app is exiting immediately after launch with no indication of why:

C:\Users\ryan\Speech\whisper.cpp>.\build\bin\Release\whisper-stream.exe -m .\ggml-large-v3.bin
init: found 2 capture devices:
init:    - Capture device #0: 'Headset Microphone (Logitech Mono H650e)'
init:    - Capture device #1: 'Headset (Ryan’s AirPods Pro - Find My Hands-Free AG Audio)'
init: attempt to open default capture device ...
init: obtained spec for input device (SDL Id = 2):
init:     - sample rate:       16000
init:     - format:            33056 (required: 33056)
init:     - channels:          1 (required: 1)
init:     - samples per frame: 1024
whisper_init_from_file_with_params ...
whisper_init_from_file_with_params_no_state: loading model from '.\ggml-large-v3.bin'
whisper_init_with_params_no_state: use gpu    = 1
whisper_init_with_params_no_state: flash attn = 0
whisper_init_with_params_no_state: gpu_device = 0
whisper_init_with_params_no_state: dtw        = 0
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon RX 7900 XT (AMD proprietary driver) | uma: 0 | fp16: 1 | warp size: 64 | matrix cores: KHR_coopmat
whisper_init_with_params_no_state: devices    = 2
whisper_init_with_params_no_state: backends   = 2
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51866
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 1280
whisper_model_load: n_audio_head  = 20
whisper_model_load: n_audio_layer = 32
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 1280
whisper_model_load: n_text_head   = 20
whisper_model_load: n_text_layer  = 32
whisper_model_load: n_mels        = 128
whisper_model_load: ftype         = 1
whisper_model_load: qntvr         = 0
whisper_model_load: type          = 5 (large v3)
whisper_model_load: adding 1609 extra tokens
whisper_model_load: n_langs       = 100

C:\Users\ryan\Speech\whisper.cpp>

foldl · 2025-02-20T09:16:20Z

@tokyovigilante You need to copy SDL2.dll to C:\Users\ryan\Speech\whisper.cpp\build\bin\Release, or add it to PATH.

tokyovigilante · 2025-02-20T09:18:49Z

Thanks for looking, I think this should be ok?

Directory of C:\Users\ryan\Speech\whisper.cpp\build\bin\Release

20/02/2025  10:05 pm    <DIR>          .
20/02/2025  10:05 pm    <DIR>          ..
20/02/2025  09:40 pm            27,136 bench.exe
20/02/2025  09:40 pm            27,136 command.exe
20/02/2025  09:40 pm           487,936 ggml-base.dll
20/02/2025  09:40 pm           322,048 ggml-cpu.dll
20/02/2025  09:40 pm        20,034,048 ggml-vulkan.dll
20/02/2025  09:40 pm            75,264 ggml.dll
20/02/2025  10:05 pm           184,832 lsp.exe
20/02/2025  09:40 pm            27,136 main.exe
20/02/2025  09:40 pm           107,520 quantize.exe
20/02/2025  09:11 pm         1,664,000 SDL2.dll
20/02/2025  09:40 pm            27,136 stream.exe
20/02/2025  10:05 pm           165,888 wchess.exe
20/02/2025  09:40 pm            20,480 whisper-bench.exe
20/02/2025  09:40 pm           183,808 whisper-cli.exe
20/02/2025  10:05 pm           167,936 whisper-command.exe
20/02/2025  10:05 pm           407,040 whisper-server.exe
20/02/2025  10:05 pm            81,920 whisper-stream.exe
20/02/2025  10:05 pm         1,200,128 whisper-talk-llama.exe
20/02/2025  09:40 pm           438,272 whisper.dll
              19 File(s)     25,649,664 bytes
               2 Dir(s)  33,750,409,216 bytes free

[EDIT]
Hmm, seems to run on Windows for me with the -ng flag, despite having the Vulkan SDK installed (7900 XT). Have just tried on Linux with the mainline code whiich works fine with my GPU. I really need to use this with a Windows app unfortunately, but this is clearly not the issue. Sorry for the noise.

Judd added 2 commits February 19, 2025 14:43

fix building with MSVC + SDL2

b3a6018

select utf8 codepage on windows

00ddb10

foldl changed the title ~~fix building with MSVC + SDL2~~ fix building & UTF8 output with MSVC + SDL2 Feb 19, 2025

ggerganov approved these changes Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix building & UTF8 output with MSVC + SDL2 #2826

fix building & UTF8 output with MSVC + SDL2 #2826

foldl commented Feb 19, 2025 •

edited

Loading

rabidcopy commented Feb 20, 2025

foldl commented Feb 20, 2025 •

edited

Loading

ggerganov left a comment

tokyovigilante commented Feb 20, 2025

foldl commented Feb 20, 2025

tokyovigilante commented Feb 20, 2025 •

edited

Loading

fix building & UTF8 output with MSVC + SDL2 #2826

Are you sure you want to change the base?

fix building & UTF8 output with MSVC + SDL2 #2826

Conversation

foldl commented Feb 19, 2025 • edited Loading

rabidcopy commented Feb 20, 2025

foldl commented Feb 20, 2025 • edited Loading

ggerganov left a comment

Choose a reason for hiding this comment

tokyovigilante commented Feb 20, 2025

foldl commented Feb 20, 2025

tokyovigilante commented Feb 20, 2025 • edited Loading

foldl commented Feb 19, 2025 •

edited

Loading

foldl commented Feb 20, 2025 •

edited

Loading

tokyovigilante commented Feb 20, 2025 •

edited

Loading