fix(bindings/go): also look into stubs when CUBLAS is enabled #1973

mudler · 2024-03-18T11:30:49Z

In some CUDA installations libcuda.so is present only in the stubs folders.

Example log:

root@76d08ab315dc:/build/sources/whisper.cpp# WHISPER_CUBLAS=1 make -j libwhisper.so                                                                                                                                                         
I whisper.cpp build info:                                                                                                                                                                                                                    
I UNAME_S:  Linux                                                                                                                                                                                                                            
I UNAME_P:  x86_64                                                                                                                                                                                                                           
I UNAME_M:  x86_64                                                                                                                                                                                                                           
I CFLAGS:   -I.              -O3 -DNDEBUG -std=c11   -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/includ
e                                                                                                                                                                                                                                            
I CXXFLAGS: -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/includ
e                                                                                                                                                                                                                                            
I LDFLAGS:  -lcuda -lcublas -lculibos -lcudart -lcublasLt -lpthread -ldl -lrt -L/usr/local/cuda/lib64 -L/opt/cuda/lib64 -L/targets/x86_64-linux/lib -L/usr/lib/wsl/lib                                                                       
I CC:       cc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0                                                                                                                                                                                         
I CXX:      g++ (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0                                                                                                                                                                                        
                                                                                                                                                                                                                                             
nvcc --forward-unknown-to-host-compiler -arch=native -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cu
da/include -I/targets/x86_64-linux/include -Wno-pedantic -c ggml-cuda.cu -o ggml-cuda.o                                                                                                                                                      
cc  -I.              -O3 -DNDEBUG -std=c11   -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include   -c g
gml.c -o ggml.o                                                                                                                                                                                                                              
cc  -I.              -O3 -DNDEBUG -std=c11   -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include   -c g
gml-alloc.c -o ggml-alloc.o                                                                                           
cc  -I.              -O3 -DNDEBUG -std=c11   -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include   -c g
gml-backend.c -o ggml-backend.o                                                                                       
cc  -I.              -O3 -DNDEBUG -std=c11   -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include   -c g
gml-quants.c -o ggml-quants.o                                                                                                                                                                                                                
g++ -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include -c whi
sper.cpp -o whisper.o                                                                                                                                                                                                                        
nvcc warning : Cannot find valid GPU for '-arch=native', default arch is used                                                                                                                                                                
g++ -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -pthread -mavx -mavx2 -mfma -mf16c -msse3 -mssse3 -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include -share
d -o libwhisper.so ggml-cuda.o ggml.o ggml-alloc.o ggml-backend.o ggml-quants.o whisper.o -lcuda -lcublas -lculibos -lcudart -lcublasLt -lpthread -ldl -lrt -L/usr/local/cuda/lib64 -L/opt/cuda/lib64 -L/targets/x86_64-linux/lib -L/usr/lib/
wsl/lib                                                                                                                                                                                                                                      
/usr/bin/ld: cannot find -lcuda: No such file or directory                                                            
collect2: error: ld returned 1 exit status                                                                                                                                                                                                   
make: *** [Makefile:374: libwhisper.so] Error 1

Try to fix: #155 (or at least it does, for me and LocalAI)

Also update the docs to mention out that it might be needed to tweak CGO_LDFLAGS during building of the golang binary (if using e.g. libwhisper.a).

Signed-off-by: mudler <mudler@localai.io>

mudler mentioned this pull request Mar 18, 2024

deps(whisper.cpp): update, fix cublas build mudler/LocalAI#1846

Merged

1 task

bindings/go: also look into stubs when CUBLAS is enabled

255431f

Signed-off-by: mudler <mudler@localai.io>

mudler force-pushed the go_binding_linking branch from af1b301 to 255431f Compare March 18, 2024 11:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(bindings/go): also look into stubs when CUBLAS is enabled #1973

fix(bindings/go): also look into stubs when CUBLAS is enabled #1973

mudler commented Mar 18, 2024 •

edited

fix(bindings/go): also look into stubs when CUBLAS is enabled #1973

Are you sure you want to change the base?

fix(bindings/go): also look into stubs when CUBLAS is enabled #1973

Conversation

mudler commented Mar 18, 2024 • edited

mudler commented Mar 18, 2024 •

edited