Feature/fix metal #148

hlhr202 · 2024-05-06T17:49:51Z

This is a naive fix for metal inference and log trampoline

sys/build.rs

hlhr202 · 2024-05-16T06:10:38Z

@tazz4843 reformatted code

tazz4843 · 2024-05-16T15:54:39Z

See review comments

hlhr202 · 2024-05-17T02:52:15Z

@tazz4843 Hey I have found a new possible way here.

whisper.cpp has now supported WHISPER_METAL_EMBED_LIBRARY build options that enable us to embed metal lib string into the build output. But we need to upgrade the whisper.cpp branch to a newer version.
See the CMakeLists in whisper.cpp here
https://github.com/ggerganov/whisper.cpp/blob/08981d1bacbe494ff1c943af6c577c669a2d9f4d/CMakeLists.txt#L78C12-L78C39
ggerganov/whisper.cpp#2110

maybe we should consider finalize #142 first then I can start implement a new build option here?

tazz4843 · 2024-05-28T22:03:22Z

#142 has been merged, took me super long, sorry about that. Embedding the library is a much better idea imo and we should favour that.

thewh1teagle · 2024-05-28T23:14:54Z

I tried it using WHISPER_METAL_EMBED_LIBRARY=ON. it works, I can see in the logs that metal framework loaded (and without it doesn't)
and it works much faster.
I believe we should enable WHISPER_METAL_EMBED_LIBRARY by default if metal feature enabled.

hlhr202 · 2024-05-29T03:37:34Z

updated. please help check the new build config. also I v updated the metal log callback setup function.
@tazz4843

tazz4843 · 2024-05-30T20:43:51Z

I don't have macOS to test on, will wait for a positive test from someone with macOS before merging

hlhr202 · 2024-05-31T02:39:57Z

I don't have macOS to test on, will wait for a positive test from someone with macOS before merging

I have self-tested it since I m using it for a private project. But its okay if one more tester passed it.

uohzxela · 2024-06-03T04:40:33Z

I have tested @hlhr202's latest changes by adding whisper-rs = { git = "https://github.com/hlhr202/whisper-rs.git", branch = "feature/fix-metal", features = ["metal"] } to my Cargo.toml and it works on my M1 Pro.

whisper_init_with_params_no_state: use gpu    = 1
whisper_init_with_params_no_state: flash attn = 0
whisper_init_with_params_no_state: gpu_device = 0
whisper_init_with_params_no_state: dtw        = 0
whisper_model_load: loading model
whisper_model_load: n_vocab       = 51864
whisper_model_load: n_audio_ctx   = 1500
whisper_model_load: n_audio_state = 768
whisper_model_load: n_audio_head  = 12
whisper_model_load: n_audio_layer = 12
whisper_model_load: n_text_ctx    = 448
whisper_model_load: n_text_state  = 768
whisper_model_load: n_text_head   = 12
whisper_model_load: n_text_layer  = 12
whisper_model_load: n_mels        = 80
whisper_model_load: ftype         = 1
whisper_model_load: qntvr         = 0
whisper_model_load: type          = 3 (small)
whisper_model_load: adding 1607 extra tokens
whisper_model_load: n_langs       = 99
whisper_backend_init: using Metal backend
whisper_model_load:    Metal total size =   487.00 MB
whisper_model_load: model size    =  487.00 MB
whisper_backend_init: using Metal backend
whisper_init_state: kv self size  =   56.62 MB
whisper_init_state: kv cross size =   56.62 MB
whisper_init_state: kv pad  size  =    4.72 MB
whisper_init_state: compute buffer (conv)   =   22.54 MB
whisper_init_state: compute buffer (encode) =  284.81 MB
whisper_init_state: compute buffer (cross)  =    6.31 MB
whisper_init_state: compute buffer (decode) =   97.40 MB

Would be great if we can merge this, and thanks @hlhr202 for your fix!

hlhr202 added 3 commits May 7, 2024 00:52

fix: metal

4bc5709

try fix metal log

a440e7c

try fix metal log

cd6a633

hlhr202 mentioned this pull request May 6, 2024

Unable to use Metal feature on Mac M1 Max (32 GB) #108

Open

tazz4843 reviewed May 15, 2024

View reviewed changes

sys/build.rs Outdated Show resolved Hide resolved

fix: fmt

bf5a08d

Merge remote-tracking branch 'origin/master' into feature/fix-metal

357e122

hlhr202 added 2 commits May 29, 2024 10:25

Merge remote-tracking branch 'origin/master' into feature/fix-metal

3f27c17

optim: use build config instead of copying metal file to target folder

8872109

tazz4843 merged commit f1030ef into tazz4843:master Jun 3, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/fix metal #148

Feature/fix metal #148

hlhr202 commented May 6, 2024

hlhr202 commented May 16, 2024

tazz4843 commented May 16, 2024

hlhr202 commented May 17, 2024

tazz4843 commented May 28, 2024

thewh1teagle commented May 28, 2024

hlhr202 commented May 29, 2024

tazz4843 commented May 30, 2024

hlhr202 commented May 31, 2024

uohzxela commented Jun 3, 2024

Feature/fix metal #148

Feature/fix metal #148

Conversation

hlhr202 commented May 6, 2024

hlhr202 commented May 16, 2024

tazz4843 commented May 16, 2024

hlhr202 commented May 17, 2024

tazz4843 commented May 28, 2024

thewh1teagle commented May 28, 2024

hlhr202 commented May 29, 2024

tazz4843 commented May 30, 2024

hlhr202 commented May 31, 2024

uohzxela commented Jun 3, 2024