show benchmarks #5

laralove143 · 2024-01-31T19:28:36Z

the advantage of this project is that it uses CoreML for a performance gain, so showing benchmarks would solidify how much this advantage is

atiorh · 2024-01-31T22:49:35Z

Hi @laralove143, please give https://www.takeargmax.com/blog/whisperkit a read for the value proposition of WhisperKit.
That being said, performance is definitely a big part and we are working on a "Performance Benchmark Tab" in the example app. Will follow up here shortly.

laralove143 · 2024-02-01T02:52:22Z

that blog is very useful maybe it could be shown more clearly in the readme, for example talking about its contents

alternatively, some stuff from the blog could be included in the readme as well, like that demo video is very useful

atiorh · 2024-02-01T07:21:12Z

Thanks for the feedback! We will think about a better way to organize information about WhisperKit that is more accessible. We will definitely flesh out the README and docs more before stable release.

ZachNagengast · 2024-02-16T22:46:25Z

Tracking this here: #28

aehlke · 2024-02-23T21:10:09Z

My understanding from running llama.cpp on iOS/macOS (via Swift, including streaming) is that Metal is faster than CoreML or Metal+CoreML. There may be some other benefits to using CoreML. Maybe battery? I don't know myself

atiorh · 2024-03-03T01:14:09Z

Metal is faster than CoreML or Metal+CoreML

This is certainly possible in specific cases but can not be a generally true statement. For context, WhisperKit is currently tuned for mobile and lower-end Macs where the Neural Engine is much more powerful with respect to the GPU (that Metal can harness) and Core ML is the primary framework for deploying to the Neural Engine. That being said, we are actively working on a Metal backend to complement the Core ML backend.

aehlke · 2024-03-03T02:10:59Z

Thanks for the context. Would be great to see a benchmark later. I recall the Metal-only whispercpp being faster even on lower spec devices such as iPhone but can't find the numbers at the moment.

aehlke · 2024-03-03T02:17:16Z

Here are some numbers I hadn't seen before showing far better Metal performance on an iPhone using Metal instead of CoreML. So it looks like it holds true for mobile...

https://www.bjnortier.com/2023/11/17/Hello-Transcribe-3.2.html

It mentions some other downsides to CoreML such as the slow caching step and unpredictable cache ejection by the OS

ZachNagengast added feature New feature or request triaged This issue has been looked at and prioritized by a maintainer labels Feb 16, 2024

ZachNagengast mentioned this issue Feb 16, 2024

Benchmark for WhisperAX & CLI #28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

show benchmarks #5

show benchmarks #5

laralove143 commented Jan 31, 2024

atiorh commented Jan 31, 2024 •

edited

laralove143 commented Feb 1, 2024

atiorh commented Feb 1, 2024

ZachNagengast commented Feb 16, 2024

aehlke commented Feb 23, 2024 •

edited

atiorh commented Mar 3, 2024 •

edited

aehlke commented Mar 3, 2024

aehlke commented Mar 3, 2024 •

edited

show benchmarks #5

show benchmarks #5

Comments

laralove143 commented Jan 31, 2024

atiorh commented Jan 31, 2024 • edited

laralove143 commented Feb 1, 2024

atiorh commented Feb 1, 2024

ZachNagengast commented Feb 16, 2024

aehlke commented Feb 23, 2024 • edited

atiorh commented Mar 3, 2024 • edited

aehlke commented Mar 3, 2024

aehlke commented Mar 3, 2024 • edited

atiorh commented Jan 31, 2024 •

edited

aehlke commented Feb 23, 2024 •

edited

atiorh commented Mar 3, 2024 •

edited

aehlke commented Mar 3, 2024 •

edited