Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The sentence is too long 句子过长 #727

Open
Liu8Can opened this issue Apr 23, 2024 · 4 comments
Open

The sentence is too long 句子过长 #727

Liu8Can opened this issue Apr 23, 2024 · 4 comments

Comments

@Liu8Can
Copy link

Liu8Can commented Apr 23, 2024

English:
I really like Buzz, a direct successor to Whisper, it has helped me a lot, and I am very grateful to the cool developer!
I used Buzz's transcribe function to identify that the subtitles are too long, the model I used is whisper-small, the language is English, and the generated subtitles are in srt format. How can this be solved, is it to adjust the parameters in advanced-temperature?
I would not be grateful if I could answer my busy schedule

Chinese:
我很喜欢buzz这款直接继承whisper的软件,它帮了我的大忙,很感谢帅气的开发者!
我是用buzz的transcribe功能识别出的字幕过长,我使用的model是whisper-small,language为English,生成的字幕格式为srt。这该如何解决呢,是调整advanced-temperature中的参数吗?
若能在百忙之中解答,我将不甚感激。

@Liu8Can
Copy link
Author

Liu8Can commented Apr 23, 2024

image
just like this, it's so long😂

@Barmaid1076
Copy link

any fixes?

@raivisdejus
Copy link
Collaborator

For this problem there is no easy fix. Length of the returned subtitles is something that lives in and comes out of the whisper itself. Buzz can't change it.

One possible solution is to generate subtitles with word level timestamps and then glue the words into sentences as you need.

Another solution is to use some other tool to process already generated subtitles. This tool seems to so something like this https://github.com/peterk/srt_equalizer

@Liu8Can
Copy link
Author

Liu8Can commented May 18, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants