You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
describe the feature you'd like to see
Download a specific portion of a given youtube video's audio. This specific part is delimited by start and end.
describe alternatives you've considered
I considered downloading a "portion" of a video to extract the audio but that is not possible right now.
additional context
I've build a summarisation pipeline that takes as an input a YouTube url (short or video) and summarises it.
To do so, I actually use Cobalt's API to request the video audio's and pass the computed audio url to a "speech-to-text" component (built on AssemblyAI api).
The issue here is that for any video that is longer than a few minutes, the audio file starts to be pretty relatively big for the downstream speech to text component.
This brings me to the current feature request that would help me divide (and conquer) through multithreading the computation of each small part of the audio into small parts of text that I would merge.
The text was updated successfully, but these errors were encountered:
describe the feature you'd like to see
Download a specific portion of a given youtube video's audio. This specific part is delimited by start and end.
describe alternatives you've considered
I considered downloading a "portion" of a video to extract the audio but that is not possible right now.
additional context
I've build a summarisation pipeline that takes as an input a YouTube url (short or video) and summarises it.
To do so, I actually use Cobalt's API to request the video audio's and pass the computed audio url to a "speech-to-text" component (built on AssemblyAI api).
The issue here is that for any video that is longer than a few minutes, the audio file starts to be pretty relatively big for the downstream speech to text component.
This brings me to the current feature request that would help me divide (and conquer) through multithreading the computation of each small part of the audio into small parts of text that I would merge.
The text was updated successfully, but these errors were encountered: