Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Canary model stuck in a loop? Just repeats the same phrases over and over. #9030

Open
bcnine opened this issue Apr 24, 2024 · 2 comments
Open
Assignees
Labels
bug Something isn't working

Comments

@bcnine
Copy link

bcnine commented Apr 24, 2024

I recorded a 5 minute audio segment on my iPhone 11, at 24 bit / 48Khz, using this as my json input:

{ "audio_filepath": "/ldir/test.wav", "duration": "None", "taskname": "asr", "source_lang": "en", "target_lang": "en", "pnc": "yes", "answer": "na" }

It was just a test recording of me reading from Vol 1 of The Art of Computer Programming. The return from the call to

canary_model.transcribe("input_manifest.json", batch_size=16)

looks like

['Historically, the first interpreters were built around machine like languages designed in a single word. In such a case, there is another way to pick out the appropriate interpretive language, which is designed in a single word. Such a programme may be considered as a longer interpretive language. In such a language, there is a language that is designed in a single word. Interpretive language. In such a language, there is a language designed in a single word design. Interpretive language designed in a single word design. In such a word design, there is a language designed interpreter designed in a single word design. In such a word designed interpreter design, designed in a single word design. In such a word designed interpreter design, designed in a single word designation design interpreter designation designation designed in a single word design designation designed interprect design design designation designed inter designation designation design designed inter designation designation designed inter designation designation designation designed inter interprect designation designation designed inter interprect designation designation designed in designed interprect designed interprection designed interprection designed interprect designed interprect designed interprection designed interprection designed inter designed inter']

You can see it starts off OK, but this it just gets stuck in a loop repeating the same word or phrase, and it's much shorter than the audio segment. I have seen this repeatedly in different tests, and I have no idea what's causing it.

I'm using the following to launch the container

docker run --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 -it --rm -v /mnt/c/Users/bcollins/local_directory:/ldir nvcr.io/nvidia/nemo:24.01.speech

on an RTX 4090 mobile chipset in a Windows laptop. VRAM is 16GB.

@bcnine bcnine added the bug Something isn't working label Apr 24, 2024
@Suma-Rajashankar
Copy link

I am facing a similar issue. Any help would be appreciated. Thank you.

@nithinraok
Copy link
Collaborator

I am afraid to say this is common with AED models, and this scenario is called hallucination.

Would it be possible for you to share the audio? @pzelasko FYI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants