dtype mismatch in AttentiveStatisticsPooling with FP16 training mode #2544

MM-0712 · 2024-05-09T07:37:41Z

Line 275 in 5beaece

attn = torch.cat([x, mean, std], dim=1)

If the model is trained with FP16 or BF16 mode, here will report dtype mismatch.
So, one solution is that it need add .to(x.dtype).

None

None

No response

No response

No response

The text was updated successfully, but these errors were encountered:

MM-0712 added the bug Something isn't working label May 9, 2024

Provide feedback