Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The code and detailed implementation of Figure 4 and Figure 5 in the paper mPLUG-Owl2 #197

Open
Zlatan-Ibrahi opened this issue Dec 29, 2023 · 1 comment

Comments

@Zlatan-Ibrahi
Copy link

I would like to analyze the attention map of my own trained model, but I am not very clear about some details. For example, do we take the average of the attention maps across multiple heads? Could you provide the code for this?

@Zlatan-Ibrahi Zlatan-Ibrahi changed the title The code and detailed implementation of Figure 4 and FIgure 5 in the paper mPLUG-Owl2 The code and detailed implementation of Figure 4 and Figure 5 in the paper mPLUG-Owl2 Dec 29, 2023
@GasolSun36
Copy link

same question, any solutions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants