Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Q] Where is the rollout AttCAT method implemented? #2

Open
NightMachinery opened this issue Jul 29, 2023 · 0 comments
Open

[Q] Where is the rollout AttCAT method implemented? #2

NightMachinery opened this issue Jul 29, 2023 · 0 comments

Comments

@NightMachinery
Copy link

Where is AttCAT-aggregated-by-rollout implemented?

Looking through the code, I could only find the sum AttCAT method.

Mathematically speaking, AttCAT produces attributions of shape (batch, from_token) for each layer. So we cannot apply the rollout method on it at all, as rollout needs attributions with the shape (batch, to_token, from_token).

Here is the part of the paper that discusses these two methods:
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant