implement an argument to directly set ff_inner_dim #52

chris-ha458 · 2023-06-26T11:35:30Z

In NVIDIA nvidia/GPT-2B-001, a very PaLM like model is implemented.

However, instead of a ffn multiplier like ffn_mult the ffn_hidden_size (comparable to ffn_inner_dim of this codebase) is directly set as 5440.

This translates to a ffn_mult of 2.65625. However, trying this in this codebase does not work.

The error

TypeError: empty() received an invalid combination of arguments - got (tuple, dtype=NoneType, device=NoneType), but expected one of:
 * (tuple of ints size, *, tuple of names names, torch.memory_format memory_format, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)
 * (tuple of ints size, *, torch.memory_format memory_format, Tensor out, torch.dtype dtype, torch.layout layout, torch.device device, bool pin_memory, bool requires_grad)

So I implemented a way to directly set the ffn_inner_dim
please take a look!

chris-ha458 · 2023-06-26T11:40:21Z

Also, the formatter I use, changes the layout a lot so I had to manually modify the code.
What formatter does this repo use?

GadiZimerman · 2023-07-16T22:00:15Z

@CodiumAI-Agent /review

CodiumAI-Agent · 2023-07-16T22:00:55Z

PR Analysis

🎯 Main theme: Adding an argument to directly set ff_inner_dim
📌 Type of PR: Enhancement
🧪 Relevant tests added: No
✨ Focused PR: Yes, the PR is focused as it has a clear and coherent title and description, and all PR code diff changes are properly derived from the title and description.
🔒 Security concerns: No, the changes made in this PR do not introduce any obvious security concerns.

PR Feedback

💡 General PR suggestions: The PR is generally well-written and the changes are clear. However, it would be beneficial to include tests to ensure the new functionality works as expected. Additionally, it would be helpful to update the function's docstring to include the new parameter.

How to use

Tag me in a comment '@CodiumAI-Agent' and add one of the following commands:
/review - Request a review of the latest update to the PR.
/describe - Modify the PR title and description based on the contents of the PR.
/improve - Suggest improvements to the code in the PR. These will be provided as pull request comments, ready to commit.
/ask - Pose a question about the PR.

CodiumAI-Agent · 2023-07-16T22:00:57Z

palm_rlhf_pytorch/palm.py

@@ -134,7 +135,8 @@ def __init__(
        self.norm = LayerNorm(dim)

        attn_inner_dim = dim_head * heads
-        ff_inner_dim = dim * ff_mult
+        # silently ignores ff_mult if ff_inner_dim is provided in the arguments
+        ff_inner_dim = dim * ff_mult if not ff_inner_dim else self.ff_inner_dim


Consider adding a check to ensure that ff_inner_dim is a positive integer if it is not None. This will prevent potential errors or unexpected behavior. [important]

CodiumAI-Agent · 2023-07-16T22:00:58Z

palm_rlhf_pytorch/palm.py

@@ -134,7 +135,8 @@ def __init__(
        self.norm = LayerNorm(dim)

        attn_inner_dim = dim_head * heads
-        ff_inner_dim = dim * ff_mult
+        # silently ignores ff_mult if ff_inner_dim is provided in the arguments
+        ff_inner_dim = dim * ff_mult if not ff_inner_dim else self.ff_inner_dim
        self.fused_dims = (attn_inner_dim, dim_head, dim_head, (ff_inner_dim * 2))


It would be beneficial to add a comment explaining why ff_inner_dim is multiplied by 2 in self.fused_dims. This would improve code readability and maintainability. [medium]

CodiumAI-Agent · 2023-07-16T22:00:59Z

palm_rlhf_pytorch/palm.py

@@ -511,4 +515,4 @@ def forward(
            return ret

        logits = rearrange(logits, 'b n c -> b c n')
-        return F.cross_entropy(logits, labels, ignore_index = self.cross_entropy_ignore_index)


Consider adding a newline at the end of the file. This is a common convention that helps with file processing in various systems. [medium]

implement direct setting of ff_inner_dim

297ac3e

CodiumAI-Agent reviewed Jul 16, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement an argument to directly set ff_inner_dim #52

implement an argument to directly set ff_inner_dim #52

chris-ha458 commented Jun 26, 2023

chris-ha458 commented Jun 26, 2023

GadiZimerman commented Jul 16, 2023

CodiumAI-Agent commented Jul 16, 2023

CodiumAI-Agent Jul 16, 2023

CodiumAI-Agent Jul 16, 2023

CodiumAI-Agent Jul 16, 2023

implement an argument to directly set ff_inner_dim #52

Are you sure you want to change the base?

implement an argument to directly set ff_inner_dim #52

Conversation

chris-ha458 commented Jun 26, 2023

chris-ha458 commented Jun 26, 2023

GadiZimerman commented Jul 16, 2023

CodiumAI-Agent commented Jul 16, 2023

PR Analysis

PR Feedback

How to use

CodiumAI-Agent Jul 16, 2023

Choose a reason for hiding this comment

CodiumAI-Agent Jul 16, 2023

Choose a reason for hiding this comment

CodiumAI-Agent Jul 16, 2023

Choose a reason for hiding this comment