Is a ggml tensor's dimension order reversed? #710

chunhualiao · 2024-01-29T05:09:14Z

Hi, when I look at the model/wte tensor (token embedding weights) of GPT-2 117 M example.
It has a shape of [768,50257]. I guess it is the [embedding_dim, vocabulary_size].

Should the usual dimension order for wte be [vocabulary_size, embedding_dim] instead?
If so, why does ggml tensor store the dimensions in a reversed order?

Similarly, model/wpe (positional encoding weights) has the shape of [768,1024], which also seems to be reversed from a usual [1024, 768] order.

Thanks,

ggerganov · 2024-01-29T08:19:09Z

Yes, dimension order is reversed compared to Python: #500 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is a ggml tensor's dimension order reversed? #710

Is a ggml tensor's dimension order reversed? #710

chunhualiao commented Jan 29, 2024

ggerganov commented Jan 29, 2024

Is a ggml tensor's dimension order reversed? #710

Is a ggml tensor's dimension order reversed? #710

Comments

chunhualiao commented Jan 29, 2024

ggerganov commented Jan 29, 2024