Skip to content
This repository has been archived by the owner on Aug 19, 2023. It is now read-only.

Support Apple Neural Engine (ANE) Transformers #13

Open
LeiHao0 opened this issue Mar 25, 2023 · 1 comment
Open

Support Apple Neural Engine (ANE) Transformers #13

LeiHao0 opened this issue Mar 25, 2023 · 1 comment

Comments

@LeiHao0
Copy link

LeiHao0 commented Mar 25, 2023

I noticed Apple supports ANE Transformers.

According to their own words:

M1 or newer chip to achieve up to 10 times faster and 14 times lower peak memory

Does that mean running 30B or 65B will be possible on small-memory MacBooks?

Here are a few links
https://github.com/apple/ml-ane-transformers
https://machinelearning.apple.com/research/neural-engine-transformers

As this project is the top LLaMA that leverages Apple GPU, is it possible to support ANE too?

@philipturner
Copy link

I don't know whether that would provide much speedup for current LLM architectures, which are memory bound. Rather, it might be useful for Stable Diffusion (compute-bound) or MegaByte transformers.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants