You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
No clue if that's related. Presumably user config/overrides might be an alternative option for supplying that information (eg: JSON) or if it's usually a field supported in GGUF, those that want that could presumably patch the metadata directly?
I am not familiar with this interface, but from my experience implementing LongRope, I think they are using a bit of a hack to get this to work because GGUF does not contain the necessary data. We actually load the full long and short factors for non-GGUF phi3 so it is flexible - and guaranteed to be correct unlike theirs - for any sequence length. I have looked at the GGUF metadata, and it does not contain these long/short factors.
We could of course implement something like this. It would not be ideal, but it would provide a way for users to put the context length about where they need it.
The text was updated successfully, but these errors were encountered: