Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Script to convert Grok-1 weights from raw JAX pickle files.
#7058
opened May 3, 2024 by
heiner
Loading…
BPE pretokenizer - add support for command-r-plus and command-r models
#7041
opened May 2, 2024 by
sealad886
Loading…
Bug fix for server crash if first token is the stop word and asking for logprobs
#7038
opened May 2, 2024 by
maor-ps
Loading…
tests : add test-tokenizer-0.sh
high priority
Very important issue
#7036
opened May 2, 2024 by
ggerganov
Loading…
convert-hf : reduce repeated boilerplate from write_tensors
need feedback
Testing and feedback with results are needed
refactoring
Refactoring
#7031
opened May 1, 2024 by
compilade
Loading…
3 of 18 tasks
convert.py: When --vocab-only is passed, generate false but valid params
#7027
opened May 1, 2024 by
20kdc
Loading…
docs: Fix typo and update description for --embeddings flag
#7026
opened May 1, 2024 by
louixs
Loading…
Update Server's README with undocumented options for RoPE, YaRN, and KV cache quantization
#7013
opened Apr 30, 2024 by
K-Mistele
Loading…
new tokenizer-verifier tool to check gguf tokenizer parameters
#6988
opened Apr 29, 2024 by
anisse
Loading…
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421
demo
Demonstrate some concept or idea, not intended to be merged
#6941
opened Apr 27, 2024 by
rahsuri
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.