Releases · ggerganov/llama.cpp

29 May 19:51

975ec63

b3039

metal : add missing asserts (#7617)

Assets 21

29 May 18:50

github-actions

b3038

fb76ec3

b3038

ggml : fix YARN + add tests + add asserts (#7617)

* tests : add rope tests

ggml-ci

* ggml : fixes (hopefully)

ggml-ci

* tests : add non-cont tests

ggml-ci

* cuda : add asserts for rope/norm + fix DS2

ggml-ci

* ggml : assert contiguousness

* tests : reduce RoPE tests

ggml-ci

Assets 21

29 May 15:47

github-actions

b3037

cce3dcf

b3037

cuda : non-cont concat support (#7610)

* tests : add non-cont concat tests

* cuda : non-cont concat support

ggml-ci

Assets 21

29 May 15:24

github-actions

b3036

210d991

b3036

llama-bench : add support for the RPC backend (#7435)

Assets 21

29 May 15:23

github-actions

b3035

87bdf2a

b3035

ggml : use atomic_flag for critical section (#7598)

* ggml : use atomic_flag for critical section

* add windows shims

Assets 21

29 May 14:18

github-actions

b3033

2ab9772

b3033

sync : ggml

Assets 21

29 May 03:17

github-actions

b3030

504f0c3

b3030

ggml : fix typo in ggml.c (#7603)

Assets 21

28 May 23:45

github-actions

b3029

b864b50

b3029

[SYCL] Align GEMM dispatch (#7566)

* align GEMM dispatch

Assets 21

28 May 21:58

github-actions

b3028

02c1eca

b3028

Tokenizer WPM fixes (#7500)

* Update random test: add_bos_token.
* Update random test: add WPM models for testing.
* Build vocab.special_tokens_cache using vocab token types.
* Fix and improve WPM preprocessing.
  - Fix unicode edge case combinations.
  - Split by whitspace in the same pass.
* Discard all tokens when no matching found.

Assets 21

28 May 21:47

github-actions

b3027

6bd12ce

b3027

sycl : fix assert (#7563)

Assets 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggerganov/llama.cpp

b3039

b3038

b3037

b3036

b3035

b3033

b3030

b3029

b3028

b3027