Releases · ggerganov/llama.cpp

01 Nov 22:42

898aeca

b1462

llama : implement YaRN RoPE scaling (#2268)

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>
Co-authored-by: Jeffrey Quesnelle <jquesnelle@gmail.com>

Assets 12

01 Nov 21:34

github-actions

b1461

c43c2da

b1461

llm : fix llm_build_kqv taking unused tensor (benign, #3837)

Assets 12

01 Nov 21:31

github-actions

b1460

523e49b

b1460

llm : fix falcon norm after refactoring (#3837)

Assets 12

01 Nov 20:00

github-actions

b1459

e16b9fa

b1459

metal : multi-simd softmax (#3710)

ggml-ci

Assets 12

01 Nov 19:41

github-actions

b1458

ff8f9a8

b1458

common : minor (#3715)

Assets 12

01 Nov 18:43

github-actions

b1457

5033796

b1457

llm : add llm_build_context (#3881)

* llm : add llm_build_context

* llm : deduce norm eps based on type + explict max_alibi_bias, clamp_kqv

* llm : restore the non-graph llm_build_ functional API

ggml-ci

* llm : cleanup + comments

Assets 12

01 Nov 18:09

github-actions

b1456

0e40806

b1456

common : allow caller to handle help/argument exceptions (#3715)

* Allow caller to handle help/argument exceptions

* Prepend newline to usage output

* Add new gpt_params_parse_ex function to hide arg-parse impl

* Fix issue blocking success case

* exit instead of returning false

* Update common/common.h

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* Update common/common.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Assets 12

01 Nov 14:46

github-actions

b1455

a2758d0

b1455

log : make generating separate log files optional (#3787)

* impl --log-new, --log-append

* Update common/log.h

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

* Update common/log.h

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

* Apply suggestions from code review

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

---------

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

Assets 12

01 Nov 14:09

github-actions

b1454

e75dfdd

b1454

sampling : null grammar field after reset (#3885)

Assets 12

01 Nov 12:19

github-actions

b1453

9a3b4f6

b1453

ggml : fix UNUSED macro (#3762)

Assets 12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggerganov/llama.cpp

b1462

b1461

b1460

b1459

b1458

b1457

b1456

b1455

b1454

b1453