init command to scaffold a new project from a template (with node-typescript and electron-typescript-react templates) (#217) (d6a0f43)
debug mode (#217) (d6a0f43)
load LoRA adapters (#217) (d6a0f43)
improve Electron support (#217) (d6a0f43)

Shipped with llama.cpp release b2928

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

0 Join discussion

12 May 20:48

github-actions

v3.0.0-beta.19

d321fe3

v3.0.0-beta.19 Pre-release

Pre-release

3.0.0-beta.19 (2024-05-12)

Bug Fixes

adapt to llama.cpp changes (#208) (29e8c67)

Features

improve grammar support (#215) (d321fe3)
improve JSON schema grammar (#215) (d321fe3)

Shipped with llama.cpp release b2861

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

09 May 23:28

github-actions

v3.0.0-beta.18

453c162

v3.0.0-beta.18 Pre-release

Pre-release

3.0.0-beta.18 (2024-05-09)

Bug Fixes

more efficient max context size finding algorithm (#214) (453c162)
make embedding-only models work correctly (#214) (453c162)
perform context shift on the correct token index on generation (#214) (453c162)
make context loading work for all models on Electron (#214) (453c162)

Features

split gguf files support (#214) (453c162)
pull command (#214) (453c162)
stopOnAbortSignal and customStopTriggers on LlamaChat and LlamaChatSession (#214) (453c162)
checkTensors parameter on loadModel (#214) (453c162)
improve Electron support (#214) (453c162)

Shipped with llama.cpp release b2834

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

27 Apr 18:28

github-actions

v2.8.10

29e8c67

v2.8.10

2.8.10 (2024-04-27)

Bug Fixes

adapt to llama.cpp changes (#208) (29e8c67)

Assets 2

24 Apr 17:23

github-actions

v3.0.0-beta.17

ef501f9

v3.0.0-beta.17 Pre-release

Pre-release

3.0.0-beta.17 (2024-04-24)

Bug Fixes

FunctionaryChatWrapper bugs (#205) (ef501f9)
function calling syntax bugs (#205) ([ef501f9]
show GPU layers in the Model line in CLI commands (#205) ([ef501f9]
refactor: rename LlamaChatWrapper to Llama2ChatWrapper

Features

Llama 3 support (#205) (ef501f9)
--gpu flag in generation CLI commands (#205) (ef501f9)
specialTokens parameter on model.detokenize (#205) (ef501f9)

Shipped with llama.cpp release b2717

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

13 Apr 17:14

github-actions

v3.0.0-beta.16

d332b77

v3.0.0-beta.16 Pre-release

Pre-release

3.0.0-beta.16 (2024-04-13)

Bug Fixes

fallback to general chat wrapper (#197) (7878c8a)

Features

inspect gpu command: print device names (#198) (5ca33c7)
inspect gpu command: print env info (#202) (d332b77)
download models using the CLI (#191) (b542b53)
interactively select a model from CLI commands (#191) (b542b53)
change the default log level to warn (#191) (b542b53)
token biases (#196) (3ad4494)

Shipped with llama.cpp release b2665

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

04 Apr 20:52

github-actions

v3.0.0-beta.15

6267778

v3.0.0-beta.15 Pre-release

Pre-release

3.0.0-beta.15 (2024-04-04)

Bug Fixes

create a context with no parameters (#188) (6267778)
improve chat wrappers tokenization (#182) (35e6f50)
use the new llama.cpp CUDA flag (#182) (35e6f50)
adapt to breaking llama.cpp changes (#183) (6b012a6)

Features

automatically adapt to current free VRAM state (#182) (35e6f50)
inspect gguf command (#182) (35e6f50)
inspect measure command (#182) (35e6f50)
readGgufFileInfo function (#182) (35e6f50)
GGUF file metadata info on LlamaModel (#182) (35e6f50)
JinjaTemplateChatWrapper (#182) (35e6f50)
use the tokenizer.chat_template header from the gguf file when available - use it to find a better specialized chat wrapper or use JinjaTemplateChatWrapper with it as a fallback (#182) (35e6f50)
simplify generation CLI commands: chat, complete, infill (#182) (35e6f50)
Windows on Arm prebuilt binary (#181) (f3b7f81)

Shipped with llama.cpp release b2608

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2.8.11 (2024-05-24)

Bug Fixes

3.0.0-beta.22 (2024-05-19)

Bug Fixes

3.0.0-beta.21 (2024-05-19)

Bug Fixes

3.0.0-beta.20 (2024-05-19)

Bug Fixes

Features

3.0.0-beta.19 (2024-05-12)

Bug Fixes

Features

3.0.0-beta.18 (2024-05-09)

Bug Fixes

Features

2.8.10 (2024-04-27)

Bug Fixes

3.0.0-beta.17 (2024-04-24)

Bug Fixes

Features

3.0.0-beta.16 (2024-04-13)

Bug Fixes

Features

3.0.0-beta.15 (2024-04-04)

Bug Fixes

Features

Releases: withcatai/node-llama-cpp

v2.8.11

2.8.11 (2024-05-24)

Bug Fixes

v3.0.0-beta.22

3.0.0-beta.22 (2024-05-19)

Bug Fixes

v3.0.0-beta.21

3.0.0-beta.21 (2024-05-19)

Bug Fixes

v3.0.0-beta.20

3.0.0-beta.20 (2024-05-19)

Bug Fixes

Features

v3.0.0-beta.19

3.0.0-beta.19 (2024-05-12)

Bug Fixes

Features

v3.0.0-beta.18

3.0.0-beta.18 (2024-05-09)

Bug Fixes

Features

v2.8.10

2.8.10 (2024-04-27)

Bug Fixes

v3.0.0-beta.17

3.0.0-beta.17 (2024-04-24)

Bug Fixes

Features

v3.0.0-beta.16

3.0.0-beta.16 (2024-04-13)

Bug Fixes

Features

v3.0.0-beta.15

3.0.0-beta.15 (2024-04-04)

Bug Fixes

Features