对Ollama模型加载keep_alive参数的支持，让模型可以在显存内保留更长时间 #1536

SuperMaxine · 2024-05-11T08:11:52Z

加入调用Ollama模型时对keep_alive参数的设置选项

ollama默认在5分钟未使用后会将模型从显存中卸载，导致下次调用时会经历较长时间的加载过程，较为影响使用体验。

Ollama在api调用时提供了keep_alive选项可以设置保留时长或永不卸载选项，希望可以加入设置页面让用户自主设置。详见Ollama关于API调用的文档：
https://github.com/ollama/ollama/blob/c02db93243353855b983db2a1562a02b57e66db1/docs/faq.md#how-do-i-keep-a-model-loaded-in-memory-or-make-it-unload-immediately

No response

yetone · 2024-05-12T06:03:24Z

好的٩(•̤̀ᵕ•̤́๑)ᵒᵏᵎᵎᵎᵎ

yetone · 2024-05-12T10:28:26Z

最新版已支持嘻嘻

SuperMaxine · 2024-05-12T10:40:34Z

最新版已支持嘻嘻

!!!!!!!

大佬好快！！！！

大佬nb！！！

SuperMaxine added the enhancement New feature or request label May 11, 2024

yetone mentioned this issue May 12, 2024

feat: ollama keep alive #1540

Merged

yetone closed this as completed May 12, 2024

Provide feedback