Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

li-plus / chatglm.cpp Public

Notifications
Fork 314
Star 2.6k

Code
Issues 139
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: li-plus/chatglm.cpp

Releases · li-plus/chatglm.cpp

v0.3.2

24 Apr 08:20

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.3.2 Latest

Latest

Support p-tuning v2 finetuned models for ChatGLM family
Fix convert.py for lora models & chatglm3-6b-128k
Fix RoPE theta config for 32k/128k sequence length
Better cuda cmake script respecting nvcc version

Assets 26

chatglm_cpp-0.3.2-cp310-cp310-macosx_10_9_x86_64.whl

725 KB 2024-04-24T08:16:18Z
chatglm_cpp-0.3.2-cp310-cp310-macosx_11_0_arm64.whl

633 KB 2024-04-24T08:16:24Z
chatglm_cpp-0.3.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

815 KB 2024-04-24T08:14:09Z
chatglm_cpp-0.3.2-cp310-cp310-win_amd64.whl

457 KB 2024-04-24T08:16:09Z
chatglm_cpp-0.3.2-cp311-cp311-macosx_10_9_x86_64.whl

727 KB 2024-04-24T08:16:19Z
chatglm_cpp-0.3.2-cp311-cp311-macosx_11_0_arm64.whl

634 KB 2024-04-24T08:16:25Z
chatglm_cpp-0.3.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

816 KB 2024-04-24T08:14:10Z
chatglm_cpp-0.3.2-cp311-cp311-win_amd64.whl

458 KB 2024-04-24T08:16:10Z
chatglm_cpp-0.3.2-cp312-cp312-macosx_10_9_x86_64.whl

727 KB 2024-04-24T08:16:20Z
chatglm_cpp-0.3.2-cp312-cp312-macosx_11_0_arm64.whl

634 KB 2024-04-24T08:16:26Z
Source code (zip)

2024-04-24T03:43:10Z
Source code (tar.gz)

2024-04-24T03:43:10Z

All reactions

v0.3.1

20 Jan 16:14

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.3.1

Support function calling in OpenAI api server
Faster repetition penalty sampling
Support max_new_tokens generation option

Assets 18

All reactions

v0.3.0

22 Nov 03:08

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.3.0

Full functionality of ChatGLM3 including system prompt, function call and code interpreter
Brand new OpenAI-style chat API
Add token usage information in OpenAI api server to be compatible with LangChain frontend
Fix conversion error for chatglm3-6b-32k

Assets 21

nianhuiY, HideOnBush1999, and StrayDragon reacted with thumbs up emoji

nianhuiY reacted with hooray emoji

Weaxs reacted with eyes emoji

All reactions

👍 3 reactions
🎉 1 reaction
👀 1 reaction

4 people reacted

v0.2.10

30 Oct 06:35

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.10

Support ChatGLM3 in conversation mode.
Coming soon: new prompt format for system message and function call.

Assets 21

All reactions

v0.2.9

22 Oct 03:03

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.9

Support InternLM 7B & 20B model architectures

Assets 3

All reactions

v0.2.8

10 Oct 16:24

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.8

Metal backend support for all models (ChatGLM & ChatGLM2 & Baichuan-7B & Baichuan-13B)
Fix GLM generation on CUDA for long context

Assets 3

nianhuiY reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

v0.2.7

28 Sep 13:23

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.7

Support Baichuan-7B model architecture (works for both Baichuan v1 & v2).
Minor bug fix and enhancement.

Assets 3

nianhuiY and neosun100 reacted with thumbs up emoji

All reactions

👍 2 reactions

2 people reacted

v0.2.6

31 Aug 11:50

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.6

Support Baichuan-13B on CPU & CUDA backends
Bug fix for Windows and Metal

Assets 3

All reactions

v0.2.5

22 Aug 16:52

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.5

Optimize context computing (GEMM) for metal backend
Support repetition penalty option for generation
Update Dockerfile for CPU & CUDA backends with full functionality, hosted on GHCR

Assets 3

All reactions

v0.2.4

11 Aug 17:30

li-plus

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

v0.2.4

Python binding enhancement: support load-and-convert directly from original Hugging Face models. Intermediate GGML model files are no longer necessary.
Small fix for CLI demo on Windows.

Assets 3

All reactions

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.