#

ggml

Here are 71 public repositories matching this topic...

llama.cpp

ggerganov / llama.cpp

LLM inference in C/C++

Updated May 3, 2024
C++

rustformers / llm

An ecosystem of Rust libraries for working with large language models

rust ai ml llm ggml

Updated Mar 23, 2024
Rust

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated May 3, 2024
Python

leejet / stable-diffusion.cpp

Stable Diffusion in pure C/C++

ai cplusplus image-generation diffusion text2image image2image img2img txt2img latent-diffusion stable-diffusion ggml

Updated May 3, 2024
C++

RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

machine-learning deep-learning quantization language-model llm rwkv ggml

Updated Apr 16, 2024
C++

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

macos swift ios ai llama gpt-2 rwkv ggml gptneox starcoder

Updated May 1, 2024
Swift

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

gpu pytorch llama quantization language-model huggingface llm llamacpp ggml llama2

Updated Nov 4, 2023
JavaScript

abacaj / mpt-30B-inference

Run inference on MPT-30B using CPU

ggml ctransformers mpt-30b

Updated Jun 30, 2023
Python

Maknee / minigpt4.cpp

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c machine-learning deep-learning cpp quantization multimodal ggml minigpt4

Updated Aug 8, 2023
C++

azkadev / whisper

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

Updated May 2, 2024
C++

the-crypt-keeper / can-ai-code

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

Updated May 1, 2024
Python

LLaMA-Cult-and-More

shm007g / LLaMA-Cult-and-More

Large Language Models for All, 🦙 Cult and More, Stay in touch !

tensorflow transformers pytorch llama gpt alpaca loralib vicuna deepspeed gpt4 llm chatgpt ggml gptq

Updated Jun 1, 2023
HTML

monatis / clip.cpp

CLIP inference in plain C/C++ with no extra dependencies

c cpp image-search clip multimodal ggml

Updated Feb 12, 2024
C

azkadev / bark

WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference

dart machine-learning text-to-speech ai deep-learning clone neural-network voice fake tts bark ggml

Updated Apr 13, 2024
C++

azkadev / general_ai

GENERAL Ai Library For DART & Flutter

dart machine-learning library ai deep-learning ml artificial-intelligence flutter whisper piper azkadev stable-diffusion ggml

Updated Apr 13, 2024
C++

mayooear / private-chatbot-mpt30b-langchain

Chat with your data privately using MPT-30b

gpt llm langchain ggml

Updated Jun 29, 2023
Python

staghado / vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

c cpu ai computer-vision cpp image-classification edge-computing vision-transformer whisper-cpp llamacpp ggml

Updated Apr 11, 2024
C++

abacaj / replit-3B-inference

Run inference on replit-3B code instruct model using CPU

replit ggml ctransformers replit-code

Updated Jul 5, 2023
Python

chenhunghan / ialacol

🪶 Lightweight OpenAI drop-in replacement for Kubernetes

python kubernetes ai gpu helm cuda openai cloudnative llm langchain llm-serving llamacpp ggml gptq llm-inference

Updated Feb 5, 2024
Python

gotzmann / collider

Large Model Collider - The Platform for serving LLM models

openai llama gpt alpaca vicuna koboldai llm chatgpt open-assistant llamacpp llama-cpp vllm ggml stablelm wizardlm exllama oobabooga

Updated May 2, 2024
C++

Improve this page

Add a description, image, and links to the ggml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ggml topic, visit your repo's landing page and select "manage topics."