MTEB: Massive Text Embedding Benchmark
-
Updated
Jun 3, 2024 - Python
MTEB: Massive Text Embedding Benchmark
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
SGPT: GPT Sentence Embeddings for Semantic Search
Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Generative Representational Instruction Tuning
Rosette API Client Library for C#
Codebase for RetroMAE and beyond.
Rosette API Client Library for PHP
Rosette API Client Library for Ruby
Perform topic classification on news articles in several limited-labeled data regimes.
cUrl examples for the Rosette API
I have improved the demo by using Azure OpenAI’s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.
Universal-Sentence-Encoder-Multilingual-QA is a model developed by researchers at Google mainly for the purpose of question answering. You can use this template to import the model in Inferless.
This code repo demonstrates how to use the word embedding model from Azure OpenAI Service to perform a semantic search on a grocery store dataset. This enhanced/completed version used Streamlit to build a web user experience to semantic search and display the most relevant items
Image Steganography GUI | Easily Hide Text Files within Images with User-Friendly GUI | Pyton Tool
A Node-RED node that interacts with OpenAI machine learning models to generate text like ChatGPT
Simple script to compute CLIP-based scores given a DALL-e trained model.
Leveraged the power of Google Cloud's Vertex AI platform to develop advanced Large Language Models (LLMs). Utilizing the Python API provided by Google Cloud, this endeavor represents a significant stride in the realm of natural language processing and LLMs.
A simple Python script for transforming a corpus of documents into text vectors suitable for visualization
Add a description, image, and links to the text-embedding topic page so that developers can more easily learn about it.
To associate your repository with the text-embedding topic, visit your repo's landing page and select "manage topics."