Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configurable Inference / Embedding Model & rate limit #5

Open
aymenfurter opened this issue Dec 23, 2023 · 1 comment
Open

Configurable Inference / Embedding Model & rate limit #5

aymenfurter opened this issue Dec 23, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@aymenfurter
Copy link
Owner

aymenfurter commented Dec 23, 2023

Introduce an environment variable for model selection and timeouts. This change will allow users to choose different LLM models, including free options, enhancing flexibility.

@aymenfurter aymenfurter added the enhancement New feature or request label Dec 23, 2023
@aymenfurter aymenfurter changed the title Configurable Model Selection via Environment Variables Configurable Model Selection Dec 23, 2023
@aymenfurter aymenfurter changed the title Configurable Model Selection Configurable LLM / Embedding Model Configuration Dec 23, 2023
@aymenfurter aymenfurter changed the title Configurable LLM / Embedding Model Configuration Configurable LLM / Embedding Model Dec 23, 2023
@aymenfurter aymenfurter changed the title Configurable LLM / Embedding Model Configurable Inference / Embedding Model Dec 23, 2023
@aymenfurter aymenfurter changed the title Configurable Inference / Embedding Model Configurable Inference / Embedding Model & rate limit Dec 24, 2023
@bearney74
Copy link
Contributor

how about something like:

OPENAI_EMBEDDING=text-embedding-ada-002
OPENAI_MODEL=gpt-4-1106-preview

I am currently putting these environment variables in a file named .env in the root directory (next to main.py). Then using dotenv module to load the environment variables. I removed the .env file and dotenv seems to silently fail, so if someone does not have a .env file (or chooses not to use a .env file, or a person sets environment variables some other way), things still seems to work.

I decided to purchase some tokens on openai so that I can use the more advanced models (like gpt-4). People that do not want to purchase openai tokens can try it out for free by setting the OPENAI_MODEL variable to gpt-3.5-turbo. That is what I used to test out some things before purchasing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants