Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series #3543

kmheckel · 2024-05-09T11:32:33Z

Title

Updated Ollama cost models to include LLaMa3 and Mistral Instruct series

Relevant issues

None currently open.

Type

🆕 New Feature

Changes

Updated model_prices_and_context_window.json to include chat endpoints for LLaMa3 and Mistral/Mixtral Intstruct.
This enables users to use any LLaMa3/Mistral/Mixtral finetunes in chat mode by doing ollama cp <finetune name> <llama3/mistral-7B-Instruct-v0.1> for example.
Small error message modifcation to include model and provider name in the error message for custom LLM providers. This makes debugging easier for users as they can see if it's just a typo or if they need to go update the cost model file.

Testing

To test, instructions from this AutoGen tutorial mostly suffice: https://microsoft.github.io/autogen/docs/topics/non-openai-models/local-litellm-ollama/

Install Ollama
Pull LLama3
ollama serve
litellm --model ollama_chat/llama3
In python, create a LiteLLM completion object and interact with it.

Notes

From my understanding, this pull request removes the necessity to specify custom pricing models from command line via a config.yaml: https://docs.litellm.ai/docs/proxy/custom_pricing

Tested locally to confirm functionality, with streaming set to False as part of the LiteLLM interface to DataDreamer.
https://github.com/datadreamer-dev/DataDreamer/blob/0.35.0/src/llms/_litellm.py

No concerns or substantial modifications; update simply adjusts metadata for newer Ollama models.

Pre-Submission Checklist (optional but appreciated):

I have included relevant documentation updates (stored in /docs/my-website)

OS Tests (optional but appreciated):

Not tested on other operating systems but this change won't break/doesn't fix other open issues with Ollama-->LiteLLM.

Tested on Windows
Tested on MacOS
Tested on Linux

Added Mistral and Mixtral Chat entries for Ollama.

vercel · 2024-05-09T11:32:36Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 11, 2024 5:07pm

kmheckel · 2024-05-09T14:19:11Z

@ishaan-jaff If you wouldn't mind review this sometime soon it would be helpful so that I don't have to keep working of a local fork of LiteLLM to use Ollama. Thanks!

krrishdholakia · 2024-05-09T14:43:55Z

model_prices_and_context_window.json

+        "max_input_tokens": 8192,
+        "max_output_tokens": 8192,
+        "input_cost_per_token": 0.00000010,
+        "output_cost_per_token": 0.00000010,


does ollama have a hosted endpoint ? why is there a cost here

Good catch, an oversight during copy&paste - I'll fix that.

fixed typo with ollama/llama3 token cost (now set to 0)

krrishdholakia · 2024-05-11T16:10:24Z

litellm/utils.py

@@ -6620,7 +6620,7 @@ def _get_max_position_embeddings(model_name):
            raise Exception()
    except:
        raise Exception(
-            "This model isn't mapped yet. Add it here - https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json"
+            f"Model {model} from provider {custom_llm_provider} isn't mapped yet. Add it here - https://github.com/BerriAI/litellm/blob/main/model_prices_and_context_window.json"


@kmheckel this would cause another error - if custom_llm_provider isn't found from 'get_llm_provider'

I'll remove the custom_llm_provider reference in that case.

changed error message

kmheckel and others added 3 commits May 9, 2024 11:01

Update model_prices_and_context_window.json

dde2e32

Added Mistral and Mixtral Chat entries for Ollama.

Added Ollama LLMs for LLaMa and Mistral

256107c

Updated docs for Ollama.

615a476

vercel bot deployed to Preview May 9, 2024 11:33 View deployment

krrishdholakia reviewed May 9, 2024

View reviewed changes

Update model_prices_and_context_window.json

54c5745

fixed typo with ollama/llama3 token cost (now set to 0)

vercel bot deployed to Preview May 9, 2024 15:04 View deployment

krrishdholakia reviewed May 11, 2024

View reviewed changes

kmheckel added 2 commits May 11, 2024 18:05

changed error message

68b3b6f

Merge pull request #1 from kmheckel/minor_edit_branch

a492020

changed error message

vercel bot deployed to Preview May 11, 2024 17:07 View deployment

ishaan-jaff approved these changes May 16, 2024

View reviewed changes

ishaan-jaff merged commit 881812d into BerriAI:main May 16, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series #3543

Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series #3543

kmheckel commented May 9, 2024

vercel bot commented May 9, 2024 •

edited

kmheckel commented May 9, 2024

krrishdholakia May 9, 2024

kmheckel May 9, 2024

krrishdholakia May 11, 2024

kmheckel May 11, 2024

Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series #3543

Updated Ollama cost models to include LLaMa3 and Mistral/Mixtral Instruct series #3543

Conversation

kmheckel commented May 9, 2024

Title

Relevant issues

Type

Changes

Testing

Notes

Pre-Submission Checklist (optional but appreciated):

OS Tests (optional but appreciated):

vercel bot commented May 9, 2024 • edited

kmheckel commented May 9, 2024

krrishdholakia May 9, 2024

Choose a reason for hiding this comment

kmheckel May 9, 2024

Choose a reason for hiding this comment

krrishdholakia May 11, 2024

Choose a reason for hiding this comment

kmheckel May 11, 2024

Choose a reason for hiding this comment

vercel bot commented May 9, 2024 •

edited