Add support for Groq LLM #565

andreibondarev · 2024-04-06T15:20:05Z

No description provided.

mattlindsey · 2024-04-22T21:24:47Z

I'll give it a try. This might already work using Langchain::LLM::OpenAI since the Groq technical FAQ says:

How to transition from OpenAI to Groq?
It's incredibly simple to transition! If you have an application already using OpenAI's API, all it takes is updating 3 variables: the API key, the endpoint, and the model!

Something like this, although this doesn't seem to work yet:

3.2.2 :019 > llm = Langchain::LLM::OpenAI.new(api_key: 'gsk_xxxx', llm_options: { uri_base: 'https:/
/api.groq.com/openai/', model: 'mixtral-8x7b-32768'})
 =>
#<Langchain::LLM::OpenAI:0x0000000107b0ee08
...
3.2.2 :020 > llm.chat(messages: [{role: "user", content: "What is the meaning of life?"}]).completion
OpenAI HTTP Error (spotted in ruby-openai 6.4.0): {"error"=>{"message"=>"The model `gpt-3.5-turbo` does not exist or you do not have access to it.", "type"=>"invalid_request_error", "code"=>"model_not_found"}}
/Users/mattlindsey/.rvm/gems/ruby-3.2.2/gems/faraday-2.9.0/lib/faraday/response/raise_error.rb:22:in `on_complete': the server responded with status 404 (Faraday::ResourceNotFound)

Possibly the ruby-openai gem needs to support it somewhere around here: https://github.com/alexrudall/ruby-openai/blob/f8a4482f7012e27f791c9259bde6fb1cda191e82/lib/openai/http.rb#L87

andreibondarev · 2024-04-22T22:25:23Z

@mattlindsey There's no way they're hosting OpenAI models, because other than GPT-2 none of them are open source. Only open-source ones: https://console.groq.com/docs/models.

mattlindsey · 2024-04-22T22:26:09Z

@andreibondarev
Actually the above example seems to work great with this:

  class OpenAI < Base
    DEFAULTS = {
      n: 1,
      temperature: 0.0,
      chat_completion_model_name: "mixtral-8x7b-32768",
      embeddings_model_name: "text-embedding-3-small"
    }.freeze

Output:

3.2.2 :007 > llm = Langchain::LLM::OpenAI.new(api_key: 'gsk_xxxx', llm_options: { uri_base: 'https:/
/api.groq.com/openai/', model: 'mixtral-8x7b-32768'})
 =>
#<Langchain::LLM::OpenAI:0x0000000109c1d518
...
3.2.2 :008 >
3.2.2 :009 > llm.chat(messages: [{role: "user", content: "What is the meaning of life?"}]).completion
 => "The meaning of life is a question that has puzzled philosophers, scientists, and thinkers for centuries. There is no one definitive answer, as the meaning of life is subjective and can vary greatly from person to person. However, here are some possible

So it should work by fixing something. The initializer doesn't seem to be respecting the model param.

andreibondarev · 2024-04-22T22:56:14Z

@mattlindsey I don't think it's a good idea. It would be kind of odd to instantiate an Langchain::LLM::OpenAI class and then interface with mixtral model via Groq.

mattlindsey · 2024-04-22T23:00:05Z

@andreibondarev That seem to be what they tell you to do in their FAQ, and it works (with a fix, it seems). But maybe you'd prefer a Groq LLM class.

mattlindsey · 2024-04-23T00:42:24Z

I discovered using parameters as follows works currently, in case anyone needs it now:

llm = Langchain::LLM::OpenAI.new(api_key: 'your_groq_key_here', llm_options: { uri_base: 'https:/
/api.groq.com/openai/'})

llm.chat(messages: [{role: "user", content: "Tell me a story?"}], model: 'mixtral-8x7b-32768').completion

This does not work, however (gives : undefined method encode in openai_validator.rb:73:in token_length`)

llm.embed(text: "foo bar", model: 'mixtral-8x7b-32768').embedding

drnic · 2024-04-26T20:45:49Z

For now (for chat completion) it seems Groq Cloud is matching OpenAI's API. So the Langchain Groq adapter could use the existing ruby-openai rubygem dependency for the subset of API features currently available on https://api.groq.com/v1/openai

The test/fixtures/vcr_cassettes folder of groq-ruby should give some examples of API calls that are available.

Currently it is:

I'd suggest creating a Langchain Groq class (at least to explicitly document what features are known to be available and are documented) and try using ruby-openai or just low level faraday calls.

mattlindsey · 2024-04-26T22:38:37Z

@andreibondarev The following should work pretty easily, and I can do a PR if you want:

module Langchain::LLM
  # LLM interface for Groq OpenAI compatible services
  #
  # Usage:
  #    groq = Langchain::LLM::GroqOpenAI.new(
  #      api_key: ENV["GROQ_API_KEY"],
  #      llm_options: {},
  #      default_options: {}
  #    )
  class GroqOpenAI < OpenAI

P.S. The Assistant and Weather tool appear to work well using OpenAI class now with model: mixtral-8x7b-32768', since chat and (apparently) function calling are supported.

drnic · 2024-05-07T09:24:46Z

For anyone else who just wants a quick Groq solution who might not know where the diff defaults/overrides go (e.g. me) this seems to work:

  @groq = llm = Langchain::LLM::OpenAI.new(
    api_key: ENV["GROQ_API_KEY"],
    llm_options: {
      uri_base: "https://api.groq.com/openai/"
    },
    default_options: {
      chat_completion_model_name: "llama3-70b-8192"
    }
  )

  messages = [{:role=>"system", :content=>"I like to solve maths problems."}, {"role"=>"user", "content"=>"What is 2+2?"}]
  @groq.chat(messages:).completion
  # => "That's an easy one! The answer is... 4!"

andreibondarev linked a pull request Apr 29, 2024 that will close this issue

Add OpenAI-compatible Groq #596

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Groq LLM #565

Add support for Groq LLM #565

andreibondarev commented Apr 6, 2024

mattlindsey commented Apr 22, 2024 •

edited

andreibondarev commented Apr 22, 2024

mattlindsey commented Apr 22, 2024 •

edited

andreibondarev commented Apr 22, 2024 •

edited

mattlindsey commented Apr 22, 2024

mattlindsey commented Apr 23, 2024 •

edited

drnic commented Apr 26, 2024 •

edited

mattlindsey commented Apr 26, 2024 •

edited

drnic commented May 7, 2024

Add support for Groq LLM #565

Add support for Groq LLM #565

Comments

andreibondarev commented Apr 6, 2024

mattlindsey commented Apr 22, 2024 • edited

andreibondarev commented Apr 22, 2024

mattlindsey commented Apr 22, 2024 • edited

andreibondarev commented Apr 22, 2024 • edited

mattlindsey commented Apr 22, 2024

mattlindsey commented Apr 23, 2024 • edited

drnic commented Apr 26, 2024 • edited

mattlindsey commented Apr 26, 2024 • edited

drnic commented May 7, 2024

mattlindsey commented Apr 22, 2024 •

edited

mattlindsey commented Apr 22, 2024 •

edited

andreibondarev commented Apr 22, 2024 •

edited

mattlindsey commented Apr 23, 2024 •

edited

drnic commented Apr 26, 2024 •

edited

mattlindsey commented Apr 26, 2024 •

edited