chat(hash) to also return raw response + header object #9

drnic · 2024-04-26T20:39:11Z

Currently chat(messages) returns a hash {role: "assistant", content: "Welcome"} and there is no way to access the raw response object which contains usage data:

"usage":{"prompt_tokens":163,"prompt_time":0.059,"completion_tokens":481,"completion_time":1.591,"total_tokens":644,"total_time":1.65}

Nor does it return the HTTP header information that contains rate limit stats:

x-ratelimit-limit-requests: "14400"
x-ratelimit-limit-tokens: "7000"
x-ratelimit-remaining-requests: "14399"
x-ratelimit-remaining-tokens: "6818"
x-ratelimit-reset-requests: "6s"
x-ratelimit-reset-tokens: "1.559999999s"
x-request-id: "req_01hwe17kc9efg93xtwk30d5tx4"

Can we change chat() to return multiple response objects?

message, response, headers = @client.chat("Hello?")
# if you just want one
message, _ = @client.chat("Hello?)

Perhaps this mode is enabled via config @client.chat("Hello?", metadata: true)

The text was updated successfully, but these errors were encountered:

inspire22 · 2024-05-14T04:07:58Z

I'd love this too! Currently struggling to stay inside their rate limits & it's hard without better ability to show their headers. I think it should be enabled by default, or just auto-log with metadata: true., but better to return it as the other result imo.

bugloper mentioned this issue Apr 28, 2024

Multiple response objects #10

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chat(hash) to also return raw response + header object #9

chat(hash) to also return raw response + header object #9

drnic commented Apr 26, 2024 •

edited

inspire22 commented May 14, 2024

chat(hash) to also return raw response + header object #9

chat(hash) to also return raw response + header object #9

Comments

drnic commented Apr 26, 2024 • edited

inspire22 commented May 14, 2024

drnic commented Apr 26, 2024 •

edited