Disables token sampling when temperature set to 0 #200

akelch11 · 2024-01-22T21:45:55Z

Disable token sampling when temperature = 0, PR addressing Issue #197

Problem/Issue: This PR turns off token sampling in the next token chooser classes (NextTokenChooser, HeterogeneousNextTokenChooser) when initialized with a temperature parameter set to 0, as outlined in #197 . LLM output should be deterministic and use the Greedy token choice system, outputting tokens that have the highest log probability in the logit distribution.

Solution: This involves setting sampling and do_sample flags in the next token chooser classes (NextTokenChooser, HeterogeneousNextTokenChooser) to False when the temperature is set to 0, so thatGreedy token choosing is enabled, creating deterministic token choices/results. Also, there are some changes to input validation that previously didn't account/check for a 0 temperature parameter so they no longer treat this as invalid.

Testing: A new test calling the default CausalLLM with deterministic next token choosers initialized with temperature = 0 was added, checking that each generated token's log probability is the maximum from its distribution. As of 1/22, the changes have all server tests passing that do not involve authenticating with HuggingFace token/login to access Llama2. Assistance with setting the login information would be appreciated.

Fixes #(#197)

Before submitting

[] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Was this discussed/approved via a Github issue or the discord / slack channel? Please add a link
to it if that's the case.: #(Disable sampling when temperature=0 #197)
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

akelch11 · 2024-01-22T22:12:24Z

@tgaddair Could a maintainer please approve the running of the test workflows?

Also, I'd want to go ahead and write a test in test_tokens.py and shows that output is deterministic, and matches the token with highest logits from the distribution? I see some tests in `server/tests/models/test_causal_lm.py that initialized models and generate tokens, but don't see an easy place to set the temperature parameters. Would anyone familiar with this be able to provide input to help write this test?

tgaddair · 2024-01-23T01:02:46Z

Approved! :)

akelch11 · 2024-01-23T01:30:11Z

The failed tests have to do with not being able to log into HuggingFace and use the Llama model -- how would one go about setting up the tests to access the huggingface login token that the repo uses?

tgaddair · 2024-01-23T17:04:12Z

Hey @akelch11, apologies for the failing test. What I need to do is disable those particular tests when run from a forked repo. It's mostly a GitHub Action change. For now, though, we can ignore those tests, as the others will have run.

tgaddair

Looks great!

Can you also update clients/python/lorax/types.py:91 to:

if v is not None and v < 0:
            raise ValidationError("`temperature` must be non-negative")

tgaddair · 2024-01-23T17:09:43Z

server/tests/utils/test_tokens.py

+from lorax_server.utils.lora import AdapterBatchData
+from lorax_server.pb import generate_pb2
+from lorax_server.models.causal_lm import CausalLM, CausalLMBatch
+from tests.models.test_causal_lm import default_causal_lm, default_causal_lm_batch


Imports from tests can be a little weird. The recommended thing to do here, since these are fixtures, would be to move them into conftest.py. Then you can pass them as arguments to your test functions without needing to import them (see usage of default_pb_parameters as an example).

Yes, looks like the current failing test is failing due to this issue.

Thanks for clarifying, will move test there and refactor it

tgaddair · 2024-01-23T21:51:23Z

clients/python/lorax/types.py

@@ -88,7 +88,7 @@ def valid_seed(cls, v):

    @validator("temperature")
    def valid_temp(cls, v):
-        if v is not None and v <= 0:
+        if v is not None and v < 0:


Nit: please change validation error to read "temperature must be non-negative".

tgaddair · 2024-01-24T01:06:36Z

@akelch11, server tests look good (just the expected permissions failures). Looks like one of the Python client tests needs to be updated to account for the new constraints.

tgaddair · 2024-01-25T17:22:01Z

Hey @akelch11, are you able to fix the remaining client test?

akelch11 · 2024-01-25T18:00:49Z

Hi! I will be able to finish up with revising tests shortly but I am traveling at the moment. Will be 1-2 days until I can get back to it fully.

…

On Thu, Jan 25, 2024 at 12:22 PM Travis Addair ***@***.***> wrote: Hey @akelch11 <https://github.com/akelch11>, are you able to fix the remaining client test? — Reply to this email directly, view it on GitHub <#200 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APAMKZCQPGJQLKCCJJNZ2ATYQKIEHAVCNFSM6AAAAABCF5IEACVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMJQGY3DANBVHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

tgaddair · 2024-01-25T18:45:49Z

Thanks @akelch11! No rush :)

prd-tuong-nguyen · 2024-05-13T07:41:25Z

@tgaddair any update on this PR? I really need this feature :( Do you have another way to disable sampling in current version>

tgaddair · 2024-05-13T17:55:25Z

Hey @prd-tuong-nguyen, the contributor to this one went dark, but I can definitely pick this up and close it out.

For now, setting temperature to 1 (default) and keeping do_sample=False should make results deterministic.

prd-tuong-nguyen · 2024-05-14T05:10:49Z

@tgaddair cool, thank u

tgaddair · 2024-05-14T06:26:07Z

@prd-tuong-nguyen this has now landed in #467.

akelch11 added 4 commits January 22, 2024 11:39

disable token sampling when temperature is 0

e295dbb

adjust sample flag setting for token choosers

18079e6

temperature validation changes

25227bb

fix comments

027bf30

akelch11 mentioned this pull request Jan 23, 2024

self extend / longlm #186

Closed

3 tasks

akelch11 added 3 commits January 22, 2024 21:23

add test for deterministic token choosing when temp = 0

4258d3f

add additional test

24da64e

clarify affects of 0 temperature parameter in docs

f3b912b

tgaddair reviewed Jan 23, 2024

View reviewed changes

akelch11 added 2 commits January 23, 2024 12:55

change temperature validation to include 0

b7f2f92

move tests to conftest.py, refactor

c5049ac

tgaddair reviewed Jan 23, 2024

View reviewed changes

update temperature validation

ed4845d

tgaddair mentioned this pull request May 14, 2024

Allow setting temperature=0 #467

Merged

tgaddair closed this May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disables token sampling when temperature set to 0 #200

Disables token sampling when temperature set to 0 #200

akelch11 commented Jan 22, 2024 •

edited

akelch11 commented Jan 22, 2024 •

edited

tgaddair commented Jan 23, 2024

akelch11 commented Jan 23, 2024

tgaddair commented Jan 23, 2024

tgaddair left a comment

tgaddair Jan 23, 2024

tgaddair Jan 23, 2024

akelch11 Jan 23, 2024

tgaddair Jan 23, 2024

tgaddair commented Jan 24, 2024

tgaddair commented Jan 25, 2024

akelch11 commented Jan 25, 2024 via email

tgaddair commented Jan 25, 2024

prd-tuong-nguyen commented May 13, 2024 •

edited

tgaddair commented May 13, 2024

prd-tuong-nguyen commented May 14, 2024

tgaddair commented May 14, 2024

Disables token sampling when temperature set to 0 #200

Disables token sampling when temperature set to 0 #200

Conversation

akelch11 commented Jan 22, 2024 • edited

Disable token sampling when temperature = 0, PR addressing Issue #197

Before submitting

Who can review?

akelch11 commented Jan 22, 2024 • edited

tgaddair commented Jan 23, 2024

akelch11 commented Jan 23, 2024

tgaddair commented Jan 23, 2024

tgaddair left a comment

Choose a reason for hiding this comment

tgaddair Jan 23, 2024

Choose a reason for hiding this comment

tgaddair Jan 23, 2024

Choose a reason for hiding this comment

akelch11 Jan 23, 2024

Choose a reason for hiding this comment

tgaddair Jan 23, 2024

Choose a reason for hiding this comment

tgaddair commented Jan 24, 2024

tgaddair commented Jan 25, 2024

akelch11 commented Jan 25, 2024 via email

tgaddair commented Jan 25, 2024

prd-tuong-nguyen commented May 13, 2024 • edited

tgaddair commented May 13, 2024

prd-tuong-nguyen commented May 14, 2024

tgaddair commented May 14, 2024

akelch11 commented Jan 22, 2024 •

edited

akelch11 commented Jan 22, 2024 •

edited

prd-tuong-nguyen commented May 13, 2024 •

edited