[stdlib] add b64decode #2364

mikowals · 2024-04-21T16:03:34Z

Followed the decode algorithm from the same paper used for b64encode.

The Llama3 tokenizer.model stores the tokens with base64 encoding so demand for this may increase. 😃

Signed-off-by: Michael Kowalski <1331470+mikowals@users.noreply.github.com>

JoeLoser

Awesome, thank you!

JoeLoser · 2024-04-27T14:52:08Z

stdlib/test/base64/test_base64.mojo

 from testing import assert_equal


 def test_b64encode():
+    print("== test_b64encode")


Suggestion Drop the print here and in the decode function below. The print(...) is only needed in FileCheck-style tests so the tool has something to latch onto for its regex-matching internally.

I went ahead and removed these prints when I imported the PR internally since it was trivial to change, so don't feel the need to do anything. 😄

JoeLoser · 2024-04-28T13:27:11Z

✅🟣 This contribution has been merged 🟣✅

Hey @mikowals,

Thanks so much for the contribution! 🎉

We're moving to a new infrastructure for merging contributions to Mojo (we're using a tool called Copybara), and your contribution has now been merged into our internal copy of the Mojo Standard Library. I've added the "merged-internally" label on this PR.

The changes in this PR will appear here in the mojo repo nightly branch when we do our next outbound synchronization at the time that the next Mojo nightly is released. That should happen on Monday (tomorrow).

Please let me know if you have any questions or concerns.

[External] [stdlib] add b64decode Followed the decode algorithm from the same paper used for `b64encode`. The Llama3 `tokenizer.model` stores the tokens with base64 encoding so demand for this may increase. 😃 ORIGINAL_AUTHOR=Michael Kowalski <1331470+mikowals@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2364 --------- Co-authored-by: Michael Kowalski <1331470+mikowals@users.noreply.github.com> Closes modularml#2364 MODULAR_ORIG_COMMIT_REV_ID: de91cca69272570a52fcbf28a5c51c8d7fe75364

[External] [stdlib] add b64decode Followed the decode algorithm from the same paper used for `b64encode`. The Llama3 `tokenizer.model` stores the tokens with base64 encoding so demand for this may increase. 😃 ORIGINAL_AUTHOR=Michael Kowalski <1331470+mikowals@users.noreply.github.com> PUBLIC_PR_LINK=#2364 --------- Co-authored-by: Michael Kowalski <1331470+mikowals@users.noreply.github.com> Closes #2364 MODULAR_ORIG_COMMIT_REV_ID: de91cca69272570a52fcbf28a5c51c8d7fe75364

JoeLoser · 2024-04-30T01:05:25Z

Closing as this got merged into the latest nightly during our outbound sync today (4/29/24) - see decdd0c.

[External] [stdlib] add b64decode Followed the decode algorithm from the same paper used for `b64encode`. The Llama3 `tokenizer.model` stores the tokens with base64 encoding so demand for this may increase. 😃 ORIGINAL_AUTHOR=Michael Kowalski <1331470+mikowals@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2364 --------- Co-authored-by: Michael Kowalski <1331470+mikowals@users.noreply.github.com> Closes modularml#2364 MODULAR_ORIG_COMMIT_REV_ID: de91cca69272570a52fcbf28a5c51c8d7fe75364

mikowals requested a review from a team as a code owner April 21, 2024 16:03

mikowals force-pushed the base64-decode branch 4 times, most recently from 5e7b1c5 to 5ceb47c Compare April 24, 2024 01:19

add b64decode

5c54d5a

Signed-off-by: Michael Kowalski <1331470+mikowals@users.noreply.github.com>

mikowals force-pushed the base64-decode branch from 7c0b99a to 5c54d5a Compare April 26, 2024 09:30

JoeLoser approved these changes Apr 27, 2024

View reviewed changes

JoeLoser added imported-internally Signals that a given pull request has been imported internally. merged-internally Indicates that this pull request has been merged internally labels Apr 27, 2024

JoeLoser closed this Apr 30, 2024

JoeLoser added the merged-externally Merged externally in public mojo repo label May 3, 2024

mikowals deleted the base64-decode branch May 7, 2024 20:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[stdlib] add b64decode #2364

[stdlib] add b64decode #2364

mikowals commented Apr 21, 2024

JoeLoser left a comment

JoeLoser Apr 27, 2024

JoeLoser Apr 28, 2024

JoeLoser commented Apr 28, 2024

JoeLoser commented Apr 30, 2024

[stdlib] add b64decode #2364

[stdlib] add b64decode #2364

Conversation

mikowals commented Apr 21, 2024

JoeLoser left a comment

Choose a reason for hiding this comment

JoeLoser Apr 27, 2024

Choose a reason for hiding this comment

JoeLoser Apr 28, 2024

Choose a reason for hiding this comment

JoeLoser commented Apr 28, 2024

JoeLoser commented Apr 30, 2024