Skip to content

Releases: huggingface/tokenizers

v0.19.1

17 Apr 21:37
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.19.0...v0.19.1

v0.19.0

17 Apr 08:51
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.15.2...v0.19.0

v0.19.0rc0

16 Apr 14:06
Compare
Choose a tag to compare
v0.19.0rc0 Pre-release
Pre-release

Bumping 3 versions because of this: https://github.com/huggingface/transformers/blob/60dea593edd0b94ee15dc3917900b26e3acfbbee/setup.py#L177

What's Changed

  • chore: Remove CLI - this was originally intended for local development by @bryantbiggs in #1442
  • [remove black] And use ruff by @ArthurZucker in #1436
  • Bump ip from 2.0.0 to 2.0.1 in /bindings/node by @dependabot in #1456
  • Added ability to inspect a 'Sequence' decoder and the AddedVocabulary. by @eaplatanios in #1443
  • 馃毃馃毃 BREAKING CHANGE 馃毃馃毃: (add_prefix_space dropped everything is using prepend_scheme enum instead) Refactor metaspace by @ArthurZucker in #1476
  • Add more support for tiktoken based tokenizers by @ArthurZucker in #1493
  • PyO3 0.21. by @Narsil in #1494
  • Remove 3.13 (potential undefined behavior.) by @Narsil in #1497
  • Bumping all versions 3 times (ty transformers :) ) by @Narsil in #1498

Full Changelog: v0.15.2...v0.19.0rc0

v0.15.2

12 Feb 02:35
Compare
Choose a tag to compare

What's Changed

Big shoutout to @rlrs for the fast replace normalizers PR. This boosts the performances of the tokenizers:
image

New Contributors

Full Changelog: v0.15.1...v0.15.2rc1

v0.15.1

22 Jan 16:49
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.15.0...v0.15.1

v0.15.1.rc0

18 Jan 16:34
888dd4b
Compare
Choose a tag to compare
v0.15.1.rc0 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.13.4.rc2...v0.15.1.rc0

v0.15.0

14 Nov 19:06
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.14.1...v0.15.0

v0.14.1

06 Oct 11:10
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.13.3...v0.14.1

v0.14.1rc1

05 Oct 13:56
Compare
Choose a tag to compare
v0.14.1rc1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.13.4.rc2...v0.14.1rc1

v0.14.0

07 Sep 13:59
Compare
Choose a tag to compare

鈿狅笍 Reworks the release pipeline. Other breaking changes 鈿狅笍 :

  • #1335, AddedToken is reworked, is_special_token rename to special for consistency
  • feature http is now OFF by default, and depends on hf-hub instead of cached_path (updated cache directory, better sync implementation)
  • Removed SSL link on the python package, calling huggingface_hub directly instead.
  • New dependency : huggingface_hub (while we deprecate Tokenizer.from_pretrained(...) to Tokenizer.from_file(hugginngface_hub.hf_hub_download(MODEL_ID, "tokenizer.json")

What's Changed

New Contributors

Full Changelog: v0.13.3...v0.14.0