Broken collation order for Latin characters with diacritics #25219
Labels
Ignored rules
issue that does not follow the rules (no template, missing debug log, ...)
Triage: Needed
(managed by bot!) issue that was just created and needs someone looking at it
Bug report
Describe the bug
Here is a clear and concise description of what the problem is:
It seems that sorting order for movie titles that contain Latin characters with diacritics is buggy. I'll describe the problem I discovered with Polish language but this very likely applies to other Latin based scripts.
Expected Behavior
Here is a clear and concise description of what was expected to happen:
Expected character order for Polish language:
Actual Behavior
Actual order in movie title list (similar results in file manager)
Most diacritics are inversed with relation to their base letter and in addition letter
ł
is shifted to the endPossible Fix
As locale-aware sorting for Unicode has been a long solved problem I figure Kodi must be using an in house solution for whatever needs may be that requires fixing.
To Reproduce
Steps to reproduce the behavior:
To generate a test case I used the code:
Scan the above files with local NFO scraper and then verify order on movie title list.
Debuglog
The debuglog can be found here:
https://paste.kodi.tv/welogokofi.kodi
Screenshots
Here are some links or screenshots to help explain the problem:
Additional context or screenshots (if appropriate)
Here is some additional context or explanation that might help:
Your Environment
Used Operating system:
Android
iOS
tvOS
Linux
macOS
Windows
Windows UWP
Operating system version/name: Ubuntu 22.04, LibreELEC 12
Kodi version: 20.2, 21
Locale:
C.utf8
,en_US.utf8
,pl_PL.utf8
(same results with different locales)The text was updated successfully, but these errors were encountered: