-
Notifications
You must be signed in to change notification settings - Fork 178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Search/sort having trouble with special characters #258
Comments
A related bug from Radu talking about the registrations page:
|
Searching for "klo" doesn't find https://www.worldcubeassociation.org/results/p.php?i=2013KOSK01 |
I meant the search at persons.php |
There's a pretty heated discussion about the Ł character on the mysql bug tracker: https://bugs.mysql.com/bug.php?id=9604. There's a lot of confusion/unsatisfying answers, until you get to Peter Gulutzan's well written explanation of what's going on here. The answer is that Mysql's utf8_unicode_ci collation uses the Unicode Collation Algorithm (UCA) 4.0.0, which doesn't treat Ł as level 1 equivalent to L (see http://www.unicode.org/Public/UCA/4.0.0/allkeys-4.0.0.txt). This was changed in UCA 4.1.0:
UCA 4.1.0 and above do treat Ł as level 1 equal to L (see jfly/translighterate#2 (comment) for some more information). One fix would be to change the collation of our tables to
Right now, our default table collation is ``: and the I think the right thing to do here would be to change all our table column COLLATE's to utf8_unicode_520_ci. @timhabermaas, @larspetrus, @FatBoyXPC do you have any experience making such a change? Anything we should worry about? |
Unfortunately, it turns out our production version of Mysql is 5.5.49:
which doesn't support
It's hard for me to find out when support for EDIT: Updating our version of mysql should be as simple as changing this one line in our Chef configuration: https://github.com/jfly/worldcubeassociation.org/blob/66a9530b6ab0e6dd23839a3f552862901363243c/chef/site-cookbooks/wca/recipes/default.rb#L74. |
…code_520_ci`. This is part of thewca#258.
Note that thanks to #1489, we're actually using However, after reading this article, it sounds like we might want to switch to |
unsure if this is the same issue, or should be made a seperate post, but searching for Denmark & Friends will show the comp with the name "Denmark & ; Friends" (except there's no space before the semicolon), so there seems to be an issue when un-escaping characters? |
https://www.worldcubeassociation.org/search?q=lapicki doesn't find https://www.worldcubeassociation.org/results/p.php?i=2006LAPI01.
Neither does https://www.worldcubeassociation.org/results/persons.php?eventId=®ionId=&pattern=lapicki&search=Search
The text was updated successfully, but these errors were encountered: