Page MenuHomePhabricator

Links in emails: Parentheses ( or ) not recognized as part of URL - brackets should be URL-encoded
Open, LowPublic

Description

Author: nilbek

Description:
example e-mail

Links on emails are broken... Email is in English, I think bug is clear...


Version: unspecified
Severity: minor

Attached:

Picture1.png (844×1 px, 353 KB)

Details

Reference
bz38265

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 1:02 AM
bzimport added a project: MediaWiki-Email.
bzimport set Reference to bz38265.
bzimport added a subscriber: Unknown Object (MLST).

Which email client is that?
Could you attach such an email here as an attachment, please?

I'm closing this as INVALID as it is a bug in your email client.

nilbek wrote:

I'm using Outlook (live.com) as a client and as I know it is one of top clients in emailing.

Basically what nilbek is suggesting is that brackets should be URL encoded in email links. That seems sensible to me, so I'll reopen pending further discussion.

Makes sense. Thanks for correcting me!

Thunderbird email (shown below) and Gmail's website, both link the part of the string in brackets correctly.

5zVfolp.png (370×942 px, 63 KB)

So I think this would be considered a bug with "Outlook (live.com)", and I'm not sure where to file an upstream bug for that?

If we were to change the default URL encoding for all sent links with brackets,
so that instead of
https://meta.wikimedia.org/wiki/User:Quiddity_(WMF)/sandbox%C4%8Di%C4%87_(test)/Logo
we received a link to
https://meta.wikimedia.org/wiki/User:Quiddity_(WMF)/sandbox%C4%8Di%C4%87_%28test%29/Logo
that would make links with brackets harder to read for everyone that uses an email client that renders the links properly. I'm not sure if that is worth it, in balance.

However, @Dispenser has been putting together some great related research at https://en.wikipedia.org/wiki/User:Dispenser/Automatic_linking_problems so he might be able to comment on the more technical aspects/limitations, or provide better long-term recommendations?

See also: the since-resolved T13056: Closing bracket skipped in emailed links - I'm not sure if we just need to add < and > per https://gerrit.wikimedia.org/r/#/c/24291/ ?

Adding @Nemo_bis per the above gerrit patch.

Aklapper renamed this task from Links in emails: Parentheses ( not recognized as part of URL - brackets should be URL-encoded to Links in emails: Parentheses ( or ) not recognized as part of URL - brackets should be URL-encoded.Oct 19 2018, 9:27 PM

General advice: Append a _ when page name is terminated by . or ) or similar.

  • Then, after _, add a space or terminate.
  • When the system (many are recognizing and automatically linking URL) is identifying start of URL at http: and termination, they interprete . or ) and more as surrounding punctuation, not part of URL, not included in target.
  • If our URL is terminated by _ that one is taken as last character and part of generated link, if a space is following and will break URL identification.
  • If any wiki server is receiving _ that is taken as space, and will be stripped since there are no spaces at the end of page names etc.