[Libreoffice-ux-advise] [Bug 156507] Ability to remove non-printing/"atypical" characters in a stretch of text

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Thu Aug 24 09:15:53 UTC 2023


https://bugs.documentfoundation.org/show_bug.cgi?id=156507

--- Comment #7 from ⁨خالد حسني⁩ <khaled at libreoffice.org> ---
Someone’s exotic character is another one’s essential part of the text.
Removing ZWNJ from Persian text alters its meaning, removing ZWJ from Emoji
sequences alters their meaning, removing Unicode Variation Selectors from CJK
text alters its meaning. Removing BiDi control characters from text changes its
intended rendering and possibly the meaning.

The notion here is fundamentally flawed, there is no such thing as exotic
characters in a multilingual and multicultural piece of software, this is very
monolingual way of thinking.

Also, don’t copy text from PDF, that is your actual problem. PDF is not a text
exchange format, despite what PDF stakeholders want to sell people. If you get
“exotic” characters copying text from any other source, there is more than a
90% chance these are essential part of the text.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


More information about the Libreoffice-ux-advise mailing list