[Poppler-bugs] [Bug 87215] evince can not find ü in attached PDF

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Jan 11 19:47:02 PST 2015


https://bugs.freedesktop.org/show_bug.cgi?id=87215

--- Comment #3 from Jason Crain <jason at aquaticape.us> ---
Created attachment 112107
  --> https://bugs.freedesktop.org/attachment.cgi?id=112107&action=edit
Remove combining characters from normalized text

This patch changes normalization so that combining characters are removed from
the normalized text.  This makes searching through TextPage::findText
insensitive to these characters.

Also, renames unicodeNormalizeNFKC to unicodeNormalizeSearch to make it clear
it's no longer doing a regular NFKC normalization.  

Renames decomp_compat to decomp_compat_base because it now strips combing
characters, leaving only base characters, in addition to compatibility
decomposition.

Removes UnicodeCompTables.h and some compose functions.  They're no longer
needed since we're not recomposing the characters.

I'm not sure if UnicodeTypeTable.h and UnicodeCompTables.h are considered part
of the public interface.  They're included in the xpdf headers.  Albert, is it
OK to change these files in this way?

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20150112/e1071600/attachment.html>


More information about the Poppler-bugs mailing list