[Poppler-bugs] [Bug 38739] Polish characters incorrectly extracted

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sat Jan 31 12:02:41 PST 2015


https://bugs.freedesktop.org/show_bug.cgi?id=38739

--- Comment #7 from Urmas <davian818 at gmail.com> ---
The font document is in have the following /Encoding:

/BaseEncoding /WinAnsiEncoding /Differences [ 1 /eogonek /zdot /zacute /cacut
e /sacute /aogonek /Zdot /nacute /Sacute /Eogonek /Zacute ] /Type /Encoding

but it has the following /ToUnicode:

/CIDInit /ProcSet findresource begin
12 dict begin
begincmap
/CMapType 2 def
/CMapName/R1642 def
1 begincodespacerange
<00><ff>
endcodespacerange
11 beginbfrange
<01><01><0119>
<02><02><015c>
<03><03><017a>
<04><04><0107>
<05><05><015b>
<06><06><0105>
<07><07><015b>
<08><08><0144>
<09><09><015a>
<0a><0a><0118>
<0b><0b><0179>
endbfrange
endcmap
CMapName currentdict /CMap defineresource pop
end end

So why does your software interpret character #7 not as EXPLICITLY DEFINED
\Zdot, but as 0x015B?

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20150131/12eedced/attachment.html>


More information about the Poppler-bugs mailing list