[Poppler-bugs] [Bug 52406] New: Wrong text extracted from attached example: 2012 extracted as 2512

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Mon Jul 23 16:55:47 PDT 2012


https://bugs.freedesktop.org/show_bug.cgi?id=52406

             Bug #: 52406
           Summary: Wrong text extracted from attached example: 2012
                    extracted as 2512
    Classification: Unclassified
           Product: poppler
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: general
        AssignedTo: poppler-bugs at lists.freedesktop.org
        ReportedBy: alevy at redhat.com


Created attachment 64556
  --> https://bugs.freedesktop.org/attachment.cgi?id=64556
wrong text extraction example: first page bottom left, copy year 2012 -> 2512

See the first page of the attached pdf. While it is in hebrew (which is surely
related to the bug), you don't need to understand hebrew - the hebrew text is
actually fine, the problem is with anything numeric.

The bottom left of the first page has this text (typing the text from watching
the correctly *rendered* document in evince):

2012
בפברואר
15

However, copying the text with the cursor and pasting produces the following
text:
‫15 בפברואר 2512‬

Clearly the text is the same, the numbers are different for the year.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Poppler-bugs mailing list