[Libreoffice-bugs] [Bug 124191] New: Text copied from a PDF exported using Linux Libertine G is missing characters.

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Tue Mar 19 00:13:37 UTC 2019


https://bugs.documentfoundation.org/show_bug.cgi?id=124191

            Bug ID: 124191
           Summary: Text copied from a PDF exported using Linux Libertine
                    G is missing characters.
           Product: LibreOffice
           Version: 6.2.0.3 release
          Hardware: All
                OS: All
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Printing and PDF export
          Assignee: libreoffice-bugs at lists.freedesktop.org
          Reporter: fz1844 at gmail.com

Description:
I use the Linux Libertine G font extensively in LibreOffice Writer to prepare
PDF eBooks. A while ago, I noticed that while trying to copy a paragraph from
such a PDF, the resulting text was missing characters, or sometimes had
duplicated characters.

Steps to Reproduce:
1. Create a new document.
2. Type in the following line: "The fire flying coffee left Quickly."
3. Make sure this line is using Linux Libertine G font.
4. Export the document to PDF.
5. Open the PDF. The text looks fine.
6. Select the text line in the PDF, and paste into Writer, or into any text
editor.
7. You will see something like this: "The fir flying coffe lft Quiickl."

Actual Results:
The fir flying coffe lft Quiickl.


Expected Results:
The fire flying coffee left Quickly.


Reproducible: Always


User Profile Reset: No



Additional Info:
The underlying text in the PDF, or perhaps some kind of translation layer or
lookup table for ligatures, is causing strange problems in text copied out of a
PDF that was created using fonts that support ligatures.

I found a book that I created with LibreOffice 3, back in the days when
ligatures were not supported, and it exports fine.

I also tried the same test with Microsoft Word (with ligature support enabled)
and it worked fine.

The problems with the text get worse as the document size grows. I have one
book of about 600 pages where the copy/pasted text is quite awful, but if you
take just that one page out of the document and put it in a new document, the
copy/pasted text is significantly better (not perfect though).

So it seems to be a problem that compounds as the document size grows.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20190319/bb1ca30f/attachment.html>


More information about the Libreoffice-bugs mailing list