[Libreoffice-bugs] [Bug 32249] When importing PDF with text in it , it will be better to have a easy and fluent option to edit the imported Text

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Thu Jun 27 23:56:42 UTC 2019


https://bugs.documentfoundation.org/show_bug.cgi?id=32249

--- Comment #19 from V Stuart Foote <vstuart.foote at utsa.edu> ---
(In reply to Justin L from comment #18)
> ... So I only see one reasonable option and that
> is a function that allows a user to combine selected textboxes into one
> textbox.
> 

Yes, agree that would be an acceptable way to handle PDF source text runs
extracted from BT/ET blocks, or where /ActualText annotation is present.

But why first extract the text runs into Draw Text boxes, and then merging them
into one or more non-formattable Draw Text boxes? Seems like a different filter
import of the PDF text runs is needed.

Dumping the strings out to a Writer Paragraph object, either in bulk or
interactively, would be more functional.  And text runs dumped into a Paragraph
object, would allow assignment of direct formatting or style, with text
validation and word and line break cleanup.

Probably more efficient UI could evolve if done as a pop-out dialog to pick the
Draw Text box snippets, but could spin up a full Writer session and do the
same.

More often than not, folks simply want to reflow the text strings back into
their lexicographically correct sequence without too great a concern as to
original formatting of the source document generating the PDF.

We can't do that with much fidelity to the original source--so why bother?

Our other 'pdfium' based "insert" filter provides the text runs to document
canvas with good fidelity to the original layout. Though the object "break"
there has similar issues to the 'poppler' based import filter for text
handling.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20190627/ee9f80d2/attachment.html>


More information about the Libreoffice-bugs mailing list