[Libreoffice-bugs] [Bug 32249] When importing PDF with text in it , it will be better to have a easy and fluent option to edit the imported Text
bugzilla-daemon at bugs.documentfoundation.org
bugzilla-daemon at bugs.documentfoundation.org
Thu Jun 27 18:38:15 UTC 2019
https://bugs.documentfoundation.org/show_bug.cgi?id=32249
--- Comment #18 from Justin L <jluth at mail.com> ---
Created attachment 152450
--> https://bugs.documentfoundation.org/attachment.cgi?id=152450&action=edit
PDF_import_testDoc.odg: exploring what combining textboxes could look like
I agree with Stuart's conclusion that monkeying with import to make larger
textboxes would be disastrous. So I only see one reasonable option and that is
a function that allows a user to combine selected textboxes into one textbox.
However, the results won't be pretty. Each character attribute change (size,
bold, font, etc.) becomes a separate textbox, and there is no way to identify
whether that ends the paragraph or not, although some content analysis
guesswork could approximate the majority of cases I guess. In any case, a LOT
of cleanup would be needed to reformat the text, since each character run is
treated as a separate paragraph and all paragraph spacing information is
missing.
The other option is to force the user to create their own textbox and
copy/paste the text from the PDF itself, but in that case all the character
properties are lost. So there does still seem to be an advantage of
consolidating textboxes into one, even if many excess paragraph markers need to
be deleted.
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20190627/3670c647/attachment-0001.html>
More information about the Libreoffice-bugs
mailing list