[Libreoffice-bugs] [Bug 137588] New: Wrong HTML copy/paste from webpage

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Mon Oct 19 09:22:25 UTC 2020


https://bugs.documentfoundation.org/show_bug.cgi?id=137588

            Bug ID: 137588
           Summary: Wrong HTML copy/paste from webpage
           Product: LibreOffice
           Version: 7.0.2.2 release
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: LibreOffice
          Assignee: libreoffice-bugs at lists.freedesktop.org
          Reporter: alepoc69 at gmail.com

Description:
Garbled text when pasting from HTML pages into Libreoffice Writer/Calc 7. 

Steps to Reproduce:
1. Enter www.deepl.com
2. Choose English to German translation
3. Enter the sentence "The trees are big". You read "die Bäume sind Groß".
4. Copy (Ctrl+V) the German translation and paste it in Libreoffice Writer or
Calc.
4a. Alternate: Paste Special -> 
    Writer: it reads "Unknown Source" instead of HTML and it doesn't offer "As
HTML" but only "Unformatted text"
    Calc: it also provides "Use text import dialog", but I cannot find any
Character set that provides a correct encoding.


Actual Results:
The pasted text contains garbled chars: "die Bäume sind groÃ" 


Expected Results:
The correct text should be "die Bäume sind groß"



Reproducible: Always


User Profile Reset: No



Additional Info:
EASY SOLUTION: When the text source is unknown it would be better to leave ALL
the possible options to choose from, instead of only providing "unformatted
text".

Copy/paste from other pages in German (i.e. spiegel.de) are correctly
recognized as HTML and pasted accordingly into Writer/Calc.

It looks like a problem of deepl.com, but we should take care of these
anomalous pages and provide a viable solution. Copying from HTML is quite
common and it is possible deepl.com is not the only page where this happens. 

BTW, pasting this same text in any other text editor in Ubuntu works correctly.
Only Libreoffice doesn't manage it correctly.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20201019/e78ff62a/attachment.htm>


More information about the Libreoffice-bugs mailing list