[poppler] pdftohtml -xml arabic pdf

Djari Imene djariimene03 at gmail.com
Thu Jun 6 13:46:42 UTC 2019


I have a problem when i use pdftohtml -xml with a arabic pdf that give me
one character in each line , how can i fix this problem ?
<text top="270" left="2245" width="5" height="12" font="3"><b>¡</b></text>
<text top="270" left="2242" width="3" height="12" font="3"><b>É </b></text>
<text top="270" left="2236" width="3" height="12" font="3"><b>GC</b></text>
<text top="270" left="2231" width="5" height="12" font="3"><b>e</b></text>
<text top="270" left="2225" width="6" height="12" font="3"><b>ù</b></text>
<text top="270" left="2217" width="8" height="12" font="3"><b>¢</b></text>
<text top="270" left="2215" width="2" height="12" font="3"><b> G</b></text>
<text top="270" left="2208" width="4" height="12" font="3"><b>d</b></text>
<text top="270" left="2202" width="6" height="12" font="3"><b>ù</b></text>
<text top="270" left="2198" width="4" height="12" font="3"><b>°</b></text>
<text top="270" left="2194" width="4" height="12" font="3"><b>Ñ</b></text>
<text top="270" left="2184" width="10" height="12" font="3"><b>â</b></text>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/poppler/attachments/20190606/d2226ed8/attachment.html>


More information about the poppler mailing list