<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - Mojibake when converting a pdf created with miktex on win"
href="https://bugs.freedesktop.org/show_bug.cgi?id=72127">72127</a>
</td>
</tr>
<tr>
<th>Assignee</th>
<td>poppler-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Summary</th>
<td>Mojibake when converting a pdf created with miktex on win
</td>
</tr>
<tr>
<th>Severity</th>
<td>normal
</td>
</tr>
<tr>
<th>Classification</th>
<td>Unclassified
</td>
</tr>
<tr>
<th>OS</th>
<td>Linux (All)
</td>
</tr>
<tr>
<th>Reporter</th>
<td>seinsvergessen@gmail.com
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86 (IA32)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Component</th>
<td>utils
</td>
</tr>
<tr>
<th>Product</th>
<td>poppler
</td>
</tr></table>
<p>
<div>
<pre>I received a pdf file created by pdflatex / miktex on windows.
Running pdftotext 0.24.3 on it produces very strange results:
original text:
Die Organisation der Flugrettung in
Österreich
pdftotext output:
❉✐❡ ❖r❣❛♥✐s❛t✐♦♥ ❞❡r ❋❧✉❣r❡tt✉♥❣ ✐♥
Öst❡rr❡✐❝❤
Some sort of glyph conversion problem?
The problem is reproducible. I installed miktex on my windows VM, ran pdflatex
on the tex file and got the same results.
I did a number of checks on the pdf with adobe acrobat and various tools and it
seems valid. Unfortunately, I do not own the file and cannot attach it here.
pdfinfo says:
Creator: LaTeX with hyperref package
Producer: pdfTeX-1.40.12
pdffonts output:
[none] Type 3 Custom yes no
no 303 0
[none] Type 3 Custom yes no
no 304 0
[none] Type 3 Custom yes no
no 305 0
[none] Type 3 Custom yes no
no 306 0
UEIZYW+CMSY10 Type 1 Builtin yes yes
no 307 0
FRNIHB+CMSY8 Type 1 Builtin yes yes
no 308 0
[none] Type 3 Custom yes no
no 348 0
[none] Type 3 Custom yes no
no 349 0
[none] Type 3 Custom yes no
no 350 0
[none] Type 3 Custom yes no
no 435 0
[none] Type 3 Custom yes no
no 436 0
[none] Type 3 Custom yes no
no 470 0
[none] Type 3 Custom yes no
no 480 0
Nothing unusual in the tex file either.
\usepackage[german]{babel}
\usepackage[utf8]{inputenc}
\usepackage[T1]{fontenc}
\usepackage{amssymb, amsmath} %arithmetic symbols, math enhancement package
\usepackage[round]{natbib} %bibliography
\usepackage{graphicx} %figures
\usepackage{hyperref} %hyperlinks
%% additional packages
\usepackage{tipa} %phonetics
Strangely this problem does not occur if I create the pdf on Linux.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>