<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - xpdf utilities cannot open files with UTF-8 characters in file path on Windows"
href="https://bugs.freedesktop.org/show_bug.cgi?id=60517">60517</a>
</td>
</tr>
<tr>
<th>Assignee</th>
<td>poppler-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Summary</th>
<td>xpdf utilities cannot open files with UTF-8 characters in file path on Windows
</td>
</tr>
<tr>
<th>Severity</th>
<td>minor
</td>
</tr>
<tr>
<th>Classification</th>
<td>Unclassified
</td>
</tr>
<tr>
<th>OS</th>
<td>Windows (All)
</td>
</tr>
<tr>
<th>Reporter</th>
<td>aurimas.dev@gmail.com
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86-64 (AMD64)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Component</th>
<td>utils
</td>
</tr>
<tr>
<th>Product</th>
<td>poppler
</td>
</tr></table>
<p>
<div>
<pre>Created <span class=""><a href="attachment.cgi?id=74453" name="attach_74453" title="PDF test files">attachment 74453</a> <a href="attachment.cgi?id=74453&action=edit" title="PDF test files">[details]</a></span>
PDF test files
pdfinfo and pdftotext (have not tested others) cannot open PDF files with UTF-8
characters in file path.
Environment:
Windows 7 Pro (x64)
poppler.0.22.0_win32: I've been having trouble compiling poppler myself, so I
got poppler.0.22.0_win32 binaries from
<a href="http://blog.alivate.com.au/poppler-windows/">http://blog.alivate.com.au/poppler-windows/</a> (perhaps this is not a problem with
official binaries, which I could not find)
Steps to reproduce:
1. Download PDF test files (or create PDF file with UTF-8 character in the
name, e.g. testα)
2. Open command prompt and navigate to the directory with PDF file
3. Run `chcp 65001` to activate UTF-8 codepage
4. Run `pdfinfo.exe testα.pdf`
Outcome:
`I/O Error: Couldn't open file 'testa.pdf': No such file or directory.`
Note that "testα.pdf" is converted to "testa.pdf"
Using the command line to open that same file with Adobe Acrobat Reader 11.0
worked just fine, so the characters are being correctly passed from the
commandline to the program.
Here's a summary of my tests
<span class="quote">>poppler.0.22.0_win32\bin\pdfinfo.exe test.pdf</span >
Tagged: no
Form: none
Pages: 1
Encrypted: no
Page size: 612 x 792 pts (letter)
Page rot: 0
File size: 14622 bytes
Optimized: no
PDF version: 1.4
<span class="quote">>poppler.0.22.0_win32\bin\pdfinfo.exe testα.pdf</span >
I/O Error: Couldn't open file 'testa.pdf': No such file or directory.
<span class="quote">>poppler.0.22.0_win32\bin\pdftotext.exe test.pdf</span >
<span class="quote">>poppler.0.22.0_win32\bin\pdftotext.exe testα.pdf</span >
I/O Error: Couldn't open file 'testa.pdf': No such file or directory.
Looking at poppler's code, it looks like a win32 version of PDFDoc constructor
is defined in PDFDoc.cc
```
#ifdef _WIN32
PDFDoc::PDFDoc(wchar_t *fileNameA, int fileNameLen, GooString *ownerPassword,
GooString *userPassword, void *guiDataA) {
```
but LocalPDFDocBuilder.cc calls the general PDFDoc constructor no matter what
(by always passing a GooString* instead of wchar_t*). Can't test this though,
since I'm having trouble compiling poppler.</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>