<html> <head> <base href="https://bugs.freedesktop.org/"> </head> <body> <div> <a class="bz_bug_link bz_status_NEEDINFO " title="NEEDINFO - pdfunite introduces syntax error / illegal characters (as per pdfinfo)" href="https://bugs.freedesktop.org/show_bug.cgi?id=103873#c10">Comment # 10</a> on <a class="bz_bug_link bz_status_NEEDINFO " title="NEEDINFO - pdfunite introduces syntax error / illegal characters (as per pdfinfo)" href="https://bugs.freedesktop.org/show_bug.cgi?id=103873">bug 103873</a> from <a class="email" href="mailto:ajohnson@redneon.com" title="Adrian Johnson <ajohnson@redneon.com>"> Adrian Johnson</a> <pre>(In reply to Albert Astals Cid from <a href="show_bug.cgi?id=103873#c9">comment #9</a>) > Before commiting this patch, anyone can explain the > > "printf didn't accept invalid UTF8 sequences in the format string" > > part a bit more? What's invalid in that code? Only the first byte in a UTF-8 sequence can start with '11' (in binary). The second and subsequent bytes must start with '10'. 0xE2, 0xE3 is 11100010, 11100011 I assume the printf implementation is trying to parse the format string one character at a time in the current locale in order to pick off the conversion specifiers. I'm not sure what the standards say about this but it is probably best to avoid putting arbitrary binary sequences in the format string.</pre> </div> <hr> You are receiving this mail because: <ul> <li>You are the assignee for the bug.</li> </ul> </body> </html>