<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body><table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>Bug ID</th>
<td><a class="bz_bug_link
bz_status_NEW "
title="NEW - journalctl unicode message split/ellipsis cuts part of utf8 symbol"
href="https://bugs.freedesktop.org/show_bug.cgi?id=89392">89392</a>
</td>
</tr>
<tr>
<th>Summary</th>
<td>journalctl unicode message split/ellipsis cuts part of utf8 symbol
</td>
</tr>
<tr>
<th>Product</th>
<td>systemd
</td>
</tr>
<tr>
<th>Version</th>
<td>unspecified
</td>
</tr>
<tr>
<th>Hardware</th>
<td>x86-64 (AMD64)
</td>
</tr>
<tr>
<th>OS</th>
<td>Linux (All)
</td>
</tr>
<tr>
<th>Status</th>
<td>NEW
</td>
</tr>
<tr>
<th>Severity</th>
<td>normal
</td>
</tr>
<tr>
<th>Priority</th>
<td>medium
</td>
</tr>
<tr>
<th>Component</th>
<td>general
</td>
</tr>
<tr>
<th>Assignee</th>
<td>systemd-bugs@lists.freedesktop.org
</td>
</tr>
<tr>
<th>Reporter</th>
<td>grossws@gmail.com
</td>
</tr>
<tr>
<th>QA Contact</th>
<td>systemd-bugs@lists.freedesktop.org
</td>
</tr></table>
<p>
<div>
<pre>journalctl splits long messages (in utf8) without knowledge of utf8-encoded
codepoints boundary, so some messages are shown as `[2.0K blob data]` because
it starts with non-utf8 bytes.
Also journalctl ellipsis can cut line in the middle of utf8-encoded codepoint.
Env: CentOS 7.0, systemd 208, en_US.UTF-8 locale.
Sample from `journalctl -f -u ...` (contains both issues):
<span class="quote">> Mar 02 18:32:01 localhost systemd-docker[19345]: 509146375 [http-nio-8983-exec-98] INFO org.apache.solr.core.SolrCore – [core0] webapp=/solr path=/select params={facet=true&sort=pubdate+desc&fl=&facet.query=((...�о+продук
> Mar 02 18:32:01 localhost systemd-docker[19345]: [293B blob data]
> Mar 02 18:32:01 localhost systemd-docker[19345]: [293B blob data]</span ></pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the QA Contact for the bug.</li>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>