[systemd-bugs] [Bug 66664] journalctl boot reporting inconsistent

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Jul 14 08:16:01 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=66664

--- Comment #15 from Zbigniew Jedrzejewski-Szmek <zbyszek at in.waw.pl> ---
(In reply to comment #14)
> (In reply to comment #12)
> A few observations/comments:
> 
>  1. Obvious: When the per-socket sockopt capability is added, you'll
> probably  want to set the qlen to a value larger than 10. :) There's nothing
> especially unusual about my system, so I would have to guess that this
> problem is happening with qlen=10 for a fairly large fraction of users, but
> simply has not been noticed. (I didn't notice it myself until I began
> looking for a specific boot message of interest and noticed that it
> sometimes appeared and sometimes did not.)
> 
> 2.  I fully understand (as you point out) that here is no guarantee that
> journald can always keep up with message generation rate with probability =
> 1 in all situations, regardless of buffer sizes and queue length choices.
> It's not possible to close the source->sink loop without incurring a
> priority inversion between source and sink when the source rate becomes very
> high. I appreciate that and am all too familiar with it. But, I would opine,
> that it ought to be an explicit design requirement on journald going forward
> that it report with "very high" probability when messages have been dropped,
> rather than dropping them silently. That ought to be possible using mutexes
> without getting into priority inversion issues. IMO as an ordinary user,
> silently dropping log messages should be considered grossly unacceptable
> except under the most extreme message loading conditions (e.g. DoS attack
> :)). Silent drops should have a vanishingly small probability of occurrence
> during ordinary events such as boot-up.
> 
> IMO, if you don't adopt this as a basic design principle, you're shooting
> yourself in the foot, PR-wise, because sooner or later people are going to
> start whining about journald logging reliability vis a vis "good old sysV". 
A major source of slowdown are various /proc lookups. There are some patches on
the ML to reduce their number [1]... but also some patches to remove the need
for them [2]. We can't really allow PID 1 to pause waiting for journald, so
making journald more efficient is the way to go.

[1] http://lists.freedesktop.org/archives/systemd-devel/2013-June/011591.html
[2] http://people.apache.org/~jkaluza/patches/linux/

> > If you're using an initramfs with systemd, I think you'd need to put the
> > file also in the initramfs.
> >
> 
> I am using an initramfs, but tried it first without putting the new config
> file into it, and it worked. Curious as to why, if it "shouldn't have", but
> not going to look it in the mouth. :)
Thinking about it a bit more, I come to the conclusion that systemd must reopen
the sockets when systemd is executed, so please ignore the part about
initramfs.

> One request, if you have time: Could you possibly post some example output
> from "journalctl -a" from some (any) systems that you might have access to?
> I'd like to run them thru my little analysis tool and see whether the kinds
> of drops I was seeing (with qlen=10) are occurring. This will also have
> benefit for you as developers to perhaps get an idea of how often this sort
> of stuff is happening.
> 
> Thanks for your time and assistance.
Maybe you could post the tool. I'm not too keen on posting my logs publicly,
since like everybody I type my password instead of the login every once in a
while... I can post some logs from my machine at work later on.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/systemd-bugs/attachments/20130714/2d02f09e/attachment.html>


More information about the systemd-bugs mailing list