[systemd-devel] Systemd Journald and audit logging causing journal issues
Brad Zynda
bradley.v.zynda at nasa.gov
Fri Dec 1 14:05:19 UTC 2017
Hey Lennart,
Just wanted to get your thoughts on this before we break the link..
also can you provide proper direction or a howto for breaking the link
between auditd and journald?
Thanks,
Brad
On Friday, December 1, 2017 8:17:58 AM EST Brad Zynda wrote:
> Hey Steve,
>
> Just wanted to follow up on this and say we are still seeing services
> across the board that have:
>
> Warning: Journal has been rotated since unit was started. Log output is
> incomplete or unavailable
>
> basically created a script to check all unit file services/targets and
> grep status -l for Journal, it is effecting a large range of
> service.target's, service.service's and service.socket's
>
> If we restart the service or reboot we no longer see those messages
> about journal and everything appears to run as expected.
I have never looked at journald code and have no idea how it works or
why it
cares about audit events. My advice last email was to break the link if its
causing problems.
-Steve
> On 10/19/2017 04:13 PM, Steve Grubb wrote:
>> On Thursday, October 19, 2017 1:08:22 PM EDT Brad Zynda wrote:
>>>>> grep perm_mod /etc/audit/audit.rules
>>>>> -a always,exit -F arch=b64 -S chmod -S fchmod -S fchmodat -F
auid>=1000
>>>>> -F auid!=4294967295 -k perm_mod
>>>>> -a always,exit -F arch=b32 -S chmod -S fchmod -S fchmodat -F
auid>=1000
>>>>> -F auid!=4294967295 -k perm_mod
>>>>> -a always,exit -F arch=b64 -S chown -S fchown -S fchownat -S lchown -F
>>>>> auid>=1000 -F auid!=4294967295 -k perm_mod
>>>>> -a always,exit -F arch=b32 -S chown -S fchown -S fchownat -S lchown -F
>>>>> auid>=1000 -F auid!=4294967295 -k perm_mod
>>>>> -a always,exit -F arch=b64 -S setxattr -S lsetxattr -S fsetxattr -S
>>>>> removexattr -S lremovexattr -S fremovexattr -F auid>=1000 -F
>>>>> auid!=4294967295 -k perm_mod
>>>>> -a always,exit -F arch=b32 -S setxattr -S lsetxattr -S fsetxattr -S
>>>>> removexattr -S lremovexattr -S fremovexattr -F auid>=1000 -F
>>>>> auid!=4294967295 -k perm_mod
>>>>>
>>>>> grep delete /etc/audit/audit.rules
>>>>> -a always,exit -F arch=b64 -S unlink -S unlinkat -S rename -S renameat
>>>>> -F auid>=1000 -F auid!=4294967295 -k delete
>>>>> -a always,exit -F arch=b32 -S unlink -S unlinkat -S rename -S renameat
>>>>> -F auid>=1000 -F auid!=4294967295 -k delete
>>>>> -a always,exit -F arch=b64 -S init_module -S delete_module -k modules
>>>>> -a always,exit -F arch=b32 -S init_module -S delete_module -k modules
>>>>
>>>> These rules are well formed. Meaning no obvious holes that would cause
>>>> unintended events. The other ausearch/aureport commands I gave you
>>>> should
>>>> show you what is causing the events and to which files. This
information
>>>> may be used to create some kind of "never" rule that limits what gets
>>>> audited. If you do create some exclusion rule, it needs to be above the
>>>> perm_mod rules because audit is a "first match wins" system.
>>>>
>>>> -Steve
>>>>
>>>> -Steve
>>>
>>> Here is a peak report (at least in the last 24 hours) didnt include the
>>
>>> file summaries as that would make this email HUGE:
>> Well, the idea is to look for something that's getting hit a lot. What it
>> sounds like is that things are getting deleted and modified quite a bit
>> all
>> over the place. Does the executable report show a pattern such as one
>> application doing a lot? For example, with the rule you have, doing a yum
>> update will delete a whole lot of stuff and modify a whole lot of stuff.
>>
>> Its possible to put exceptions in the rules so that one program does not
>> flood the logs. But looking at the quantities below, the audit system can
>> easily handle that.
>>
>> Its also possible to exclude directories from auditing if the pattern is
>> that you have a daemon receiving and modifying files and then deleting
>> them. You should look at the executables & files and see if you can do
>> with auditing what they are doing because its not interesting.
>>
>> If this is causing you problems on the journald side where its causing
>> other tasks to fail. Then I'd break the link between auditd and journald.
>> The amount of event you show is highish but well within range of what
>> auditd can do. Just make sure flush is set to incremental_async and freq
>> is 100 or 200. You should be OK.
>>
>> -Steve
>>
>>> Key Summary Report
>>> ===========================
>>> total key
>>> ===========================
>>> 8170 perm_mod
>>> 5390 delete
>>> 1073 access
>>> 56 time-change
>>> 26 session
>>> 12 privileged
>>> 7 logins
>>>
>>>
>>> Syscall Summary Report
>>> ==========================
>>> total syscall
>>> ==========================
>>> 4250 fchmodat
>>> 1613 chmod
>>> 831 fchmod
>>> 521 fchown
>>> 97 chown
>>> 12 setxattr
>>>
>>>
>>> Syscall Summary Report
>>> ==========================
>>> total syscall
>>> ==========================
>>> 2887 unlink
>>> 2189 rename
>>> 186 unlinkat
>>>
>>>
>>> so from the list the top 2 in each perm_mod and delete from the above
>>> list seem to be the culprits..
More information about the systemd-devel
mailing list