[systemd-devel] workaround for systemd-networkd-wait-online boot fail/delay on systems with bridge for v234? (fix @ systemd/issues/2154 requires v>242)

PGNet Dev pgnet.dev at gmail.com
Sun May 31 01:02:37 UTC 2020


 I've switched from distro networking stack (wicked) to systemd-network; generally, what a relief!

The distro (opensuse leap 15.1) has 'old' systemd -- v234

As packaged,

  'systemd-network.service' has "Also=systemd-networkd-wait-online.service".

&

  'systemd-networkd-wait-online.service' has "WantedBy=network-online.target"

A number of my services depend on 'network-online.target'.

All this^ is to say -- if 'network-online.target' is used, 'systemd-networkd-wait-online.service' gets launched.

My box ALSO has a bridge defined.

Which causes me to hit

	"systemd-networkd-wait-online fails with bridged interfaces"
	 https://github.com/systemd/systemd/issues/2154

on boot,

	[  151.868637] systemd-networkd-wait-online[1394]: Event loop failed: Connection timed out
	[  151.884034] systemd[1]: systemd-networkd-wait-online.service: Main process exited, code=exited, status=1/FAILURE
	[  151.885593] systemd[1]: Failed to start Wait for Network to be Configured.
	[  151.885714] systemd[1]: systemd-networkd-wait-online.service: Unit entered failed state.
	[  151.885741] systemd[1]: systemd-networkd-wait-online.service: Failed with result 'exit-code'.

which causes significant delay

	systemd-analyze critical-chain

		multi-user.target @2min 50.841s
        └─...
		  └─network-online.target @2min 19.876s
		    └─network.target @19.712s
		      └─systemd-networkd.service @19.268s +409ms
		        └─network-pre.target @19.249s
		          └─...

The solution @bug is -- upgrade to systemd v242.

On this distro, upstream refuses to upgrade systemd; apparently for them its 'non trivial' and they can't be bothered.  AND, they've a new 'stable' version coming out AnyDayNow(tm), Leap 15.2, for which apparently systemd will REMAIN at broken-like-this^^ v234.

So *my* solution has been to switch distros.  NOT just for this^^ reason, but it was the last straw ...
~ 600 machines have finished migrating to other distros with newer systemd installs that avoid this problem.

BUT -- I'm left with a few boxes that I can't move for awhile.

So ...

	IS there a backport of this^^ fix available for v234 that popped up in the meantime?

	If not, as is likely, is there a "safe" workaround for quieting the fail, and rm'ing the associated boot delay?  Is rm'ing either the "Also=" or "WantedBy=" a reasonable band-aid?

	Or, some other approach?



More information about the systemd-devel mailing list