<div dir="auto"><div>Hi Andrei,<br><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Nov 18, 2021, 12:13 AM Andrei Borzenkov <<a href="mailto:arvidjaar@gmail.com">arvidjaar@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 18.11.2021 03:20, Brian Hutchinson wrote:<br>
> Yet another update, I was able to get it working .. but feel like it is a<br>
> hack so comments welcome ... see below:<br>
> <br>
> On Wed, Nov 17, 2021 at 12:26 AM Brian Hutchinson <<a href="mailto:b.hutchman@gmail.com" target="_blank" rel="noreferrer">b.hutchman@gmail.com</a>><br>
> wrote:<br>
> <br>
>> Update below<br>
>><br>
>> On Tue, Nov 16, 2021 at 2:27 PM Brian Hutchinson <<a href="mailto:b.hutchman@gmail.com" target="_blank" rel="noreferrer">b.hutchman@gmail.com</a>><br>
>> wrote:<br>
>><br>
>>> Hi Mikulėnas,<br>
>>><br>
>>> On Tue, Nov 16, 2021, 3:12 AM Mantas Mikulėnas <<a href="mailto:grawity@gmail.com" target="_blank" rel="noreferrer">grawity@gmail.com</a>> wrote:<br>
>>><br>
>>>> Most of this looks like it could be done with systemd-networkd to create<br>
>>>> a bond .netdev, with a small oneshot service for i2c. (What's the exact<br>
>>>> criteria for when it should be run? Does it depend on bond0 being there,<br>
>>>> does it need to be last, etc?)<br>
>>>><br>
>>><br>
>>> It can be last in the startup chain I guess, don't know what other<br>
>>> systemd things that might need the network to be up before the last unit<br>
>>> file runs.<br>
>>><br>
>>> I start linuxptp too so I would have the unit file that starts ptp4l<br>
>>> start after the bond was created etc.<br>
>>><br>
>>> Same thing for the i2c command to enable the switch.<br>
>>><br>
>>> Regards,<br>
>>><br>
>>> Brian<br>
>>><br>
>>><br>
>>>> On Tue, Nov 16, 2021, 02:58 Brian Hutchinson <<a href="mailto:b.hutchman@gmail.com" target="_blank" rel="noreferrer">b.hutchman@gmail.com</a>><br>
>>>> wrote:<br>
>>>><br>
>>>>> Hi,<br>
>>>>><br>
>>>>> I'm on a IMX8 platform and have a Microchip KSZ9567 Ethernet switch. I<br>
>>>>> can use IP commands to manually bring lan1 and lan2 interfaces up and then<br>
>>>>> create a redundant/failover bond ... but I'm having difficulty figuring out<br>
>>>>> how to do this the "systemd" way.<br>
>>>>><br>
>>>>> My first attempt was to just have systemd run a script of all the<br>
>>>>> commands I do manually but during system startup there appears to be race<br>
>>>>> conditions so I have to set my service type to "Idle" and sometimes even<br>
>>>>> that doesn't work. So I want to exploit any systemd support for DSA and<br>
>>>>> bonding.<br>
>>>>><br>
>>>>> Here is script my manual steps which is what I want systemd to<br>
>>>>> ultimately do:<br>
>>>>><br>
>>>>> #!/bin/bash<br>
>>>>><br>
>>>>> # Create a redundant bond between ksz9567 DSA lan1 and lan2 interfaces<br>
>>>>><br>
>>>>> # Load bonding kernel module<br>
>>>>> modprobe bonding<br>
>>>>><br>
>>>>> # Bring up CPU interface (cpu to switch port 7 - the RGMII link)<br>
>>>>> ip link set eth0 up<br>
>>>>><br>
>>>>> # Create a bond<br>
>>>>> echo +bond0 > /sys/class/net/bonding_masters<br>
>>>>><br>
>>>>> # Set mode to active-backup (redundancy failover)<br>
>>>>> echo active-backup > /sys/class/net/bond0/bonding/mode<br>
>>>>><br>
>>>>> # Set time it takes (in ms) for slave to move when a link goes down<br>
>>>>> echo 1000 > /sys/class/net/bond0/bonding/miimon<br>
>>>>><br>
>>>>> # Add slaves to bond<br>
>>>>><br>
>>>>> echo +lan1 > /sys/class/net/bond0/bonding/slaves<br>
>>>>> echo +lan2 > /sys/class/net/bond0/bonding/slaves<br>
>>>>><br>
>>>>> # Set IP and netmask of the bond<br>
>>>>> ip addr add <a href="http://192.168.0.4/24" rel="noreferrer noreferrer" target="_blank">192.168.0.4/24</a> dev bond0<br>
>>>>><br>
>>>>> # And bring bond up. Pings and network connectivity should work now<br>
>>>>> ip link set bond0 up<br>
>>>>><br>
>>>>> # For a board that doesn't have Ethernet switch hardware strapped to<br>
>>>>> enable at boot .. enable it now<br>
>>>>> i2cset -f -y 0 0x5f 0x03 0x00 0x01 i<br>
>>>>><br>
>>>>> Thanks for any information, pointers etc.<br>
>>>>><br>
>>>>> Regards,<br>
>>>>><br>
>>>>> Brian<br>
>>>>><br>
>>>><br>
>> So not sure I'm doing this right. eth0 needs to be up before lan1 and<br>
>> lan2 can be added to the bond. Not quite sure how to do that with systemd<br>
>> but I made the following files and see some progress but ping doesn't work<br>
>> so appears I have no network connectivity:<br>
>><br>
>> root@imx8mmevk:/etc/systemd/network# cat 10-bond1.netdev<br>
>> [NetDev]<br>
>> Name=bond1<br>
>> Kind=bond<br>
>><br>
>> [Bond]<br>
>> Mode=active-backup<br>
>> PrimaryReselectPolicy=failure<br>
>> MIIMonitorSec=2s<br>
>><br>
>> root@imx8mmevk:/etc/systemd/network# cat 10-bond1.network<br>
>> [Match]<br>
>> Name=bond1<br>
>><br>
>> [Network]<br>
>> Address=<a href="http://192.168.0.4/24" rel="noreferrer noreferrer" target="_blank">192.168.0.4/24</a><br>
>><br>
>> root@imx8mmevk:/etc/systemd/network# cat 20-lan1.network<br>
>> [Match]<br>
>> Name=lan1<br>
>><br>
>> [Network]<br>
>> Bond=bond1<br>
>> PrimarySlave=true<br>
>><br>
>> root@imx8mmevk:/etc/systemd/network# cat 30-lan2.network<br>
>><br>
>> [Match]<br>
>> Name=lan2<br>
>><br>
>> [Network]<br>
>> Bond=bond<br>
>><br>
>> ip link list<br>
>> 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode<br>
>> DEFAULT group default qlen 1000<br>
>> link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00<br>
>> 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1506 qdisc mq state UP mode<br>
>> DEFAULT group default qlen 1000<br>
>> link/ether f0:1f:af:6b:b2:17 brd ff:ff:ff:ff:ff:ff<br>
>> 3: lan1@eth0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode<br>
>> DEFAULT group default qlen 1000<br>
>> link/ether f0:1f:af:6b:b2:17 brd ff:ff:ff:ff:ff:ff<br>
>> 4: lan2@eth0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode<br>
>> DEFAULT group default qlen 1000<br>
>> link/ether f0:1f:af:6b:b2:17 brd ff:ff:ff:ff:ff:ff<br>
>> 5: bond1: <NO-CARRIER,BROADCAST,MULTICAST,MASTER,UP> mtu 1500 qdisc<br>
>> noqueue state DOWN mode DEFAULT group default qlen 1000<br>
>> link/ether be:87:0a:9b:13:03 brd ff:ff:ff:ff:ff:ff<br>
>><br>
>> cat /proc/net/bonding/bond1<br>
>> Ethernet Channel Bonding Driver: v5.10.69<br>
>><br>
>> Bonding Mode: fault-tolerance (active-backup)<br>
>> Primary Slave: None<br>
>> Currently Active Slave: None<br>
>> MII Status: down<br>
>> MII Polling Interval (ms): 2000<br>
>> Up Delay (ms): 0<br>
>> Down Delay (ms): 0<br>
>> Peer Notification Delay (ms): 0<br>
>><br>
>> At this point there should be info on lan1 and lan2 status but don't see<br>
>> it.<br>
>><br>
>> ... so of course I can't ping.<br>
>><br>
>> Next I did systemctl restart systemd-networkd and saw the following:<br>
>><br>
>> systemctl restart systemd-networkd<br>
>> root@imx8mmevk:~# [ 33.816313] device eth0 entered promiscuous mode<br>
>> [ 33.821026] audit: type=1700 audit(1636550966.339:2): dev=eth0 prom=256<br>
>> old_prom=0 auid=4294967295 uid=995 gid=994 ses=4294967295<br>
>> [ 33.867164] ksz9477-switch 0-005f lan2: configuring for phy/gmii link<br>
>> mode<br>
>> [ 33.875066] bond1: (slave lan2): Enslaving as a backup interface with a<br>
>> down link<br>
>> [ 33.919055] ksz9477-switch 0-005f lan1: configuring for phy/gmii link<br>
>> mode<br>
>> [ 33.926683] bond1: (slave lan1): Enslaving as a backup interface with a<br>
>> down link<br>
>> [ 38.066148] ksz9477-switch 0-005f lan1: Link is Up - 1Gbps/Full - flow<br>
>> control rx/tx<br>
>> [ 39.472022] bond1: (slave lan1): link status definitely up, 1000 Mbps<br>
>> full duplex<br>
>> [ 39.479537] bond1: (slave lan1): making interface the new active one<br>
>> [ 39.486154] bond1: active interface up!<br>
>> [ 39.490071] IPv6: ADDRCONF(NETDEV_CHANGE): bond1: link becomes ready<br>
>><br>
>> At which point I can ping. Feels like there still might be some kind of<br>
>> race condition as things won't work unless I manually issue systemctl<br>
>> restart systemd-networkd command after logging in.<br>
>><br>
>> In kernel dmesg logs I see:<br>
>> [ 4.165940] bond1: (slave lan2): Opening slave failed<br>
>> [ 4.196834] bond1: (slave lan1): Opening slave failed<br>
>> [ 4.315588] Generic PHY fixed-0:00: attached PHY driver [Generic PHY]<br>
>> (mii_bus:phy_addr=fixed-0:00, irq=POLL)<br>
>> [ 4.326181] fec 30be0000.ethernet eth0: Link is Up - 1Gbps/Full - flow<br>
>> control off<br>
>> [ 4.561000] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready<br>
>><br>
>> Loaded: loaded (/lib/systemd/system/systemd-networkd.service; enabled;<br>
>> vendor preset: enabled)<br>
>> Active: [[0;1;32mactive (running)[[0m since Sun 2020-09-20 10:43:59<br>
>> UTC; 1 years 1 months ago<br>
>> TriggeredBy: [[0;1;32m*[[0m systemd-networkd.socket<br>
>> Docs: man:systemd-networkd.service(8)<br>
>> Main PID: 251 (systemd-network)<br>
>> Status: "Processing requests..."<br>
>> Tasks: 1 (limit: 1574)<br>
>> Memory: 1.5M<br>
>> CGroup: /system.slice/systemd-networkd.service<br>
>> `-251 /lib/systemd/systemd-networkd<br>
>><br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: [[0;1;31m[[0;1;39m[[0;1;31mlan2:<br>
>> Could not join netdev: Network is down[[0m<br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: [[0;1;38;5;185m[[0;1;39m[[0;1;38;5;185mlan2:<br>
>> Failed[[0m<br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: [[0;1;31m[[0;1;39m[[0;1;31mlan1:<br>
>> Could not join netdev: Network is down[[0m<br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: [[0;1;38;5;185m[[0;1;39m[[0;1;38;5;185mlan1:<br>
>> Failed[[0m<br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: eth0: IPv6 successfully<br>
>> enabled<br>
>> Nov 10 13:28:56 imx8mmevk systemd-networkd[251]: eth0: Link UP<br>
>> Nov 10 13:28:57 imx8mmevk systemd-networkd[251]: eth0: Gained carrier<br>
>> Nov 10 13:28:57 imx8mmevk systemd-networkd[251]: bond1: IPv6 successfully<br>
>> enabled<br>
>> Nov 10 13:28:57 imx8mmevk systemd-networkd[251]: bond1: Link UP<br>
>> Nov 10 13:28:58 imx8mmevk systemd-networkd[251]: eth0: Gained IPv6LL<br>
>><br>
>> ... so looks like the bond stuff is trying to happen before eth0 (my DSA<br>
>> HOST/CPU interface to switch) is up. How can I make eth0 up with systemd?<br>
>> eth0 just needs to be up ... no IP etc., and the bond1 gets the IP etc.<br>
>><br>
>> For now I'm doing the i2c command to enable my switch in u-boot but still<br>
>> need to incorporate that into systemd somehow.<br>
>><br>
>> Any ideas as to what I'm doing wrong? I think at a minimum I need to<br>
>> bring eth0 up before the bonding stuff happens but not quite sure what that<br>
>> would look like using systemd.<br>
>><br>
>> Regards,<br>
>><br>
>> Brian<br>
>><br>
>><br>
> I tried and tried to get eth0 to come up before the bond was brought up. I<br>
> had everything named in lexical order but didn't appear to matter. I added<br>
> a eth0.network file and in it specified ActivationPolicy=always-up and<br>
> other things but could not get eth0 to come up.<br>
> <br>
> It was obvious the bond was trying to be established before eth0 was up and<br>
> since this is using DSA that just won't work. Dmesg logs would show slaves<br>
> being added before eth0 was up as in copy/paste from previous email.<br>
> <br>
> So I added a service to bring eth0 up:<br>
> <br>
> cat eth0-up.service<br>
> [Unit]<br>
> Description=Bring eth0 up before bonding<br>
> Before=network-pre.target<br>
> Wants=network-pre.target<br>
> <br>
> [Service]<br>
> ExecStart=/usr/local/bin/eth0-up.sh<br>
> RemainAfterExit=yes<br>
> <br>
> [Install]<br>
> WantedBy=multi-user.target<br>
> <br>
> cat /usr/local/bin/eth0-up.sh<br>
> #!/bin/bash<br>
> ip link set eth0 up<br>
> <br>
> ... I feel like this is a hack, that systemd can probably do this but<br>
> either I can't figure out how or there is a problem with the code.<br>
> <br>
<br>
How lan1 and lan2 are related to eth0? Your script never creates or sets up them.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">... its a DSA (Distributed Switch Architecture) thing. Port 7 of my switch is a RGMII CPU/host link to my IMX8 which is eth0. When using DSA driver to bust switch up into individual ports (like individual NIC cards), eth0 cannot be used for anything (like assigned an IP address etc.) but the interface has to be up for DSA driver and also to add slaves to bond.</div><div dir="auto"><br></div><div dir="auto">Regards,</div><div dir="auto"><br></div><div dir="auto">Brian</div><div dir="auto"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"></blockquote></div></div></div>