xwayland + radeon = consistent filesystem corruption Re: I'm the only one getting hard drive errors, right?
darxus at chaosreigns.com
darxus at chaosreigns.com
Tue Sep 4 19:37:47 PDT 2012
I reproduced the problem with chromium under X, not running wayland. Woo.
Just by loading up chromium and playing back a youtube video for a while.
(After doing a long smartctl test and fsck of the partition.)
[ 4043.097706] EXT4-fs error (device sda1): ext4_ext_search_left:1275: inode #21374007: comm flush-8:0: ix (10742) != EXT_FIRST_INDEX (0) (depth 1)!
[ 4043.097711] Aborting journal on device sda1-8.
[ 4043.097873] EXT4-fs (sda1): Remounting filesystem read-only
[ 4043.097883] EXT4-fs error (device sda1) in ext4_da_writepages:3033: IO failure
[ 4043.098011] EXT4-fs (sda1): ext4_da_writepages: jbd2_start: 962 pages, ino 21374007; err -30
Same inode again. Which I guess I didn't mention on this list turned out
to be a file in chromium's cache:
$ sudo find . -inum 21374007 -print
./home/darxus/.cache/chromium/Default/Cache/data_3
Still curious... wtf... but at least it's not wayland. I'll continue this
on the ext4 list.
On 09/03, darxus at chaosreigns.com wrote:
> [732715.730069] EXT4-fs error (device sda1): ext4_ext_search_left:1275: inode #21374007: comm flush-8:0: ix (10742) != EXT_FIRST_INDEX (0) (depth 1)!
> [732715.730084] Aborting journal on device sda1-8.
> [732715.730269] EXT4-fs (sda1): Remounting filesystem read-only
> [732715.730278] EXT4-fs error (device sda1) in ext4_da_writepages:3033: IO failure
> [732715.730440] EXT4-fs (sda1): ext4_da_writepages: jbd2_start: 589 pages, ino 21374007; err -30
>
> This hasn't happened in three months. The last time I saw it was the
> last time I ran xwayland. While correlation does not imply causation,
> and it *could* be a coincidence, I'm really not willing to entertain
> that possibility as realistic.
>
> This time I used RAOF's X DDX (and updated the xwayland instructions
> for Radeon / ATI to use it). Last time I was using timon37's DDX. I don't
> know if they share code. I don't know if they're at fault.
>
> I was using the DRM backend. I ran "make install" as a non-root user, and
> then set weston-launch as owned by root and +s, and ran weston-launch. I
> did not have xserver set suid root.
>
> I was playing a video on youtube in chromium vix xwayland when X crashed
> (taking firefox, the only other X client I was running, out with it.)
> And was chatting with folks in IRC about X's failure to respawn when I
> realized my filesystem had been remounted readonly. Then dug the above
> output out of dmesg.
>
> I was working on updating my "state of wayland" page to say that wayland
> was looking pretty usable now :/
>
> fsck said lots of scary things after rebooting, I had to manually confirm
> it wanted to do many of them. I have photos if anyone is interested
> in details. Lots of "Free blocks count wrong for group... Fix(y)?" and
> "Free inodes count wrong for group... Fix(y)?" A "Block bitmap
> differences..."
>
> I don't know for sure if I lost anything, but have not yet
> seen evidence that I did. I have pretty good backups.
>
>
> 12:15 < pq> either xwayland triggers some fs bug, or triggers a gfx driver
> bug, which then scribbles over kernel memory - or faulty hw. Can't know.
> 12:16 < soreau> either way, it's a fairly serious problem
>
> I agree with this assessment.
>
>
> So far, I think it has only affected the filesystem I was using at the time
> (I basically only use one partition per linux install). So I may be
> willing to do more testing on a dedicated testing partition.
>
> This graphics card needed to go on ubuntu's grub gfxpayload blacklist,
> because for some reason retaining the graphics mode from grub to X breaks
> on some graphics cards, including this one. Seems unlikely to be directly
> related, just trying to provide all possibly relevant info I have. The bug
> for this was: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/971204
>
> I was running an up to date ubuntu oneric install.
>
> lspci output:
>
> 05:00.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI Barts XT [Radeon HD 6800 Series] (prog-if 00 [VGA controller])
> Subsystem: Hightech Information System Ltd. Device 2010
> Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
> Latency: 0, Cache Line Size: 64 bytes
> Interrupt: pin A routed to IRQ 46
> Region 0: Memory at d0000000 (64-bit, prefetchable) [size=256M]
> Region 2: Memory at fbfc0000 (64-bit, non-prefetchable) [size=128K]
> Region 4: I/O ports at e000 [size=256]
> Expansion ROM at fbfa0000 [disabled] [size=128K]
> Capabilities: <access denied>
> Kernel driver in use: radeon
> Kernel modules: radeon
>
>
> I had the latest get masters of everything as of 2012-09-03 08:36 -0400.
> weston commit 8538b22ff4ad8879b4e3288be053508167562859
> wayland commit 2be6e0ed142bac669398a9ad26d336666fa53216
> raof's xf86-video-ati xwayland branch commit 8dc07e63eaf8909f7046bf746a119ec749352441
>
> On 05/30, darxus at chaosreigns.com wrote:
> > I was just playing with weston master under X, and timon37's radeon ddx,
> > weston crashed, when I tried to delete the wayland lock I got an error
> > that the filesystem was readonly, dmesg said:
> >
> > [ 496.347230] EXT4-fs error (device sda1): ext4_ext_search_left:1275: inode #21374007: comm flush-8:0:ix (10742) != EXT_FIRST_INDEX (0) (depth 1)!
> > [ 496.347236] Aborting journal on device sda1-8.
> > [ 496.347383] EXT4-fs (sda1): Remounting filesystem read-only
> >
> > Which is pretty scary.
> >
> > I feel like this *might* have happened before when I was playing with
> > weston, but I definitely don't have enough information to suggest there is
> > any real correlation. Or knowledge of what exactly is going on here to
> > know if a correlation is even possible.
> >
> > I had xserver set suid root, out of habit from before I fixed the bug
> > complaining about not getting master, which seems like it might have made
> > something like this possible.
> >
> > I do have good backups.
>
> --
> "If you would be a real seeker after truth, it is necessary that at
> least once in your life you doubt, as far as possible, all things."
> - Rene Descartes
> http://www.ChaosReigns.com
> _______________________________________________
> wayland-devel mailing list
> wayland-devel at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/wayland-devel
>
--
"When we remember we are all mad, the mysteries of life disappear and
life stands explained." - Mark Twain
http://www.ChaosReigns.com
More information about the wayland-devel
mailing list