[patch] filter invalid utf-8 characters from volume labels
David Zeuthen
david at fubar.dk
Thu Aug 5 13:24:44 PDT 2004
On Thu, 2004-08-05 at 15:54 -0400, Joe Shaw wrote:
> On Thu, 2004-08-05 at 21:46 +0200, Kay Sievers wrote:
> > Do I need a licence now? :)
>
> Yes. You can send your check to the following address...
>
> > Today I've found my first user of this feature. I just got a DVD with a
> > 'ä' character, stupid encoded as ISO8859-1 (in 16Bit values):
> >
> > http://vrfy.org/projects/hal/invalid-unicode.png
>
> Not bad, although it'd be nice if it actually converted it. But like
> Sjoerd said, it's basically a guess.
>
> How do others feel about an encoding setting that we always try to
> convert from? It'd probably be Latin-15 for the majority of distros out
> there, but people could set it to whatever was most appropriate for
> them. Hrmmmm.
>
Sure, when dealing with stuff where we can't determine the encoding it's
sane to fallback to something. However, specifically for UDF
filesystems, we may be in luck as the spec
http://www.osta.org/specs/pdf/udf201.pdf
talks about that the disc stores how things are encoded. I got the link
from this mail
http://mail.nl.linux.org/linux-utf8/2002-10/msg00063.html
when I Googled for it. I haven't looked too much at it.
Cheers,
David
_______________________________________________
hal mailing list
hal at freedesktop.org
http://freedesktop.org/mailman/listinfo/hal
More information about the Hal
mailing list