[patch] filter invalid utf-8 characters from volume labels

David Zeuthen david at fubar.dk
Thu Aug 5 13:24:44 PDT 2004


On Thu, 2004-08-05 at 15:54 -0400, Joe Shaw wrote:
> On Thu, 2004-08-05 at 21:46 +0200, Kay Sievers wrote:
> > Do I need a licence now? :)
> 
> Yes.  You can send your check to the following address...
> 
> > Today I've found my first user of this feature. I just got a DVD with a
> > 'ä' character, stupid encoded as ISO8859-1 (in 16Bit values):
> > 
> >   http://vrfy.org/projects/hal/invalid-unicode.png
> 
> Not bad, although it'd be nice if it actually converted it.  But like
> Sjoerd said, it's basically a guess.
> 
> How do others feel about an encoding setting that we always try to
> convert from?  It'd probably be Latin-15 for the majority of distros out
> there, but people could set it to whatever was most appropriate for
> them.  Hrmmmm.
> 

Sure, when dealing with stuff where we can't determine the encoding it's
sane to fallback to something. However, specifically for UDF
filesystems, we may be in luck as the spec

 http://www.osta.org/specs/pdf/udf201.pdf

talks about that the disc stores how things are encoded. I got the link
from this mail

 http://mail.nl.linux.org/linux-utf8/2002-10/msg00063.html

when I Googled for it. I haven't looked too much at it.

Cheers,
David
_______________________________________________
hal mailing list
hal at freedesktop.org
http://freedesktop.org/mailman/listinfo/hal



More information about the Hal mailing list