[Clipart] Roadmap

Jon Phillips jon at rejon.org
Sat Jul 30 13:41:04 PDT 2005


Whoa, this is a lot to read. Did you get enough feedback on the files
with hash problems? How is this problem? I feel like I have lost track
of it.

Jon


On Thu, 2005-07-21 at 13:12 -0400, Nathan Eady wrote:
> > > There are still a lot of images with HASH in them,
> > 
> > Ok, I checked through them and found that 3174 are effected.
> 
> Sounds about right.
> 
> > That is a lot of affected files!!!
> 
> Yes.
> 
> > We need to develop a strategy for dealing with this. Ok, 
> > Jonadab, could you advise us all as to the best way to conquer 
> > and divide this task.
> 
> My plan, such as it is, runs roughly as follows:
> 
> 1. I have one script (not sure if it's in CVS or not; I can
>    put it in tonight if it's not) that is designed to look
>    through all the .svg files in the current directory, find
>    ones that contain "HASH" in them, and for each one look
>    through upload.log and extract all the apparently-relevant
>    input log entries and write them to a file that's named
>    the same as the .svg file but with .log appended to the name.
>    Thus, if there's a broken file foo.svg, this script creates
>    foo.svg.log and writes in it all the relevant entries it can
>    find in the upload log.  I *think* this script works, at
>    least for the majority of cases, but there will be a few
>    cases where for some reason it cannot find the relevant
>    upload log entries, e.g. if the title and filename are
>    HASHsomething so that it doesn't know what original title
>    to look for in the log.  Those few can be left for now
>    until the majority are done, and then figured out by hand
>    subsequently; there won't be many, I think.
> 2. I have another script (not sure if this is in CVS either,
>    but again, if not, it can be) that is *mostly* done, that
>    is designed to read the .log files and trace the process
>    that upload_svg.cgi would have done with that input and
>    basically reproduce it, creating a repaired file, which
>    will have -repaired.svg appended to its name, to avoid
>    overwriting the original.  Thus, if there was foo.svg,
>    which was broken, and the first script created foo.svg.log,
>    this one creates foo.svg-repaired.svg or somesuch.
> 3. Someone will need to look over the repaired files created 
>    by the second script and verify that they seem okay.
> 4. Then the original, b0rkened files can be replaced with
>    the new repaired ones.
> 
> For files submitted anytime after 0.12 (i.e., any files submitted
> for 0.13, 0.14, 0.15, or 0.16 before I repaired the upload script),
> my plan is to re-process the incoming files, running the above
> steps over them before processing, so that we will re-do what was 
> done for those releases, only correctly.  Thus, release 0.16 will 
> be, like the last two releases, based on 0.12 but with new images
> added from all of the releases in the interim.  I hope that 0.16
> is the last release I have to do that way, as otherwise it's going 
> to get cumbersome reprocessing all those files every time :-)
> 
> Any images that were already corrupted with HASH keywords in
> the 0.12 release are another matter.  They may have to be 
> re-keyworded by someone.  However, the good news is, *only*
> keywords seemed to be impacted by the hash bug until more
> recently, so I don't think any of those are missing author
> or title information.
> 
> > Also, are you sure that the "hash bug" is squished? 
> 
> Yes, assuming everyone uses the version of Metadata.pm that is 
> in CVS, that particular bug is gone.  There may be other bugs,
> but that one is toast.
> 
> > Hopefully, we can fix these images once and then that will 
> > be that...
> 
> Yes, and then we can get on to worrying about the character set
> issues again...

Great!

> --
> This message was sent using 3wmail.
> Your fast free POP3 mail client at www.3wmail.com
> _______________________________________________
> clipart mailing list
> clipart at lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/clipart
> 
-- 
Jon Phillips

San Francisco, CA
USA PH 510.499.0894
jon at rejon.org
http://www.rejon.org

MSN, AIM, Yahoo Chat: kidproto
Jabber Chat: rejon at gristle.org
IRC: rejon at irc.freenode.net

Inkscape (http://inkscape.org)
Open Clip Art Library (www.openclipart.org)




More information about the clipart mailing list