<div dir="ltr">All sounds good to me, before we imported tons of files manually under the account Anonymous when there was this issue so we could keep track. But, as long as these imports are tagged in a special way so librarians could help go through them manually, I think that is the crucial part.<div>
<br></div><div>Cheerz on the housecleaning!<br><br></div><div>Jon</div></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Tue, Mar 12, 2013 at 10:23 AM, Wolfgang Spraul <span dir="ltr"><<a href="mailto:wolfgang@fabricatorz.com" target="_blank">wolfgang@fabricatorz.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">OK I have continued some cleanup, roughly summing up:<br>
<br>
*) deleted about 500 .svg files that had pd_issue set in the<br>
database, but were still present in the filesystem (and publicly<br>
accessible). Also deleted about 4200 derivative .pngs of those<br>
pd_issue graphics<br>
<br>
*) imported about 500 old cch graphics into the database, and<br>
deleted about 1000 duplicates of those graphics (see earlier<br>
mail)<br>
<br>
*) renamed about 200 files with .SVG (capitalized) extension<br>
that didn't show up in the database. The database had a few<br>
(13) .SVG graphics which I also renamed to .svg. Going forward,<br>
I will try to disable .SVG (capitalized) or auto-lowercase for<br>
new uploads.<br>
<br>
Now I am looking at 6308 .svg files that show up in the filesystem,<br>
but not in the database. They belong to the following users:<br>
<br>
962 ArtFavor<br>
785 Anonymous<br>
176 inky2010<br>
88 rejon<br>
84 gustavorezende<br>
80 pianoBrad<br>
73 Gerald_G<br>
69 brandynne<br>
68 slanteigne<br>
66 Merlin2525<br>
62 jiangyi_99<br>
57 cibo00<br>
54 gsagri04<br>
52 boobaloo<br>
51 10binary<br>
plus another 856 users with less than 50 graphics, for a total of 6308.<br>
<br>
Any idea what that might be? I think I will roughly do this:<br>
<br>
1. make sure that all those users exist in the database, if not<br>
create them.<br>
2. check some graphics manually to understand whether they are old<br>
'problem' uploads, spam, non-pd, duplicates, etc.<br>
3. import all those 6308 into the database same as I did for the 'cch'<br>
user this morning.<br>
<br>
The problem with #3 is that unlike the cch import, this one will be<br>
harder to reverse once I did it. In the cch case, it's all listed<br>
under that specific user. But for these 6308 graphics, they are spread<br>
out among hundreds of users. So once I import them, they will be<br>
everywhere and if there are problems with those graphics, they would<br>
need to be flagged pd_issue etc. and then deleted again.<br>
<br>
Feedback?<br>
_______________________________________________<br>
clipart mailing list<br>
<a href="mailto:clipart@lists.freedesktop.org">clipart@lists.freedesktop.org</a><br>
<a href="http://lists.freedesktop.org/mailman/listinfo/clipart" target="_blank">http://lists.freedesktop.org/mailman/listinfo/clipart</a><br>
</blockquote></div><br><br clear="all"><div><br></div>-- <br>Jon Phillips 王✳ <a href="http://fabricatorz.com" target="_blank">http://fabricatorz.com</a> ✳ skype: kidproto ✳ irc: rejon<br>+1.415.830.3884 (global) ✳ +86-187-1003-9974 (beijing)
</div>