<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2900.2722" name=GENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=#ffffff>
<DIV><FONT face=Verdana size=2>Yes, the amount of duplicates is huge and I agree
with you that they should be deleted. The clipart must be cleaned from
them, even if this reduces the total number of items. But I think it would be
useful to check if some duplicates were made deliberately, to add keywords (for
example somebody posted a drawing, then added keywords to it end reposted the
same drawing again.)</FONT></DIV>
<DIV><FONT face=Verdana size=2></FONT> </DIV>
<DIV><FONT face=Verdana size=2>Also, I have a question: How many drawings do not
have keywords at all? And if they exist, how can we add keywords to them? I
ask this because I'm worried about the Openclipart website's search poor (not to
say unexisting) abilities: is it caused by the fact that some drawings lack
keywords, or is it a search engine's bug?</FONT></DIV>
<DIV><FONT face=Verdana size=2></FONT><FONT face=Verdana
size=2></FONT> </DIV>
<DIV><FONT face=Verdana size=2>Mo.</FONT></DIV>
<DIV><FONT face=Verdana size=2></FONT> </DIV>
<BLOCKQUOTE
style="PADDING-RIGHT: 0px; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000000 2px solid; MARGIN-RIGHT: 0px">
<DIV style="FONT: 10pt arial">----- Original Message ----- </DIV>
<DIV
style="BACKGROUND: #e4e4e4; FONT: 10pt arial; font-color: black"><B>From:</B>
<A title=tumaix@gmail.com href="mailto:tumaix@gmail.com">Tomaz Canabrava</A>
</DIV>
<DIV style="FONT: 10pt arial"><B>To:</B> <A
title=clipart@lists.freedesktop.org
href="mailto:clipart@lists.freedesktop.org">clipart@lists.freedesktop.org</A>
</DIV>
<DIV style="FONT: 10pt arial"><B>Sent:</B> Monday, September 19, 2005 11:16
PM</DIV>
<DIV style="FONT: 10pt arial"><B>Subject:</B> [Clipart] Duplicated items</DIV>
<DIV><BR></DIV>
<DIV><BR>I did a simple test of hashing every svg on the .17 release, and
found out that was 1248 itens with a duplicate entry. (624 cliparts
actually)</DIV>
<DIV>if they are removed, will reduce 24 megabytes of the entire
package.</DIV>
<DIV>this is huge. i think that the next release should be made without this
duplicates.</DIV>
<DIV>even if this will just reduce the number of itens (but wat count? Number
or quality?)</DIV>
<DIV> </DIV>
<DIV>Sorry for my english, i´m brazilian, and i´m sleepy.</DIV>
<DIV>Tomaz Canabrava</DIV>
<P>
<HR>
<P></P>_______________________________________________<BR>clipart mailing
list<BR>clipart@lists.freedesktop.org<BR>http://lists.freedesktop.org/mailman/listinfo/clipart<BR></BLOCKQUOTE></BODY></HTML>