[Clipart] [Bug 19449] New: Ridiculous number of duplicates

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Wed Jan 7 13:43:22 PST 2009


http://bugs.freedesktop.org/show_bug.cgi?id=19449

           Summary: Ridiculous number of duplicates
           Product: openclipart.org
           Version: unspecified
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: medium
         Component: clipart
        AssignedTo: clipart at lists.freedesktop.org
        ReportedBy: jhansonxi at gmail.com


Created an attachment (id=21767)
 --> (http://bugs.freedesktop.org/attachment.cgi?id=21767)
Listing of cuplicate files

I installed the Windows version of the library and noticed a lot of duplicates
when adding them to the OpenOffice.org Gallery.  I then downloaded
openclipart-0.18-full.tar.bz2 and did a duplicate check (on Ubuntu) and found
thousands.  The commands I used are below and were run from the clipart
directory:

find . -type f -exec md5sum '{}' \; >md5_listing.txt
sort md5_listing.txt | uniq -d -w32 | cut -c 1-32 >md5_duplicates.txt
grep -f md5_duplicates.txt md5_listing.txt | sort >duplicates_listing.txt

My report is attached.


-- 
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.



More information about the clipart mailing list